DigitalPebble's Blog
Monday, 27 September 2010

SimilarPages is out!

›
It's always nice to see clients emerging of stealth mode and showing the fruits of their labour to the public. Our friends at http://www...

Apache Nutch 1.2 released

›
[quoting the announcement by Chris Mattmann] The Apache Nutch project is pleased to announce the release of Apache Nutch 1.2. The release...
Friday, 3 September 2010

Field-based Weighting Schemes for Text Classification

›
Our Text Classification API uses a representation of documents based on fields, a bit like in Lucene. This is quite useful as it allows ...
Saturday, 28 August 2010

Behemoth talk from BerlinBuzzwords 2010

›
The talk I gave on Behemoth at BerlinBuzzwords has been filmed (I do not dare watching it) and is available on http://blip.tv/file/3809855...
Friday, 27 August 2010

Tom White on Hadoop 0.21

›
An excellent summary from Tom White on the release 0.21 of Hadoop http://www.cloudera.com/blog/2010/08/what%e2%80%99s-new-in-apache-hadoop...
Thursday, 26 August 2010

Using Payloads with DisMaxQParser in SOLR

›
Payloads are a good way of controlling the scores in SOLR/Lucene. This post by Grant Ingersoll gives a good introduction to payloads, I al...
7 comments:
Thursday, 19 August 2010

Tika on FeatherCast

›
Apache Tika recently split off from the Lucene project and became a separate top level Apache project. Chris Mattmann is talking about wh...
‹
›
Home
View web version
Powered by Blogger.