Sunday, 12 June 2011
Nutch 1.3 released + BerlinBuzzwords presentation
›
Nutch 1.3 has been released and contains quite a few changes , some of which have been retrofitted from Nutch 2.0 in trunk. The main modif...
Friday, 27 May 2011
Parsing the Enron email dataset using Tika and Hadoop
›
In order to parse a large collection of emails, such as the Enron Email Dataset , we might choose to use Apache Hadoop , a scalable computin...
5 comments:
Saturday, 7 May 2011
Nutch talk at Berlin Buzzwords 2011
›
I'll be giving a talk on Apache Nutch at Berlin Buzzwords. This talk will give an overview of Apache Nutch. I will describe its main ...
Tuesday, 22 March 2011
Search for US properties with SOLR and Maptimize
›
Our clients 5k50 have recently opened a preview of their real-estate search system which is based on Apache SOLR and Maptimize. Maptimize ...
Saturday, 19 March 2011
DigitalPebble is hiring!
›
We are looking for a candidate with the following skills and expertise : strong background in NLP and Java GATE, experience of writing plu...
Monday, 21 February 2011
Watson, the computer Behemoth in Jeopardy!
›
Alex Popescu's excellent blog mentioned the DeepQA project and IBM's supercomputer Watson. Watson's recent appearance on the US...
Friday, 21 January 2011
BerlinBuzzwords 2011
›
There is a CFP for BerlinBuzzwords 2011 which will be on 6/7 June. As the website says : Berlin Buzzwords 2011 is ...
‹
›
Home
View web version