DigitalPebble's Blog
Sunday, 12 June 2011

Nutch 1.3 released + BerlinBuzzwords presentation

›
Nutch 1.3 has been released and contains quite a few changes , some of which have been retrofitted from Nutch 2.0 in trunk. The main modif...
Friday, 27 May 2011

Parsing the Enron email dataset using Tika and Hadoop

›
In order to parse a large collection of emails, such as the Enron Email Dataset , we might choose to use Apache Hadoop , a scalable computin...
5 comments:
Saturday, 7 May 2011

Nutch talk at Berlin Buzzwords 2011

›
I'll be giving a talk on Apache Nutch at Berlin Buzzwords. This talk will give an overview of Apache Nutch. I will describe its main ...
Tuesday, 22 March 2011

Search for US properties with SOLR and Maptimize

›
Our clients 5k50 have recently opened a preview of their real-estate search system which is based on Apache SOLR and Maptimize. Maptimize ...
Saturday, 19 March 2011

DigitalPebble is hiring!

›
We are looking for a candidate with the following skills and expertise : strong background in NLP and Java GATE, experience of writing plu...
Monday, 21 February 2011

Watson, the computer Behemoth in Jeopardy!

›
Alex Popescu's excellent blog mentioned the DeepQA project and IBM's supercomputer Watson. Watson's recent appearance on the US...
Friday, 21 January 2011

BerlinBuzzwords 2011

›
There is a CFP for BerlinBuzzwords 2011 which will be on 6/7 June. As the website says : Berlin Buzzwords 2011 is ...
‹
›
Home
View web version
Powered by Blogger.