DigitalPebble's Blog
Showing posts with label mahout. Show all posts
Showing posts with label mahout. Show all posts
Wednesday, 5 September 2012

Using Behemoth on the CommonCrawl dataset

›
Behemoth is an open-source platform for document processing based on Hadoop which provides an excellent way to process document collection...
4 comments:
Saturday, 19 March 2011

DigitalPebble is hiring!

›
We are looking for a candidate with the following skills and expertise : strong background in NLP and Java GATE, experience of writing plu...
›
Home
View web version
Powered by Blogger.