July 18, 2009 Leave a comment
Here it is – Sematext’s new and shinny blog.
We’ll be writing about topics that are dear and important to us – search (both web search and enterprise search), text analytics, natural language processing (sentiment detection, named entity recognition…), machine learning, information gathering (e.g. web crawling), information extraction, e-discovery, recommendation engines, etc. There will be a lot of talk about tools we use regularly – Lucene, Solr, Nutch, Mahout and Taste, Hadoop, HBase and friends, and more.