November 5, 2014 Leave a comment
Going to Lucene/Solr Revolution next week — November 11-14 — in Washington, DC? If so…Sematext will be there exhibiting AND giving two talks! If you are going, stop by our table to say hello. We can show you the latest versions of SPM Performance Monitoring, Logsene Log Management and Analytics, Site Search Analytics, and, of course, talk about metrics, centralized log management, Lucene, Solr, Elasticsearch, and just about any other search-related topic you might be interested in. After all, not only have we blogged, given talks and spread the word in all sorts of ways, we’ve also written books on these subjects!
Both of the Sematext engineer talks take place on Friday, November 14. They are:
Radu Gheorghe will talk about “Tuning Solr for Logs” at 10:15 am
Summary: Performance tuning is always nice for keeping your applications snappy and your costs down. This is especially the case for logs, social media and other stream-like data that can easily grow into terabyte territory. While you can always use SolrCloud to scale out of performance issues, this talk is about optimizing. The following questions about Solr settings will be answered. How often should you commit and merge? How can you have one collection per day/month/year/etc? What are the performance trade-offs for these options? There will also be a discussion around choosing the appropriate hardware. Radu will talk about optimizing the infrastructure when pushing logs to Solr. This includes tuning Apache Flume to handle large flows of logs and overall design options that also apply to other shippers, like Logstash.
Rafal Kuc will talk about “Solr Anti-Patterns” at 10:55 am
Summary: Working as a consultant, software engineer and helping people in various ways, Rafał has seen multiple patterns in how Solr is used and how it should be used. Consulting on best practices is common, but talking about what NOT to do is not. This talk will point out common mistakes and roads that should be avoided at all costs, covering use cases and guidelines around general configuration pitfalls, data modeling and what to avoid when making your data indexable, and mistakes made when it comes to queries and searching for indexed data. Each use case will be illustrated by a before and after analysis where changes in metrics will be shown to bring a know-how worth remembering.
20% Discount Code
If you currently use a Sematext product or have been a client in the past and want to go, drop us a line for more info.
Hope to see you in DC!