↧
Storing log messages in Hadoop - Scalable Log Data Management with Hadoop,...
In part 1 of this article series we described the various challenges of dealing with large amounts of logging data in a heavily distributed software ecosystem. After evaluating different approaches, we...
View ArticleRealtime Search for Hadoop - Scalable Log Data Management with Hadoop, Part 3
In the previous part of this article series we focused on the efficient storage of log data in Hadoop. We described how to store the data in Hadoop’s MapFiles, and we tweaked the configuration settings...
View ArticleManaging distributed Solr Servers - Scalable Log Data Management with Hadoop,...
We use the open-source search server Solr for real-time search on data stored in a Hadoop cluster. For our terabyte-scale dataset, we had to implement distributed search on multiple Lucene index...
View Article