Integrate Elasticsearch into Hadoop to effectively visualize and analyze your data
About This Book
- Build production-ready analytics applications by integrating the Hadoop ecosystem with Elasticsearch
- Learn complex Elasticsearch queries and develop real-time monitoring Kibana dashboards to visualize your data
- Use Elasticsearch and Kibana to search data in Hadoop easily with this comprehensive, step-by-step guide
Who This Book Is For
This book is targeted at Java developers with basic knowledge on Hadoop. No prior Elasticsearch experience is expected.
What You Will Learn
- Set up the Elasticsearch-Hadoop environment
- Import HDFS data into Elasticsearch with MapReduce jobs
- Perform full-text search and aggregations efficiently using Elasticsearch
- Visualize data and create interactive dashboards using Kibana
- Check and detect anomalies in streaming data using Storm and Elasticsearch
- Inject and classify real-time streaming data into Elasticsearch
- Get production-ready for Elasticsearch-Hadoop based projects
- Integrate with Hadoop eco-system such as Pig, Storm, Hive, and Spark
In Detail
The Hadoop ecosystem is a de-facto standard for processing terra-bytes and peta-bytes of data. Lucene-enabled Elasticsearch is becoming an industry standard for its full-text search and aggregation capabilities. Elasticsearch-Hadoop serves as a perfect tool to bridge the worlds of Elasticsearch and Hadoop ecosystem to get best out of both the worlds. Powered with Kibana, this stack makes it a cakewalk to get surprising insights out of your massive amount of Hadoop ecosystem in a flash.
In this book, youll learn to use Elasticsearch, Kibana and Elasticsearch-Hadoop effectively to analyze and understand your HDFS and streaming data.
You begin with an in-depth understanding of the Hadoop, Elasticsearch, Marvel, and Kibana setup. Rl#