If you’re reading this, it’s because you heard about Storm somehow, and you’re interested
in better understanding what it does, how you can use it to solve various problems,
and how it works.
This book will get you started with Storm in a very straightforward and easy way.
The first few chapters will give you a general overview of the technologies involved,
some concepts you should understand so we all speak the same language, and how to
install and configure Storm. The second half of the book will get you deep into spouts,
bolts and topologies (more about these in a moment). The last few chapters address
some more advanced features that we consider very important and interesting, like
using Storm with languages that are not JVM-based.
Even as big data is turning the world upside down, the next phase of the revolution is already taking shape: real-time data analysis. This hands-on guide introduces you to Storm, a distributed, JVM-based system for processing streaming data. Through simple tutorials, sample Java code, and a complete real-world scenario, you’ll learn how to build fast, fault-tolerant solutions that process results as soon as the data arrives.
Discover how easy it is to set up Storm clusters for solving various problems, including continuous data computation, distributed remote procedure calls, and data stream processing.
-
Learn how to program Storm components: spouts for data input and bolts for data transformation
-
Discover how data is exchanged between spouts and bolts in a Storm topology
-
Make spouts fault-tolerant with several commonly used design strategies
-
Explore bolts—their life cycle, strategies for design, and ways to implement them
-
Scale your solution by defining each component’s level of parallelism
-
Study a real-time web analytics system built with Node.js, a Redis server, and a Storm topology
-
Write spouts and bolts with non-JVM languages such as Python, Ruby, and Javascript