Difference between revisions of "Storm"

From Knoesis wiki
Jump to: navigation, search
Line 1: Line 1:
 +
 
== Overview ==
 
== Overview ==
  
Line 11: Line 12:
 
== Sample Projects  ==
 
== Sample Projects  ==
  
* [[Storm_URL_Extraction|Simple URL Extraction using Regular Expressions]]
+
* [[Storm_URL_Resolution|Twitter URL Extraction and Resolution]]
* [[Storm_URL_Resolution|URL Extraction and Resolution]]
+
 
+
 
+
----
+
  
  
References: <references />
+
== References ==
 +
<references />

Revision as of 03:55, 7 October 2011

Overview

Storm is an open-source computing platform that provides a set of language-agnostic primitives to perform distributed computation on real-time data. Storm performs transformations on streams, or "unbounded sequence[s] of tuples", using the spout and bolt primitives.<ref>http://engineering.twitter.com/2011/08/storm-is-coming-more-details-and-plans.html</ref> Spouts are sources of streams. Bolts are single-step transformations on that stream. Spouts deliver streams to bolts. Bolts may manipulate those streams and deliver them as tuples to other bolts. Bolts can be grouped, which allows data to be pushed to a matching task. The complete set of stream transformations are called a topology.


Deployment Environment

Storm relies on ZooKeeper for coordinating the cluster.<ref>https://github.com/nathanmarz/storm/wiki/Setting-up-a-Storm-cluster</ref> The master node in the cluster is called "Nimbus".<ref>https://github.com/nathanmarz/storm/wiki/Setting-up-development-environment</ref> Storm may be deployed to Amazon Web Services using the Storm-Deploy project. Storm may also be tested and executed locally using Maven.


Sample Projects


References

<references />