Difference between revisions of "Storm"

From Knoesis wiki
Jump to: navigation, search
(Created page with "== Overview == Storm is an open-source computing platform that provides a set of language-agnostic primitives to perform distributed computation on real-time data. Storm perform...")
 
Line 6: Line 6:
 
== Deployment Environment ==
 
== Deployment Environment ==
  
Storm relies on [http://zookeeper.apache.org/ ZooKeeper] for coordinating the cluster.<ref>https://github.com/nathanmarz/storm/wiki/Setting-up-a-Storm-cluster</ref> The master node in the cluster is called "Nimbus".<ref>https://github.com/nathanmarz/storm/wiki/Setting-up-development-environment</ref>
+
Storm relies on [http://zookeeper.apache.org/ ZooKeeper] for coordinating the cluster.<ref>https://github.com/nathanmarz/storm/wiki/Setting-up-a-Storm-cluster</ref> The master node in the cluster is called "Nimbus".<ref>https://github.com/nathanmarz/storm/wiki/Setting-up-development-environment</ref> Storm may be deployed to [http://aws.amazon.com/ Amazon Web Services] using the [https://github.com/nathanmarz/storm-deploy Storm-Deploy] project. Storm may also be tested and executed locally using [https://github.com/nathanmarz/storm/wiki/Maven Maven].
 +
 
 +
 
 +
== Sample Projects  ==
 +
 
 +
* [[Storm_URL_Extraction|Simple URL Extraction using Regular Expressions]]
 +
* [[Storm_URL_Resolution|URL Extraction and Resolution]]
  
  

Revision as of 03:51, 7 October 2011

Overview

Storm is an open-source computing platform that provides a set of language-agnostic primitives to perform distributed computation on real-time data. Storm performs transformations on streams, or "unbounded sequence[s] of tuples", using the spout and bolt primitives.<ref>http://engineering.twitter.com/2011/08/storm-is-coming-more-details-and-plans.html</ref> Spouts are sources of streams. Bolts are single-step transformations on that stream. Spouts deliver streams to bolts. Bolts may manipulate those streams and deliver them as tuples to other bolts. Bolts can be grouped, which allows data to be pushed to a matching task. The complete set of stream transformations are called a topology.


Deployment Environment

Storm relies on ZooKeeper for coordinating the cluster.<ref>https://github.com/nathanmarz/storm/wiki/Setting-up-a-Storm-cluster</ref> The master node in the cluster is called "Nimbus".<ref>https://github.com/nathanmarz/storm/wiki/Setting-up-development-environment</ref> Storm may be deployed to Amazon Web Services using the Storm-Deploy project. Storm may also be tested and executed locally using Maven.


Sample Projects




References: <references />