Big Data Technologies
Apache Hadoop is a framework that allows for the distributed processing of large data sets across clusters of computers using a simple programming models. It is designed to scale up from single servers to thousands of machines, each offering local computation and storage. Rather than rely on hardware to deliver high-avaiability, the library itself is designed to detect and handle failures at the application layer, so delivering a highly-availabile service on top of a cluster of computers, each of which may be prone to failures. Hadoop is for data in rest.
You can find out more about Hadoop on the Apache Hadoop website.
Storm is a free and open source distributed realtime computation system. Storm makes it easy to reliably process unbounded streams of data, doing for realtime processing what Hadoop did for batch processing. Storm is for data in motion.
You can find out more about Storm on the website.
ElasticSearch is an open source, distributed, RESTful, search Engine built on top of Apache Lucene.
You can find out more about ElasticSearch on the website.
Lily is a data management platform combining big data, indexing and search with on-line, real-time usage tracking, audience analytics and content recommendations. Lily builds on Apache HBase, Hadoop and Solr.