Technologies - datacrunchers.eu

Hadoop

Apache Hadoop is a framework that allows for the distributed processing of large data sets across clusters of computers using a simple programming models. It is designed to scale up from single servers to thousands of machines, each offering local computation and storage. Rather than rely on hardware to deliver high-avaiability, the library itself is designed to detect and handle failures at the application layer, so delivering a highly-availabile service on top of a cluster of computers, each of which may be prone to failures. Hadoop is for data in rest.

You can find out more about Hadoop on the Apache Hadoop website.

Storm

Storm is a free and open source distributed realtime computation system. Storm makes it easy to reliably process unbounded streams of data, doing for realtime processing what Hadoop did for batch processing. Storm is for data in motion.

You can find out more about Storm on the website.

ElasticSearch

ElasticSearch is an open source, distributed, RESTful, search Engine built on top of Apache Lucene.

You can find out more about ElasticSearch on the website.

Lily

Lily is a data management platform combining big data, indexing and search with on-line, real-time usage tracking, audience analytics and content recommendations. Lily builds on Apache HBase, Hadoop and Solr.

You can find out more about Lily Project on the website or on our NGDATA partner page.

Redis

Redis is a fast Key Value store with support for SETs and Lists.

Big Data Technologies

Hadoop

Storm

ElasticSearch

Lily

Redis

Cassandra

Hbase

Puppet

Azure

Amazon Webservices