Strategy

DISCOVERY TO STRATEGY

Let us analyze your organization to define your Big Data strategy

Cluster Management

CLUSTER MANAGEMENT

Simplify the installation, configuration and operation of Hadoop and Storm clusters

Training

TRAINING

Train your developers and data analysts in getting started with Big Data

Search

SEARCH

Search on a gigantic scale, using ElasticSearch.

DataCrunchers helps organizations in becoming
Data Driven.

Data Mining

DATA MINING

Extract valuable insights from your stored data

Building Solutions

BUILDING SOLUTIONS

Design and implementation of data processing applications

Application Monitoring

APPLICATION MONITORING

Use your log files to monitor your applications in real-time

Data Repository

DATA REPOSITORY

Bring together all enterprise data and disclose it through an API

What is Big Data?

We think of Big Data as a set of concepts and technologies that allow the rapid and efficient processing of large data sets with a focus on performance, resiliency and agility.

Big Data is not a product, software package nor a W3C standard. Big Data solutions challenge traditional technology and promote new thinking and technologies to solve today’s business problems.

We describe the Big Data business drivers as Volume, Velocity, Variety and Agility.

The Big Data Buzz

After having heard all the buzz around Big Data we imagine you are left with questions: What is Big Data really about? What can it do for me? How can I get value out of large data sets? What is my return on investment? Which technology should I use from the abundant list of technologies around? How to implement a Big Data project? What kind of people do I need to adopt Big Data? Our Mission DataCrunchers wants to enable you in adopting Big Data, ensuring that you are building a robust and long term Big Data Solution. Our consultants have built numerous BIg Data Solutions and understand how to return value on your investment. We enable our customers in setting up clusters, teaching and guiding them to use technologies on top of them, and help you design and implement Big Data Solutions you can support yourself afterwards.

Technologies

Hadoop

Hadoop is a key part in our Big Data solutions. We use Hadoop, MapReduce, HDFS, as well as Hive and Pig.

Storm

Storm is a free and open source distributed realtime computation system. Storm makes it easy to reliably process unbounded streams of data, doing for realtime processing what Hadoop did for batch processing.

ElasticSearch

ElasticSearch is an open source, distributed, RESTful, search Engine built on top of Apache Lucene.

Lily

Lily is a data management platform combining big data, indexing and search with on-line, real-time usage tracking, audience analytics and content recommendations. Lily builds on Apache HBase, Hadoop and Solr.

More technologies

We use many more technologies. For server and cluster management, we use Puppet , Amazon AWS, Microsoft Windows Azure. For our database needs we find Cassandra, Redis and Hbase a good fit.
See more technologies.