How to “Crunch” Your Data Stored in HDFS

Olivier Mallassi from Octo Consulting writes that HDFS stores huge amount of data, but storing it is worthless if you cannot analyze it.

There are two high-level languages that have been developed to work on top of Hadoop:

  • Pig which provides a specific script language. Unfortunately, I do not have enough time to investigate this further for the moment. I will try to do so in a near future.
  • Hive which provides a limited SQL-like language (for instance, Date type is not supported…).

Read the Full Story.