Tag Archives: Hadoop

Makalu 1.6 Release

Posted by on

Makalu
Deerwalk is pleased to announce the 1.6 release of Makalu, a data analytics platform.

QM in Cascading

Cascading is an application framework that allows for rapid development of data analytics on Apache Hadoop. It is an abstraction of the popular MapReduce software framework that supports distributed computing on large data sets on clusters of computers. By moving to this new framework, Deerwalk will be able to rapidly deliver new sets of Quality Metrics. It also allows us to more easily create integrated unit tests of our metrics to ensure the highest quality.

  • Read More

Designing Hadoop/HBase Based Solutions

Posted by on

A good knowledge of data structures always helps in designing good systems, whether we are working with relational databases or NOSQL databases. However, this knowledge is much more important when working with NOSQL systems. One important fact to remember is that while a lot of optimization for SQL based solutions is during query time, NOSQL solutions (especially Hadoop/HBase) are design time optimized. You design an optimized schema and the queries are almost straight forward.

  • Read More