Links

   Quran Explorer - Interactive Audio Recitations & Translations

Friday, July 15, 2011

MapReduce powering Cloud Computing

Hello Nerd-migos !!!!

The cloud is here... r u (still) waiting for the rain !!!!

Forget the buzzword and the business hype, the cloud is real (hehe).

U need to (first) appreciate Virtual Machines/Computing (at least in theory - for non-nerds), Qemu, VMware, Xen, Virtual Box, ....


Now for fellow Nerd-migos.. we need to acquint with some of the following stuff

MapReduce:

"MapReduce is a patented software framework introduced by Google in 2004 to support distributed computing on large data sets on clusters of computers.[2]
The framework is inspired by the map and reduce functions commonly used in functional programming,[3] although their purpose in the MapReduce framework is not the same as their original forms.[4]
MapReduce libraries have been written in C++, C#, Erlang, Java, OCaml, Perl, Python, PHP, Ruby, F#, R and other programming languages." - Wikipedia


Notable (IMHO) implementation of this paradigm (apart from google itself) are

A
Hadoop

The Apache Hadoop software library is a framework that allows for the distributed processing of large data sets across clusters of computers using a simple programming model. It is designed to scale up from single servers to thousands of machines, each offering local computation and storage. Rather than rely on hardware to deliver high-avaiability, the library itself is designed to detect and handle failures at the application layer, so delivering a highly-availabile service on top of a cluster of computers, each of which may be prone to failures.

B
OpenStack (+ Cloudera) is the (leading - IMHO) OpenSource Cloud attempt !!!!!

What synergy.. Apache's Hadoop powers RackSpace's OpenStack which is leveraged by Cloudera's ingenuity..


And by the way, did u know that Amazons' Elastic Cloud is a fork of Hadoop ????????