Giving something back

Let's face it, we've all been there. Here we post the things we think it can help the community to have a deep understanding and to solve common problems.

Tag – Mapreduce

MapReduce and Hadoop

Posted by Fabrizio Albertoni


MapReduce is a programming model for processing and generating large datasets allowing massive scalability, inspired on the map and reduce primitives found in many functional programming languages. Basically, it refers to two separate tasks: the map job, which takes data and converts it into another set of data where each element is broken into key/value pairs, and the reduce job, which takes those pairs and combines them into a smaller set of data.

Programs that implement this paradigm can be automatically parallelized and executed on clusters of computers, since the input data can be partitioned. A system like ...

Read More