MapReduce VS Spark - Inverted Index Example

Comparing MapReduce to Spark using Inverted Index example.

Requirements

The repository contains both MapReduce and Spark projects MRInvertedIndex and SparkInvertedIndex

com/stdatalabs/SparkInvertedIndex
- Driver.scala -- Spark code to build inverted index
com/stdatalabs/MRInvertedIndex
- InvertedIndexMapper.java -- Reads files in input directory and outputs (word, filename) as key-value pair
- InvertedIndexReducer.java -- Reads the list of (word, firstnames) key-value pair and outputs (word, (filename, count))
- InvertedIndexDriver.java -- Driver program for MapReduce jobs

A comparison between MapReduce and Apache Spark RDD code using Inverted Index example Discussed in blog -- MapReduce VS Spark - Inverted Index Example

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
MRInvertedIndex		MRInvertedIndex
SparkInvertedIndex		SparkInvertedIndex
dataset/shakespeare		dataset/shakespeare
README.md		README.md