CS492 Special Topics in Computer Science:
Distributed Algorithms and Systems (2008 Fall)

Lab 1

Notice

     Check Noah borad!


Introduction
    This is for introduction to Hadoop and MapReduce.

    Hadoop is a platform for distributed computations based on Google's Map Reduce (paper)

    The goal of this project is to become familiar with a popular model for programming distributed systems.

    

    You will do this project in teams of two

    We assigned 6 clusters(1G mem) for a team.

    You need to install hadoop and run a simple mapreduce example.

    Turn in the documentation including installation path in your machine, how to execute, and feedback about using hadoop via e-mail.

    TA: Seoyeon Kang (sykang golbaengi an.kaist.ac.kr)


    Due: September 18th at 18:00



Related links


To do
    1. Install hadoop

     All steps are in the Hadoop's home page


    2. Run the example - wordcount 2.0