Introduction
With the work of optimizing the performance of MapReduce.
I do some solid work to understand the core source code of MapReduce.
And do some programming with the core code.
So there is something being introduced here to clean the way of compiling Hadoop on Ubuntu.
1.Initiation
At first, the Eclipse and the work-space must be configured appropriately with the code of Hadoop(I use the version CDH5 modified by Cloudera).
Then, there are some installation work with Maven and the configuration of the Repositories of Maven.
2.Compile
With some modify of the MapReduce core source code, the code must be packaged and deployed to every node of the cluster.
The command is below:
1 | mvn install -DskipTests |
You may with some errors telling you that some tools must be installed before compiling.
Just like Findbugs and Protobuf, they can be easily installed by yourself.