Hello World from OSS Silicon Valley


HowToUse/Hadoop/3.0


  • The added line is THIS COLOR.
  • The deleted line is THIS COLOR.
#contents

*Prerequisite [#s6b92995]
-CentOS installation (You can refer [[HowToUse/CentOS/6.5]])
-Java installation (You can refer [[HowToUse/Java/1.8]])

*Install&Setup [#md469a67]
:Step.1|
Create user account for Hadoop.

 $ sudo useradd hadoop
 $ sudo passwd hadoop

:Step.2|
Download source files from [[here:http://hadoop.apache.org/]] and unarchive the file.

 $ wget http://www.apache.org/dyn/closer.cgi/hadoop/common/hadoop-3.0.0-alpha1/hadoop-3.0.0-alpha1-src.tar.gz
 $ tar xzvf hadoop-3.0.0-alpha1-src.tar.gz
 $ wget http://ftp.meisei-u.ac.jp/mirror/apache/dist/hadoop/common/hadoop-3.0.0-alpha1/hadoop-3.0.0-alpha1.tar.gz
 $ tar xzvf hadoop-3.0.0-alpha1.tar.gz
 $ mv hadoop-3.0.0-alpha1 /usr/local
 $ chown -R hadoop:hadoop /usr/local/hadoop-3.0.0-alpha1

:Step.3|
Setup environmental variables in .bashrc.

 $ vi ~/.bashrc

 export JAVA_HOME=/usr/lib/jvm/java-1.8.0
 export HADOOP_INSTALL=/usr/share/hadoop-3.0.0-alpha1
 export PATH=$HADOOP_INSTALL/bin:$JAVA_HOME/bin:$PATH
 export HADOOP_CLASSPATH=${JAVA_HOME}/lib/tools.jar


*HowToUse [#sa395ad0]

:Step.1|
xxx
Create Hadoop code.

 $ vi WordCount.java

You can see sample code from [[here:https://github.com/osssv/osssv-helloworld/blob/master/hadoop/3.0/src/WordCount.java]]

:Step.2|
Compile sample code and create jar file.

 $ hadoop com.sun.tools.javac.Main WordCount.java
 $ jar cf wc.jar WordCount*.class

:Step.3|
Prepare input files for WordCount.

 $ mkdir input
 $ vi input/file01
 $ vi input/file02

You can see sample code from [[here:https://github.com/osssv/osssv-helloworld/tree/master/hadoop/3.0/input]]

:Step.4|
Execute hadoop job for WordCount.

 $ hadoop jar wc.jar WordCount ../input/ ../output

Then you will see output in output directory as below.

 [hadoop@localhost work]$ ls output/
 _SUCCESS  part-r-00000


 $ vi output/part-r-00000

 Bye     1
 Goodbye 1
 Hadoop  2
 Hello   2
 World   2


*Author [#p28646ff]
S.Yatsuzuka