Hadoop best performs on a cluster of multiple nodes/servers, however, it can run perfectly on a single machine, even a Mac, so we can use it for development. Also, Spark is a popular tool to process data in Hadoop. The purpose of this blog is to show you the steps to install Hadoop and Spark on a Mac.
Download Hadoop available at: Unpack the download file on your local filesystem and take note of the directory created, lets call this the installation directory for. In this video, cloudyrathor explain the installation process on ubuntu 16.04 with Hadoop-2.9.0 and try to explain as easy as possible also try to do the inst. # Java property: hadoop.security.logger # export HADOOPSECURITYLOGGER=INFO,NullAppender # Default process priority level # Note that sub-processes will also run at this level! # export HADOOPNICENESS=0 # Default name for the service level authorization file # Java property: hadoop.policy.file # export HADOOPPOLICYFILE='hadoop.
This tutorial outlines the basic steps to get started with Hadoop on Mac OS. This is a single-node cluster with YARN as the resource manager. Xcode can be installed via Apple appstore. Xcode is Apple’s Integrated Development Environment (IDE). Xcode is a large suite of software development tools and libraries from Apple.
Operating System: Mac OSX Yosemite 10.11.3
Hadoop Version 2.7.2 Spark 1.6.1
Pre-requisites
1. Install Java
Open a terminal window to check what Java version is installed.
$ java -version
If Java is not installed, go to https://java.com/en/download/ to download and install latest JDK. If Java is installed, use following command in a terminal window to find the java home path
$ /usr/libexec/java_home
Next we need to set JAVA_HOME environment on mac
$ echo export “JAVA_HOME=$(/usr/libexec/java_home)” >> ~/.bash_profile $ source ~/.bash_profile
2. Enable SSH as Hadoop requires it.
Go to System Preferences -> Sharing -> and check “Remote Login”.
Generate SSH Keys
$ ssh-keygen -t rsa -P “” $ cat ~/.ssh/id_rsa.pub >> ~/.ssh/authorized_keys
Open a terminal window, and make sure we can do this.
$>ssh localhost
Download Hadoop Distribution
Download the latest hadoop distribution (2.7.2 at the time of writing)
http://www.apache.org/dyn/closer.cgi/hadoop/common/hadoop-2.7.2/hadoop-2.7.2.tar.gz
Create Hadoop Folder
Open a new terminal window, and go to the download folder, (let’s use “~/Downloads”), and find hadoop-2.7.2.tar
$ cd ~/Downloads
$ tar xzvf hadoop-2.7.2.tar $ mv hadoop-2.7.2 /usr/local/hadoop
Hadoop Configuration Files
Go to the directory where your hadoop distribution is installed.
$ cd /usr/local/hadoop
Then change the following files
$ vi etc/hadoop/hdfs-site.xml
$ vi etc/hadoop/core-site.xml Smith Micro Clip Studio Paint EX 1.6.
$ vi etc/hadoop/yarn-site.xml
$ vi etc/hadoop/mapred-site.xml
Start Hadoop Services
Format HDFS
$ cd /usr/local/hadoop $ bin/hdfs namenode -format
Start HDFS
$ sbin/start-dfs.sh
Start YARN
$ sbin/start-yarn.sh
Validation
Check HDFS file Directory
$ bin/hdfs dfs -ls /
If you don’t like to include the bin/ every time you run a hadoop command, you can do the following
$ vi ~/.bash_profile
append this line to the end of the file “export PATH=$PATH:/usr/local/hadoop/bin” $ source ~/.bash_profile
Now try to add the following two folders in HDFS that is needed for MapReduce job, but this time, don’t include the bin/.
$ hdfs dfs -mkdir /user
$ hdfs dfs -mkdir /user/{your username} ![]()
You can also open a browser and access Hadoop by using the following URL
http://localhost:50070/
Next: Spark
Installing Spark is a little easier. You can download the latest Spark here:
http://spark.apache.org/downloads.html
It’s a little tricky on choosing which package type. We want to choose “pre-build with user provided Hadoop [can use with most Hadoop distributions]” type, and the downloaded file name is spark-1.6.1-bin-without-hadoop.tgz
After spark is downloaded, we need to untar it. Open a terminal window and do the following:
$ cd ~/Downloads
$ tar xzvf spark-1.6.1-bin-without-hadoop.tgz $ mv spark-1.6.1-bin-without-hadoop /usr/local/spark
Add spark bin folder to PATH
$ vi ~/.bash_profile
append this line to the end of the file “export PATH=$PATH:/usr/local/spark/bin” $ source ~/.bash_profile
What about Scala?
Spark is written in Scala, so even though we can use Java to write Spark code, we want to install Scala as well. Windows 101 iso download byte file count.
Download Scala from here: http://www.scala-lang.org/download/
Choose the first one to download Scala in binary, and the downloaded file is scala-2.11.8.tar
Untar Scala and move it to a dedicated folder
Hadoop Tutorial
$ cd ~/Downloads
$ tar xzvf scala-2.11.8.tar $ mv scala-2.11.8 /usr/local/scala Hadoop Mac Os X
Add Scala bin folder to PATH
$ vi ~/.bash_profile
append this line to the end of the file “export PATH=$PATH:/usr/local/scala/bin” $ source ~/.bash_profile
Now you should be able to do the following to access Spark shell for Scala
$ spark-shell Audobe audition for mac.
Mac Hadoop Install
That’s it! Happy coding!
Comments are closed.
|
AuthorWrite something about yourself. No need to be fancy, just an overview. Archives
December 2020
Categories |