How to Install Hadoop on Mac OS X 10.10 Yosemite Step-by-Step Guide

November 2, 2014 | By Mark Goo.

Hadoop 2.X Quick Start on macOS 10.10 Yosemite

Hi! The Tutorial shows you Step-by-Step How to Install and Getting-Started with Apache Hadoop/Map-Reduce vanilla in Pseudo-Distributed mode on Mac OS X 10.10 Yosemite Desktop.

Hadoop is a distributed master-slave that consists of the Hadoop Distributed File System (HDFS) for storage and Map-Reduce for computational capabilities.

The Guide Describe a System-Wide Setup with Root Privileges but you Can Easily Convert the Procedure to a Local One.

Apache Hadoop Require the Java JDK 6+ Installed so If Needed Just Update your Mac 10.10 Yosemite.

The Contents and Details of How to Install Hadoop on macOS 10.10 Yosemite are Expressly Essentials to Give Focus Only to the Essentials Instructions and Commands.

Hadoop Getting-Started on macOS 10.10 Yosemite - Featured

  1. Download Latest Apache Hadoop Stable Release:

    Apache Hadoop Binary tar.gz
  2. Double-Click on Archive to Extract

  3. Open Terminal Window
    (Press “Enter” to Execute Commands)

    Install Hadoop for macOS 10.10 Yosemite - Open Terminal
  4. Relocate Apache Hadoop Directory

    sudo mv $HOME/Downloads/hadoop* /usr/local
    sudo ln -s /usr/local/hadoop-[2.x] /usr/local/hadoop
    sudo mkdir /usr/local/hadoop/tmp
    sudo chown -R root:admin /usr/local/hadoop*
  5. Check if Java JDK 6/7 is Installed

    java --version

    How to Install Required Oracle JDK on MacOS X

    Install Oracle JDK for Mac
  6. Set JAVA_HOME in Hadoop Env File.

    sudo su

    If Got “User is Not in Sudoers file” then see: How to Enable sudo

    mkdir /usr/local/hadoop/conf
    nano /usr/local/hadoop/conf/hadoop-env.sh

    Append:

    export JAVA_HOME=$(/usr/libexec/java_home)
    

    Ctrl+x to Save & Exit :)

  7. Configuration for Pseudo-Distributed mode

    nano /usr/local/hadoop/conf/core-site.xml

    The Content Should Look Like:

    <?xml version="1.0"?>
    <?xml-stylesheet type="text/xsl" href="configuration.xsl"?>
    <configuration>
    <property>
    <name>hadoop.tmp.dir</name>
    <value>/usr/local/hadoop/tmp</value>
    </property>
    <property>
    <name>fs.default.name</name>
    <value>hdfs://localhost:8020</value>
    </property>
    </configuration>
    

    Next:

    nano /usr/local/hadoop/conf/hdfs-site.xml

    The Content Should Look Like:

    <?xml version="1.0"?>
    <?xml-stylesheet type="text/xsl" href="configuration.xsl"?>
    <configuration>
    <property>
    <name>dfs.replication</name>
    <value>1</value>
    </property>
    <property>
    <!-- specify this so that running 'hdfs namenode -format'
    formats the right dir -->
    <name>dfs.name.dir</name>
    <value>/usr/local/hadoop/cache/hadoop/dfs/name</value>
    </property>
    </configuration>
    

    Latest:

    nano /usr/local/hadoop/conf/mapred-site.xml

    The Content Should Look Like:

    <?xml version="1.0"?>
    <?xml-stylesheet type="text/xsl" href="configuration.xsl"?>
    <configuration>
    <property>
    <name>mapred.job.tracker</name>
    <value>localhost:8021</value>
    </property>
    </configuration>
    
  8. SetUp Local Path & Environment.

    exit
    cd  
    nano .profile

    Inserts:

    HADOOP_HOME=/usr/local/hadoop
    export PATH=$PATH:$HADOOP_HOME/bin:$HADOOP_HOME/sbin
    

    The JAVA_HOME is Set Following Oracle Java JDK6+ Installation Version…

    Then Load New Setup:

    source $HOME/.profile
  9. SetUp Needed Local SSH Connection.

    Enable SSH Connection:

    System Preferences > Sharing

    Mac Yosemite 10.10 Hadoop Quick Start - Enabling Remote Login

    Testing Connection:

    ssh 127.0.0.1
  10. Formatting HDFS

    hdfs namenode -format

    Install Hadoop for macOS 10.10 Yosemite - Terminal Apache Hadoop HDFS Formatting Success

  11. Starting Up Hadoop Database

    start-all.sh
  12. Apache Hadoop Database Quick Start Guide

  13. Eclipse Hadoop 2.X Integration with Free Plugin.

    Hadoop 2.X Eclipse Plugin SetUp

QuickChic Theme • Powered by WordPress