Install Hortonworks in Virtual Box for Spark

by | May 30, 2018 | Apache Spark, Big Data | 0 comments



First install virtual box and once installed go to the virtual machine of Hortonworks and run it, this will appear an installation of this machine in virtual box.

Configure the features of the machine, comment that minimally needs 8GB of RAM.


Hortonworks Configuration

Once the machine is ripped we will get an HTTP in the shell indicating the Web path to access. Example

Note: In the version HDP 2.6.5 path is

Hortonworks Loaded

Accessing the browser will get the following Web.

Hortonworks Initial interfaceOnce here we can access directly to “NEW to HDP” with user/pass = raj_ops/Raj_ops or “ADVANCED HDP” and in each one of the services you can see the user and the default key.


Interpreter settings (Python or Scala)

Enter Zeppelin ( and at the top right there is a configuration icon to configure the interpreters.

Once there modify the Python path so that the system is able to find it.


Use with Zeppelin

Zeppelin provides a notebook (type Jupiter) where we can execute our code in different languages, just indicate in the first line the interpreter to use:

%spark2 => para scala
%spark2.pyspark => para python

Note: Change the Ambari password


Submit a Comment

Your email address will not be published. Required fields are marked *