Porting your hadoop app to horton works hdp

Preview:

Citation preview

Porting your Hadoop app to HortonWorks HDP

Based on one man's experience

Let's talk about distributions

Ones that I played with include1. Cloudera2. MapR3. HortonWorks4. Intel (future)

Extended Q&A :)

What stands out about HortonWorks

(From their site)

Birthplace of HadoopMany Apache contributorsAll open sourceSpinoff from Yahoo

Where I come from

I want to put my eDiscovery application on HortonWorks

● All Java and standard Hadoop, but● Uses Cloudera on EC2● Uses custom machine images● Uses custom managing software written in

Java

Find an RHEL (required) on EC2

Start it

That's all the install commands

It finds the machine (you give ips)

Good luck finding

Choose services to install

Java is installed automatically

It's a 6GB root (can be increased)

Keep installing through the browser

Customize your Nagios

Start deploying the services

Watch the progress

HBase install failed

Remove the services I don't need

Re-install attempt

This time it worked

Working cluster

Command-line check

More info from Rohit

1. I want instructions, please -- Coming(Many customers want their own control scripts)2. Re-starting clusters on EC2? - Better new ones3. My own customer startup scripts? -- Should use the instructions that will be provided

Q&A with Rohit Bakhshi

Join.me