/
HDP & Core Hadoop Cheat Sheet

HDP & Core Hadoop Cheat Sheet

Hortonworks Sandbox

http://127.0.0.1:8888/  (HUE: hue/1111  and   AMBARI: admin/admin)

ssh root@127.0.0.1 -p 2222 (default password is "hadoop")

scp -P 2222 example.jar mruser@127.0.0.1:/home/mruser/mrJars (password is "Sprint2000" for linux user and "hadoop" for one created in hue)

Networking & VirtualBox

if you make changes to /etc/sysconfig/network-scripts/ifcfg-eth0, then make sure you rm /etc/udev/rules.d/70-persistent-net.rules

HDP File Locations

Benchmarking & Performance/Scalability Testing

Repo Help

Apache Ambari

Other Stuff

YARN

Kill a hadoop job:

yarn application -kill $ApplicationId

You can get a list of all ApplicationId's doing:

yarn application -list

Random Notes

dfs.datanode.max.transfer.threads - default is 1024, but bump to >= 4096 or >= 16K for HBase

Related content

Links & Cheat Sheets for Hadoop & Big Data
Links & Cheat Sheets for Hadoop & Big Data
More like this
building a virtualized 5-node HDP 2.0 cluster (all within a mac)
building a virtualized 5-node HDP 2.0 cluster (all within a mac)
More like this
installing hdp 2.2 with ambari 2.0 (moving to the azure cloud)
installing hdp 2.2 with ambari 2.0 (moving to the azure cloud)
More like this
installing hdp 2.2 with ambari 2.0 (moving to the amazon cloud)
installing hdp 2.2 with ambari 2.0 (moving to the amazon cloud)
More like this
setting up hdp 2.1 with non-standard users for hadoop services (why not use a non-standard user for ambari, too)
setting up hdp 2.1 with non-standard users for hadoop services (why not use a non-standard user for ambari, too)
More like this
what's after the hortonworks sandbox? (a 5-node cluster!)
what's after the hortonworks sandbox? (a 5-node cluster!)
More like this