Links & Cheat Sheets for Hadoop & Big Data

Parent Page for Hadoop & Big Data Links & Cheat Sheets



http://127.0.0.1:8888/  (HUE: hue/1111  and   AMBARI: admin/admin)

ssh root@127.0.0.1 -p 2222 (default password is "hadoop")

scp -P 2222 example.jar mruser@127.0.0.1:/home/mruser/mrJars (password is "Sprint2000" for linux user and "hadoop" for one created in hue)

Networking & VirtualBox

if you make changes to /etc/sysconfig/network-scripts/ifcfg-eth0, then make sure you rm /etc/udev/rules.d/70-persistent-net.rules

HDP File Locations

Benchmarking & Performance/Scalability Testing

Repo Help

Other Stuff

YARN

Kill a hadoop job:

yarn application -kill $ApplicationId

You can get a list of all ApplicationId's doing:

yarn application -list

Random Notes

dfs.datanode.max.transfer.threads - default is 1024, but bump to >= 4096 or >= 16K for HBase