Hortonworks Sandbox
...
- Benchmarking and Stress Testing a Hadoop Cluster
- Teragen & Terasort on HDP (the argument is how many lines of 100 bytes you want - 10,000,000,000 would net you 1TB of data)
- HDP 1.3.2 (correctly identified that 100000000 would give you 10GB)
- HDP 2.0.9.0 (incorrectly identifies that 10000 would give you 10GB, but that is really only 1MB)
...
Apache Ambari
- Examples of Ambari REST API
- Ambari Shell for CLI commands; https://cwiki.apache.org/confluence/display/AMBARI/Ambari+Shell
Other Stuff
- Script to iterate down a dir tree and copy everything into HDFS; http://one-line-it.blogspot.com/2013/05/hadoop-copy-directly-to-hdfs-from.html
- How to set the number of mappers and reducers of Hadoop in command line
- Details on HDFS Balancer command; http://www.swiss-scalability.com/2013/08/hadoop-hdfs-balancer-explained.html
- Oracle JDK 7 Archive Download Page
- SAP HANA and HDP Integration; http://hortonworks.com/wp-content/uploads/2013/09/Demo-Tutorial-Leveraging_SAP_HANA__HDP_Jan_2014.pdf
- Creating & registering custom Ambari alerts; https://community.hortonworks.com/articles/38149/how-to-create-and-register-custom-ambari-alerts.html
- Managing Hadoop DR with distcp and snapshots; https://community.hortonworkscloudera.com/contentt5/kbentry/71775/managing-hadoop-drCommunity-Articles/Managing-Hadoop-DR-with-distcp-and-snapshots.html/ta-p/248362
YARN
Kill a hadoop job:
yarn application -kill $ApplicationId
...