DO NOT RESTART VM
When using the VM (downloaded or cloud-based), please do NOT restart the OS that is running in the VM. For cloud-based VMs, just leave it running. For downloaded VMs, please use VMware's suspend/resume functionality. If you do restart the VM, please try the following command on the outer (non-Docker) host which is usually named ubuntu if your environment does not restore properly.
sed -i -- 's/\/root\/.sys\/create/#\/root\/.sys\/create/' /etc/rc.local;sed -i -- 's/#\/root\/.sys\/restart/\/root\/.sys\/restart/' /etc/rc.local
This might also be useful for anyone choosing (recommendation is to NOT do this) to import the VMware image into VirtualBox (don't forget to open the .vmwarevm package first).
General Links
- Hortonworks Product Documentation
- Hadoop Distribution Components & Versions
- Any (and all) of Lester's blog posts & wiki pages with 'hadoop' label
HDP Developer: Apache Pig and Hive
- Course Labs
- Sqoop
- Flume
- Pig
- Hive
- Project Wiki
- stinger.next to the rescue (but you do have stinger.NOW tuning options available, well, "now")
- INSERT/UPDATE/DELETE, ACID & Transactions
- ACID and Transactions in Hive
- Alan Gates' 2015 Hadoop Summit "Adding Insert, Update, and Delete to Hive" talk
- ORC
- Oozie
HDP Developer: Storm and Trident Fundamentals
- Course Labs
- Storm
- Kafka
- HDP Tutorial: Transporting Real-Time Event Stream with Apache Kafka
- Some insight on why so fast (i.e. using memory to support a cache for writing to disk) is available on this SlideShare preso
HDP Developer: Custom YARN Application
- Course Labs
- Writing YARN Applications
- YARN JavaDoc (Hadoop 2.4.x since current Rev using HDP 2.1.x)
- Slider
- Apache Site
- 2015 Hadoop Summit Presentation: Authoring and Hosting Applications on YARN using Slider
HDP Operations: Install and Manage with Apache Hadoop
- HDFS
- FS Commands
- Heterogeneous Storage
- Storage Types and Storage Policies
- Use Case from eBay's presentation at 2015 Hadoop Summit
- High Availability
- Snapshots
- MapReduce
- Solr
- Hardware and General Configuration
HDP Operations: Apache HBase Advanced Management
- Reference Guide
- To introduce "Polyglot Persistence" – what the world needs now is another nosql preso (like i need a hole in my head)
- A pretty solid walk through of the architecture and major moving parts is presented here, but do overlook the source and the eventual plug for the non-ASF MapR-DB