Instructor Links for HDP Training
General Links
- Hortonworks Product Documentation
- Hadoop Distribution Components & Versions
- Any (and all) of Lester's blog posts & wiki pages with 'hadoop' label
HDP Developer: Apache Pig and Hive
- Course Labs
- Sqoop
- Flume
- Pig
- Hive
- Project Wiki
- stinger.next to the rescue (but you do have stinger.NOW tuning options available, well, "now")
- INSERT/UPDATE/DELETE, ACID & Transactions
- ACID and Transactions in Hive
- Alan Gates' 2015 Hadoop Summit "Adding Insert, Update, and Delete to Hive" talk
- ORC
- Oozie
HDP Developer: Storm and Trident Fundamentals
- Course Labs
- Storm
- Kafka
- HDP Tutorial: Transporting Real-Time Event Stream with Apache Kafka
- Some insight on why so fast (i.e. using memory to support a cache for writing to disk) is available on this SlideShare preso
HDP Developer: Java
HDP Developer: Custom YARN Application
- Course Labs
- Writing YARN Applications
- YARN JavaDoc (Hadoop 2.4.x since current Rev using HDP 2.1.x)
- Slider
- Apache Site
- 2015 Hadoop Summit Presentation: Authoring and Hosting Applications on YARN using Slider
HDP Operations: Install and Manage with Apache Hadoop
- HDFS
- FS Commands
- Heterogeneous Storage
- Storage Types and Storage Policies
- Use Case from eBay's presentation at 2015 Hadoop Summit
- High Availability
- Snapshots
- MapReduce
- Solr
- Hardware and General Configuration
HDP Operations: Apache HBase Advanced Management
- Reference Guide
- HBase Shell Commands
- To introduce "Polyglot Persistence" – what the world needs now is another nosql preso (like i need a hole in my head)
- A pretty solid walk through of the architecture and major moving parts is presented here, but do overlook the source and the eventual plug for the non-ASF MapR-DB
- Range Prefix Scans