NiFi Cheat Sheet

Pretty solid NiFi intro blog post >> https://www.freecodecamp.org/news/nifi-surf-on-your-dataflow-4f3343c50aa2/

NiFi (and other HDF/CDF components) SE training deck >> https://docs.google.com/presentation/d/1EuxZeintuYUzAQ83mfTJcAdFfPgP2DxoUl4lLvptueE/edit?usp=sharing (Cloudera-internal)

UpdateAttribute Processor

ETL-ish references

UpdateRecord Processor

Blog about Wait/Notify processors >> http://ijokarumawak.github.io/nifi/2017/02/02/nifi-notify-batch/

Pausing a particular FlowFile a period of time before being processed again.  Some options include:

Load-balancing a list of files with ListSFTP and FetchSFTP >> https://community.hortonworks.com/articles/97773/how-to-retrieve-files-from-a-sftp-server-using-nif.html

Scheduling & invoking a flow as a batch activity

Nifi & Kafka partitioning line up >> https://community.hortonworks.com/content/kbentry/57262/integrating-apache-nifi-and-apache-kafka.html

NiFi Expression Language Cheat Sheet >> https://www.nifi.rocks/documents/nifi-expression-language-cheat-sheet.pdf

Retry Loop Template >> https://community.cloudera.com/t5/Support-Questions/NiFi-Flowfile-retries/td-p/171724

RDBMS integration

Custom processor development >> https://community.hortonworks.com/content/kbentry/4318/build-custom-nifi-processor.html

Leveraging external scripts (STDOUT responses become a FlowFile to be leveraged in the rest of the flow definition)

"Repositories" Details and Configuration

HDF/NIFI Best practices for setting up a high performance NiFi installation >> https://community.hortonworks.com/articles/7882/hdfnifi-best-practices-for-setting-up-a-high-perfo.html

Thoughts on Yield Duration and Penalty Duration settings for processors >> https://medium.com/@ben2460/nifi-scheduling-a522a1c9e740

Controller Services scoping/availability >> https://community.hortonworks.com/articles/90259/understanding-controller-service-availability-in-a.html

SDLC with NiFi Registry

Example of using InvokeHTTP processor to read/write to Dropbox REST services; https://pierrevillard.com/2016/03/13/get-data-from-dropbox-using-apache-nifi/

Hosting REST services ON NiFi

Offloading (flowfiles from) a node >> https://community.cloudera.com/t5/Community-Articles/Offload-NiFi-Cluster-Nodes-using-the-UI-NiFi-1-8-0/ta-p/249070

Atlas Integration