/
NiFi Cheat Sheet

NiFi Cheat Sheet

Pretty solid NiFi intro blog post >> https://www.freecodecamp.org/news/nifi-surf-on-your-dataflow-4f3343c50aa2/

NiFi (and other HDF/CDF components) SE training deck >> https://docs.google.com/presentation/d/1EuxZeintuYUzAQ83mfTJcAdFfPgP2DxoUl4lLvptueE/edit?usp=sharing (Cloudera-internal)

UpdateAttribute Processor

ETL-ish references

UpdateRecord Processor

Blog about Wait/Notify processors >> http://ijokarumawak.github.io/nifi/2017/02/02/nifi-notify-batch/

Pausing a particular FlowFile a period of time before being processed again.  Some options include:

Load-balancing a list of files with ListSFTP and FetchSFTP >> https://community.hortonworks.com/articles/97773/how-to-retrieve-files-from-a-sftp-server-using-nif.html

Scheduling & invoking a flow as a batch activity

Nifi & Kafka partitioning line up >> https://community.hortonworks.com/content/kbentry/57262/integrating-apache-nifi-and-apache-kafka.html

NiFi Expression Language Cheat Sheet >> https://www.nifi.rocks/documents/nifi-expression-language-cheat-sheet.pdf

Retry Loop Template >> https://community.cloudera.com/t5/Support-Questions/NiFi-Flowfile-retries/td-p/171724

RDBMS integration

Custom processor development >> https://community.hortonworks.com/content/kbentry/4318/build-custom-nifi-processor.html

Leveraging external scripts (STDOUT responses become a FlowFile to be leveraged in the rest of the flow definition)

"Repositories" Details and Configuration

HDF/NIFI Best practices for setting up a high performance NiFi installation >> https://community.hortonworks.com/articles/7882/hdfnifi-best-practices-for-setting-up-a-high-perfo.html

Thoughts on Yield Duration and Penalty Duration settings for processors >> https://medium.com/@ben2460/nifi-scheduling-a522a1c9e740

Controller Services scoping/availability >> https://community.hortonworks.com/articles/90259/understanding-controller-service-availability-in-a.html

SDLC with NiFi Registry

Example of using InvokeHTTP processor to read/write to Dropbox REST services; https://pierrevillard.com/2016/03/13/get-data-from-dropbox-using-apache-nifi/

Hosting REST services ON NiFi

Offloading (flowfiles from) a node >> https://community.cloudera.com/t5/Community-Articles/Offload-NiFi-Cluster-Nodes-using-the-UI-NiFi-1-8-0/ta-p/249070

Atlas Integration



Related content

Kafka Cheat Sheet
Kafka Cheat Sheet
Read with this
Hadoop & Big Data
Hadoop & Big Data
More like this
Hive Cheat Sheet
Hive Cheat Sheet
More like this
Links & Cheat Sheets for Hadoop & Big Data
Links & Cheat Sheets for Hadoop & Big Data
More like this
topology supervision features of streaming frameworks (or lack thereof)
topology supervision features of streaming frameworks (or lack thereof)
More like this
Trino Compared to Other Big Data SQL Engines
Trino Compared to Other Big Data SQL Engines
More like this