Nginx => hadoop HDFS using Fluentd Fluentd is a json everywhere log collector. It transmits logs as json streams so that log processing can be easily managed and processed. Hadoop HDFS is a distributed filesystem which can be used to store any amount of logs and run mapreduce jobs for faster log processing. We will be using fluent-webhdfs-plugin to send logs over to httpfs interface 1. Install had