Moving Beyond Data Lakes

Hadoop, Pig and Hive, HBase and other NoSQL point solutions onto Spark, Flink, Drill, and Kafka were built to handle individual aspects of the three V’s of big data (volume, variety, and velocity).If a storage system can scale linearly, then we can put the applications on top of the storage platform. If the application runs where the data is stored, then we don't have to worry about moving the data later to perform analytics.Model of messaging delivered via Kafka and MapR Streams can achieve rates about one million events per second with a minor investment. These technologies take a little time to understand and get comfortable with, but may be worth the investment.You can read more at : https://www.oreilly.com/ideas/using-microservices-to-evolve-beyond-the-data-lake