Apache Kudu support in Apache Apex

Introduction The last few years has seen HDFS as a great enabler that would help organizations store extremely large amounts of data on commodity hardware. However over the last couple of years the technology landscape changed rapidly and new age engines like Apache Spark, Apache Apex and Apache Flink have started enabling more powerful use cases on a distributed data store paradigm. This has quickly brought out the short-comings of an immutable data store. The primary short comings are: Immutability resulted in complex lambda architectures when HDFS is used as a store by a query engine When data...

RSS Subscribe


See more posts from the Blogs Archive.