When the big data movement started it was mostly focused on batch processing. Distributed data storage and querying tools like MapReduce, Hive, and Pig were all designed to process data in batches ...
A monthly overview of things you need to know as an architect or aspiring architect. Unlock the full InfoQ experience by logging in! Stay updated with your favorite authors and topics, engage with ...
Unlock the full InfoQ experience by logging in! Stay updated with your favorite authors and topics, engage with content, and download exclusive resources. In this episode, Thomas Betts chats with ...
Originally developed at LinkedIn, Apache Kafka is one of the most mature platforms for event streaming. Kafka is used for high-performance data pipelines, streaming analytics, data integration, and ...
The latest trends and issues around the use of open source software in the enterprise. This is a guest post for the Computer Weekly Open Source Inside blog written by by Ben Slater in his role as ...
Following the path of Hadoop and Spark, Apache Kafka is becoming a rising star in the Big Data ecosystem, playing a leading role in two brand-new open source offerings from LinkedIn, where Kafka ...
Streaming is hot. The demand for real-time data processing is rising, and streaming vendors are proliferating and competing. Apache Kafka is a key component in many data pipeline architectures, mostly ...
Confluent, founded by the creators of Apache™ Kafka™, announced the release of open source Confluent Platform 2.0, based on an updated Apache Kafka 0.9 core. Representing a big leap forward in the ...