Are you having difficulty keeping up to date on all the frequent changes and updates in the streaming data space? Then the 'Streaming Data Monthly Digest' ( updated daily!) has the solution you’re looking for. Please find below a list of web resources related to streaming data in general for March 2017.
I am daily updating this list without a focus on a particular tool be it open source or commercial. Many web resources listed below are future events such as meetups and conference talks. Related slides and videos will be added as they are made available.
Not a single streaming data processor can claim to be a silver bullet! All streaming data processors have their own strengths and weaknesses and are sweet spots for particular use cases.
March 1st, 2017
- [Slides] Kafka's Role in Implementing Oracle's Big Data Reference Architecture, Robin Moffat
- [Article] How Hedge Funds Use Twitter to Gain an Edge in Trading, Emel Akan
- [Article] Big Data Predictions 2017. Oracle
- [Blog] Announcing real-time Geospatial Analytics in Azure Stream Analytics, Samartha Chandrashekar
- [Blog] Kafka REST and JQuery Helper, Jesse Anderson
March 2nd, 2017
- [Meetup] Apache Flink's Stateful Operators And Table SQL API. Apache Flink Meetup, Amsterdam. Slides
- [Meetup] Building streaming applications using Kafka*[Connect + Core + Streams], Slim Baltagi. Chicago Real-Time Streaming Analytics Meetup
- [Blog] Face-off in Message Queue Reviews: IBM MQ vs. RabbitMQ vs. Apache Kafka vs. Apache ActiveMQ
- [Meetup] Bonner Microservices Meetup: #7 Apache Kafka & Event Sourcing
- [Video] Streaming with @mapr and @streamsets talking about
- [Video] Streaming all the things with Akka Streams by Johan Andrén
March 3rd, 2017
- [Book] Kafka Streams in Action, William P. Bejeck Jr.
March 4th, 2017
March 5th, 2017
- [Blog] Configuring and Running Apache Kafka in IBM BigInsights, Nisanth Simon
March 6th, 2017
- [Blog] Kafka Connect Elasticsearch: Consuming and Indexing with Kafka Connect, Rafal Kuć
- [Blog] Handling the Extremes: Scaling and Streaming in Finance, Jim Scott, MapR.
- [Blog] Real time processing with Kafka Streams based microservices on Application Container Cloud, Abhishek Gupta, Oracle
March 7th, 2017
- [Webinar] Generalized Streaming Pipelines with Spark Streaming, Kafka, IBM
- [Blog] Kafka and the Oracle database; making the connection – Part 3, Mike Donovan, DBvisit
- [Meetup] Dean Wampler - Stream All the Things! Austin Apache Kafka Meetup. Slides
- [Blog] Whatever happened to Durability?
- [Blog] Using Control Streams to Manage Apache Flink Applications, Scott Kidder
- [Article] How Kafka Redefined Data Processing for the Streaming Age, Alex Woodie
- [Blog] The Arrival of Streaming: Confluent raises $50 million series C, Jay Kreps, Confluent.
March 8th, 2017
- [Conference talk] Power of the Log:LSM & Append Only Data Structures, Ben Stopford, QCon London 2017. Slides
- [Article] “Engineers love a good challenge”: Confluent’s Jay Kreps on attracting talent, Praseeda Nair.
- [Webinar] Intro to Big Data AppHub: S3 to HDFS Sync App & HDFS to Kafka Sync App Templates. Video, Slides
- [Blog] Drivetribe’s Modern Take On CQRS With Apache Flink®, Aris Koliopoulos
- [Whiteboard Walkthrough] Keeping Big Data Containers Lightweight: Persisting Streams, Tables and Files, Ted Dunning
March 9th, 2017
- [Blog] Log compaction - Highlights in the Apache Kafka and Stream Processing Community - March 2017, Gwen Shapira
- [Slides] Kafka & Couchbase Integration Patterns, Manuel Hurtado
- [Webinar] What's new in Confluent 3.2 and Apache Kafka 0.10.2, Clarke Patterson . Confluent. Slides
- [Blog] Drivetribe’s Modern Take On CQRS With Apache Flink®, Aris Koliopoulos
March 10th, 2017
- [Blog] Kafka Basics, Producer, Consumer, Partitions, Topic, Offset, Messages, Serkan Sunel
- [Slides] Enabling Rapid Business Insight into Data with Stream Analytics and GoldenGate, Robin Moffatt
- [Video] Apache Beam: From the Dataflow SDK to the Apache Big Data Ecosystem (Google Cloud Next '17), Jesse Anderson
- [Blog] Apache Flink Cluster Setup on CentOS
March 11th, 2017
- [Meetup] Apache Flink Meetup Zurich kickoff founded
- [Blog] Streaming Oracle database change data to Kafka; hello Dbvisit Replicate Connector for Kafka – Part 4, Mike Donovan, Dbvisit
- [Blog] Apache Spark Streaming – A Comprehensive Guide
March 12th, 2017
- [Slides] Dive into Spark Streaming, Gerard Maas
- [Slides] Akka Stream for image processing, Fabian Gutierrez
March 13th, 2017
- [Article] Apache Kafka Gives Large-Scale Image Processing a Boost, Ben Cotton
- [Blog] Interactive Queries in Apache Kafka Streams, Florian Troßbach
- [Slides] Building Streaming And Fast Data Applications With Spark, Mesos, Akka, Cassandra And Kafka, Sean Glover, Lightbend
- [Article] Hadoop Has Failed Us, Tech Experts Say, Alex Woodie
- [Blog] There's so much more to Apache Kafka than just Hadoop, Clarke Patterson, Confluent
- [Blog] Kinetica Joins Confluent Partner Program and Releases Confluent Certified Connector for Apache Kafka™, , Kinetica
March 14th, 2017
- [Blog] Taking control of your Apache Storm cluster with tag-aware scheduling, Panos Katseas
[Blog] Rheos, Connie Yang, eBay.
March 15th, 2017
- [News] Confluent Delivers First-ever Certification Program for Apache Kafka
- [Slides] Applying Machine Learning to Live Patient Data, Carol McDonald, MapR
- [Blog] ETL Integration Tools fore Apache Kafka
March 14th-16th, 2017
- [Conference] Strata + Hadoop World. Stream processing and analytics
- [Tutorial] Building real-time data pipelines with Apache Kafka, Ian Wrigley, Confluent
- [Tutorial] Architecting a next-generation data platform, Jonathan Seidman (Cloudera), Ted Malaska (Blizzard), Mark Grover (Cloudera), Gwen Shapira (Confluent) Slides
- [Conference Talk] Unified, portable, efficient: Batch and stream processing with Apache Beam (incubating), Kenneth Knowles (Google).
- [Conference Talk] The rise of real time: Apache Kafka and the streaming revolution, Jay Kreps (Confluent)
- [Conference Talk] Operating Kafka at petabyte scale, Michael Edwards
- [Conference Talk] Building reliable real-time services with Apache DistributedLog, Sijie Guo
- [Conference Talk] The evolution of massive-scale data processing, Tyler Akidau (Google).
- [Conference Talk] From rivulets to rivers: Elastic stream processing in Heron, Bill Graham (Twitter), Avrilia Floratau (Microsoft), Ashvin Agrawal (Microsoft).
- [Conference Talk] Anomaly detection in real-time data streams using Heron, Arun Kejariwal (Machine Zone), Karthik Ramasamy (Twitter)
- [Conference Talk] Watermarks: Time and progress in Apache Beam (incubating) and beyond, Slava Chernyak (Google).
- [Conference Talk] Apache Flink: The latest and greatest, Jamie Grier (Data Artisans).
- [Conference Talk] Developing streaming applications with Apache Apex, David Yan (DataTorrent).
- [Conference Talk] One cluster does not fit all: Architecture patterns for multicluster Apache Kafka deployments, Gwen Shapira (Confluent)
- [Conference Talk] Stream me up, Scotty: Transitioning to the cloud using a streaming data platform, Gwen Shapira (Confluent), Bob Lehmann (Monsanto)
- [Conference Talk] Mistakes were made, but not by us: Lessons from a year of supporting Apache Kafka, Ryan Pridgeon (Confluent), Dustin Cote (Confluent)
- [Conference Talk] Building a healthy data ecosystem around Kafka and Hadoop: Lessons learned at LinkedIn, Shirshanka Das
- [Conference talk] When data center is not enough, Gwen Shapira Slides
March 15th, 2017
- [Meetup] Apache Flink: Use Cases. Paris Apache Flink Meetup
- [Meetup] Realtime Tensorflow AI + Spark ML/Streaming, Kafka, Beam, Flink, Advanced Spark and Tensorflow meetup
March 16th, 2017
- [Blog] An Introduction to Oracle Stream Analytics by Robin Moffatt, Jürgen Kress
- [Blog] Configuring and Running Apache Kafka in IBM BigInsights
March 17th, 2017
- [Conference Talk] From Messaging to Logs with Apache Kafka, Jorge Quilcate. Oracle User Group Norway. Slides
- [Blog] Stream Oracle data to Kafka; Q&A with the Dbvisit Replicate Connector for Kafka – Part 5, Mike Donovan
March 18th, 2017
- [Blog] Kafka, Why and how we have used it? Sapna Upreti, Nodexperts
March 20th, 2017
- [Blog] Building a distributed Runtime for Interactive Queries in Apache Kafka with Vert.x, Florian Troßbach
- [Blog] Running Docker based Kafka Streams microservices on Oracle Container Cloud, Abhishek Gupta, Oracle
March 21st, 2017
[Webinar] Getting Started with Streaming Data and Stream Processing with Apache Kafka, David Tucker
- [Blog] Kafka Streams state stores…
March 22nd, 2017
- [Meetup] Flink Forward San Fran Sneak Peak Double Feature! Chicago Apache Flink Meetup
- [Meetup] How Apache Kafka can change your life, Brighton Java
- [Blog] Flafka: Big Data Solution for Data Silos, Randall V Shane
- [Blog] Implementing The Schema Registry, Callum Leahy
March 23rd, 2017
- [Meetup] Meet Kafka, the distributed log, Ippon
- [Webinar] Data Pipelines Made Simple with Apache Kafka, Ewen Cheslack-Postava, Confluent. Slides
March 24th, 2017
- [Blog] Retrying consumer architecture in the Apache Kafka
- [Blog] Queryable State in Apache Flink® 1.2.0: An Overview & Demo, Michael Winters and Ufuk Celebi
March 25th, 2017
- [Blog] Building on top of Reactive Streams, Nicolas A. Perez
- [Blog] Kafka with Docker: A Docker introduction, Nikolaos Georgiou
- [Slides + Video] Stream Processing & Analytics with Flink @Uber, Danny Yuan
March 27th, 2017
- [Article] Streaming Analytics Picks Up Where Hadoop Lakes Leave Off, Alex Woodie
March 28th, 2017
- [Blog] Applying Kafka Streams to the Purchase Transaction Flow, Bill Bejeck
- [Blog] Making Spark and Kafka Data Pipelines Manageable with Tuning, Larry Murdock
March 29th, 2017
- [News] Hazelcast joins Confluent Partner Program
- [Slides] Building Event-Driven Services with Apache Kafka, Ben Stopford, Confluent.
March 30th, 2017
- [Meetup] Stream Processing with Apache Kafka and .NET, Matt Howlett, South Bay.NET
- [Blog] Benchmarking Kafka Performance Part 1: Write Throughput
- [Slides + Audio] Real-time Recommendations using Spark Streaming, Elliot Chow
- [Video] Reactive Kafka with Akka Streams, KrzysztofCiesielski, SoftwareMill.
- [Slides] User Behavior Analysis with Session Windows and Apache Kafka's Streams AP, Michael G. Noll
March 31st, 2017
- [Conference talk] Streaming and Microservices for Fast Data on MapR, Dale Kim, MapR. Strata + Hadoop World