Using Apache Pulsar to Provide Real-Time IoT Analytics on the Edge

Using Apache Pulsar to Provide Real-Time IoT Analytics on the Edge

Wednesday, May 22
11:50 AM - 12:30 PM
Marquis Salon 9

The business value of data decreases rapidly after it is created, particularly in use cases such as fraud prevention, cybersecurity, and real-time system monitoring. The high-volume, high-velocity datasets used to feed these use cases often contain valuable, but perishable, insights that must be acted upon immediately.

In order to maximize the value of their data enterprises must fundamentally change their approach to processing real-time data to focusing reducing their decision latency on the perishable insights that exist within their real-time data streams. Thereby enabling the organization to act upon them while the window of opportunity is open.

Generating timely insights in a high-volume, high-velocity data environment is challenging for a multitude of reasons. As the volume of data increases, so does the amount of time required to transmit it back to the datacenter and process it. Secondly, as the velocity of the data increases, the faster the data and the insights derived from it lose value.

In this talk, we will present a solution based on Apache Pulsar Functions that significantly reduces decision latency by using probabilistic algorithms to perform analytic calculations on the edge.


David Kjerrumgaard
Director - Solution Architecture
David is a Director of Solution Architecture at Streamlio, and also a contributor to the Apache NiFi, and Apache Pulsar projects. He was formerly the Practice Director at Hortonworks, where he was responsible for the development of best practices and solutions for the professional services team, with a focus on HDF-related technologies including Kafka, NiFi, and Storm. He is a co-author of “Practical Hive: A Guide to Hadoop’s Data Warehouse System”, and holds a B.S and Master’s Degree in Computer Science from Kent State University.