Philly ETE 2015 – Helena Edelson – Streaming Big Data with Spark, Spark Streaming, Kafka, Cassandra and Akka

Tags: , , , , , , ,

Download (PDF, 5.51MB)


This talk presents Apache Spark, Spark Streaming, Apache Kafka, Apache Cassandra and Akka as supporting Lambda architecture in the context of a fault tolerant, streaming big data pipeline. We will walk through the Fault Tolerance story with these technologies to build applications, and how to easily implement and integrate them in a Scala Akka application for real-time delivery of meaning at high velocity, in highly distributed and concurrent environments.

About Helena:

Helena is a committer on several open source projects including Akka (Akka Cluster), Apache Spark, the Spark Cassandra Connector, Spring Integration and Spring AMQP. She has been a Senior Software Engineer, Senior Cloud Engineer, Principal Engineer and Architect over the last 15 years. Her primary academic background is in science, specifically biology, where her studies of energy pathways in large-scale dynamic systems and biological system modeling led her to tech. For the last several Helena has been working in big data in the domains of cyber security and technology, and in Scala for 6 years. She is currently a Senior Engineer on the Analytics team at DataStax, working with Apache Spark, Cassandra, Kafka, Scala and Akka. Most recently she has been a speaker at international Big Data and Scala conferences including Strata, Spark Summit, Scala Days and Code Neuro.