Data I/O 2013 – Web-scale Data Processing: Practical approaches for low-latency and batch – Edward Capriolo
ARVE Error:
autoplay
ARVE Error: loop
loop
not valid
not validautoplay
not validPodcast: Play in new window | Download (Duration: 59:47 — 137.9MB) | Embed
In this talk, Hive and Cassandra author (and Hive committer and PMC member) Edward Capriolo will discuss common big-data software challenges and how they can be solved using both batch and stream processing. Technology focus will primarily be on Apache Kafka for publish-subscribe messaging, Storm for stream processing, and Apache Cassandra as a NoSQL data store.