Data technologies and SaaS platforms are evolving fast to meet the challenge of sophisticated use cases. They are changing the landscape of the modern data stack, from OLTP, data lakehouse to analytical systems, and empowering a new ecosystem in data engineering where batch and realtime systems are converging. But what should you choose? Can we do more with less? Come to this track to learn some fundamental, powerful yet versatile building blocks and their core engineering principles that you can leverage to build a simple yet efficient and scalable data architecture.
From this track
Laying the Foundations for a Kappa Architecture - The Yellow Brick Road
Tuesday Jun 13 / 10:35AM EDT
In the ever changing landscape of big data, focus is slowly moving away from batch and towards realtime analytics. Data Science workflows are evolving to adapt to this changing landscape.
Sherin Thomas
Staff Software Engineer @Chime
Streaming from Apache Iceberg - Building Low-Latency and Cost-Effective Data Pipelines
Tuesday Jun 13 / 11:50AM EDT
Apache Flink is a very popular stream processing engine featuring sophisticated state management, even-time semantics, exactly-once state consistency. For low latency processing, Flink jobs typically consume data from streaming sources like Apache Kafka.
Steven Wu
Software Engineer @Apple and Apache Iceberg PMC
The Rise of the Serverless Data Architectures
Tuesday Jun 13 / 01:40PM EDT
For a while, it looked like Serverless was just a convenient way to run stateless functions in the cloud. But in the last year we’ve seen the rapid rise in serverless data stores.
Gwen Shapira
Founder @Nile, PMC Member @Kafka
Building a Large Scale Real-Time Ad Events Processing System
Tuesday Jun 13 / 02:55PM EDT
Two years ago, we embarked on building DoorDash's ad platform from the ground up. Today, our platform handles over 2 trillion events every day and our advertising business has experienced significant growth in recent years, becoming a key area of focus for the company.
Chao Chu
Software Engineer @DoorDash
Enabling Remote Query Execution Through DuckDB Extensions
Tuesday Jun 13 / 04:10PM EDT
DuckDB is a high-performance, embeddable analytical database system that has gained massive popularity in the last few years.
Stephanie Wang
Founding Engineer @MotherDuck
Unconference: Modern Data Architecture & Engineering
Tuesday Jun 13 / 05:25PM EDT
What is an unconference? An unconference is a participant-driven meeting. Attendees come together, bringing their challenges and relying on the experience and know-how of their peers for solutions.
Ben Linders
Independent Consultant in Agile, Lean, Quality and Continuous Improvement
Track Host
Allen Wang
Senior Staff Engineer @DoorDash