Meetup
Future Data Practitioner Roles!
Come join us as we talk data in our very first SF meetup! This event is a casual meetup for data engineers and scientist. We’ll talk shop and have food and drinks! The Tech industry is taking a hit with all the layoffs happening. We want to use this opportunity to host an event for …
SF Data Council Meetup w/ Airbyte, Census, & lakeFS
We’re getting the group back together in San Francisco! The Data Council Community is meeting up to discuss the latest throughout data infrastructure. This meeting will be hosted from 6 – 8 PM at the San Francisco Mindspace HQ – just a 2 minute walk from the Montgomery Bart station. Each talk will be about …
SF Data Council Meetup w/ Airbyte, Census, & lakeFS Read More »
How Akamai process 10Gb/s of events in real-time using Kafka and Spark
Details 17:30 – 18:00 – Mingling and food 🙂 18:00 – 18:20 – Opening session 18:20 – 19:00 – How to implement Kafka Exactly-Once – Yulia Antonovsky – Senior II Software Engineer @ Akamai 19:00 – 19:40 – Deep dive into Spark 3 Data source read API – Kineret Raviv, Principal Software Developer @ Akamai …
How Akamai process 10Gb/s of events in real-time using Kafka and Spark Read More »
State of Data Engineering
Join us as we discuss the current impact of Data Engineering and what the future holds.
Chaos Engineering – Managing Stages in a Complex Data Flow
Learn how to apply the principles of chaos engineering to make more resilient data pipelines.
Data Lifecycle Management – Applying Engineering Best Practices for Data
Learn better Data Lifecycle Management practices with lakeFS
Level Up Your Lakehouse with Data Source Control & Cross Collection Consistency
Learn how to leverage Delta Lake and LakeFS to reach cross-collection consistency when operating on multi-statement transactions.
DevOps and Drinks: Ensuring data quality in a data lake environment with lakeFS
Learn how lakeFS simplifies maintaining high-quality data lakes in two ways: 1. by providing no-copy, isolated data development environments and 2. enabling CI/CD workflows that allow for automated testing of data.
DoK Talks 103: Performant and Version-Aware Analytics With Spark & lakeFS on K8s
Learn how running lakeFS on Kubernetes can bring a new level of scalability and resilience to data pipelines.