Machine learning design patterns: between Beam and a hard place
By Beam Summit Team

Machine learning design patterns: between Beam and a hard place

In a recent book entitled Machine Learning Design Patterns, we captured best practices and solutions to recurring problems in machine learning. Many of these design patterns are best implemented using Beam. The obvious example is the Transform design pattern, which allows you to replicate arbitrary operations from the training graph in the serving graph while keeping both training and serving code efficient and maintainable. Indeed, the tf.transform package makes this easy.

Read More
Implementing Cloud Agnostic Machine Learning Workflows with Apache Beam on Kubernetes
By Beam Summit Team

Implementing Cloud Agnostic Machine Learning Workflows with Apache Beam on Kubernetes

The need for a highly efficient data processing workflow is fast becoming a necessity in every organization implementing and deploying Machine Learning models at scale. In most cases, ML teams leverage the managed service solutions already in place by the cloud infrastructure provider they choose. While this approach is good enough for most teams to get going, the long-term cost of keeping the platform running may be prohibitively higher over time.

Read More
Unified Streaming and Batch Pipelines at LinkedIn using Beam
By Beam Summit Team

Unified Streaming and Batch Pipelines at LinkedIn using Beam

Many use cases at LinkedIn require real-time processing and periodic backfilling of data. Running a single codebase for both needs is an emerging requirement. In this talk, we will share how we leverage Apache Beam to unify Samza stream and Spark batch processing. We will present the first unified production use case Standardization. By leveraging Beam on Spark for its backfilling, we reduced the backfilling time by 93% while only using 50% of resources.

Read More
Get your early bird tickets for Beam Summit 2022
By Beam Summit Team

Get your early bird tickets for Beam Summit 2022

Registration is now open for Beam Summit 2022. Online participation is free. Please get a free ticket in order to be able to access the sessions. Early bird pricing for in-person passes is $290 USD for 2-day pass and $350 USD for 3-day pass. If you would like to attend in person but cannot afford a ticket, please apply for a scholarship. Check out ticket options and prices.

Read More
Beam Summit 2022 CFP is open. We want to hear from you!
By Beam Summit Team

Beam Summit 2022 CFP is open. We want to hear from you!

The “Call for Papers” for Beam Summit 2022 is open. We are looking for speakers that can provide technical insights, but also for people that can tell interesting stories on how and where “Apache Beam” is used. The following session types are suggested (if you have a good reason to deviate from the format please add this to your description): Session 25m: Live session of 25 minutes, ideal for a use-case.

Read More