Change data capture is a popular method for unobtrusively ingesting data from SQL sources. In this talk, we will show how to easily incorporate your SQL data sources in near-real-time into Databricks and Delta Lake on Google Cloud.
We will provide a short introduction to change-data-capture, Google Datastream (serverless CDC on Google Cloud), Databricks, and Delta Lake. In addition, we will also give a walk-through of our new open source Spark Structured Streaming connector which provides an easy-to-use / configure method of linking Datastream to Delta Lake.
About Delta Lake
Delta Lake is an open-source storage framework that enables building a Lakehouse architecture with compute engines including Spark, PrestoDB, Flink, Trino, and Hive and APIs for Scala, Java, Rust, Ruby, and Python. Join this group to attend community office hour events, Q&A live sessions, and tech talks around Delta Lake and the ecosystem.
This email was sent by: The Linux Foundation
548 Market St, PMB 57274, San Francisco, CA 94104-5401, United States
No comments:
Post a Comment
Keep a civil tongue.