top of page
Writer's pictureHarini Mallawaarachchi

Use delta tables in Apache Spark


Tables in a Microsoft Fabric lakehouse are based on the open source Delta Lake format for Apache Spark. Delta Lake adds support for relational semantics for both batch and streaming data operations, and enables the creation of a Lakehouse architecture in which Apache Spark can be used to process and query data in tables that are based on underlying files in a data lake.


This exercise should take approximately 40 minutes to complete



Note: You need a Microsoft school or work account to complete this exercise. If you don’t have one, you can sign up for a trial of Microsoft Office 365 E3 or higher.


Create a workspace

Create a lakehouse and upload files

Explore data in a dataframe

Create delta tables

Use SQL to create a table

Explore table versioning

Use delta tables for streaming data

Clean up resources




1 view0 comments

Recent Posts

See All

Comments


bottom of page