top of page
  • Writer's pictureHarini Mallawaarachchi

Create a Lakehouse

Updated: Jan 24


Large-scale data analytics solutions have traditionally been built around a data warehouse, in which data is stored in relational tables and queried using SQL. The growth in “big data” (characterized by high volumes, variety, and velocity of new data assets) together with the availability of low-cost storage and cloud-scale distributed compute technologies has led to an alternative approach to analytical data storage; the data lake.


In a data lake, data is stored as files without imposing a fixed schema for storage. Increasingly, data engineers and analysts seek to benefit from the best features of both of these approaches by combining them in a data lakehouse; in which data is stored in files in a data lake and a relational schema is applied to them as a metadata layer so that they can be queried using traditional SQL semantics.


In Microsoft Fabric, a lakehouse provides highly scalable file storage in a OneLake store (built on Azure Data Lake Store Gen2) with a metastore for relational objects such as tables and views based on the open source Delta Lake table format. Delta Lake enables you to define a schema of tables in your lakehouse that you can query using SQL.


This lab takes approximately 30 minutes to complete.




Note: You need a Microsoft school or work account to complete this exercise. If you don’t have one, you can sign up for a trial of Microsoft Office 365 E3 or higher.


Create a workspace

Create a Lakehouse

Upload a file

Explore shortcuts

Load file data into a table

Use SQL to query tables

Create a visual query

Create a report

Clean up resources


Tip

To learn more about lakehouses in Microsoft Fabric, see the What is a lakehouse? reference documentation.


9 views0 comments

Recent Posts

See All

Comments


bottom of page