top of page
  • Writer's pictureHarini Mallawaarachchi

L19-Use Spark in Azure Databricks


Azure Databricks is a Microsoft Azure-based version of the popular open-source Databricks platform. Azure Databricks is built on Apache Spark, and offers a highly scalable solution for data engineering and analysis tasks that involve working with data in files. One of the benefits of Spark is support for a wide range of programming languages, including Java, Scala, Python, and SQL; making Spark a very flexible solution for data processing workloads including data cleansing and manipulation, statistical analysis and machine learning, and data analytics and visualization.



Before you start

You'll need an Azure subscription in which you have administrative-level access.

Review the Exploratory data analysis on Azure Databricks article in the Azure Synapse Analytics documentation.



Create a cluster

Explore data using a notebook




0 views0 comments

Recent Posts

See All

Comments


bottom of page