Uncategorized

Databricks Platform Features – Deep Dive into Delta Lake using PySpark Data Frames

Databricks Platform Features – Deep Dive into Delta Lake using PySpark Data Frames Do you want to learn Spark Delta Lake hands-on? Here are the YouTube Videos premiered for free. Also, the lectures and content along with the support are available as part of our Udemy course (go to the bottom or look at the …

Databricks Platform Features – Deep Dive into Delta Lake using PySpark Data Frames Read More »

Data Engineering using Databricks on AWS and Azure

Data Engineering using Databricks on AWS and Azure Course Details As part of this course, you will be learning Data Engineering using Databricks. Getting Started with Databricks Setup Local Development Environment to develop Data Engineering Applications using Databricks Using Databricks CLI to manage files, jobs, clusters, etc related to Data Engineering Applications Spark Application Development …

Data Engineering using Databricks on AWS and Azure Read More »

Solutions – Managing Partitioning Tables using Postgresql

Partitioning Tables¶ Here is the exercise to get comfort with partitioning. We will be using range partitioning. Use retail database. Make sure orders table already exists. In [1]: %%HTML <iframe width=”560″ height=”315″ src=”https://www.youtube.com/embed/uAkrpaJmbx0?rel=0&amp;controls=1&amp;showinfo=0″ frameborder=”0″ allowfullscreen></iframe> In [1]: %load_ext sql In [2]: %env DATABASE_URL=postgresql://itversity_retail_user:retail_password@localhost:5432/itversity_retail_db env: DATABASE_URL=postgresql://itversity_retail_user:retail_password@localhost:5432/itversity_retail_db Exercise 1¶ Create table orders_part with the same columns as orders. Partition the …

Solutions – Managing Partitioning Tables using Postgresql Read More »