Lesson Category: Data Engineering

Basics of programming using Scala

What is Scala? Scala is JVM based functional programming language. Why Scala? Even though Scala is there for more than a decade (founded in 2000), it have gained lot of momentum with Spark. Spark is completely developed using Scala On top of Spark (used for data processing), Scala also have frameworks such as Play to …

Basics of programming using Scala Read More »

Linux Fundamentals

While Windows is popular in PCs, Linux is popular in running applications. It is highly desired for every IT professional or aspirant to have good Linux knowledge. Important linux commands to deal with file system Ability to troubleshoot the applications Regular expressions to validate the data Automate mundane tasks using shell scripting and more As …

Linux Fundamentals Read More »

Using PC Professionally

As IT professional or aspirant it is highly desired to use relevant tools. As part of this module we will be touching those important points which can improve your productivity. This module is relatively small than others. We will only list the tools and purpose of them. We will also make recommendations as there are …

Using PC Professionally Read More »

Python Fundamentals

As part of these sessions we have covered fundamentals of python to be a skilled Data Engineer using Big Data eco system. Click here to go to reference material. Basics of Python Programming Functions and Collections Operations on Collections Itertools and Pandas Pandas – DataFrames Panda SQL and JDBC Programming \

Kickoff Sessions

To cover important aspects of Linux, Database and Programming we have conducted 3 kick off sessions. Linux Essentials Database Essentials Programming Essentials Setup environment for Python Setup PyCharm IDE Setup Spark and integrate with Python and PyCharm