Overview of Big Data eco system and Basics of Python 3

Current Status
Not Enrolled
Get Started

As part of this course, we will be seeing

  • Overview of Big Data cluster
  • HDFS – Hadoop Distributed File System
  • YARN and Map Reduce
  • Overview of Hive and SQL interfaces
  • Overview of Spark
  • Overview of Data Ingestion technologies – Sqoop, Flume, Kafka etc
  • Basic Python
  • Declaring Variables
  • Conditional statements and loops
  • Functions
  • Operators
  • Object-Oriented Concepts
  • Collections
  • Tuples
  • Pandas (Data Frames)

Why python?

Python is designed to be highly readable. It uses English keywords frequently whereas the other languages use punctuations.  Python is Interpreted, Interactive and object-oriented.

History of python

Guido van Rossum made initial attempts in development of Python in the late eighties and early nineties. The first version of Python(Python 1.0) was released in November 1994. Python 2.0 was released in 2000. Python 3.0 was released in 2008. The current version of python in use is 3.6.x.

Python Features

  • Easy to learn, read and maintain
  • Portable, Extendable and Scalable
  • It supports functional and structured programming methods as well as OOP.
  • It can be used as a scripting language or can be compiled to byte-code for building large applications
  •  supports dynamic type checking and automatic garbage collection.

Share this post