Agenda
- Introduction
- Setup Python
- REPL
- Basic Programming Constructs
- Functions and Lambda Functions
- Collections— List, Set, Dict
- Basic Map Reduce operations
- Basic 1/0 operations
Introduction
- Python is interpreter based programming language
- Adaptability of Python is very high in Data Engineering and Data Science fields
- Spark APIs are well integrated with Python
- Highly relevant for Cloudera and Hortonworks Spark certifications
Setup Python
- Mac— pre-installed (if you need additional packages better to install pip following instructions)
- Linux— pre-installed (if you need additional packages better to install pip following instructions)
- Windows —Google and install using MSI
- https://labs.itversity.com — pre-installed
- For building applications, better to install IDE such as pycharm
- For certification preparation, practice using REPL
REPL
- REPL— Read, Evaluate, Print and Loop
- Use python command to launch REPL
- exit() or Ctrl-D to come out
- Ctrl+l to clear the screen
- We can get the help using help command
- Use up arrow to see the history