Course Objectives
By the end of this course, you will be confident in using a diverse array of sources to extract, clean, transform, and format your data efficiently.
Agenda
- Python for Data Wrangling
- Lists, Sets, Strings, Tuples, and Dictionaries
- Advanced Data Structures
- Basic File Operations in Python
- NumPy Arrays
- Pandas DataFrames
- Statistics and Visualization with NumPy and Pandas
- Using NumPy and Pandas to Calculate Basic Descriptive Statistics on the DataFrame
- Subsetting, Filtering, and Grouping
- Detecting Outliers and Handling Missing Values
- Concatenating, Merging, and Joining
- Useful Methods of Pandas
- Reading Data from Different Text-Based (and Non-Text-Based) Sources
- Introduction to BeautifulSoup4 and Web Page Parsing
- Advanced List Comprehension and the zip Function
- Data Formatting
- Basics of Web Scraping and BeautifulSoup libraries
- Reading Data from XML
- Refresher of RDBMS and SQL
- Using an RDBMS (MySQL/PostgreSQL/SQLite)
- Applying Your Knowledge to a Real-life Data Wrangling Task
- An Extension to Data Wrangling
FREE
Interested in course?
Course Type: Instructor Led