Maxtrain.com - info@maxtrain.com - 513-322-8888 - 866-595-6863
Applied Python for Data Science
Ohio TechCred Approved Credential: Programming for Data Science with Python
Description
Applied Python for Data Science
This intermediate course provides students with Python data science skills that can immediately be applied in real life. The course focuses on Pandas as the primary tool, using related packages such as NumPy and Seaborn to enhance processing and visualization.
Applied Python for Data Science
Working in a hands-on, applied learning environment, participants will learn to:
- Advanced Data Ingestion & Preparation: Efficiently import, clean, and export complex datasets using Pandas, preparing data for deeper analysis and reuse.
- Sophisticated Data Selection & Indexing: Confidently navigate and subset data using advanced indexing techniques, Boolean logic, and multi-indexing for hierarchical datasets
- Data Aggregation & Summarization: Apply groupby() operations and aggregation functions to analyze trends, patterns, and summaries across large datasets
- Data Transformation & Reshaping: Transform, merge, and reshape datasets to support more effective analysis and streamlined analytical workflows.
- Functional Data Processing: Apply user-defined and third-party functions to Pandas objects to extend analytical capabilities and customize data processing.
- Advanced Data Visualization: Create clear, informative, and visually compelling data visualizations using advanced Matplotlib features and Seaborn enhancements.
- NumPy for Analytical Efficiency: Utilize NumPy arrays and operations to manipulate large numerical datasets and improve analytical performance.
- Applied Scientific Computing with SciPy: Gain practical exposure to key SciPy subpackages to support statistical analysis, optimization, and scientific workflows
Prerequisites
Attending students are required to have a background in basic Python.
Take Before: Students should have attended or have incoming skills equivalent to those in the following courses:
- Fast Track to Python for Data Science
- Python Fundamentals for Data Science
Audience
- This course is designed for data professionals who already have foundational Python and Pandas skills and want to apply Python more effectively to real-world data analysis
problems. Typical roles include data analysts, data scientists, engineers, and researchers.
Applied Python for Data Science Outline
Pandas input and output
Reading data into Pandas dataframes and exporting to various formats.
- General input considerations
- Reading CSV Files
- Data cleaning
- Reading other data formats
- Exporting data
Pandas filtering and sorting
Selecting subsets of dataframes for focused analysis.
- Indexing rows and columns
- Multi-indexing
- Selection by conditions
- Sorting data
Pandas grouping and aggregation
Consolidating data and providing sums and other aggregate values
- Using groupby()
- Aggregate functions
- Using data summaries
- Alternate approaches
Pandas Data Transformation
Manipulating datasets for simpler analysis
- Applying functions to data
- Renaming columns and indexes
- Inserting and removing data
- Combining and merging dataframes
- Reshaping datasets
Advanced Matplotlib
Going beyond the basics with Matplotlib
- Components of a figure
- Multiple plots
- Complex plots
- Matplotlib options and settings
- Customing styles (and everything else)
SeabornFiltering a DataFrame
Learning how Seaborn supplements and improves on Matplotlib
- What does Seaborn provide?
- Using themes
- Advanced plot types
- Fine-tuning the details
Using NumPy
Loading large datasets into NumPy arrays for further analysis
- NumPy basics
- Creating arrays
- Indexing and slicing
- Builtin functions()
- Reading and writing data
Useful SciPy subpackages
A look at some of the 20-odd SciPy subpackages
- What is SciPy?
- scipy.stats
- scipy.interpolate
- scipy.optimize
The following are required:
The Anaconda Python distribution (or equivalent), including
- iPython
- Jupyter
- Pandas
- Matplotlib
- Seaborn
- Numpy
- SciPy
|
$2195.00
|
2 Days Course |

