Course Outline
Introduction
- Building effective algorithms in pattern recognition, classification and regression.
Setting up the Development Environment
- Python libraries
- Online vs offline editors
Overview of Feature Engineering
- Input and output variables (features)
- Pros and cons of feature engineering
Types of Problems Encountered in Raw Data
- Unclean data, missing data, etc.
Pre-Processing Variables
- Dealing with missing data
Handling Missing Values in the Data
Working with Categorical Variables
Converting Labels into Numbers
Handling Labels in Categorical Variables
Transforming Variables to Improve Predictive Power
- Numerical, categorical, date, etc.
Cleaning a Data Set
Machine Learning Modelling
Handling Outliers in Data
- Numerical variables, categorical variables, etc.
Summary and Conclusion
Requirements
- Python programming experience.
- Experience with Numpy, Pandas and scikit-learn.
- Familiarity with Machine Learning algorithms.
Audience
- Developers
- Data scientists
- Data analysts
Testimonials (2)
the ML ecosystem not only MLFlow but Optuna, hyperops, docker , docker-compose
Guillaume GAUTIER - OLEA MEDICAL
Course - MLflow
I enjoyed participating in the Kubeflow training, which was held remotely. This training allowed me to consolidate my knowledge for AWS services, K8s, all the devOps tools around Kubeflow which are the necessary bases to properly tackle the subject. I wanted to thank Malawski Marcin for his patience and professionalism for training and advice on best practices. Malawski approaches the subject from different angles, different deployment tools Ansible, EKS kubectl, Terraform. Now I am definitely convinced that I am going into the right field of application.