Careertail
About UsCoursesCareer PathsBlogOpportunities
Log In
Courses>Data Science>Data Cleansing Master Class in Python
DevelopmentData Cleansing Master Class in Python
Price:Paid
Length:3.5 hours
Content type:video
level:intermediate
Updated:05 March 2024
Published:21 August 2022
Similar courses
Opportunities
Courses>Data Science>Data Cleansing Master Class in Python
Data Cleansing Master Class in Python
4.4 (235.0)
3.5 hours
235 students
What you will learn
1You'll learn data imputation and advanced data cleansing techniques.
2You'll learn how to apply real-world data cleansing techniques to your data.
3You'll learn advanced data cleansing techniques.
4You'll learn how to prepare data in a way that avoids data leakage, and in turn, incorrect model evaluation.
Target audiences
1You are serious about become a machine learning engineer in the real-world.
Requirements
1You'll need a really solid foundation in Python.
2You'll need to understand the basics of machine learning.
FAQ
You can view and review the lecture materials indefinitely, like an on-demand channel.
Definitely! If you have an internet connection, courses on Udemy are available on any device at any time. If you don't have an internet connection, some instructors also let their students download course lectures. That's up to the instructor though, so make sure you get on their good side!
Description

Welcome to Data Cleansing Master Class in Python.

Data preparation may be the most important part of a machine learning project. It is the most time consuming part, although it seems to be the least discussed topic. Data preparation, sometimes referred to as data preprocessing, is the act of transforming raw data into a form that is appropriate for modeling.

Machine learning algorithms require input data to be numbers, and most algorithm implementations maintain this expectation. Therefore, if your data contains data types and values that are not numbers, such as labels, you will need to change the data into numbers. Further, specific machine learning algorithms have expectations regarding the data types, scale, probability distribution, and relationships between input variables, and you may need to change the data to meet these expectations.

In the course you'll learn: 

  • The importance of data preparation for predictive modeling machine learning projects.

  • How to prepare data in a way that avoids data leakage, and in turn, incorrect model evaluation.

  • How to identify and handle problems with messy data, such as outliers and missing values.

  • How to identify and remove irrelevant and redundant input variables with feature selection methods.

  • How to know which feature selection method to choose based on the data types of the variables.

  • How to scale the range of input variables using normalization and standardization techniques.

  • How to encode categorical variables as numbers and numeric variables as categories.

  • How to transform the probability distribution of input variables.

  • How to transform a dataset with different variable types and how to transform target variables.

  • How to project variables into a lower-dimensional space that captures the salient data relationships.

This course is a hands on-guide. It is a playbook and a workbook intended for you to learn by doing and then apply your new understanding to the feature engineering in Python. To get the most out of the course, I would recommend working through all the examples in each tutorial. If you watch this course like a movie you'll get little out of it.

In the applied space machine learning is programming and programming is a hands on-sport.

Thank you for your interest in Data Cleansing Master Class in Python.

Let's get started!

Similar courses
Opportunities
Make the most out of your online education
Careertail
Copyright © 2021 Careertail.
All rights reserved
Quick Links
Get StartedLog InAbout UsCourses
Company
BlogContactsPrivacy PolicyCookie PolicyTerms and Conditions
Stay up to date
Trustpilot
Careertail
Courses>Data Science>Data Cleansing Master Class in Python
DevelopmentData Cleansing Master Class in Python
Price:Paid
Length:3.5 hours
Content type:video
level:intermediate
Updated:05 March 2024
Published:21 August 2022
Similar courses
Opportunities
Courses>Data Science>Data Cleansing Master Class in Python
Data Cleansing Master Class in Python
4.4 (235.0)
3.5 hours
235 students
What you will learn
1You'll learn data imputation and advanced data cleansing techniques.
2You'll learn how to apply real-world data cleansing techniques to your data.
3You'll learn advanced data cleansing techniques.
4You'll learn how to prepare data in a way that avoids data leakage, and in turn, incorrect model evaluation.
Target audiences
1You are serious about become a machine learning engineer in the real-world.
Requirements
1You'll need a really solid foundation in Python.
2You'll need to understand the basics of machine learning.
FAQ
You can view and review the lecture materials indefinitely, like an on-demand channel.
Definitely! If you have an internet connection, courses on Udemy are available on any device at any time. If you don't have an internet connection, some instructors also let their students download course lectures. That's up to the instructor though, so make sure you get on their good side!
Description

Welcome to Data Cleansing Master Class in Python.

Data preparation may be the most important part of a machine learning project. It is the most time consuming part, although it seems to be the least discussed topic. Data preparation, sometimes referred to as data preprocessing, is the act of transforming raw data into a form that is appropriate for modeling.

Machine learning algorithms require input data to be numbers, and most algorithm implementations maintain this expectation. Therefore, if your data contains data types and values that are not numbers, such as labels, you will need to change the data into numbers. Further, specific machine learning algorithms have expectations regarding the data types, scale, probability distribution, and relationships between input variables, and you may need to change the data to meet these expectations.

In the course you'll learn: 

  • The importance of data preparation for predictive modeling machine learning projects.

  • How to prepare data in a way that avoids data leakage, and in turn, incorrect model evaluation.

  • How to identify and handle problems with messy data, such as outliers and missing values.

  • How to identify and remove irrelevant and redundant input variables with feature selection methods.

  • How to know which feature selection method to choose based on the data types of the variables.

  • How to scale the range of input variables using normalization and standardization techniques.

  • How to encode categorical variables as numbers and numeric variables as categories.

  • How to transform the probability distribution of input variables.

  • How to transform a dataset with different variable types and how to transform target variables.

  • How to project variables into a lower-dimensional space that captures the salient data relationships.

This course is a hands on-guide. It is a playbook and a workbook intended for you to learn by doing and then apply your new understanding to the feature engineering in Python. To get the most out of the course, I would recommend working through all the examples in each tutorial. If you watch this course like a movie you'll get little out of it.

In the applied space machine learning is programming and programming is a hands on-sport.

Thank you for your interest in Data Cleansing Master Class in Python.

Let's get started!

Similar courses
Opportunities
Make the most out of your online education
Careertail
Copyright © 2021 Careertail.
All rights reserved
Quick Links
Get StartedLog InAbout UsCourses
Company
BlogContactsPrivacy PolicyCookie PolicyTerms and Conditions
Stay up to date
Trustpilot