Preprocessing

 

In this section we’re going to cover what I believe are two of the most important topics in this course: data management and preprocessing. An unfortunate truth is that a lot of the real world data you’ll be working with will be incomplete, inconsistent or just mismanaged. Preprocessing enables you and your company to address any possible error and turns raw data into usable, manageable data that can aid in future decision making.   

So, in the following guides we’re going to discuss the importance of proper data management, how to import and segment a data frame, and then we’ll finish with a high level dissection of each step involved in preprocessing data.