Questions to ask as you delve into data cleaning
- Which data do we need for our analysis?
- Do we need to create any new variables?
- Are the data of the correct type?
- Do we need to combine data frames?
- Do the data need to be reshaped?
- Are there missing data?
- Are there outliers?