A foundational pack of prompts to help you clean, validate, and transform your data for robust machine learning and analytical workflows. Ensure data quality from the start.
Generate a basic cleaning process for numerical datasets, including handling missing values and outliers.
Define basic data integrity rules for a given dataset and outline how to validate them.
Explain common techniques for transforming categorical features into numerical formats for machine learning.