Data Cleaning & AI Preparation - 31/03/2026 06:45 EDT
Description
Budget: $2 - $8/hr
I have between four and six structured datasets, all in familiar CSV or Excel formats, that need to be cleaned, pre-processed, and shaped into reliable training material for upcoming AI models. The work goes beyond basic munging: I need missing-value strategies applied, outliers handled, data types standardized, categorical variables encoded, and any useful feature engineering you spot along the way.
Once the data are in good shape, please hand back both a well-commented Jupyter Notebook that walks through each step and a matching, modular Python script (.py) so I can rerun the pipeline head-less later. Everything should finish within one week, and the code must be reproducible on a fresh environment with common libraries such as pandas, NumPy and scikit-learn.
Deliverables • Cleaned and transformed datasets (CSV or pickle) • Stand-alone Python script mirroring the notebook logic • Brief README with setup instructions and library versions
I’ll be on hand for quick clarifications and sample data as soon as you’re ready to start.
Skills
Want AI to find more roles like this?
Upload your CV once. Get matched to relevant assignments automatically.