A scalable Python framework for cleaning and enriching CSV data with additional features like weather data. This framework is designed to assist in preparing datasets for machine learning applications.
- Data Cleaning: Removes missing, invalid, or duplicate records.
- Weather Data Enrichment: Fetches real-time weather data based on geographic coordinates using the OpenWeatherMap API.
- Progress Monitoring: Real-time progress bars using
tqdm
. - Output: Returns a cleaned and enriched Pandas DataFrame for immediate use in machine learning workflows.
- Python 3.7 or higher
- Pip
- Clone the repository:
git clone https://github.com/your_username/data-enrichment-framework.git cd data-enrichment-framework