Skip to content

sameer-labs/csv-data-cleaning-automation

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

2 Commits
 
 

Repository files navigation

CSV Data Cleaning Automation

Overview

This project automates the cleaning and validation of CSV datasets using Python. It detects missing values, invalid entries, and produces a cleaned output file along with a summary report.

Features

  • Detects missing or invalid values
  • Cleans and standardises CSV data
  • Generates a summary report
  • Handles messy real-world data safely

Input

  • CSV file containing customer demographic data

Output

  • Cleaned CSV file
  • Text summary report

How to Run

python src/clean_csv.py

About

This automation streamlines the data cleaning process, making it faster and more efficient than manual methods. By automating the cleaning of CSV data, users can save time and resources while ensuring data quality.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors