🧹 Data Cleaning & EDA Tools

This repository contains reusable tools for quick and effective exploratory data analysis (EDA) and data cleaning, built with Python and Pandas. Ideal for analysts, data scientists, or anyone who works with messy tabular data regularly.

📂 Structure

notebooks/: Interactive .ipynb version with full markdown explanations
scripts/: .py script version of the cleaning functions for reuse in projects

🛠 Features

Load and preview tabular data
Summarize data shape, types, and nulls
Count unique values and detect cardinality issues
Fill or drop missing values using strategies (mean, median, value)
Clean and standardize text columns
Rename and normalize column names
Export cleaned datasets

🧪 Usage

In Notebook:

# Load and explore
df = pd.read_csv('your_file.csv')
df.shape
df.isnull().sum()

# Clean text
df = clean_text_cols(df, ['description', 'notes'])

# Fill missing values
df = fill_nulls(df, strategy='mean')

As Script:

Import the .py module and reuse functions.

from scripts.data_cleaning_tools import fill_nulls, clean_text_cols

📦 Requirements

pandas
numpy
re (standard lib)

👤 Author

James Witcher
LinkedIn

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
.DS_Store		.DS_Store
README.md		README.md
data_clean.ipynb		data_clean.ipynb
data_cleaning_tools.ipynb		data_cleaning_tools.ipynb
data_cleaning_tools_expanded.py		data_cleaning_tools_expanded.py
data_cleaning_tools_expanded.txt		data_cleaning_tools_expanded.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

🧹 Data Cleaning & EDA Tools

📂 Structure

🛠 Features

🧪 Usage

In Notebook:

As Script:

📦 Requirements

👤 Author

About

Uh oh!

Releases

Packages

Languages

jwitcher3/data_cleaning_tools

Folders and files

Latest commit

History

Repository files navigation

🧹 Data Cleaning & EDA Tools

📂 Structure

🛠 Features

🧪 Usage

In Notebook:

As Script:

📦 Requirements

👤 Author

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages