Skip to content

This commit aggregates all four required projects for the freeCodeCamp Data Analysis with Python Developer Certification).\n\nProjects completed:\n- NumPy Mean, Variance, Standard Deviation Calculator\n- Pandas Demographic Data Analyzer\n- Matplotlib/Seaborn Medical Data Visualizer\n- SciPy Sea Level Predictor

Notifications You must be signed in to change notification settings

Rabbani218/fcc-data-analysis-with-python-Collection

Repository files navigation

freeCodeCamp - Data Analysis with Python Certification Projects

Python Pandas NumPy Seaborn License

This repository is a complete collection of all five projects required for the Data Analysis with Python Developer Certification from freeCodeCamp.org.

This 300-hour certification program provided hands-on experience in data analysis by applying foundational Python libraries. Each project in this collection demonstrates proficiency in data manipulation, cleaning, visualization, time-series analysis, and statistical calculation.


🚀 Projects Included

Here is a summary of the projects included in this repository, each in its own directory.

1. Mean-Variance-Standard Deviation Calculator

  • Description: A function that uses NumPy to calculate the mean, variance, standard deviation, max, min, and sum of a 3x3 matrix across both axes and for the flattened array.
  • Key Libraries: NumPy
  • Concepts: Array manipulation, statistical calculations, matrix operations.

2. Demographic Data Analyzer

  • Description: An analysis of 1994 US Census data using Pandas. This project involved answering several descriptive questions about the dataset by filtering and grouping the data.
  • Key Libraries: Pandas
  • Concepts: Data cleaning, data filtering, .groupby(), .value_counts(), boolean masking.

3. Medical Data Visualizer

  • Description: A visualization project using Matplotlib and Seaborn to analyze medical examination data. Tasks included creating a 'overweight' column, normalizing data, and generating a categorical plot and a correlation heatmap.
  • Key Libraries: Pandas, Matplotlib, Seaborn
  • Concepts: Data visualization, data cleaning, correlation matrix (heatmap), categorical plots (catplot).

4. Sea Level Predictor

  • Description: A time-series analysis and prediction project. This involved plotting global sea level change since 1880, calculating two separate lines of best fit (one for all data, one for recent data), and predicting sea level rise through 2050.
  • Key Libraries: Pandas, Matplotlib, SciPy
  • Concepts: Linear regression (scipy.stats.linregress), time-series analysis, data prediction.

5. Page View Time Series Visualizer

  • Description: A project to visualize and analyze time-series data from freeCodeCamp.org forum page views. This involved cleaning the data and creating line plots and box plots to identify yearly and monthly trends.
  • Key Libraries: Pandas, Matplotlib, Seaborn
  • Concepts: Time-series data, date parsing, box plots, line plots, data visualization.

🛠️ Technology Stack

The primary technologies used across these projects include:

  • Python: The core programming language.
  • Pandas: For data manipulation, cleaning, and analysis.
  • NumPy: For high-performance scientific computing and array operations.
  • Matplotlib: For creating static, animated, and interactive visualizations.
  • Seaborn: A high-level interface for drawing attractive and informative statistical graphics.
  • SciPy: For scientific and technical computing, specifically linear regression.

🏁 Getting Started

To run any of these projects locally, please follow the steps below.

Prerequisites

  • Python 3.8 or higher
  • Git

Installation & Usage

  1. Clone the repository:

    git clone [https://github.com/](https://github.com/)[YOUR_USERNAME]/fcc-data-analysis-with-python-Collection.git
    cd fcc-data-analysis-with-python-Collection
  2. Navigate to a project directory:

    # Example:
    cd 02-demographic-data-analyzer
  3. Install dependencies: It is highly recommended to use a virtual environment.

    # Create a virtual environment
    python -m venv venv
    source venv/bin/activate  # On Windows, use `venv\Scripts\activate`
    
    # Install required packages
    pip install -r requirements.txt
  4. Run the project: Each project's main logic is run from main.py, which also imports the unit tests from test_module.py.

    python main.py

📄 License

This project portfolio is licensed under the MIT License. See the LICENSE file for more details.

✨ Acknowledgments

A special thank you to Quincy Larson and the entire freeCodeCamp.org team for creating this comprehensive and challenging curriculum, which provides invaluable, accessible education for all.

About

This commit aggregates all four required projects for the freeCodeCamp Data Analysis with Python Developer Certification).\n\nProjects completed:\n- NumPy Mean, Variance, Standard Deviation Calculator\n- Pandas Demographic Data Analyzer\n- Matplotlib/Seaborn Medical Data Visualizer\n- SciPy Sea Level Predictor

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages