Skip to content

Webscraping python notebook project using Jupyter notebook for learning data analysis, pandas, matplotlib, and BeautifulSoup

Notifications You must be signed in to change notification settings

tylerjamesf/chocolate-soup

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

6 Commits
 
 
 
 
 
 

Repository files navigation

Chocolate Soup 🍫

Webscraping python notebook project using Jupyter notebook for learning data analysis, pandas, matplotlib, and BeautifulSoup

Table of Contents

Scope

Data

Data provided by Codecademy URL content https://content.codecademy.com/courses/beautifulsoup/cacao/index.html

Questions

Some questions we thought about when we found this dataset were:

  • Where are the best cocao beans grown?
  • Which countries produce the highest-rated bars?
  • What's the relationship between cocao solids percentage and rating?

Analysis

  • scrape data into soup object from HTML parser, `html.parser'
  • python loop data into lists
  • create pandas DataFrame from lists
  • using .groupby, .nlargest, and .mean to evaluate DataFrame
  • plot data on scatter and histogram using matplotlib
  • draw a best-fit line over scatter plot

Additional Considerations

None

General Info

Technologies

  • Python 3
  • Jupyter Notebook
  • Pandas Library
  • Matplotlib Library
  • Requests Library
  • bs4 Library

Setup

  • download using Jupyter notebook

About

Webscraping python notebook project using Jupyter notebook for learning data analysis, pandas, matplotlib, and BeautifulSoup

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published