Webscraping python notebook project using Jupyter notebook for learning data analysis, pandas, matplotlib, and BeautifulSoup
Data provided by Codecademy URL content https://content.codecademy.com/courses/beautifulsoup/cacao/index.html
Some questions we thought about when we found this dataset were:
- Where are the best cocao beans grown?
- Which countries produce the highest-rated bars?
- What's the relationship between cocao solids percentage and rating?
- scrape data into soup object from HTML parser, `html.parser'
- python loop data into lists
- create pandas DataFrame from lists
- using
.groupby,.nlargest, and.meanto evaluate DataFrame - plot data on scatter and histogram using
matplotlib - draw a best-fit line over scatter plot
None
- Python 3
- Jupyter Notebook
- Pandas Library
- Matplotlib Library
- Requests Library
- bs4 Library
- download using Jupyter notebook