Introduction to Web Scraping with Python and Beautiful Soup

Published 2017-01-06
Web scraping is a very powerful tool to learn for any data professional. With web scraping the entire internet becomes your database. In this tutorial, we show you how to parse a web page into a data file (CSV) using a Python package called BeautifulSoup.

In this example, we web scrape graphics cards from NewEgg.com.

Find the updated version of this tutorial here:    • Web Scraping Tutorial with Python and...  

Python Code:
code.datasciencedojo.com/datasciencedojo/tutorials…

Sublime:
www.sublimetext.com/3

Anaconda:
www.anaconda.com/distribution/#download-section

JavaScript beautifier:
beautifier.io/

If you are not seeing the command line, follow this tutorial:
www.tenforums.com/tutorials/72024-open-command-win…

Table of Contents:
0:00 - Introduction
1:28 - Setting up Anaconda
3:00 - Installing Beautiful Soup
3:43 - Setting up urllib
6:07 - Retrieving the Web Page
10:47 - Evaluating Web Page
11:27 - Converting Listings into Line Items
16:13 - Using jsbeautiful
16:31 - Reading Raw HTML for Items to Scrape
18:34 - Building the Scraper
22:11 - Using the "findAll" Function
27:26 - Testing the Scraper
29:07 - Creating the .csv File
32:18 - End Result

--

At Data Science Dojo, we believe data science is for everyone. Our data science trainings have been attended by more than 10,000 employees from over 2,500 companies globally, including many leaders in tech like Microsoft, Google, and Facebook. For more information please visit: hubs.la/Q01Z-13k0

💼 Learn to build LLM-powered apps in just 40 hours with our Large Language Models bootcamp: hubs.la/Q01ZZGL-0

💼 Get started in the world of data with our top-rated data science bootcamp: hubs.la/Q01ZZDpt0

💼 Master Python for data science, analytics, machine learning, and data engineering: hubs.la/Q01ZZD-s0

💼 Explore, analyze, and visualize your data with Power BI desktop: hubs.la/Q01ZZF8B0

--

Unleash your data science potential for FREE! Dive into our tutorials, events & courses today!

📚 Learn the essentials of data science and analytics with our data science tutorials: hubs.la/Q01ZZJJK0

📚 Stay ahead of the curve with the latest data science content, subscribe to our newsletter now: hubs.la/Q01ZZBy10

📚 Connect with other data scientists and AI professionals at our community events: hubs.la/Q01ZZLd80

📚 Checkout our free data science courses: hubs.la/Q01ZZMcm0

📚 Get your daily dose of data science with our trending blogs: hubs.la/Q01ZZMWl0

--

📱 Social media links

Connect with us: www.linkedin.com/company/data-science-dojo

Follow us: twitter.com/DataScienceDojo

Keep up with us: www.instagram.com/data_science_dojo/

Like us: www.facebook.com/datasciencedojo

Find us: www.threads.net/@data_science_dojo

--

Also, join our communities:

LinkedIn: www.linkedin.com/groups/13601597/

Twitter: twitter.com/i/communities/1677363761399865344

Facebook: www.facebook.com/groups/AIandMachineLearningforEve…

Vimeo: vimeo.com/datasciencedojo

Discord: discord.com/invite/tj8ken4Err

_

Want to share your data science knowledge? Boost your profile and share your knowledge with our community: hubs.la/Q01ZZNCn0

#webscraping #python #beautifulsoup