13.30 Download the information of the site using Beautiful Soup

(Comments)

Download the information of the site using Beautiful Soup

Intro

The beautiful soup has a better way of parsing HTML in comparison with request

Example

The example of beautiful soup can be seen here in the Trinket attachment

The more detail of the lesson can be found also in AI Sweigart web

Summary

  • Web pages are plaintext files formatted as HTML.
  • HTML can be parsed with the BeautifulSoup module.
  • BeautifulSoup is imported with the name bs4.
  • Pass the string with the HTML to the bs4.BeautfiulSoup() function to get a Soup object.
  • The Soup object has a select() method that can be passed a string of the CSS selector for an HTML tag.
  • You can get a CSS selector string from the browser's developer tools by right-clicking the element and selecting Copy CSS Path.
  • The select() method will return a list of matching Element objects.
Currently unrated

Comments

Riddles

22nd Jul- 2020, by: Editor in Chief
524 Shares 4 Comments
Generic placeholder image
20 Oct- 2019, by: Editor in Chief
524 Shares 4 Comments
Generic placeholder image
20Aug- 2019, by: Editor in Chief
524 Shares 4 Comments

Economics

10Aug- 2019, by: Editor in Chief
424 Shares 4 Comments
Generic placeholder image
10Aug- 2015, by: Editor in Chief
424 Shares 4 Comments

More News  »

Kim Kardashian West has filed for divorce from Kanye West

Recent news

Reported from CNN, Kim Kardashian West has filed for divorce from Kanye West, a court clerk for Los Angeles Superior Court confirmed to CNN on Friday.

read more
1 week, 3 days ago

Formula in finding the area of a circle

Recent news

What is a circle and what the formula to find them? 

read more
1 month ago

Indonesian book for kids

Recent news

Indonesian books for kids

read more
1 month ago

Test the formula from Mathjax

Recent news

When \(a \ne 0\), there are two solutions to \(ax^2 + bx + c = 0\) and they are
\[x = {-b \pm \sqrt{b^2-4ac} \over 2a}.\]

read more
1 month ago

13.39 Using the request module to download the website

Recent news
1 month, 3 weeks ago

13.30 Download the information of the site using Beautiful Soup

Recent news

Download the information of the site using Beautiful Soup

Intro

The beautiful soup has a better way of parsing HTML in comparison with request

read more
1 month, 3 weeks ago

Debug error Could not install packages due to an EnvironmentError: [Errno 13]

Recent news
1 month, 3 weeks ago

13.38 Download the site with the module webbrowser

Recent news
1 month, 3 weeks ago

More News »

Generic placeholder image

Collaboratively administrate empowered markets via plug-and-play networks. Dynamically procrastinate B2C users after installed base benefits. Dramatically visualize customer directed convergence without