I am trying to scrap a website and extracting the data from it. For Demo purpose, I am using http://www.w3schools.com/sql/sql_select.asp website as it is one for simple and popular website for learning basic of programming languages. I am now concentrating on the values which are on left pane on this site. Let’s go to extract these keywords. For the web parsing, we should install beautiful soup. If we go to “Inspect element”, the HTML tag for the website looks like this: We now came to know that, the names or text, we need to extract is lying under “<a>” tag. To get all the data from “<a>” tag, use the following code. from bs4 import BeautifulSoup import urllib2 html="http://www.w3schools.com/sql/sql_select.asp" WebParse = urllib2.urlopen (html).read() soup = BeautifulSoup (WebParse) for ul in soup.findAll ('a'): print ul And if we investigate, we can see there are lot of r...