w3resource

Python Web Scraping: Display the name of the most recently added dataset on data.gov


Write a Python program to display the name of the most recently added dataset on data.gov.

Sample Solution:

Python Code:

from lxml import html
import requests
response = requests.get('http://catalog.data.gov/dataset?q=&sort=metadata_created+desc')
doc = html.fromstring(response.text)
title = doc.cssselect('h3.dataset-heading')[0].text_content()
print("The name of the most recently added dataset on data.gov:")
print(title.strip())

Sample Output:

The name of the most recently added dataset on data.gov:
Small Cell Locations
 

Flowchart:

Python Web Scraping Flowchart: Display the name of the most recently added dataset on data.gov

Python Code Editor:

Have another way to solve this solution? Contribute your code (and comments) through Disqus.

Previous: Write a Python program to convert an address (like "1600 Amphitheatre Parkway, Mountain View, CA") into geographic coordinates (like latitude 37.423021 and longitude -122.083739).
Next: Write a Python program to extract h1 tag from example.com

What is the difficulty level of this exercise?



Follow us on Facebook and Twitter for latest update.