Skip to content

Latest commit

 

History

History
27 lines (18 loc) · 1009 Bytes

File metadata and controls

27 lines (18 loc) · 1009 Bytes

Python Developer Sample Project

The task is to create a simple web scraper and display the results on a webpage.

Python libraries:

  • requests
  • BeautifulSoup4
  • flask

You will be scraping the Auto's section of the New York Daily News website.

Scrape the list of vehicles reviewed (only results page 1) and store them locally (sqlite? json file? etc).

Then, using flask, create a simple webpage endpoint that displays an HTML table showing the title (year/make/model) and summary of the vehicle. You can have 1 table for Trucks, and one for Sport.

Bonus features:

  • Crawl pagination (results page 2, 3, etc of the car lists)
  • Crawl vehicle price (how is it loaded into the page? :) )
  • Allow the crawler / flask app to be run inside a docker container

How to run

docker-compose up --build -d

Browser to http://localhost and click to Sport or Truck Models links