Web Scraping of O’Reilly Artificial Intelligence Conference 2019 San Jose Using Python

SUMMARY: The purpose of this project is to practice web scraping by extracting specific pieces of information from a website. The web scraping Python code leverages the BeautifulSoup module.

INTRODUCTION: The O’Reilly Artificial Intelligence (AI) Conference covers the full range of topics in leveraging the AI technologies for developing software applications and creating innovative solutions. This web scraping script will automatically traverse through the entire web page and collect all links to the PDF and PPTX documents. The script will also download the documents as part of the scraping process.

https://conferences.oreilly.com/artificial-intelligence/ai-ca-2019/public/schedule/proceedings

The source code and HTML output can be found here on GitHub.