Gathering Open Data: Web Scraping and APIs

LecturerJun.-Prof. Dr. Tristan Becker
Junior Professor in Business Administration, esp. Management Science
TU Dresden
DateSeptember 11 & 12, 2025
with classes from 9:00 a.m. to 4:00 p.m. each day
Room/AddressGeorg Schumann-Bau (SCH/B37)
TU Dresden 
Seminar contentThe internet contains vast amounts of open data. In most cases, it is impossible to simply download the desired data in a structured form, but the data is distributed across many web pages. Web Scraping is a technique to automate the extraction of desired data. There are numerous applications, such as gathering price data from online shops, collecting information from social media websites like Facebook or Twitter, gathering data from job networks, and collecting general information on sports results or movie scores. By applying web scraping, the data from a large number of web pages can be quickly collected and saved in a structured data set. The data holds potential for all kinds of research projects using, e.g., statistical or optimization methods.
In this course, we will explore the fundamentals of web scraping with Python 3. We will learn how to access APIs with Python and look at the basics of web scraping. This includes an overview of fundamental elements that make up websites, libraries for web scraping (such as requests, Beautiful Soup, Scrapy, Selenium Webdriver), and a brief discussion about data storage. Further, we will examine some examples of scraping real websites.
PrerequisitesWe recommend basic programming skills in Python 3.
CertificateDoctoral candidates from the Faculty of Business and Economics, TU Dresden can earn a certificate according to § 9 of the Ph.D. doctoral regulations (PromO 2018):
Doctoral candidates of Business Administration: § 9 (1) Nr. 5 or 6
Doctoral candidates of Business Information Systems: § 9 (1) Nr. 6
Doctoral candidates of Economics: § 9 (1) Nr. 6

Doctoral candidates from other universities can earn a certificate as well.
AssignmentStudents have to complete a brief web scraping assignment by picking a website and applying the Web Scraping skills from this course to compile a data set (e.g., collect weather data, sports results, or price data). They must submit both their code and data.
RegistrationParticipation is limited (max. 15). 
To register send an e-mail to Dr. Uta Schwarz: uta.schwarz@tu-dresden.de
Phone: +49 351 463-33141

Schreibe einen Kommentar

Deine E-Mail-Adresse wird nicht veröffentlicht. Erforderliche Felder sind mit * markiert