Data acquisition, extraction, and storage (2024–2025)
Course material for Data acquisition, extraction, and storage of the IASD master.
- Web content acquisition
- Structured content extraction from the Web
- Lab: Scrapy
- Handling Relational Data
- Lab: relational data
- Processing other (non-HTML, non-tabular) data formats
- Distributed Computing with MapReduce and Beyond
- Provenance in Databases: Principles and Applications
- Probabilistic Databases