Data acquisition, extraction, and storage (2023–2024)
Course material for Data acquisition, extraction, and storage of the IASD master. Course created in 2023–2024.
- Web content acquisition
- Structured content extraction from the Web
- Relational Database Management
- DB or no DB?
- Distributed Computing with MapReduce and Beyond
- Processing other (non-HTML, non-tabular) data formats
- Inverted Index Construction