Web Search and Data Mining (WiSDoM) is an area that aims at extracting knowledge from the largest source of information created by humans: The Web! Throughout this course we will see how this extracted knowledge can solve complex tasks with advanced Computer Vision, Natural Language and Information Retrieval algorithms. The main topics of this course are:
This course includes intensive hands-on laboratories where key CV, NLP and IR algorithms are examined.
We suggest you to use the account in the lab cluster. However, if you would like to have your own setup, you can follow this guide:
Tutorial 0 - Environment setup instructions
Tutorial 1 - Text representation
Tutorial 2 - OpenSearch
Tutorial 3 - Transformer encoder
Tutorial 4 - Vision understanding
Tutorial 5 - Vision and language models
Tutorial 6 - Dialog intent detection
Tutorial 7 - Model fine tuning (draft) You will also need this extra code.
Tutorial 8 - Language generation (draft)
Joao Magalhaes ([email protected] - remove the ‘x’ character to send an email)