Language, Vision, and Interaction - Einzelansicht

Funktionen:

Veranstaltungsart	Seminar	Veranstaltungsnummer
SWS	2	Semester	SoSe 2022
Einrichtung	Department Linguistik	Sprache	englisch
Belegungsfrist	01.04.2022 - 10.05.2022 Belegung über PULS

Gruppe 1:

Vormerken: jetzt belegen / abmelden

		Tag	Zeit	Rhythmus	Dauer	Raum	Lehrperson	Ausfall-/Ausweichtermine	Max. Teilnehmer/-innen
	Seminar	Do	10:00 bis 12:00	wöchentlich	21.04.2022 bis 28.07.2022	2.14.2.22	Prof. Dr. Schlangen
Einzeltermine: 21.04.2022 28.04.2022 05.05.2022 12.05.2022 19.05.2022 02.06.2022 09.06.2022 16.06.2022 23.06.2022 30.06.2022 07.07.2022 14.07.2022 21.07.2022 28.07.2022

Kommentar	In this practical course, we will look at current models that work at the intersection of natural language processing and computer vision, specifically those that turn images into words. We will do this in the context of a (somewhat) practical application. We will build an interactive system that can serve as the "eyes" of the user, through and with which the user can investigate a virtual environment. This takes the existing language & vision models out of their usual laboratory environment, in which they are tested on automated metrics, and into the real of actual use; we will see how well they do. The students will gain familiarity with current state-of-the-art models in Language and Vision, as well as some insight into modelling dialogue. On the software engineering side, questions of how to actually deploy deep learning models will become relevant as well. The project will be group work, with the division of tasks to be determined in the first meetings.

Kommentar

In this practical course, we will look at current models that work at
the intersection of natural language processing and computer vision,
specifically those that turn images into words. We will do this in the
context of a (somewhat) practical application. We will build an
interactive system that can serve as the "eyes" of the user, through and
with which the user can investigate a virtual environment. This takes
the existing language & vision models out of their usual laboratory
environment, in which they are tested on automated metrics, and into the
real of actual use; we will see how well they do.

The students will gain familiarity with current state-of-the-art models
in Language and Vision, as well as some insight into modelling dialogue.
On the software engineering side, questions of how to actually deploy
deep learning models will become relevant as well. The project will be
group work, with the division of tasks to be determined in the first
meetings.

Strukturbaum

Keine Einordnung ins Vorlesungsverzeichnis vorhanden. Veranstaltung ist aus dem Semester SoSe 2022 , Aktuelles Semester: WiSe 2024/25