Skip to main content
King Abdullah University of Science and Technology
KAUST
Main navigation
Home
multimodal alignment
Towards Scalable and Efficient Semantic Video Search
Mattia Soldan, Ph.D. Student, Electrical and Computer Engineering
Jul 13, 18:00
-
19:00
B4 L5 R5209
video-language grounding
semantic video retrieval
multimodal alignment
This dissertation advances fine-grained, content-aware video retrieval by developing novel models and frameworks for Video-Language Grounding, enabling accurate alignment between natural language queries and specific temporal segments in unstructured video content.