Zero-Shot Video Moment Retrieval From Frozen Vision-Language Models

Zero-Shot Video Moment Retrieval From Frozen Vision-Language Models

Preventing Zero-Shot Transfer Degradation in Continual Learning of Vision-Language ModelsПодробнее

Preventing Zero-Shot Transfer Degradation in Continual Learning of Vision-Language Models

Large Language Models Are Zero Shot ReasonersПодробнее

Large Language Models Are Zero Shot Reasoners

Zero-Shot Visual Question AnsweringПодробнее

Zero-Shot Visual Question Answering

412 - Two-Level Adversarial Visual-Semantic Coupling for Generalized Zero-shot LearningПодробнее

412 - Two-Level Adversarial Visual-Semantic Coupling for Generalized Zero-shot Learning

OpenAI's CLIP for Zero Shot Image ClassificationПодробнее

OpenAI's CLIP for Zero Shot Image Classification

Contextual Emotion Recognition using Large Vision Language ModelsПодробнее

Contextual Emotion Recognition using Large Vision Language Models

Fast Zero Shot Object Detection with OpenAI CLIPПодробнее

Fast Zero Shot Object Detection with OpenAI CLIP

Zero-Shot Building Attribute Extraction From Large-Scale Vision and Language ModelsПодробнее

Zero-Shot Building Attribute Extraction From Large-Scale Vision and Language Models

CVPR #18542 - New Frontiers for Zero-Shot Image Captioning EvaluationПодробнее

CVPR #18542 - New Frontiers for Zero-Shot Image Captioning Evaluation

OpenAI CLIP: ConnectingText and Images (Paper Explained)Подробнее

OpenAI CLIP: ConnectingText and Images (Paper Explained)

Video Moment Retrieval With Cross Modal Neural Architecture SearchПодробнее

Video Moment Retrieval With Cross Modal Neural Architecture Search

Modality-Aware Representation Learning for Zero-Shot Sketch-Based Image RetrievalПодробнее

Modality-Aware Representation Learning for Zero-Shot Sketch-Based Image Retrieval

MAtch, eXpand and Improve: Unsupervised Finetuning for Zero-Shot Action Recognition with Language KПодробнее

MAtch, eXpand and Improve: Unsupervised Finetuning for Zero-Shot Action Recognition with Language K

Extending CLIP Model to Video Retrieval and Action Recognition [VLR-16824] | Final ProjectПодробнее

Extending CLIP Model to Video Retrieval and Action Recognition [VLR-16824] | Final Project

[CVPR 2023] Hierarchical Video-Moment Retrieval and Step-CaptioningПодробнее

[CVPR 2023] Hierarchical Video-Moment Retrieval and Step-Captioning

Video Moment Retrieval app using Tensorflow and Django.Подробнее

Video Moment Retrieval app using Tensorflow and Django.

Query-Dependent Video Representation for Moment Retrieval and Highlight DetectionПодробнее

Query-Dependent Video Representation for Moment Retrieval and Highlight Detection

【S3E8】Learning visual language models for video understandingПодробнее

【S3E8】Learning visual language models for video understanding