Audiovisual Self-Supervised Learning

SANE2023 | Arsha Nagrani - Audio-Visual Learning for Video UnderstandingПодробнее

SANE2023 | Arsha Nagrani - Audio-Visual Learning for Video Understanding

[ICML2024] EquiAV: Leveraging Equivariance for Audio-Visual Contrastive LearningПодробнее

[ICML2024] EquiAV: Leveraging Equivariance for Audio-Visual Contrastive Learning

[Interspeech 2021] AVLnet: Learning Audio-Visual Language Representations from Instructional VideosПодробнее

[Interspeech 2021] AVLnet: Learning Audio-Visual Language Representations from Instructional Videos

Hear The Flow: Optical Flow-Based Self-Supervised Visual Sound Source LocalizationПодробнее

Hear The Flow: Optical Flow-Based Self-Supervised Visual Sound Source Localization

CVPR 2023 Paper: Learning AV Source Localization via False Negative Aware Contrastive LearningПодробнее

CVPR 2023 Paper: Learning AV Source Localization via False Negative Aware Contrastive Learning

Boosting Positive Segments for Weakly-Supervised Audio-Visual Video ParsingПодробнее

Boosting Positive Segments for Weakly-Supervised Audio-Visual Video Parsing

[CVPR 2024] AVFF: Audio-Visual Feature Fusion for Video Deepfake DetectionПодробнее

[CVPR 2024] AVFF: Audio-Visual Feature Fusion for Video Deepfake Detection

Audio-visual self-supervised baby learningПодробнее

Audio-visual self-supervised baby learning

[Interspeech 2021] Cascaded Multilingual Audio-Visual Learning from VideosПодробнее

[Interspeech 2021] Cascaded Multilingual Audio-Visual Learning from Videos

Learning Audio-Visual Source Localization via False Negative Aware Contrastive LearningПодробнее

Learning Audio-Visual Source Localization via False Negative Aware Contrastive Learning

Speech emotion recognition using self-supervised learning with domain-specific audiovisual tasksПодробнее

Speech emotion recognition using self-supervised learning with domain-specific audiovisual tasks

Visually Guided Sound Source Separation and Localization using Self-Supervised Motion RepresentatioПодробнее

Visually Guided Sound Source Separation and Localization using Self-Supervised Motion Representatio

Fellowship: Robust Self Supervised Audio Visual Speech RecognitionПодробнее

Fellowship: Robust Self Supervised Audio Visual Speech Recognition

Comparing Learning Methodologies for Self Supervised Audio Visual Representation LearningПодробнее

Comparing Learning Methodologies for Self Supervised Audio Visual Representation Learning

EI Seminar - Kristen Grauman - Audio-Visual Learning in 3D EnvironmentsПодробнее

EI Seminar - Kristen Grauman - Audio-Visual Learning in 3D Environments

Self-Supervised Learning & Foundation Models? MIT Short AnswerПодробнее

Self-Supervised Learning & Foundation Models? MIT Short Answer

Fellowship: Robust self supervised audio visual speech recognition.Подробнее

Fellowship: Robust self supervised audio visual speech recognition.

[ECCVW22 VOLI] Poster 7: Self-Supervised Representation Learning from Videos of Audible InteractionsПодробнее

[ECCVW22 VOLI] Poster 7: Self-Supervised Representation Learning from Videos of Audible Interactions

Recent Progress in Audio-Visual Language Learning with Jim GlassПодробнее

Recent Progress in Audio-Visual Language Learning with Jim Glass

IROS 2023 AV-PedAware: Self-Supervised Audio-Visual Fusion for Dynamic Pedestrian AwarenessПодробнее

IROS 2023 AV-PedAware: Self-Supervised Audio-Visual Fusion for Dynamic Pedestrian Awareness