| Perfect match: Improved cross-modal embeddings for audio-visual synchronisation | Sep 21, 2018 | Binary ClassificationCross-Modal Retrieval | —Unverified | 0 | 0 |
| Sub-millisecond Video Synchronization of Multiple Android Smartphones | Jul 2, 2021 | Video Synchronization | —Unverified | 0 | 0 |
| Multimodal Cinematic Video Synthesis Using Text-to-Image and Audio Generation Models | Apr 6, 2025 | Audio GenerationGPU | —Unverified | 0 | 0 |
| ReVISE: Self-Supervised Speech Resynthesis with Visual Input for Universal and Generalized Speech Enhancement | Dec 21, 2022 | Audio-Visual Speech RecognitionResynthesis | —Unverified | 0 | 0 |
| ReVISE: Self-Supervised Speech Resynthesis With Visual Input for Universal and Generalized Speech Regeneration | Jan 1, 2023 | Audio-Visual Speech RecognitionResynthesis | —Unverified | 0 | 0 |
| Applying Automated Machine Translation to Educational Video Courses | Jan 9, 2023 | Machine TranslationSpeech Synthesis | —Unverified | 0 | 0 |
| Video alignment using unsupervised learning of local and global features | Apr 13, 2023 | Dynamic Time WarpingHuman Detection | —Unverified | 0 | 0 |
| AV-Link: Temporally-Aligned Diffusion Features for Cross-Modal Audio-Video Generation | Dec 19, 2024 | Video GenerationVideo Synchronization | —Unverified | 0 | 0 |
| Self-supervised learning for audio-visual speaker diarization | Feb 13, 2020 | Self-Supervised Learningspeaker-diarization | —Unverified | 0 | 0 |
| Bronchoscopic video synchronization for interactive multimodal inspection of bronchial lesions | Mar 20, 2023 | Computed Tomography (CT)Video Synchronization | —Unverified | 0 | 0 |