| CoMAC: Conversational Agent for Multi-Source Auxiliary Context with Sparse and Symmetric Latent Interactions | Mar 25, 2025 | Response Generationtext similarity | CodeCode Available | 0 |
| TextInPlace: Indoor Visual Place Recognition in Repetitive Structures with Scene Text Spotting and Verification | Mar 9, 2025 | Robot NavigationSTS | CodeCode Available | 1 |
| TAIL: Text-Audio Incremental Learning | Mar 6, 2025 | AudioCapsIncremental Learning | —Unverified | 0 |
| Interpretable Text Embeddings and Text Similarity Explanation: A Primer | Feb 20, 2025 | Similarity Explanationtext similarity | —Unverified | 0 |
| Semantics-aware Test-time Adaptation for 3D Human Pose Estimation | Feb 15, 2025 | 3D human pose and shape estimation3D Human Pose Estimation | —Unverified | 0 |
| FiLo++: Zero-/Few-Shot Anomaly Detection by Fused Fine-Grained Descriptions and Deformable Localization | Jan 17, 2025 | Anomaly DetectionImage-text matching | CodeCode Available | 2 |
| SHYI: Action Support for Contrastive Learning in High-Fidelity Text-to-Image Generation | Jan 15, 2025 | Contrastive LearningImage Generation | —Unverified | 0 |
| Taxonomy-Aware Evaluation of Vision-Language Models | Jan 1, 2025 | Fine-Grained Image ClassificationLanguage Modeling | —Unverified | 0 |
| Unleashing Text-to-Image Diffusion Prior for Zero-Shot Image Captioning | Dec 31, 2024 | Caption GenerationDecoder | —Unverified | 0 |
| DeepCRCEval: Revisiting the Evaluation of Code Review Comment Generation | Dec 24, 2024 | Comment GenerationFew-Shot Learning | —Unverified | 0 |