| RL from Physical Feedback: Aligning Large Motion Models with Humanoid Control | Jun 15, 2025 | Humanoid ControlMotion Generation | —Unverified | 0 |
| Jamais Vu: Exposing the Generalization Gap in Supervised Semantic Correspondence | Jun 9, 2025 | Depth EstimationMonocular Depth Estimation | —Unverified | 0 |
| Do It Yourself: Learning Semantic Correspondence from Pseudo-Labels | Jun 5, 2025 | Semantic correspondence | —Unverified | 0 |
| MotionRAG-Diff: A Retrieval-Augmented Diffusion Framework for Long-Term Music-to-Dance Generation | Jun 3, 2025 | Contrastive LearningMotion Synthesis | —Unverified | 0 |
| Cora: Correspondence-aware image editing using few step diffusion | May 29, 2025 | Image-to-Image TranslationSemantic correspondence | CodeCode Available | 1 |
| Semantic Correspondence: Unified Benchmarking and a Strong Baseline | May 23, 2025 | BenchmarkingSemantic correspondence | CodeCode Available | 1 |
| TC-MGC: Text-Conditioned Multi-Grained Contrastive Learning for Text-Video Retrieval | Apr 7, 2025 | Contrastive LearningRetrieval | CodeCode Available | 0 |
| SemAlign3D: Semantic Correspondence between RGB-Images through Aligning 3D Object-Class Representations | Mar 28, 2025 | ObjectSemantic correspondence | —Unverified | 0 |
| Semantix: An Energy Guided Sampler for Semantic Style Transfer | Mar 28, 2025 | Appearance TransferSemantic correspondence | —Unverified | 0 |
| Evaluating book summaries from internal knowledge in Large Language Models: a cross-model and semantic consistency approach | Mar 27, 2025 | Semantic correspondence | —Unverified | 0 |
| FPGS: Feed-Forward Semantic-aware Photorealistic Style Transfer of Large-Scale Gaussian Splatting | Mar 11, 2025 | Semantic correspondenceStyle Transfer | —Unverified | 0 |
| Evaluation of Multilingual Image Captioning: How far can we get with CLIP models? | Feb 10, 2025 | Image CaptioningSemantic correspondence | CodeCode Available | 0 |
| Towards Affordance-Aware Articulation Synthesis for Rigged Objects | Jan 21, 2025 | Semantic correspondence | —Unverified | 0 |
| Seeing Sound: Assembling Sounds from Visuals for Audio-to-Image Generation | Jan 9, 2025 | DiversityImage Generation | —Unverified | 0 |
| Common3D: Self-Supervised Learning of 3D Morphable Models for Common Objects in Neural Feature Space | Jan 1, 2025 | Instance SegmentationObject | CodeCode Available | 0 |
| Bridging Viewpoint Gaps: Geometric Reasoning Boosts Semantic Correspondence | Jan 1, 2025 | Geometric MatchingSemantic correspondence | —Unverified | 0 |
| Self-Supervised Spatial Correspondence Across Modalities | Jan 1, 2025 | Geometric MatchingSemantic correspondence | —Unverified | 0 |
| Manga Generation via Layout-controllable Diffusion | Dec 26, 2024 | Semantic correspondence | —Unverified | 0 |
| DenseMatcher: Learning 3D Semantic Correspondence for Category-Level Manipulation from a Single Demo | Dec 6, 2024 | ObjectSemantic correspondence | —Unverified | 0 |
| A Framework For Image Synthesis Using Supervised Contrastive Learning | Dec 5, 2024 | Contrastive LearningGenerative Adversarial Network | —Unverified | 0 |
| Distillation of Diffusion Features for Semantic Correspondence | Dec 4, 2024 | 3D ReconstructionData Augmentation | —Unverified | 0 |
| Multi-Level Correlation Network For Few-Shot Image Classification | Dec 4, 2024 | Few-Shot Image Classificationimage-classification | CodeCode Available | 0 |
| CleanDIFT: Diffusion Features without Noise | Dec 4, 2024 | Semantic correspondence | CodeCode Available | 2 |
| OmniCreator: Self-Supervised Unified Generation with Universal Editing | Dec 3, 2024 | DenoisingSemantic correspondence | —Unverified | 0 |
| Integration of Contextual Descriptors in Ontology Alignment for Enrichment of Semantic Correspondence | Nov 28, 2024 | Semantic correspondence | —Unverified | 0 |
| Unleashing the Potential of the Diffusion Model in Few-shot Semantic Segmentation | Oct 3, 2024 | Few-Shot Semantic SegmentationImage Generation | CodeCode Available | 1 |
| DreamMover: Leveraging the Prior of Diffusion Models for Image Interpolation with Large Motion | Sep 15, 2024 | Semantic correspondence | —Unverified | 0 |
| MOSMOS: Multi-organ segmentation facilitated by medical report supervision | Sep 4, 2024 | Contrastive LearningOrgan Segmentation | —Unverified | 0 |
| CONDA: Condensed Deep Association Learning for Co-Salient Object Detection | Sep 2, 2024 | Co-Salient Object Detectionobject-detection | —Unverified | 0 |
| SGOR: Outlier Removal by Leveraging Semantic and Geometric Information for Robust Point Cloud Registration | Jul 8, 2024 | Point Cloud RegistrationSemantic correspondence | CodeCode Available | 0 |
| GPT-4o: Visual perception performance of multimodal large language models in piglet activity understanding | Jun 14, 2024 | Activity RecognitionMMR total | —Unverified | 0 |
| Advanced Multimodal Deep Learning Architecture for Image-Text Matching | Jun 13, 2024 | Deep LearningImage-text matching | —Unverified | 0 |
| Zero-shot Image Editing with Reference Imitation | Jun 11, 2024 | Semantic correspondence | CodeCode Available | 5 |
| Eye-for-an-eye: Appearance Transfer with Semantic Correspondence in Diffusion Models | Jun 11, 2024 | Appearance TransferImage Generation | —Unverified | 0 |
| Zero-Shot Video Semantic Segmentation based on Pre-Trained Diffusion Models | May 27, 2024 | SegmentationSemantic correspondence | CodeCode Available | 2 |
| CoHD: A Counting-Aware Hierarchical Decoding Framework for Generalized Referring Expression Segmentation | May 24, 2024 | Generalized Referring Expression SegmentationObject | CodeCode Available | 1 |
| Semantic Gesticulator: Semantics-Aware Co-Speech Gesture Synthesis | May 16, 2024 | Language ModellingLarge Language Model | CodeCode Available | 3 |
| Factual Serialization Enhancement: A Key Innovation for Chest X-ray Report Generation | May 15, 2024 | Contrastive Learningcross-modal alignment | CodeCode Available | 1 |
| Learning SO(3)-Invariant Semantic Correspondence via Local Shape Transform | Apr 17, 2024 | DecoderSemantic correspondence | —Unverified | 0 |
| Independently Keypoint Learning for Small Object Semantic Correspondence | Apr 3, 2024 | DecoderObject | —Unverified | 0 |
| Robo-ABC: Affordance Generalization Beyond Categories via Semantic Correspondence for Robot Manipulation | Jan 15, 2024 | Robot ManipulationSemantic correspondence | —Unverified | 0 |
| FPRF: Feed-Forward Photorealistic Style Transfer of Large-Scale 3D Neural Radiance Fields | Jan 10, 2024 | Semantic correspondenceStyle Transfer | —Unverified | 0 |
| Making Visual Sense of Oracle Bones for You and Me | Jan 1, 2024 | Semantic correspondence | CodeCode Available | 0 |
| Pixel-level Semantic Correspondence through Layout-aware Representation Learning and Multi-scale Matching Integration | Jan 1, 2024 | Representation LearningSemantic correspondence | CodeCode Available | 0 |
| Improving Semantic Correspondence with Viewpoint-Guided Spherical Maps | Dec 20, 2023 | Representation LearningSemantic correspondence | —Unverified | 0 |
| CL2CM: Improving Cross-Lingual Cross-Modal Retrieval via Cross-Lingual Knowledge Transfer | Dec 14, 2023 | Cross-Lingual TransferCross-Modal Retrieval | —Unverified | 0 |
| Patch-wise Graph Contrastive Learning for Image Translation | Dec 13, 2023 | Contrastive LearningGraph Neural Network | CodeCode Available | 1 |
| StableVITON: Learning Semantic Correspondence with Latent Diffusion Model for Virtual Try-On | Dec 4, 2023 | Semantic correspondenceVirtual Try-on | CodeCode Available | 3 |
| Match me if you can: Semi-Supervised Semantic Correspondence Learning with Unpaired Images | Nov 30, 2023 | Semantic correspondence | CodeCode Available | 0 |
| Telling Left from Right: Identifying Geometry-Aware Semantic Correspondence | Nov 28, 2023 | Animal Pose EstimationPose Estimation | CodeCode Available | 1 |