| Deep Momentum Multi-Marginal Schrödinger Bridge | Mar 3, 2023 | Position | CodeCode Available | 1 |
| Deep Domain Confusion: Maximizing for Domain Invariance | Dec 10, 2014 | Domain AdaptationModel Selection | CodeCode Available | 1 |
| Deep Reinforcement Learning for Producing Furniture Layout in Indoor Scenes | Jan 19, 2021 | Deep Reinforcement LearningPosition | CodeCode Available | 1 |
| ADIFF: Explaining audio difference using natural language | Feb 6, 2025 | AudioCapsAudio captioning | CodeCode Available | 1 |
| DALNet: A Rail Detection Network Based on Dynamic Anchor Line | Aug 22, 2023 | DiversityLane Detection | CodeCode Available | 1 |
| Cross-View Geo-Localization with Street-View and VHR Satellite Imagery in Decentrality Settings | Dec 16, 2024 | Disaster Responsegeo-localization | CodeCode Available | 1 |
| DDLP: Unsupervised Object-Centric Video Prediction with Deep Dynamic Latent Particles | Jun 9, 2023 | ObjectPosition | CodeCode Available | 1 |
| Deep SE(3)-Equivariant Geometric Reasoning for Precise Placement Tasks | Apr 20, 2024 | Pose PredictionPosition | CodeCode Available | 1 |
| DNTextSpotter: Arbitrary-Shaped Scene Text Spotting via Improved Denoising Training | Aug 1, 2024 | DenoisingGraph Matching | CodeCode Available | 1 |
| EgoNN: Egocentric Neural Network for Point Cloud Based 6DoF Relocalization at the City Scale | Oct 24, 2021 | Position | CodeCode Available | 1 |
| Anomaly Detection Requires Better Representations | Oct 19, 2022 | 3D Anomaly Detection and SegmentationAnomaly Detection | CodeCode Available | 1 |
| Context-Aware Relative Object Queries To Unify Video Instance and Panoptic Segmentation | Jan 1, 2023 | Instance SegmentationMulti-Object Tracking | CodeCode Available | 1 |
| ContraCLIP: Interpretable GAN generation driven by pairs of contrasting sentences | Jun 5, 2022 | Position | CodeCode Available | 1 |
| Consensus Learning with Deep Sets for Essential Matrix Estimation | Jun 25, 2024 | Position | CodeCode Available | 1 |
| ComRoPE: Scalable and Robust Rotary Position Embedding Parameterized by Trainable Commuting Angle Matrices | Jun 4, 2025 | Position | CodeCode Available | 1 |
| Position: Considerations for Differentially Private Learning with Large-Scale Public Pretraining | Dec 13, 2022 | PositionPrivacy Preserving | CodeCode Available | 1 |
| Cross-Attention Head Position Patterns Can Align with Human Visual Concepts in Text-to-Image Generative Models | Dec 3, 2024 | Image GenerationPosition | CodeCode Available | 1 |
| Comment on paper: Position: Rethinking Post-Hoc Search-Based Neural Approaches for Solving Large-Scale Traveling Salesman Problems | Jun 11, 2024 | AllPosition | CodeCode Available | 1 |
| Cobiveco: Consistent biventricular coordinates for precise and intuitive description of position in the heart -- with MATLAB implementation | Feb 4, 2021 | Position | CodeCode Available | 1 |
| ColdNAS: Search to Modulate for User Cold-Start Recommendation | Jun 6, 2023 | Neural Architecture SearchPosition | CodeCode Available | 1 |
| Classification of Long Sequential Data using Circular Dilated Convolutional Neural Networks | Jan 6, 2022 | Audio ClassificationClassification | CodeCode Available | 1 |
| A Closer Look at Parameter-Efficient Tuning in Diffusion Models | Mar 31, 2023 | Efficient Diffusion PersonalizationPosition | CodeCode Available | 1 |
| CLEX: Continuous Length Extrapolation for Large Language Models | Oct 25, 2023 | 4kPosition | CodeCode Available | 1 |
| ChordMixer: A Scalable Neural Attention Model for Sequences with Different Lengths | Jun 12, 2022 | ChunkingDocument Classification | CodeCode Available | 1 |
| Advancing Social Intelligence in AI Agents: Technical Challenges and Open Questions | Apr 17, 2024 | Position | CodeCode Available | 1 |
| COGS: A Compositional Generalization Challenge Based on Semantic Interpretation | Oct 12, 2020 | PositionSemantic Parsing | CodeCode Available | 1 |
| Collect-and-Distribute Transformer for 3D Point Cloud Analysis | Jun 2, 2023 | Point Cloud ClassificationPosition | CodeCode Available | 1 |
| Combining Semantic Guidance and Deep Reinforcement Learning For Generating Human Level Paintings | Nov 25, 2020 | Deep Reinforcement LearningModel-based Reinforcement Learning | CodeCode Available | 1 |
| 3rd Place Solution for PVUW2023 VSS Track: A Large Model for Semantic Segmentation on VSPW | Jun 4, 2023 | PositionSegmentation | CodeCode Available | 1 |
| Which One? Leveraging Context Between Objects and Multiple Views for Language Grounding | Nov 12, 2023 | ObjectPosition | CodeCode Available | 1 |
| CMMU: A Benchmark for Chinese Multi-modal Multi-type Question Understanding and Reasoning | Jan 25, 2024 | Multiple-choicePosition | CodeCode Available | 1 |
| ConDor: Self-Supervised Canonicalization of 3D Pose for Partial Shapes | Jan 19, 2022 | 3D Canonicalization3D Geometry Perception | CodeCode Available | 1 |
| CoMoGAN: continuous model-guided image-to-image translation | Mar 11, 2021 | Image-to-Image TranslationPosition | CodeCode Available | 1 |
| Context-Patch Face Hallucination Based on Thresholding Locality-constrained Representation and Reproducing Learning | Sep 3, 2018 | Face HallucinationHallucination | CodeCode Available | 1 |
| Cross-Field Transformer for Diabetic Retinopathy Grading on Two-field Fundus Images | Nov 26, 2022 | Diabetic Retinopathy GradingPosition | CodeCode Available | 1 |
| Convolutions and Self-Attention: Re-interpreting Relative Positions in Pre-trained Language Models | Jun 10, 2021 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| 3D human pose estimation in video with temporal convolutions and semi-supervised training | Nov 28, 2018 | 3D Human Pose EstimationMonocular 3D Human Pose Estimation | CodeCode Available | 1 |
| CAPE: Camera View Position Embedding for Multi-View 3D Object Detection | Mar 17, 2023 | 3D Object Detectionobject-detection | CodeCode Available | 1 |
| CTIN: Robust Contextual Transformer Network for Inertial Navigation | Dec 3, 2021 | DecoderMulti-Task Learning | CodeCode Available | 1 |
| CoCA: Fusing Position Embedding with Collinear Constrained Attention in Transformers for Long Context Window Extending | Sep 15, 2023 | 2kPosition | CodeCode Available | 1 |
| DeepBall: Deep Neural-Network Ball Detector | Feb 19, 2019 | General ClassificationObject | CodeCode Available | 1 |
| Deep Deformable 3D Caricatures with Learned Shape Control | Jul 29, 2022 | CaricaturePosition | CodeCode Available | 1 |
| DeepFocus: a Few-Shot Microscope Slide Auto-Focus using a Sample Invariant CNN-based Sharpness Function | Jan 2, 2020 | Position | CodeCode Available | 1 |
| Can an AI Win Ghana's National Science and Maths Quiz? An AI Grand Challenge for Education | Jan 30, 2023 | MathPosition | CodeCode Available | 1 |
| Causal Imitative Model for Autonomous Driving | Dec 7, 2021 | Autonomous DrivingImitation Learning | CodeCode Available | 1 |
| On the Connection between Local Attention and Dynamic Depth-wise Convolution | Jun 8, 2021 | object-detectionObject Detection | CodeCode Available | 1 |
| Depth Based Semantic Scene Completion with Position Importance Aware Loss | Jan 29, 2020 | 3D Semantic SegmentationPosition | CodeCode Available | 1 |
| Depth Estimation From Indoor Panoramas With Neural Scene Representation | Jan 1, 2023 | Depth EstimationPosition | CodeCode Available | 1 |
| Camera Pose Auto-Encoders for Improving Pose Regression | Jul 12, 2022 | Positionregression | CodeCode Available | 1 |
| Masked Jigsaw Puzzle: A Versatile Position Embedding for Vision Transformers | May 25, 2022 | Federated LearningPosition | CodeCode Available | 1 |