| On the Connection between Local Attention and Dynamic Depth-wise Convolution | Jun 8, 2021 | object-detectionObject Detection | CodeCode Available | 1 |
| Depth Based Semantic Scene Completion with Position Importance Aware Loss | Jan 29, 2020 | 3D Semantic SegmentationPosition | CodeCode Available | 1 |
| Deep SE(3)-Equivariant Geometric Reasoning for Precise Placement Tasks | Apr 20, 2024 | Pose PredictionPosition | CodeCode Available | 1 |
| Deep Reinforcement Learning for Producing Furniture Layout in Indoor Scenes | Jan 19, 2021 | Deep Reinforcement LearningPosition | CodeCode Available | 1 |
| Differentiable Physics Simulations with Contacts: Do They Have Correct Gradients w.r.t. Position, Velocity and Control? | Jul 8, 2022 | Position | CodeCode Available | 1 |
| Differentiable short-time Fourier transform with respect to the hop length | Jul 26, 2023 | Position | CodeCode Available | 1 |
| Delta Hedging Liquidity Positions on Automated Market Makers | Aug 4, 2022 | Position | CodeCode Available | 1 |
| Conditional Positional Encodings for Vision Transformers | Feb 22, 2021 | AutoMLClassification | CodeCode Available | 1 |
| DriveEditor: A Unified 3D Information-Guided Framework for Controllable Object Editing in Driving Scenes | Dec 27, 2024 | Autonomous DrivingNovel View Synthesis | CodeCode Available | 1 |
| DropPos: Pre-Training Vision Transformers by Reconstructing Dropped Positions | Sep 7, 2023 | PositionSpatial Reasoning | CodeCode Available | 1 |
| Depth Estimation From Indoor Panoramas With Neural Scene Representation | Jan 1, 2023 | Depth EstimationPosition | CodeCode Available | 1 |
| DeepFocus: a Few-Shot Microscope Slide Auto-Focus using a Sample Invariant CNN-based Sharpness Function | Jan 2, 2020 | Position | CodeCode Available | 1 |
| Efficient DOA Estimation Method for Reconfigurable Intelligent Surfaces Aided UAV Swarm | Mar 19, 2022 | Position | CodeCode Available | 1 |
| Efficient Object Localization Using Convolutional Networks | Nov 16, 2014 | ObjectObject Localization | CodeCode Available | 1 |
| A Case for Rejection in Low Resource ML Deployment | Aug 12, 2022 | DiversityPosition | CodeCode Available | 1 |
| Electricity Theft Detection with self-attention | Feb 14, 2020 | Missing ValuesPosition | CodeCode Available | 1 |
| EllSeg: An Ellipse Segmentation Framework for Robust Gaze Tracking | Jul 19, 2020 | PositionSegmentation | CodeCode Available | 1 |
| End-to-end Learning Improves Static Object Geo-localization in Monocular Video | Apr 10, 2020 | geo-localizationMulti-Object Tracking | CodeCode Available | 1 |
| 3D Feature Tracking via Event Camera | Jan 1, 2024 | Motion CompensationPatch Matching | CodeCode Available | 1 |
| A Deep Recurrent Survival Model for Unbiased Ranking | Apr 30, 2020 | Information Retrievalmodel | CodeCode Available | 1 |
| Everybody Compose: Deep Beats To Music | Jun 9, 2023 | Position | CodeCode Available | 1 |
| Exploiting Inductive Bias in Transformer for Point Cloud Classification and Segmentation | Apr 27, 2023 | 3D Object Classification3D Part Segmentation | CodeCode Available | 1 |
| Deep Domain Confusion: Maximizing for Domain Invariance | Dec 10, 2014 | Domain AdaptationModel Selection | CodeCode Available | 1 |
| Exploring Text-transformers in AAAI 2021 Shared Task: COVID-19 Fake News Detection in English | Jan 7, 2021 | Fake News DetectionPosition | CodeCode Available | 1 |
| DeepBall: Deep Neural-Network Ball Detector | Feb 19, 2019 | General ClassificationObject | CodeCode Available | 1 |
| Deep Deformable 3D Caricatures with Learned Shape Control | Jul 29, 2022 | CaricaturePosition | CodeCode Available | 1 |
| Deep Momentum Multi-Marginal Schrödinger Bridge | Mar 3, 2023 | Position | CodeCode Available | 1 |
| DeSCo: Towards Generalizable and Scalable Deep Subgraph Counting | Aug 16, 2023 | Graph Neural NetworkGraph Regression | CodeCode Available | 1 |
| Dynamic Local Feature Aggregation for Learning on Point Clouds | Jan 7, 2023 | Point Cloud ClassificationPosition | CodeCode Available | 1 |
| Fast Risk Assessment for Autonomous Vehicles Using Learned Models of Agent Futures | May 27, 2020 | Autonomous VehiclesPosition | CodeCode Available | 1 |
| CrossFormer: A Versatile Vision Transformer Hinging on Cross-scale Attention | Jul 31, 2021 | image-classificationImage Classification | CodeCode Available | 1 |
| Cross-View Geo-Localization with Street-View and VHR Satellite Imagery in Decentrality Settings | Dec 16, 2024 | Disaster Responsegeo-localization | CodeCode Available | 1 |
| Cross-Attention Head Position Patterns Can Align with Human Visual Concepts in Text-to-Image Generative Models | Dec 3, 2024 | Image GenerationPosition | CodeCode Available | 1 |
| Cross-Field Transformer for Diabetic Retinopathy Grading on Two-field Fundus Images | Nov 26, 2022 | Diabetic Retinopathy GradingPosition | CodeCode Available | 1 |
| CTIN: Robust Contextual Transformer Network for Inertial Navigation | Dec 3, 2021 | DecoderMulti-Task Learning | CodeCode Available | 1 |
| ContraCLIP: Interpretable GAN generation driven by pairs of contrasting sentences | Jun 5, 2022 | Position | CodeCode Available | 1 |
| Context-Patch Face Hallucination Based on Thresholding Locality-constrained Representation and Reproducing Learning | Sep 3, 2018 | Face HallucinationHallucination | CodeCode Available | 1 |
| Convolutions and Self-Attention: Re-interpreting Relative Positions in Pre-trained Language Models | Jun 10, 2021 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Position: Considerations for Differentially Private Learning with Large-Scale Public Pretraining | Dec 13, 2022 | PositionPrivacy Preserving | CodeCode Available | 1 |
| Consensus Learning with Deep Sets for Essential Matrix Estimation | Jun 25, 2024 | Position | CodeCode Available | 1 |
| Context-Aware Relative Object Queries To Unify Video Instance and Panoptic Segmentation | Jan 1, 2023 | Instance SegmentationMulti-Object Tracking | CodeCode Available | 1 |
| CoCA: Fusing Position Embedding with Collinear Constrained Attention in Transformers for Long Context Window Extending | Sep 15, 2023 | 2kPosition | CodeCode Available | 1 |
| CoMoGAN: continuous model-guided image-to-image translation | Mar 11, 2021 | Image-to-Image TranslationPosition | CodeCode Available | 1 |
| Which One? Leveraging Context Between Objects and Multiple Views for Language Grounding | Nov 12, 2023 | ObjectPosition | CodeCode Available | 1 |
| Combining Semantic Guidance and Deep Reinforcement Learning For Generating Human Level Paintings | Nov 25, 2020 | Deep Reinforcement LearningModel-based Reinforcement Learning | CodeCode Available | 1 |
| Collect-and-Distribute Transformer for 3D Point Cloud Analysis | Jun 2, 2023 | Point Cloud ClassificationPosition | CodeCode Available | 1 |
| Comment on paper: Position: Rethinking Post-Hoc Search-Based Neural Approaches for Solving Large-Scale Traveling Salesman Problems | Jun 11, 2024 | AllPosition | CodeCode Available | 1 |
| ComRoPE: Scalable and Robust Rotary Position Embedding Parameterized by Trainable Commuting Angle Matrices | Jun 4, 2025 | Position | CodeCode Available | 1 |
| Cobiveco: Consistent biventricular coordinates for precise and intuitive description of position in the heart -- with MATLAB implementation | Feb 4, 2021 | Position | CodeCode Available | 1 |
| CMMU: A Benchmark for Chinese Multi-modal Multi-type Question Understanding and Reasoning | Jan 25, 2024 | Multiple-choicePosition | CodeCode Available | 1 |