| Deep Domain Confusion: Maximizing for Domain Invariance | Dec 10, 2014 | Domain AdaptationModel Selection | CodeCode Available | 1 |
| Deep Deformable 3D Caricatures with Learned Shape Control | Jul 29, 2022 | CaricaturePosition | CodeCode Available | 1 |
| Deep Momentum Multi-Marginal Schrödinger Bridge | Mar 3, 2023 | Position | CodeCode Available | 1 |
| DeSCo: Towards Generalizable and Scalable Deep Subgraph Counting | Aug 16, 2023 | Graph Neural NetworkGraph Regression | CodeCode Available | 1 |
| CTIN: Robust Contextual Transformer Network for Inertial Navigation | Dec 3, 2021 | DecoderMulti-Task Learning | CodeCode Available | 1 |
| CrossFormer: A Versatile Vision Transformer Hinging on Cross-scale Attention | Jul 31, 2021 | image-classificationImage Classification | CodeCode Available | 1 |
| CoCA: Fusing Position Embedding with Collinear Constrained Attention in Transformers for Long Context Window Extending | Sep 15, 2023 | 2kPosition | CodeCode Available | 1 |
| Convolutions and Self-Attention: Re-interpreting Relative Positions in Pre-trained Language Models | Jun 10, 2021 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| A Contextual-Aware Position Encoding for Sequential Recommendation | Feb 13, 2025 | PositionRecommendation Systems | CodeCode Available | 1 |
| Context-Patch Face Hallucination Based on Thresholding Locality-constrained Representation and Reproducing Learning | Sep 3, 2018 | Face HallucinationHallucination | CodeCode Available | 1 |
| 2-D SSM: A General Spatial Layer for Visual Transformers | Jun 11, 2023 | Inductive BiasPosition | CodeCode Available | 1 |
| ContraCLIP: Interpretable GAN generation driven by pairs of contrasting sentences | Jun 5, 2022 | Position | CodeCode Available | 1 |
| Cross-Attention Head Position Patterns Can Align with Human Visual Concepts in Text-to-Image Generative Models | Dec 3, 2024 | Image GenerationPosition | CodeCode Available | 1 |
| Cross-Field Transformer for Diabetic Retinopathy Grading on Two-field Fundus Images | Nov 26, 2022 | Diabetic Retinopathy GradingPosition | CodeCode Available | 1 |
| DALNet: A Rail Detection Network Based on Dynamic Anchor Line | Aug 22, 2023 | DiversityLane Detection | CodeCode Available | 1 |
| Cross-View Geo-Localization with Street-View and VHR Satellite Imagery in Decentrality Settings | Dec 16, 2024 | Disaster Responsegeo-localization | CodeCode Available | 1 |
| DDLP: Unsupervised Object-Centric Video Prediction with Deep Dynamic Latent Particles | Jun 9, 2023 | ObjectPosition | CodeCode Available | 1 |
| DeepBall: Deep Neural-Network Ball Detector | Feb 19, 2019 | General ClassificationObject | CodeCode Available | 1 |
| Enquire One's Parent and Child Before Decision: Fully Exploit Hierarchical Structure for Self-Supervised Taxonomy Expansion | Jan 27, 2021 | PositionTaxonomy Expansion | CodeCode Available | 1 |
| DeepFocus: a Few-Shot Microscope Slide Auto-Focus using a Sample Invariant CNN-based Sharpness Function | Jan 2, 2020 | Position | CodeCode Available | 1 |
| Deep Reinforcement Learning for Producing Furniture Layout in Indoor Scenes | Jan 19, 2021 | Deep Reinforcement LearningPosition | CodeCode Available | 1 |
| Deep SE(3)-Equivariant Geometric Reasoning for Precise Placement Tasks | Apr 20, 2024 | Pose PredictionPosition | CodeCode Available | 1 |
| On the Connection between Local Attention and Dynamic Depth-wise Convolution | Jun 8, 2021 | object-detectionObject Detection | CodeCode Available | 1 |
| Dense Prediction Transformer for Scale Estimation in Monocular Visual Odometry | Oct 4, 2022 | Autonomous VehiclesMonocular Visual Odometry | CodeCode Available | 1 |
| Detection, Tracking, and Counting Meets Drones in Crowds: A Benchmark | May 6, 2021 | Crowd Countingobject-detection | CodeCode Available | 1 |
| Which One? Leveraging Context Between Objects and Multiple Views for Language Grounding | Nov 12, 2023 | ObjectPosition | CodeCode Available | 1 |
| Differentiable short-time Fourier transform with respect to the hop length | Jul 26, 2023 | Position | CodeCode Available | 1 |
| Diffusion Action Segmentation | Mar 31, 2023 | Action SegmentationDenoising | CodeCode Available | 1 |
| Conditional Positional Encodings for Vision Transformers | Feb 22, 2021 | AutoMLClassification | CodeCode Available | 1 |
| CoMoGAN: continuous model-guided image-to-image translation | Mar 11, 2021 | Image-to-Image TranslationPosition | CodeCode Available | 1 |
| DropPos: Pre-Training Vision Transformers by Reconstructing Dropped Positions | Sep 7, 2023 | PositionSpatial Reasoning | CodeCode Available | 1 |
| A Large-Scale Dataset for Benchmarking Elevator Button Segmentation and Character Recognition | Mar 16, 2021 | BenchmarkingPosition | CodeCode Available | 1 |
| ComRoPE: Scalable and Robust Rotary Position Embedding Parameterized by Trainable Commuting Angle Matrices | Jun 4, 2025 | Position | CodeCode Available | 1 |
| Dynamic Local Feature Aggregation for Learning on Point Clouds | Jan 7, 2023 | Point Cloud ClassificationPosition | CodeCode Available | 1 |
| Comment on paper: Position: Rethinking Post-Hoc Search-Based Neural Approaches for Solving Large-Scale Traveling Salesman Problems | Jun 11, 2024 | AllPosition | CodeCode Available | 1 |
| EEGEyeNet: a Simultaneous Electroencephalography and Eye-tracking Dataset and Benchmark for Eye Movement Prediction | Nov 6, 2021 | EEGElectroencephalogram (EEG) | CodeCode Available | 1 |
| Combining Semantic Guidance and Deep Reinforcement Learning For Generating Human Level Paintings | Nov 25, 2020 | Deep Reinforcement LearningModel-based Reinforcement Learning | CodeCode Available | 1 |
| EgoNN: Egocentric Neural Network for Point Cloud Based 6DoF Relocalization at the City Scale | Oct 24, 2021 | Position | CodeCode Available | 1 |
| ALIKE: Accurate and Lightweight Keypoint Detection and Descriptor Extraction | Dec 6, 2021 | Camera Pose EstimationGPU | CodeCode Available | 1 |
| Electricity Theft Detection with self-attention | Feb 14, 2020 | Missing ValuesPosition | CodeCode Available | 1 |
| ETC: Encoding Long and Structured Inputs in Transformers | Apr 17, 2020 | PositionQuestion Answering | CodeCode Available | 1 |
| Everybody Compose: Deep Beats To Music | Jun 9, 2023 | Position | CodeCode Available | 1 |
| ConDor: Self-Supervised Canonicalization of 3D Pose for Partial Shapes | Jan 19, 2022 | 3D Canonicalization3D Geometry Perception | CodeCode Available | 1 |
| All-to-key Attention for Arbitrary Style Transfer | Dec 8, 2022 | AllPosition | CodeCode Available | 1 |
| COGS: A Compositional Generalization Challenge Based on Semantic Interpretation | Oct 12, 2020 | PositionSemantic Parsing | CodeCode Available | 1 |
| A Low-Cost, Flexible and Portable Volumetric Capturing System | Sep 3, 2019 | Position | CodeCode Available | 1 |
| Cobiveco: Consistent biventricular coordinates for precise and intuitive description of position in the heart -- with MATLAB implementation | Feb 4, 2021 | Position | CodeCode Available | 1 |
| ColdNAS: Search to Modulate for User Cold-Start Recommendation | Jun 6, 2023 | Neural Architecture SearchPosition | CodeCode Available | 1 |
| A More Fine-Grained Aspect-Sentiment-Opinion Triplet Extraction Task | Mar 29, 2021 | Aspect-Based Sentiment AnalysisAspect-Sentiment-Opinion Triplet Extraction | CodeCode Available | 1 |
| PoseBench3D: A Cross-Dataset Analysis Framework for 3D Human Pose Estimation | May 16, 2025 | 3D Human Pose EstimationPose Estimation | CodeCode Available | 1 |