| Context-Aware Relative Object Queries To Unify Video Instance and Panoptic Segmentation | Jan 1, 2023 | Instance SegmentationMulti-Object Tracking | CodeCode Available | 1 |
| ContraCLIP: Interpretable GAN generation driven by pairs of contrasting sentences | Jun 5, 2022 | Position | CodeCode Available | 1 |
| Consensus Learning with Deep Sets for Essential Matrix Estimation | Jun 25, 2024 | Position | CodeCode Available | 1 |
| Advancing Social Intelligence in AI Agents: Technical Challenges and Open Questions | Apr 17, 2024 | Position | CodeCode Available | 1 |
| CrossFormer: A Versatile Vision Transformer Hinging on Cross-scale Attention | Jul 31, 2021 | image-classificationImage Classification | CodeCode Available | 1 |
| Position: Considerations for Differentially Private Learning with Large-Scale Public Pretraining | Dec 13, 2022 | PositionPrivacy Preserving | CodeCode Available | 1 |
| CoCA: Fusing Position Embedding with Collinear Constrained Attention in Transformers for Long Context Window Extending | Sep 15, 2023 | 2kPosition | CodeCode Available | 1 |
| DALNet: A Rail Detection Network Based on Dynamic Anchor Line | Aug 22, 2023 | DiversityLane Detection | CodeCode Available | 1 |
| DeepBall: Deep Neural-Network Ball Detector | Feb 19, 2019 | General ClassificationObject | CodeCode Available | 1 |
| Deep Deformable 3D Caricatures with Learned Shape Control | Jul 29, 2022 | CaricaturePosition | CodeCode Available | 1 |
| Deep Reinforcement Learning for Producing Furniture Layout in Indoor Scenes | Jan 19, 2021 | Deep Reinforcement LearningPosition | CodeCode Available | 1 |
| Deep SE(3)-Equivariant Geometric Reasoning for Precise Placement Tasks | Apr 20, 2024 | Pose PredictionPosition | CodeCode Available | 1 |
| On the Connection between Local Attention and Dynamic Depth-wise Convolution | Jun 8, 2021 | object-detectionObject Detection | CodeCode Available | 1 |
| Dense Prediction Transformer for Scale Estimation in Monocular Visual Odometry | Oct 4, 2022 | Autonomous VehiclesMonocular Visual Odometry | CodeCode Available | 1 |
| Convolutions and Self-Attention: Re-interpreting Relative Positions in Pre-trained Language Models | Jun 10, 2021 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| DeSCo: Towards Generalizable and Scalable Deep Subgraph Counting | Aug 16, 2023 | Graph Neural NetworkGraph Regression | CodeCode Available | 1 |
| Detection, Tracking, and Counting Meets Drones in Crowds: A Benchmark | May 6, 2021 | Crowd Countingobject-detection | CodeCode Available | 1 |
| Which One? Leveraging Context Between Objects and Multiple Views for Language Grounding | Nov 12, 2023 | ObjectPosition | CodeCode Available | 1 |
| 3D Feature Tracking via Event Camera | Jan 1, 2024 | Motion CompensationPatch Matching | CodeCode Available | 1 |
| A Deep Recurrent Survival Model for Unbiased Ranking | Apr 30, 2020 | Information Retrievalmodel | CodeCode Available | 1 |
| CoMoGAN: continuous model-guided image-to-image translation | Mar 11, 2021 | Image-to-Image TranslationPosition | CodeCode Available | 1 |
| Conditional Positional Encodings for Vision Transformers | Feb 22, 2021 | AutoMLClassification | CodeCode Available | 1 |
| A differentiable short-time Fourier transform with respect to the window length | Aug 23, 2022 | Position | CodeCode Available | 1 |
| DropPos: Pre-Training Vision Transformers by Reconstructing Dropped Positions | Sep 7, 2023 | PositionSpatial Reasoning | CodeCode Available | 1 |
| ComRoPE: Scalable and Robust Rotary Position Embedding Parameterized by Trainable Commuting Angle Matrices | Jun 4, 2025 | Position | CodeCode Available | 1 |