| Never Lost in the Middle: Mastering Long-Context Question Answering with Position-Agnostic Decompositional Training | Nov 15, 2023 | Passage RetrievalPosition | CodeCode Available | 2 | 5 |
| Think Twice before Driving: Towards Scalable Decoders for End-to-End Autonomous Driving | May 10, 2023 | Autonomous DrivingBench2Drive | CodeCode Available | 2 | 5 |
| Attention-Propagation Network for Egocentric Heatmap to 3D Pose Lifting | Feb 28, 2024 | 3D Pose EstimationEgocentric Pose Estimation | CodeCode Available | 1 | 5 |
| Deep Domain Confusion: Maximizing for Domain Invariance | Dec 10, 2014 | Domain AdaptationModel Selection | CodeCode Available | 1 | 5 |
| DeepFocus: a Few-Shot Microscope Slide Auto-Focus using a Sample Invariant CNN-based Sharpness Function | Jan 2, 2020 | Position | CodeCode Available | 1 | 5 |
| DeepBall: Deep Neural-Network Ball Detector | Feb 19, 2019 | General ClassificationObject | CodeCode Available | 1 | 5 |
| 3D Feature Tracking via Event Camera | Jan 1, 2024 | Motion CompensationPatch Matching | CodeCode Available | 1 | 5 |
| Deep Deformable 3D Caricatures with Learned Shape Control | Jul 29, 2022 | CaricaturePosition | CodeCode Available | 1 | 5 |
| CoCA: Fusing Position Embedding with Collinear Constrained Attention in Transformers for Long Context Window Extending | Sep 15, 2023 | 2kPosition | CodeCode Available | 1 | 5 |
| A Case for Rejection in Low Resource ML Deployment | Aug 12, 2022 | DiversityPosition | CodeCode Available | 1 | 5 |
| DALNet: A Rail Detection Network Based on Dynamic Anchor Line | Aug 22, 2023 | DiversityLane Detection | CodeCode Available | 1 | 5 |
| Asynchronous Trajectory Matching-Based Multimodal Maritime Data Fusion for Vessel Traffic Surveillance in Inland Waterways | Feb 22, 2023 | PositionVessel Detection | CodeCode Available | 1 | 5 |
| A Transformer-based Approach for Source Code Summarization | May 1, 2020 | Code SummarizationPosition | CodeCode Available | 1 | 5 |
| Audio-Conditioned U-Net for Position Estimation in Full Sheet Images | Oct 16, 2019 | Multimodal Deep LearningPosition | CodeCode Available | 1 | 5 |
| DDLP: Unsupervised Object-Centric Video Prediction with Deep Dynamic Latent Particles | Jun 9, 2023 | ObjectPosition | CodeCode Available | 1 | 5 |
| Deep Momentum Multi-Marginal Schrödinger Bridge | Mar 3, 2023 | Position | CodeCode Available | 1 | 5 |
| A Skull-Adaptive Framework for AI-Based 3D Transcranial Focused Ultrasound Simulation | May 19, 2025 | Position | CodeCode Available | 1 | 5 |
| Assigning personality/identity to a chatting machine for coherent conversation generation | Jun 9, 2017 | ChatbotDecoder | CodeCode Available | 1 | 5 |
| ASF-YOLO: A Novel YOLO Model with Attentional Scale Sequence Fusion for Cell Instance Segmentation | Dec 11, 2023 | Instance SegmentationPosition | CodeCode Available | 1 | 5 |
| ARTS: Semi-Analytical Regressor using Disentangled Skeletal Representations for Human Mesh Recovery from Videos | Oct 21, 2024 | 3D Human Pose EstimationDisentanglement | CodeCode Available | 1 | 5 |
| CrossFormer: A Versatile Vision Transformer Hinging on Cross-scale Attention | Jul 31, 2021 | image-classificationImage Classification | CodeCode Available | 1 | 5 |
| Cross-View Geo-Localization with Street-View and VHR Satellite Imagery in Decentrality Settings | Dec 16, 2024 | Disaster Responsegeo-localization | CodeCode Available | 1 | 5 |
| Arithmetic Transformers Can Length-Generalize in Both Operand Length and Count | Oct 21, 2024 | Position | CodeCode Available | 1 | 5 |
| Cross-Attention Head Position Patterns Can Align with Human Visual Concepts in Text-to-Image Generative Models | Dec 3, 2024 | Image GenerationPosition | CodeCode Available | 1 | 5 |
| Context-Patch Face Hallucination Based on Thresholding Locality-constrained Representation and Reproducing Learning | Sep 3, 2018 | Face HallucinationHallucination | CodeCode Available | 1 | 5 |