| Conditional Positional Encodings for Vision Transformers | Feb 22, 2021 | AutoMLClassification | CodeCode Available | 1 | 5 |
| Classification of Long Sequential Data using Circular Dilated Convolutional Neural Networks | Jan 6, 2022 | Audio ClassificationClassification | CodeCode Available | 1 | 5 |
| Unified Learning Approach for Egocentric Hand Gesture Recognition and Fingertip Detection | Jan 6, 2021 | Fingertip DetectionGesture Recognition | CodeCode Available | 1 | 5 |
| DriveEditor: A Unified 3D Information-Guided Framework for Controllable Object Editing in Driving Scenes | Dec 27, 2024 | Autonomous DrivingNovel View Synthesis | CodeCode Available | 1 | 5 |
| EllSeg: An Ellipse Segmentation Framework for Robust Gaze Tracking | Jul 19, 2020 | PositionSegmentation | CodeCode Available | 1 | 5 |
| Fast Risk Assessment for Autonomous Vehicles Using Learned Models of Agent Futures | May 27, 2020 | Autonomous VehiclesPosition | CodeCode Available | 1 | 5 |
| Attention-Propagation Network for Egocentric Heatmap to 3D Pose Lifting | Feb 28, 2024 | 3D Pose EstimationEgocentric Pose Estimation | CodeCode Available | 1 | 5 |
| Differentiable Physics Simulations with Contacts: Do They Have Correct Gradients w.r.t. Position, Velocity and Control? | Jul 8, 2022 | Position | CodeCode Available | 1 | 5 |
| A Transformer-based Approach for Source Code Summarization | May 1, 2020 | Code SummarizationPosition | CodeCode Available | 1 | 5 |
| Detection, Tracking, and Counting Meets Drones in Crowds: A Benchmark | May 6, 2021 | Crowd Countingobject-detection | CodeCode Available | 1 | 5 |
| Differentiable short-time Fourier transform with respect to the hop length | Jul 26, 2023 | Position | CodeCode Available | 1 | 5 |
| Asynchronous Trajectory Matching-Based Multimodal Maritime Data Fusion for Vessel Traffic Surveillance in Inland Waterways | Feb 22, 2023 | PositionVessel Detection | CodeCode Available | 1 | 5 |
| Depth Based Semantic Scene Completion with Position Importance Aware Loss | Jan 29, 2020 | 3D Semantic SegmentationPosition | CodeCode Available | 1 | 5 |
| Depth Estimation From Indoor Panoramas With Neural Scene Representation | Jan 1, 2023 | Depth EstimationPosition | CodeCode Available | 1 | 5 |
| On the Connection between Local Attention and Dynamic Depth-wise Convolution | Jun 8, 2021 | object-detectionObject Detection | CodeCode Available | 1 | 5 |
| Delta Hedging Liquidity Positions on Automated Market Makers | Aug 4, 2022 | Position | CodeCode Available | 1 | 5 |
| Dense Prediction Transformer for Scale Estimation in Monocular Visual Odometry | Oct 4, 2022 | Autonomous VehiclesMonocular Visual Odometry | CodeCode Available | 1 | 5 |
| DeSCo: Towards Generalizable and Scalable Deep Subgraph Counting | Aug 16, 2023 | Graph Neural NetworkGraph Regression | CodeCode Available | 1 | 5 |
| Diffusion Action Segmentation | Mar 31, 2023 | Action SegmentationDenoising | CodeCode Available | 1 | 5 |
| ASF-YOLO: A Novel YOLO Model with Attentional Scale Sequence Fusion for Cell Instance Segmentation | Dec 11, 2023 | Instance SegmentationPosition | CodeCode Available | 1 | 5 |
| Deep Momentum Multi-Marginal Schrödinger Bridge | Mar 3, 2023 | Position | CodeCode Available | 1 | 5 |
| A Skull-Adaptive Framework for AI-Based 3D Transcranial Focused Ultrasound Simulation | May 19, 2025 | Position | CodeCode Available | 1 | 5 |
| ARTS: Semi-Analytical Regressor using Disentangled Skeletal Representations for Human Mesh Recovery from Videos | Oct 21, 2024 | 3D Human Pose EstimationDisentanglement | CodeCode Available | 1 | 5 |
| DeepFocus: a Few-Shot Microscope Slide Auto-Focus using a Sample Invariant CNN-based Sharpness Function | Jan 2, 2020 | Position | CodeCode Available | 1 | 5 |
| Deep Reinforcement Learning for Producing Furniture Layout in Indoor Scenes | Jan 19, 2021 | Deep Reinforcement LearningPosition | CodeCode Available | 1 | 5 |