| Brain over Brawn: Using a Stereo Camera to Detect, Track, and Intercept a Faster UAV by Reconstructing the Intruder's Trajectory | Jul 2, 2021 | Position | CodeCode Available | 1 | 5 |
| Masked Jigsaw Puzzle: A Versatile Position Embedding for Vision Transformers | May 25, 2022 | Federated LearningPosition | CodeCode Available | 1 | 5 |
| A differentiable short-time Fourier transform with respect to the window length | Aug 23, 2022 | Position | CodeCode Available | 1 | 5 |
| Dynamic Local Feature Aggregation for Learning on Point Clouds | Jan 7, 2023 | Point Cloud ClassificationPosition | CodeCode Available | 1 | 5 |
| ASF-YOLO: A Novel YOLO Model with Attentional Scale Sequence Fusion for Cell Instance Segmentation | Dec 11, 2023 | Instance SegmentationPosition | CodeCode Available | 1 | 5 |
| ADIFF: Explaining audio difference using natural language | Feb 6, 2025 | AudioCapsAudio captioning | CodeCode Available | 1 | 5 |
| Deep Deformable 3D Caricatures with Learned Shape Control | Jul 29, 2022 | CaricaturePosition | CodeCode Available | 1 | 5 |
| Assigning personality/identity to a chatting machine for coherent conversation generation | Jun 9, 2017 | ChatbotDecoder | CodeCode Available | 1 | 5 |
| Camera-Space Hand Mesh Recovery via Semantic Aggregation and Adaptive 2D-1D Registration | Mar 4, 2021 | 3D Hand Pose EstimationPosition | CodeCode Available | 1 | 5 |
| Deep Domain Confusion: Maximizing for Domain Invariance | Dec 10, 2014 | Domain AdaptationModel Selection | CodeCode Available | 1 | 5 |