| DropPos: Pre-Training Vision Transformers by Reconstructing Dropped Positions | Sep 7, 2023 | PositionSpatial Reasoning | CodeCode Available | 1 |
| DSSE: a drone swarm search environment | Jul 12, 2023 | Positionreinforcement-learning | CodeCode Available | 1 |
| A differentiable short-time Fourier transform with respect to the window length | Aug 23, 2022 | Position | CodeCode Available | 1 |
| Dynamic Local Feature Aggregation for Learning on Point Clouds | Jan 7, 2023 | Point Cloud ClassificationPosition | CodeCode Available | 1 |
| ConDor: Self-Supervised Canonicalization of 3D Pose for Partial Shapes | Jan 19, 2022 | 3D Canonicalization3D Geometry Perception | CodeCode Available | 1 |
| ADIFF: Explaining audio difference using natural language | Feb 6, 2025 | AudioCapsAudio captioning | CodeCode Available | 1 |
| Context-Patch Face Hallucination Based on Thresholding Locality-constrained Representation and Reproducing Learning | Sep 3, 2018 | Face HallucinationHallucination | CodeCode Available | 1 |
| CrossFormer: A Versatile Vision Transformer Hinging on Cross-scale Attention | Jul 31, 2021 | image-classificationImage Classification | CodeCode Available | 1 |
| On the Connection between Local Attention and Dynamic Depth-wise Convolution | Jun 8, 2021 | object-detectionObject Detection | CodeCode Available | 1 |
| Collect-and-Distribute Transformer for 3D Point Cloud Analysis | Jun 2, 2023 | Point Cloud ClassificationPosition | CodeCode Available | 1 |