| Exploring Lightweight Hierarchical Vision Transformers for Efficient Visual Tracking | Aug 14, 2023 | PositionVisual Tracking | CodeCode Available | 1 |
| V-DETR: DETR with Vertex Relative Position Encoding for 3D Object Detection | Aug 8, 2023 | 3D Object DetectionDecoder | CodeCode Available | 1 |
| Point Anywhere: Directed Object Estimation from Omnidirectional Images | Aug 2, 2023 | Objectobject-detection | CodeCode Available | 1 |
| Advancing Beyond Identification: Multi-bit Watermark for Large Language Models | Aug 1, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Differentiable short-time Fourier transform with respect to the hop length | Jul 26, 2023 | Position | CodeCode Available | 1 |
| Latent-OFER: Detect, Mask, and Reconstruct with Latent Vectors for Occluded Facial Expression Recognition | Jul 21, 2023 | Facial Expression RecognitionFacial Expression Recognition (FER) | CodeCode Available | 1 |
| DSSE: a drone swarm search environment | Jul 12, 2023 | Positionreinforcement-learning | CodeCode Available | 1 |
| 2-D SSM: A General Spatial Layer for Visual Transformers | Jun 11, 2023 | Inductive BiasPosition | CodeCode Available | 1 |
| Everybody Compose: Deep Beats To Music | Jun 9, 2023 | Position | CodeCode Available | 1 |
| DDLP: Unsupervised Object-Centric Video Prediction with Deep Dynamic Latent Particles | Jun 9, 2023 | ObjectPosition | CodeCode Available | 1 |