| V2PE: Improving Multimodal Long-Context Capability of Vision-Language Models with Variable Visual Position Encoding | Dec 12, 2024 | Position | CodeCode Available | 2 |
| Structure Consistent Gaussian Splatting with Matching Prior for Few-shot Novel View Synthesis | Nov 6, 2024 | 3DGSNeRF | CodeCode Available | 2 |
| TidalDecode: Fast and Accurate LLM Decoding with Position Persistent Sparse Attention | Oct 7, 2024 | Position | CodeCode Available | 2 |
| 1st Place Solution of Multiview Egocentric Hand Tracking Challenge ECCV2024 | Sep 28, 2024 | Position | CodeCode Available | 2 |
| PCP-MAE: Learning to Predict Centers for Point Masked Autoencoders | Aug 16, 2024 | 3D Object Classification3D Point Cloud Classification | CodeCode Available | 2 |
| OPEN: Object-wise Position Embedding for Multi-view 3D Object Detection | Jul 15, 2024 | 3D Object DetectionDepth Estimation | CodeCode Available | 2 |
| PARE-Net: Position-Aware Rotation-Equivariant Networks for Robust Point Cloud Registration | Jul 14, 2024 | Inductive BiasPoint Cloud Registration | CodeCode Available | 2 |
| Refuse Whenever You Feel Unsafe: Improving Safety in LLMs via Decoupled Refusal Training | Jul 12, 2024 | Position | CodeCode Available | 2 |
| PosFormer: Recognizing Complex Handwritten Mathematical Expression with Position Forest Transformer | Jul 10, 2024 | DecoderHandwritten Mathmatical Expression Recognition | CodeCode Available | 2 |
| PianoMotion10M: Dataset and Benchmark for Hand Motion Generation in Piano Performance | Jun 13, 2024 | Motion GenerationPosition | CodeCode Available | 2 |