| 1.5-Pints Technical Report: Pretraining in Days, Not Months -- Your Language Model Thrives on Quality Data | Aug 7, 2024 | 16k2k | CodeCode Available | 3 |
| MaskGWM: A Generalizable Driving World Model with Video Mask Reconstruction | Feb 17, 2025 | 2kAutonomous Driving | CodeCode Available | 3 |
| AIR-Bench: Benchmarking Large Audio-Language Models via Generative Comprehension | Feb 12, 2024 | 2kAutomatic Speech Recognition | CodeCode Available | 2 |
| Linear Attention Sequence Parallelism | Apr 3, 2024 | 2k | CodeCode Available | 2 |
| High-fidelity 3D Human Digitization from Single 2K Resolution Images | Mar 27, 2023 | 2k3D Human Reconstruction | CodeCode Available | 2 |
| Hyena Hierarchy: Towards Larger Convolutional Language Models | Feb 21, 2023 | 2k8k | CodeCode Available | 2 |
| 360MonoDepth: High-Resolution 360deg Monocular Depth Estimation | Jan 1, 2022 | 2kDepth Estimation | CodeCode Available | 2 |
| HHAvatar: Gaussian Head Avatar with Dynamic Hairs | Dec 5, 2023 | 2k | CodeCode Available | 2 |
| GPS-Gaussian: Generalizable Pixel-wise 3D Gaussian Splatting for Real-time Human Novel View Synthesis | Dec 4, 2023 | 2kDepth Estimation | CodeCode Available | 2 |
| FaceVerse: a Fine-grained and Detail-controllable 3D Face Morphable Model from a Hybrid Dataset | Mar 26, 2022 | 2k3D Face Reconstruction | CodeCode Available | 2 |