| ChamaleonLLM: Batch-Aware Dynamic Low-Rank Adaptation via Inference-Time Clusters | Feb 6, 2025 | DecoderLanguage Modeling | CodeCode Available | 0 |
| RWKV-UI: UI Understanding with Enhanced Perception and Reasoning | Feb 6, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Ola: Pushing the Frontiers of Omni-Modal Language Model | Feb 6, 2025 | cross-modal alignmentLanguage Modeling | CodeCode Available | 3 |
| ADIFF: Explaining audio difference using natural language | Feb 6, 2025 | AudioCapsAudio captioning | CodeCode Available | 1 |
| DiTAR: Diffusion Transformer Autoregressive Modeling for Speech Generation | Feb 6, 2025 | DiversityLanguage Modeling | —Unverified | 0 |
| Multi-agent Architecture Search via Agentic Supernet | Feb 6, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 3 |
| Great Models Think Alike and this Undermines AI Oversight | Feb 6, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| WaferLLM: Large Language Model Inference at Wafer Scale | Feb 6, 2025 | GPULanguage Modeling | CodeCode Available | 2 |
| Gompertz Linear Units: Leveraging Asymmetry for Enhanced Learning Dynamics | Feb 5, 2025 | image-classificationImage Classification | CodeCode Available | 1 |
| Gold-medalist Performance in Solving Olympiad Geometry with AlphaGeometry2 | Feb 5, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |