| AutoVerus: Automated Proof Generation for Rust Code | Sep 19, 2024 | Code GenerationLanguage Modeling | CodeCode Available | 2 |
| Generalized Interpolating Discrete Diffusion | Mar 6, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| A Generalist Agent | May 12, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| Autonomous Data Selection with Zero-shot Generative Classifiers for Mathematical Texts | Feb 12, 2024 | Continual PretrainingGSM8K | CodeCode Available | 2 |
| Generative Pre-trained Speech Language Model with Efficient Hierarchical Transformer | Jun 3, 2024 | Audio GenerationIn-Context Learning | CodeCode Available | 2 |
| ExpertPrompting: Instructing Large Language Models to be Distinguished Experts | May 24, 2023 | In-Context LearningInstruction Following | CodeCode Available | 2 |
| Training Diffusion Models with Reinforcement Learning | May 22, 2023 | Decision MakingDenoising | CodeCode Available | 2 |
| GIT: A Generative Image-to-text Transformer for Vision and Language | May 27, 2022 | DecoderImage Captioning | CodeCode Available | 2 |
| AnyAnomaly: Zero-Shot Customizable Video Anomaly Detection with LVLM | Mar 6, 2025 | Anomaly DetectionLanguage Modeling | CodeCode Available | 2 |
| GPT-Driver: Learning to Drive with GPT | Oct 2, 2023 | Autonomous DrivingAutonomous Vehicles | CodeCode Available | 2 |
| Full-Duplex-Bench: A Benchmark to Evaluate Full-duplex Spoken Dialogue Models on Turn-taking Capabilities | Mar 6, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| G1: Bootstrapping Perception and Reasoning Abilities of Vision-Language Model via Reinforcement Learning | May 19, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| From Words to Numbers: Your Large Language Model Is Secretly A Capable Regressor When Given In-Context Examples | Apr 11, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| TritonBench: Benchmarking Large Language Model Capabilities for Generating Triton Operators | Feb 20, 2025 | BenchmarkingCode Generation | CodeCode Available | 2 |
| TrustRAG: Enhancing Robustness and Trustworthiness in RAG | Jan 1, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| Frontiers in Intelligent Colonoscopy | Oct 22, 2024 | Image Captioning | CodeCode Available | 2 |
| A Touch, Vision, and Language Dataset for Multimodal Alignment | Feb 20, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| A Training-free LLM-based Approach to General Chinese Character Error Correction | Feb 21, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| Forgetting Transformer: Softmax Attention with a Forget Gate | Mar 3, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| Formal Mathematics Statement Curriculum Learning | Feb 3, 2022 | Automated Theorem ProvingLanguage Modeling | CodeCode Available | 2 |
| GAMA: A Large Audio-Language Model with Advanced Audio Understanding and Complex Reasoning Abilities | Jun 17, 2024 | Audio Question AnsweringInstruction Following | CodeCode Available | 2 |
| A Systematic Survey of Prompt Engineering on Vision-Language Foundation Models | Jul 24, 2023 | Image GenerationImage-text matching | CodeCode Available | 2 |
| A Systematic Study of Cross-Layer KV Sharing for Efficient LLM Inference | Oct 18, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| FLASK: Fine-grained Language Model Evaluation based on Alignment Skill Sets | Jul 20, 2023 | Instruction FollowingLanguage Model Evaluation | CodeCode Available | 2 |
| Fishing for Magikarp: Automatically Detecting Under-trained Tokens in Large Language Models | May 8, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 |