| AnyTool: Self-Reflective, Hierarchical Agents for Large-Scale API Calls | Feb 6, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 3 |
| A Phylogenetic Approach to Genomic Language Modeling | Mar 4, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 3 |
| Llemma: An Open Language Model For Mathematics | Oct 16, 2023 | Arithmetic ReasoningAutomated Theorem Proving | CodeCode Available | 3 |
| CRAB: Cross-environment Agent Benchmark for Multimodal Language Model Agents | Jul 1, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 3 |
| Cramming: Training a Language Model on a Single GPU in One Day | Dec 28, 2022 | GPULanguage Modeling | CodeCode Available | 3 |
| Llama Scope: Extracting Millions of Features from Llama-3.1-8B with Sparse Autoencoders | Oct 27, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 3 |
| Diffusion Language Models Are Versatile Protein Learners | Feb 28, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 3 |
| LLaVA-Phi: Efficient Multi-Modal Assistant with Small Language Model | Jan 4, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 3 |
| Evolution of Heuristics: Towards Efficient Automatic Algorithm Design Using Large Language Model | Jan 4, 2024 | Combinatorial OptimizationLanguage Modeling | CodeCode Available | 3 |
| ContextCite: Attributing Model Generation to Context | Sep 1, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 3 |