| WaferLLM: Large Language Model Inference at Wafer Scale | Feb 6, 2025 | GPULanguage Modeling | CodeCode Available | 2 |
| ScoreFlow: Mastering LLM Agent Workflows via Score-based Preference Optimization | Feb 6, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| Speculative Prefill: Turbocharging TTFT with Lightweight and Training-Free Token Importance Estimation | Feb 5, 2025 | BenchmarkingLarge Language Model | CodeCode Available | 2 |
| Reusing Embeddings: Reproducible Reward Model Research in Large Language Model Alignment without GPUs | Feb 4, 2025 | Code GenerationLanguage Modeling | CodeCode Available | 2 |
| Reviving The Classics: Active Reward Modeling in Large Language Model Alignment | Feb 4, 2025 | Computational EfficiencyExperimental Design | CodeCode Available | 2 |
| MetaOpenFOAM 2.0: Large Language Model Driven Chain of Thought for Automating CFD Simulation and Post-Processing | Feb 1, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| SafeRAG: Benchmarking Security in Retrieval-Augmented Generation of Large Language Model | Jan 28, 2025 | BenchmarkingLanguage Modeling | CodeCode Available | 2 |
| Fast Think-on-Graph: Wider, Deeper and Faster Reasoning of Large Language Model on Knowledge Graph | Jan 24, 2025 | Community DetectionHallucination | CodeCode Available | 2 |
| OstQuant: Refining Large Language Model Quantization with Orthogonal and Scaling Transformations for Better Distribution Fitting | Jan 23, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| Monte Carlo Tree Search for Comprehensive Exploration in LLM-Based Automatic Heuristic Design | Jan 15, 2025 | Combinatorial OptimizationLanguage Modeling | CodeCode Available | 2 |