| MMS-LLaMA: Efficient LLM-based Audio-Visual Speech Recognition with Minimal Multimodal Speech Tokens | Mar 14, 2025 | Audio-Visual Speech RecognitionComputational Efficiency | CodeCode Available | 1 |
| LLMs Working in Harmony: A Survey on the Technological Aspects of Building Effective LLM-Based Multi Agent Systems | Mar 13, 2025 | Decision MakingLanguage Modeling | —Unverified | 0 |
| NeurIPS 2023 LLM Efficiency Fine-tuning Competition | Mar 13, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| SCE: Scalable Consistency Ensembles Make Blackbox Large Language Model Generation More Reliable | Mar 13, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Language Models, Graph Searching, and Supervision Adulteration: When More Supervision is Less and How to Make More More | Mar 13, 2025 | DecoderLanguage Modeling | —Unverified | 0 |
| GroundingSuite: Measuring Complex Multi-Granular Pixel Grounding | Mar 13, 2025 | DiversityLanguage Modeling | CodeCode Available | 2 |
| MMLU-ProX: A Multilingual Benchmark for Advanced Large Language Model Evaluation | Mar 13, 2025 | Language Model EvaluationLanguage Modeling | —Unverified | 0 |
| GoT: Unleashing Reasoning Capability of Multimodal Large Language Model for Visual Generation and Editing | Mar 13, 2025 | Image GenerationLanguage Modeling | CodeCode Available | 3 |
| MouseGPT: A Large-scale Vision-Language Model for Mouse Behavior Analysis | Mar 13, 2025 | DescriptiveLanguage Modeling | —Unverified | 0 |
| Representation-based Reward Modeling for Efficient Safety Alignment of Large Language Model | Mar 13, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| TacticExpert: Spatial-Temporal Graph Language Model for Basketball Tactics | Mar 13, 2025 | Contrastive LearningLanguage Modeling | —Unverified | 0 |
| Hybrid Agents for Image Restoration | Mar 13, 2025 | Image RestorationIn-Context Learning | —Unverified | 0 |
| PRISM: Preference Refinement via Implicit Scene Modeling for 3D Vision-Language Preference-Based Reinforcement Learning | Mar 13, 2025 | Autonomous NavigationDecision Making | —Unverified | 0 |
| Tempest: Autonomous Multi-Turn Jailbreaking of Large Language Models with Tree Search | Mar 13, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| SmartWay: Enhanced Waypoint Prediction and Backtracking for Zero-Shot Vision-and-Language Navigation | Mar 13, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| OR-LLM-Agent: Automating Modeling and Solving of Operations Research Optimization Problem with Reasoning Large Language Model | Mar 13, 2025 | AI AgentLanguage Modeling | CodeCode Available | 2 |
| Toward a method for LLM-enabled Indoor Navigation | Mar 12, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Leveraging Knowledge Graphs and LLMs for Context-Aware Messaging | Mar 12, 2025 | Entity LinkingEvent Detection | —Unverified | 0 |
| Medical Large Language Model Benchmarks Should Prioritize Construct Validity | Mar 12, 2025 | Clinical KnowledgeLanguage Modeling | —Unverified | 0 |
| PolyPythias: Stability and Outliers across Fifty Language Model Pre-Training Runs | Mar 12, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Why LLMs Cannot Think and How to Fix It | Mar 12, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Communication-Efficient Language Model Training Scales Reliably and Robustly: Scaling Laws for DiLoCo | Mar 12, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Token Weighting for Long-Range Language Modeling | Mar 12, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| SAEBench: A Comprehensive Benchmark for Sparse Autoencoders in Language Model Interpretability | Mar 12, 2025 | DisentanglementLanguage Modeling | —Unverified | 0 |
| Block Diffusion: Interpolating Between Autoregressive and Diffusion Language Models | Mar 12, 2025 | DenoisingLanguage Modeling | CodeCode Available | 4 |