| Generative Modeling for Mathematical Discovery | Mar 14, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| LLMs Working in Harmony: A Survey on the Technological Aspects of Building Effective LLM-Based Multi Agent Systems | Mar 13, 2025 | Decision MakingLanguage Modeling | —Unverified | 0 |
| NeurIPS 2023 LLM Efficiency Fine-tuning Competition | Mar 13, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| SCE: Scalable Consistency Ensembles Make Blackbox Large Language Model Generation More Reliable | Mar 13, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| TacticExpert: Spatial-Temporal Graph Language Model for Basketball Tactics | Mar 13, 2025 | Contrastive LearningLanguage Modeling | —Unverified | 0 |
| GroundingSuite: Measuring Complex Multi-Granular Pixel Grounding | Mar 13, 2025 | DiversityLanguage Modeling | CodeCode Available | 2 |
| Language Models, Graph Searching, and Supervision Adulteration: When More Supervision is Less and How to Make More More | Mar 13, 2025 | DecoderLanguage Modeling | —Unverified | 0 |
| Representation-based Reward Modeling for Efficient Safety Alignment of Large Language Model | Mar 13, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| MMLU-ProX: A Multilingual Benchmark for Advanced Large Language Model Evaluation | Mar 13, 2025 | Language Model EvaluationLanguage Modeling | —Unverified | 0 |
| GoT: Unleashing Reasoning Capability of Multimodal Large Language Model for Visual Generation and Editing | Mar 13, 2025 | Image GenerationLanguage Modeling | CodeCode Available | 3 |
| MouseGPT: A Large-scale Vision-Language Model for Mouse Behavior Analysis | Mar 13, 2025 | DescriptiveLanguage Modeling | —Unverified | 0 |
| PRISM: Preference Refinement via Implicit Scene Modeling for 3D Vision-Language Preference-Based Reinforcement Learning | Mar 13, 2025 | Autonomous NavigationDecision Making | —Unverified | 0 |
| Hybrid Agents for Image Restoration | Mar 13, 2025 | Image RestorationIn-Context Learning | —Unverified | 0 |
| SmartWay: Enhanced Waypoint Prediction and Backtracking for Zero-Shot Vision-and-Language Navigation | Mar 13, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| OR-LLM-Agent: Automating Modeling and Solving of Operations Research Optimization Problem with Reasoning Large Language Model | Mar 13, 2025 | AI AgentLanguage Modeling | CodeCode Available | 2 |
| Tempest: Autonomous Multi-Turn Jailbreaking of Large Language Models with Tree Search | Mar 13, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Toward a method for LLM-enabled Indoor Navigation | Mar 12, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Leveraging Knowledge Graphs and LLMs for Context-Aware Messaging | Mar 12, 2025 | Entity LinkingEvent Detection | —Unverified | 0 |
| Medical Large Language Model Benchmarks Should Prioritize Construct Validity | Mar 12, 2025 | Clinical KnowledgeLanguage Modeling | —Unverified | 0 |
| Communication-Efficient Language Model Training Scales Reliably and Robustly: Scaling Laws for DiLoCo | Mar 12, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| xVLM2Vec: Adapting LVLM-based embedding models to multilinguality using Self-Knowledge Distillation | Mar 12, 2025 | Knowledge DistillationLanguage Modeling | —Unverified | 0 |
| Global Position Aware Group Choreography using Large Language Model | Mar 12, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Token Weighting for Long-Range Language Modeling | Mar 12, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Language-Enhanced Representation Learning for Single-Cell Transcriptomics | Mar 12, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Block Diffusion: Interpolating Between Autoregressive and Diffusion Language Models | Mar 12, 2025 | DenoisingLanguage Modeling | CodeCode Available | 4 |
| PolyPythias: Stability and Outliers across Fifty Language Model Pre-Training Runs | Mar 12, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Reinforcement Learning is all You Need | Mar 12, 2025 | AllLanguage Modeling | —Unverified | 0 |
| Why LLMs Cannot Think and How to Fix It | Mar 12, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| BAMBI: Developing Baby Language Models for Italian | Mar 12, 2025 | Language AcquisitionLanguage Modeling | —Unverified | 0 |
| SAEBench: A Comprehensive Benchmark for Sparse Autoencoders in Language Model Interpretability | Mar 12, 2025 | DisentanglementLanguage Modeling | —Unverified | 0 |
| NVP-HRI: Zero Shot Natural Voice and Posture-based Human-Robot Interaction via Large Language Model | Mar 12, 2025 | HallucinationLanguage Modeling | CodeCode Available | 0 |
| Sometimes Painful but Certainly Promising: Feasibility and Trade-offs of Language Model Inference at the Edge | Mar 12, 2025 | CPUGPU | —Unverified | 0 |
| Membership Inference Attacks fueled by Few-Short Learning to detect privacy leakage tackling data integrity | Mar 12, 2025 | Deep LearningFew-Shot Learning | —Unverified | 0 |
| SimLingo: Vision-Only Closed-Loop Autonomous Driving with Language-Action Alignment | Mar 12, 2025 | Autonomous DrivingBench2Drive | CodeCode Available | 3 |
| Perplexity Trap: PLM-Based Retrievers Overrate Low Perplexity Documents | Mar 11, 2025 | Information RetrievalLanguage Modeling | CodeCode Available | 0 |
| D3PO: Preference-Based Alignment of Discrete Diffusion Models | Mar 11, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Bring Remote Sensing Object Detect Into Nature Language Model: Using SFT Method | Mar 11, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Understanding the Quality-Diversity Trade-off in Diffusion Language Models | Mar 11, 2025 | DiversityLanguage Modeling | CodeCode Available | 0 |
| Extragradient Preference Optimization (EGPO): Beyond Last-Iterate Convergence for Nash Learning from Human Feedback | Mar 11, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| LongProLIP: A Probabilistic Vision-Language Model with Long Context Text | Mar 11, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| Training Plug-n-Play Knowledge Modules with Deep Context Distillation | Mar 11, 2025 | In-Context LearningLanguage Modeling | —Unverified | 0 |
| EvalTree: Profiling Language Model Weaknesses via Hierarchical Capability Trees | Mar 11, 2025 | ChatbotLanguage Modeling | CodeCode Available | 1 |
| A Cascading Cooperative Multi-agent Framework for On-ramp Merging Control Integrating Large Language Models | Mar 11, 2025 | Decision Makingglobal-optimization | —Unverified | 0 |
| Position-Aware Depth Decay Decoding (D^3): Boosting Large Language Model Inference Efficiency | Mar 11, 2025 | GSM8KLanguage Modeling | —Unverified | 0 |
| Cross-Examiner: Evaluating Consistency of Large Language Model-Generated Explanations | Mar 11, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| OASIS: Order-Augmented Strategy for Improved Code Search | Mar 11, 2025 | Code SearchLanguage Modeling | —Unverified | 0 |
| Large Language Model as Meta-Surrogate for Data-Driven Many-Task Optimization: A Proof-of-Principle Study | Mar 11, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| BiasEdit: Debiasing Stereotyped Language Models via Model Editing | Mar 11, 2025 | counterfactualLanguage Modeling | CodeCode Available | 1 |
| Mellow: a small audio language model for reasoning | Mar 11, 2025 | Audio captioningLanguage Modeling | CodeCode Available | 2 |
| Prompt-OT: An Optimal Transport Regularization Paradigm for Knowledge Preservation in Vision-Language Model Adaptation | Mar 11, 2025 | Domain GeneralizationLanguage Modeling | CodeCode Available | 0 |