SOTAVerified

Language Modeling

Papers

Showing 51100 of 14182 papers

TitleStatusHype
Prompt-Guided Turn-Taking Prediction0
World-aware Planning Narratives Enhance Large Vision-Language Model Planner0
V2X-REALM: Vision-Language Model-Based Robust End-to-End Cooperative Autonomous Driving with Adaptive Long-Tail Modeling0
Large Language Model-Driven Code Compliance Checking in Building Information Modeling0
GPTailor: Large Language Model Pruning Through Layer Cutting and StitchingCode1
Biomed-Enriched: A Biomedical Dataset Enriched with LLMs for Pretraining and Extracting Rare and Hidden Content0
OctoThinker: Mid-training Incentivizes Reinforcement Learning ScalingCode2
Towards Community-Driven Agents for Machine Learning EngineeringCode0
SEED: A Structural Encoder for Embedding-Driven Decoding in Time Series Prediction with LLMs0
A Multi-Pass Large Language Model Framework for Precise and Efficient Radiology Report Error DetectionCode0
Case-based Reasoning Augmented Large Language Model Framework for Decision Making in Realistic Safety-Critical Driving Scenarios0
Automatic Demonstration Selection for LLM-based Tabular Data Classification0
Enterprise Large Language Model Evaluation Benchmark0
Language Modeling by Language ModelsCode2
AALC: Large Language Model Efficient Reasoning via Adaptive Accuracy-Length ControlCode0
Narrative Shift Detection: A Hybrid Approach of Dynamic Topic Models and Large Language ModelsCode0
PARALLELPROMPT: Extracting Parallelism from Large Language Model Queries0
A Large Language Model-based Multi-Agent Framework for Analog Circuits' Sizing Relationships Extraction0
GradualDiff-Fed: A Federated Learning Specialized Framework for Large Language Model0
AdapThink: Adaptive Thinking Preferences for Reasoning Language Model0
Smart-LLaMA-DPO: Reinforced Large Language Model for Explainable Smart Contract Vulnerability Detection0
ShareGPT-4o-Image: Aligning Multimodal Models with GPT-4o-Level Image GenerationCode3
Pre-Trained LLM is a Semantic-Aware and Generalizable Segmentation BoosterCode2
Leveraging Large Language Model for Intelligent Log Processing and Autonomous Debugging in Cloud AI Platforms0
Research on Model Parallelism and Data Parallelism Optimization Methods in Large Language Model-Based Recommendation Systems0
Reflective Verbal Reward Design for Pluralistic Alignment0
Can Generated Images Serve as a Viable Modality for Text-Centric Multimodal Learning?0
Challenges in Grounding Language in the Real World0
Computational Approaches to Understanding Large Language Model Impact on Writing and Information Ecosystems0
LM-SPT: LM-Aligned Semantic Distillation for Speech Tokenization0
LLMs in Coding and their Impact on the Commercial Software Engineering Landscape0
LMR-BENCH: Evaluating LLM Agent's Ability on Reproducing Language Modeling ResearchCode1
Watermarking Autoregressive Image GenerationCode2
From RAG to Agentic: Validating Islamic-Medicine Responses with LLM Agents0
Make Your AUV Adaptive: An Environment-Aware Reinforcement Learning Framework For Underwater Tasks0
RAS-Eval: A Comprehensive Benchmark for Security Evaluation of LLM Agents in Real-World EnvironmentsCode0
Show-o2: Improved Native Unified Multimodal ModelsCode5
Finance Language Model Evaluation (FLaME)0
BMFM-RNA: An Open Framework for Building and Evaluating Transcriptomic Foundation ModelsCode2
Thinking in Directivity: Speech Large Language Model for Multi-Talker Directional Speech Recognition0
Hypothesis Testing for Quantifying LLM-Human Misalignment in Multiple Choice Settings0
Lightweight Relevance Grader in RAGCode0
Don't Make It Up: Preserving Ignorance Awareness in LLM Fine-Tuning0
From Bytes to Ideas: Language Modeling with Autoregressive U-NetsCode7
From What to Respond to When to Respond: Timely Response Generation for Open-domain Dialogue AgentsCode0
ASCD: Attention-Steerable Contrastive Decoding for Reducing Hallucination in MLLM0
DiffusionBlocks: Blockwise Training for Generative Models via Score-Based Diffusion0
Guaranteed Guess: A Language Modeling Approach for CISC-to-RISC Transpilation with Testing Guarantees0
Interpreting Biomedical VLMs on High-Imbalance Out-of-Distributions: An Insight into BiomedCLIP on RadiologyCode0
RMIT-ADM+S at the SIGIR 2025 LiveRAG ChallengeCode1
Show:102550
← PrevPage 2 of 284Next →

No leaderboard results yet.