SOTAVerified

Language Modeling

Papers

Showing 11011125 of 14182 papers

TitleStatusHype
Walk the Talk? Measuring the Faithfulness of Large Language Model ExplanationsCode1
Learning to Attribute with AttentionCode1
SilVar-Med: A Speech-Driven Visual Language Model for Explainable Abnormality Detection in Medical ImagingCode1
Fine-tuning a Large Language Model for Automating Computational Fluid Dynamics SimulationsCode1
Parameterized Synthetic Text Generation with SimpleStoriesCode1
LauraTSE: Target Speaker Extraction using Auto-Regressive Decoder-Only Language ModelsCode1
Collab-RAG: Boosting Retrieval-Augmented Generation for Complex Question Answering via White-Box and Black-Box LLM CollaborationCode1
Hessian of Perplexity for Large Language Models by PyTorch autograd (Open Source)Code1
CO-Bench: Benchmarking Language Model Agents in Algorithm Search for Combinatorial OptimizationCode1
MSL: Not All Tokens Are What You Need for Tuning LLM as a RecommenderCode1
Beyond the Next Token: Towards Prompt-Robust Zero-Shot Classification via Efficient Multi-Token PredictionCode1
Efficient Dynamic Clustering-Based Document Compression for Retrieval-Augmented-GenerationCode1
Distillation and Refinement of Reasoning in Small Language Models for Document Re-rankingCode1
SARLANG-1M: A Benchmark for Vision-Language Modeling in SAR Image UnderstandingCode1
STING-BEE: Towards Vision-Language Model for Real-World X-ray Baggage Security InspectionCode1
MG-MotionLLM: A Unified Framework for Motion Comprehension and Generation across Multiple GranularitiesCode1
IPA-CHILDES & G2P+: Feature-Rich Resources for Cross-Lingual Phonology and Phonemic Language ModelingCode1
JailDAM: Jailbreak Detection with Adaptive Memory for Vision-Language ModelCode1
STPNet: Scale-aware Text Prompt Network for Medical Image SegmentationCode1
TiC-LM: A Web-Scale Benchmark for Time-Continual LLM PretrainingCode1
Representation Bending for Large Language Model SafetyCode1
Rethinking Key-Value Cache Compression Techniques for Large Language Model ServingCode1
CrowdVLM-R1: Expanding R1 Ability to Vision Language Model for Crowd Counting using Fuzzy Group Relative Policy RewardCode1
Whisper-LM: Improving ASR Models with Language Models for Low-Resource LanguagesCode1
Imagine All The Relevance: Scenario-Profiled Indexing with Knowledge Expansion for Dense RetrievalCode1
Show:102550
← PrevPage 45 of 568Next →

No leaderboard results yet.