SOTAVerified

Language Modeling

Papers

Showing 551600 of 14182 papers

TitleStatusHype
Large Language Model with Region-guided Referring and Grounding for CT Report GenerationCode2
Steering Away from Harm: An Adaptive Approach to Defending Vision Language Model Against JailbreaksCode2
ScribeAgent: Towards Specialized Web Agents Using Production-Scale Workflow DataCode2
RE-Bench: Evaluating frontier AI R&D capabilities of language model agents against human expertsCode2
GMAI-VL & GMAI-VL-5.5M: A Large Vision-Language Model and A Comprehensive Multimodal Dataset Towards General Medical AICode2
MC-LLaVA: Multi-Concept Personalized Vision-Language ModelCode2
BianCang: A Traditional Chinese Medicine Large Language ModelCode2
GeoGround: A Unified Large Vision-Language Model for Remote Sensing Visual GroundingCode2
SEAGULL: No-reference Image Quality Assessment for Regions of Interest via Vision-Language Instruction TuningCode2
LHRS-Bot-Nova: Improved Multimodal Large Language Model for Remote Sensing Vision-Language InterpretationCode2
Tucano: Advancing Neural Text Generation for PortugueseCode2
TIPO: Text to Image with Text Presampling for Prompt OptimizationCode2
The Super Weight in Large Language ModelsCode2
Concept Bottleneck Language Models For protein designCode2
End-to-End Navigation with Vision Language Models: Transforming Spatial Reasoning into Question-AnsweringCode2
PhoneLM:an Efficient and Capable Small Language Model Family through Principled Pre-trainingCode2
V-DPO: Mitigating Hallucination in Large Vision Language Models via Vision-Guided Direct Preference OptimizationCode2
RAGViz: Diagnose and Visualize Retrieval-Augmented GenerationCode2
GPT or BERT: why not both?Code2
Plan-on-Graph: Self-Correcting Adaptive Planning of Large Language Model on Knowledge GraphsCode2
What is Wrong with Perplexity for Long-context Language Modeling?Code2
Rare-to-Frequent: Unlocking Compositional Generation Power of Diffusion Models on Rare Concepts with LLM GuidanceCode2
Protecting Privacy in Multimodal Large Language Models with MLLMU-BenchCode2
Retrieval-Enhanced Mutation Mastery: Augmenting Zero-Shot Prediction of Protein Language ModelCode2
MiniPLM: Knowledge Distillation for Pre-Training Language ModelsCode2
Frontiers in Intelligent ColonoscopyCode2
PAPILLON: Privacy Preservation from Internet-based and Local Language Model EnsemblesCode2
Improve Vision Language Model Chain-of-thought ReasoningCode2
RM-Bench: Benchmarking Reward Models of Language Models with Subtlety and StyleCode2
A Systematic Study of Cross-Layer KV Sharing for Efficient LLM InferenceCode2
Montessori-Instruct: Generate Influential Training Data Tailored for Student LearningCode2
On the Role of Attention Heads in Large Language Model SafetyCode2
MLLM can see? Dynamic Correction Decoding for Hallucination MitigationCode2
WeatherDG: LLM-assisted Diffusion Model for Procedural Weather Generation in Domain-Generalized Semantic SegmentationCode2
Process Reward Model with Q-Value RankingsCode2
Enhancing Multi-Step Reasoning Abilities of Language Models through Direct Q-Function OptimizationCode2
TurboRAG: Accelerating Retrieval-Augmented Generation with Precomputed KV Caches for Chunked TextCode2
Efficiently Learning at Test-Time: Active Fine-Tuning of LLMsCode2
Q-VLM: Post-training Quantization for Large Vision-Language ModelsCode2
OneRef: Unified One-tower Expression Grounding and Segmentation with Mask Referring ModelingCode2
Sylber: Syllabic Embedding Representation of Speech from Raw AudioCode2
Towards Interpreting Visual Information Processing in Vision-Language ModelsCode2
BUMBLE: Unifying Reasoning and Acting with Vision-Language Models for Building-wide Mobile ManipulationCode2
Think While You Generate: Discrete Diffusion with Planned DenoisingCode2
PDF-WuKong: A Large Multimodal Model for Efficient Long PDF Reading with End-to-End Sparse SamplingCode2
Differential TransformerCode2
Mitigating Modality Prior-Induced Hallucinations in Multimodal Large Language Models via Deciphering Attention CausalityCode2
TextHawk2: A Large Vision-Language Model Excels in Bilingual OCR and Grounding with 16x Fewer TokensCode2
GenSim: A General Social Simulation Platform with Large Language Model based AgentsCode2
SyllableLM: Learning Coarse Semantic Units for Speech Language ModelsCode2
Show:102550
← PrevPage 12 of 284Next →

No leaderboard results yet.