SOTAVerified

Language Modeling

Papers

Showing 14011450 of 14182 papers

TitleStatusHype
Tokens for Learning, Tokens for Unlearning: Mitigating Membership Inference Attacks in Large Language Models via Dual-Purpose Training0
M-LLM Based Video Frame Selection for Efficient Video Understanding0
Collaborative Stance Detection via Small-Large Language Model Consistency VerificationCode0
AsymLoRA: Harmonizing Data Conflicts and Commonalities in MLLMsCode3
DiffCSS: Diverse and Expressive Conversational Speech Synthesis with Diffusion Models0
Do Sparse Autoencoders Generalize? A Case Study of Answerability0
GRACE: A Granular Benchmark for Evaluating Model Calibration against Human Calibration0
Emergent Symbolic Mechanisms Support Abstract Reasoning in Large Language ModelsCode1
ChatMol: A Versatile Molecule Designer Based on the Numerically Enhanced Large Language Model0
SeisMoLLM: Advancing Seismic Monitoring via Cross-modal Transfer with Pre-trained Large Language ModelCode1
Conformal Linguistic Calibration: Trading-off between Factuality and Specificity0
I Know What I Don't Know: Improving Model Cascades Through Confidence Tuning0
TestNUC: Enhancing Test-Time Computing Approaches through Neighboring Unlabeled Data ConsistencyCode0
Nexus: An Omni-Perceptive And -Interactive Model for Language, Audio, And Vision0
Pathology Report Generation and Multimodal Representation Learning for Cutaneous Melanocytic Lesions0
On the Importance of Text Preprocessing for Multimodal Representation Learning and Pathology Report Generation0
AgentSociety Challenge: Designing LLM Agents for User Modeling and Recommendation on Web PlatformsCode2
Kanana: Compute-efficient Bilingual Language Models0
ANPMI: Assessing the True Comprehension Capabilities of LLMs for Multiple Choice Questions0
A City of Millions: Mapping Literary Social Networks At ScaleCode0
The Sharpness Disparity Principle in Transformers for Accelerating Language Model Pre-Training0
Revealing Treatment Non-Adherence Bias in Clinical Machine Learning Using Large Language Models0
Improving Representation Learning of Complex Critical Care Data with ICU-BERT0
Evaluating Gender Bias in German Machine TranslationCode0
Faster, Cheaper, Better: Multi-Objective Hyperparameter Optimization for LLM and RAG Systems0
VALUE: Value-Aware Large Language Model for Query Rewriting via Weighted Trie in Sponsored Search0
from Benign import Toxic: Jailbreaking the Language Model via Adversarial Metaphors0
MindMem: Multimodal for Predicting Advertisement Memorability Using LLMs and Deep Learning0
LDGen: Enhancing Text-to-Image Synthesis via Large Language Model-Driven Language Representation0
Independent Mobility GPT (IDM-GPT): A Self-Supervised Multi-Agent Large Language Model Framework for Customized Traffic Mobility Analysis Using Machine Learning Models0
AfroXLMR-Comet: Multilingual Knowledge Distillation with Attention Matching for Low-Resource languages0
Citrus: Leveraging Expert Cognitive Pathways in a Medical Language Model for Advanced Medical Decision SupportCode2
Your Language Model May Think Too Rigidly: Achieving Reasoning Consistency with Symmetry-Enhanced Training0
Can LLMs Explain Themselves Counterfactually?0
AMPO: Active Multi-Preference Optimization0
Beyond In-Distribution Success: Scaling Curves of CoT Granularity for Language Model GeneralizationCode0
PyEvalAI: AI-assisted evaluation of Jupyter Notebooks for immediate personalized feedback0
Rank1: Test-Time Compute for Reranking in Information RetrievalCode2
TextGames: Learning to Self-Play Text-Based Puzzle Games via Language Model ReasoningCode0
olmOCR: Unlocking Trillions of Tokens in PDFs with Vision Language ModelsCode11
SPECTRE: An FFT-Based Efficient Drop-In Replacement to Self-Attention for Long ContextsCode2
Broadening Discovery through Structural Models: Multimodal Combination of Local and Structural Properties for Predicting Chemical Features0
A Combinatorial Identities Benchmark for Theorem Proving via Automated Theorem Generation0
Large Language Model Driven Agents for Simulating Echo Chamber Formation0
Iterative Counterfactual Data AugmentationCode0
NotaGen: Advancing Musicality in Symbolic Music Generation with Large Language Model Training ParadigmsCode5
Steering Language Model to Stable Speech Emotion Recognition via Contextual Perception and Chain of ThoughtCode1
Inverse Materials Design by Large Language Model-Assisted Generative FrameworkCode1
Improving Interactive Diagnostic Ability of a Large Language Model Agent Through Clinical Experience Learning0
Knowledge Distillation with Training Wheels0
Show:102550
← PrevPage 29 of 284Next →

No leaderboard results yet.