SOTAVerified

Language Modeling

Papers

Showing 14011425 of 14182 papers

TitleStatusHype
GRACE: A Granular Benchmark for Evaluating Model Calibration against Human Calibration0
Playing Pokémon Red via Deep Reinforcement LearningCode1
KEDRec-LM: A Knowledge-distilled Explainable Drug Recommendation Large Language Model0
From Retrieval to Generation: Comparing Different Approaches0
M-LLM Based Video Frame Selection for Efficient Video Understanding0
Tokens for Learning, Tokens for Unlearning: Mitigating Membership Inference Attacks in Large Language Models via Dual-Purpose Training0
Do Sparse Autoencoders Generalize? A Case Study of Answerability0
Collaborative Stance Detection via Small-Large Language Model Consistency VerificationCode0
ChatMol: A Versatile Molecule Designer Based on the Numerically Enhanced Large Language Model0
SeisMoLLM: Advancing Seismic Monitoring via Cross-modal Transfer with Pre-trained Large Language ModelCode1
Conformal Linguistic Calibration: Trading-off between Factuality and Specificity0
I Know What I Don't Know: Improving Model Cascades Through Confidence Tuning0
TestNUC: Enhancing Test-Time Computing Approaches through Neighboring Unlabeled Data ConsistencyCode0
Nexus: An Omni-Perceptive And -Interactive Model for Language, Audio, And Vision0
Pathology Report Generation and Multimodal Representation Learning for Cutaneous Melanocytic Lesions0
On the Importance of Text Preprocessing for Multimodal Representation Learning and Pathology Report Generation0
AgentSociety Challenge: Designing LLM Agents for User Modeling and Recommendation on Web PlatformsCode2
ANPMI: Assessing the True Comprehension Capabilities of LLMs for Multiple Choice Questions0
A City of Millions: Mapping Literary Social Networks At ScaleCode0
The Sharpness Disparity Principle in Transformers for Accelerating Language Model Pre-Training0
Improving Representation Learning of Complex Critical Care Data with ICU-BERT0
Kanana: Compute-efficient Bilingual Language Models0
Revealing Treatment Non-Adherence Bias in Clinical Machine Learning Using Large Language Models0
Evaluating Gender Bias in German Machine TranslationCode0
Faster, Cheaper, Better: Multi-Objective Hyperparameter Optimization for LLM and RAG Systems0
Show:102550
← PrevPage 57 of 568Next →

No leaderboard results yet.