SOTAVerified

Language Modeling

Papers

Showing 54515500 of 14182 papers

TitleStatusHype
Enriching Tabular Data with Contextual LLM Embeddings: A Comprehensive Ablation Study for Ensemble Classifiers0
A Deep Dive Into Large Language Model Code Generation Mistakes: What and Why?0
Privacy Leakage Overshadowed by Views of AI: A Study on Human Oversight of Privacy in Language Model Agent0
Interacting Large Language Model Agents. Interpretable Models and Social Learning0
Can Multimodal Large Language Model Think Analogically?0
Can Large Language Model Predict Employee Attrition?0
A Mechanistic Explanatory Strategy for XAI0
PRIMO: Progressive Induction for Multi-hop Open Rule Generation0
Swan and ArabicMTEB: Dialect-Aware, Arabic-Centric, Cross-Lingual, and Cross-Cultural Embedding Models and Benchmarks0
Unified Generative and Discriminative Training for Multi-modal Large Language Models0
RadFlag: A Black-Box Hallucination Detection Method for Medical Vision Language Models0
Normalization Layer Per-Example Gradients are Sufficient to Predict Gradient Noise Scale in TransformersCode0
ReSpAct: Harmonizing Reasoning, Speaking, and Acting Towards Building Large Language Model-Based Conversational AI Agents0
Enhancing the Traditional Chinese Medicine Capabilities of Large Language Model through Reinforcement Learning from AI Feedback0
LLM-KT: A Versatile Framework for Knowledge Transfer from Large Language Models to Collaborative Filtering0
Improving Few-Shot Cross-Domain Named Entity Recognition by Instruction Tuning a Word-Embedding based Retrieval Augmented Large Language Model0
Adding Error Bars to Evals: A Statistical Approach to Language Model Evaluations0
Leveraging Large Language Models for Code-Mixed Data Augmentation in Sentiment AnalysisCode0
DEREC-SIMPRO: unlock Language Model benefits to advance Synthesis in Data Clean Room0
ALISE: Accelerating Large Language Model Serving with Speculative Scheduling0
From Context to Action: Analysis of the Impact of State Representation and Context on the Generalization of Multi-Turn Web Navigation Agents0
Beyond Label Attention: Transparency in Language Models for Automated Medical Coding via Dictionary Learning0
EchoNarrator: Generating natural text explanations for ejection fraction predictionsCode0
Thought Space Explorer: Navigating and Expanding Thought Space for Large Language Model Reasoning0
Schema Augmentation for Zero-Shot Domain Adaptation in Dialogue State Tracking0
Representative Social Choice: From Learning Theory to AI Alignment0
The NPU-HWC System for the ISCSLP 2024 Inspirational and Convincing Audio Generation Challenge0
Web-Scale Visual Entity Recognition: An LLM-Driven Data Approach0
π_0: A Vision-Language-Action Flow Model for General Robot Control0
Matchmaker: Self-Improving Large Language Model Programs for Schema Matching0
MESS+: Energy-Optimal Inferencing in Language Model Zoos with Service Level Guarantees0
Morphological Typology in BPE Subword Productivity and Language Modeling0
Stereo-Talker: Audio-driven 3D Human Synthesis with Prior-Guided Mixture-of-Experts0
Neural spell-checker: Beyond words with synthetic data generationCode0
Prove Your Point!: Bringing Proof-Enhancement Principles to Argumentative Essay Generation0
VisualPredicator: Learning Abstract World Models with Neuro-Symbolic Predicates for Robot Planning0
Teaching a Language Model to Distinguish Between Similar Details using a Small Adversarial Training Set0
Robotic State Recognition with Image-to-Text Retrieval Task of Pre-Trained Vision-Language Model and Black-Box Optimization0
Smaller Large Language Models Can Do Moral Self-Correction0
Toward Understanding In-context vs. In-weight Learning0
COMAL: A Convergent Meta-Algorithm for Aligning LLMs with General PreferencesCode0
A Monte Carlo Framework for Calibrated Uncertainty Estimation in Sequence Prediction0
Learning and Transferring Sparse Contextual Bigrams with Linear Transformers0
All or None: Identifiable Linear Properties of Next-token Predictors in Language Modeling0
Constructing Multimodal Datasets from Scratch for Rapid Development of a Japanese Visual Language Model0
Explainable Behavior Cloning: Teaching Large Language Model Agents through Learning by Demonstration0
Beyond Ontology in Dialogue State Tracking for Goal-Oriented ChatbotCode0
A Theoretical Perspective for Speculative Decoding Algorithm0
Dynamic Information Sub-Selection for Decision Support0
A Hierarchical Language Model For Interpretable Graph Reasoning0
Show:102550
← PrevPage 110 of 284Next →

No leaderboard results yet.