SOTAVerified

Language Modeling

Papers

Showing 28012850 of 14182 papers

TitleStatusHype
Plan-on-Graph: Self-Correcting Adaptive Planning of Large Language Model on Knowledge GraphsCode2
ALISE: Accelerating Large Language Model Serving with Speculative Scheduling0
From Context to Action: Analysis of the Impact of State Representation and Context on the Generalization of Multi-Turn Web Navigation Agents0
The NPU-HWC System for the ISCSLP 2024 Inspirational and Convincing Audio Generation Challenge0
A Theoretical Perspective for Speculative Decoding Algorithm0
Dynamic Information Sub-Selection for Decision Support0
Neural spell-checker: Beyond words with synthetic data generationCode0
Learning and Transferring Sparse Contextual Bigrams with Linear Transformers0
Smaller Large Language Models Can Do Moral Self-Correction0
Real-Time Personalization for LLM-based Recommendation with Customized In-Context LearningCode1
Beyond Ontology in Dialogue State Tracking for Goal-Oriented ChatbotCode0
Prove Your Point!: Bringing Proof-Enhancement Principles to Argumentative Essay Generation0
A Monte Carlo Framework for Calibrated Uncertainty Estimation in Sequence Prediction0
Explainable Behavior Cloning: Teaching Large Language Model Agents through Learning by Demonstration0
MutaPLM: Protein Language Modeling for Mutation Explanation and EngineeringCode4
All or None: Identifiable Linear Properties of Next-token Predictors in Language Modeling0
Constructing Multimodal Datasets from Scratch for Rapid Development of a Japanese Visual Language Model0
COMAL: A Convergent Meta-Algorithm for Aligning LLMs with General PreferencesCode0
Toward Understanding In-context vs. In-weight Learning0
Online Intrinsic Rewards for Decision Making Agents from Large Language Model FeedbackCode1
Robotic State Recognition with Image-to-Text Retrieval Task of Pre-Trained Vision-Language Model and Black-Box Optimization0
Teaching a Language Model to Distinguish Between Similar Details using a Small Adversarial Training Set0
VisualPredicator: Learning Abstract World Models with Neuro-Symbolic Predicates for Robot Planning0
Long-context Protein Language Modeling Using Bidirectional Mamba with Shared Projection LayersCode1
Multimodal Quantum Natural Language Processing: A Novel Framework for using Quantum Methods to Analyse Real DataCode0
CurateGPT: A flexible language-model assisted biocuration tool0
Anticipating Future with Large Language Model for Simultaneous Machine Translation0
Rare-to-Frequent: Unlocking Compositional Generation Power of Diffusion Models on Rare Concepts with LLM GuidanceCode2
VL-Cache: Sparsity and Modality-Aware KV Cache Compression for Vision-Language Model Inference Acceleration0
Rethinking Code Refinement: Learning to Judge Code EfficiencyCode0
Auto-Intent: Automated Intent Discovery and Self-Exploration for Large Language Model Web Agents0
PerSRV: Personalized Sticker Retrieval with Vision-Language ModelCode0
Discrete Modeling via Boundary Conditional Diffusion Processes0
A Hierarchical Language Model For Interpretable Graph Reasoning0
Protecting Privacy in Multimodal Large Language Models with MLLMU-BenchCode2
Improving In-Context Learning with Small Language Model EnsemblesCode0
Reliable Semantic Understanding for Real World Zero-shot Object Goal Navigation0
MARCO: Multi-Agent Real-time Chat Orchestration0
Democratizing Reward Design for Personal and Representative Value-Alignment0
f-PO: Generalizing Preference Optimization with f-divergence MinimizationCode1
SG-Bench: Evaluating LLM Safety Generalization Across Diverse Tasks and Prompt TypesCode1
Online Detecting LLM-Generated Texts via Sequential Hypothesis Testing by BettingCode0
Abrupt Learning in Transformers: A Case Study on Matrix Completion0
MotionGPT-2: A General-Purpose Motion-Language Model for Motion Generation and Understanding0
Learning and Unlearning of Fabricated Knowledge in Language Models0
FactBench: A Dynamic Benchmark for In-the-Wild Language Model Factuality Evaluation0
From melodic note sequences to pitches using word2vec0
Are VLMs Really BlindCode0
An Actor-Critic Approach to Boosting Text-to-SQL Large Language Model0
Energy-Based Diffusion Language Models for Text Generation0
Show:102550
← PrevPage 57 of 284Next →

No leaderboard results yet.