SOTAVerified

Language Modeling

Papers

Showing 45014550 of 14182 papers

TitleStatusHype
Dialogue Action Tokens: Steering Language Models in Goal-Directed Dialogue with a Multi-Turn PlannerCode1
Large Scale Transfer Learning for Tabular Data via Language ModelingCode2
DeepSeek-Coder-V2: Breaking the Barrier of Closed-Source Models in Code IntelligenceCode9
UniGLM: Training One Unified Language Model for Text-Attributed Graph EmbeddingCode1
What Kinds of Tokens Benefit from Distant Text? An Analysis on Long Context Language Modeling0
Mitigating Large Language Model Hallucination with Faithful Finetuning0
SPA-VL: A Comprehensive Safety Preference Alignment Dataset for Vision Language ModelCode1
Optimizing Instructions and Demonstrations for Multi-Stage Language Model ProgramsCode14
SLEGO: A Collaborative Data Analytics System with LLM Recommender for Diverse Users0
Preserving Knowledge in Large Language Model with Model-Agnostic Self-Decompression0
Adversarial Style Augmentation via Large Language Model for Robust Fake News DetectionCode0
RepLiQA: A Question-Answering Dataset for Benchmarking LLMs on Unseen Reference ContentCode0
SUGARCREPE++ Dataset: Vision-Language Model Sensitivity to Semantic and Lexical AlterationsCode0
Fairer Preferences Elicit Improved Human-Aligned Large Language Model JudgmentsCode1
AvaTaR: Optimizing LLM Agents for Tool Usage via Contrastive ReasoningCode3
GAMA: A Large Audio-Language Model with Advanced Audio Understanding and Complex Reasoning AbilitiesCode2
CoSQA+: Pioneering the Multi-Choice Code Search Benchmark with Test-Driven AgentsCode0
VideoLLM-online: Online Video Large Language Model for Streaming Video0
Language Modeling with Editable External KnowledgeCode1
Knowledge-to-Jailbreak: Investigating Knowledge-driven Jailbreaking Attacks for Large Language ModelsCode0
HARE: HumAn pRiors, a key to small language model Efficiency0
Watch Every Step! LLM Agent Learning via Iterative Step-Level Process RefinementCode2
Generative Visual Instruction TuningCode0
mDPO: Conditional Preference Optimization for Multimodal Large Language ModelsCode2
STEVE Series: Step-by-Step Construction of Agent Systems in Minecraft0
Promises, Outlooks and Challenges of Diffusion Language Modeling0
Prompts as Auto-Optimized Training Hyperparameters: Training Best-in-Class IR Models from Scratch with 10 Gold Labels0
CrisisSense-LLM: Instruction Fine-Tuned Large Language Model for Multi-label Social Media Text Classification in Disaster InformaticsCode0
RoseLoRA: Row and Column-wise Sparse Low-rank Adaptation of Pre-trained Language Model for Knowledge Editing and Fine-tuningCode0
Avoiding Copyright Infringement via Large Language Model UnlearningCode0
Taking a Deep Breath: Enhancing Language Modeling of Large Language Models with Sentinel Tokens0
Logit Separability-Driven Samples and Multiple Class-Related Words Selection for Advancing In-Context LearningCode0
Large Language Models for Dysfluency Detection in Stuttered Speech0
ShareLoRA: Parameter Efficient and Robust Large Language Model Fine-tuning via Shared Low-Rank AdaptationCode0
Optimization of Armv9 architecture general large language model inference performance based on Llama.cppCode0
VCEval: Rethinking What is a Good Educational Video and How to Automatically Evaluate It0
CancerLLM: A Large Language Model in Cancer Domain0
Reactor Mk.1 performances: MMLU, HumanEval and BBH test results0
RoboPoint: A Vision-Language Model for Spatial Affordance Prediction for Robotics0
MALLM-GAN: Multi-Agent Large Language Model as Generative Adversarial Network for Synthesizing Tabular Data0
Self-Supervised Representation Learning with Spatial-Temporal Consistency for Sign Language RecognitionCode1
Large Language Model Enhanced Clustering for News Event Detection0
CoLoR-Filter: Conditional Loss Reduction Filtering for Targeted Language Model Pre-trainingCode1
PARSE-Ego4D: Personal Action Recommendation Suggestions for Egocentric Videos0
A Probability--Quality Trade-off in Aligned Language Models and its Relation to Sampling Adaptors0
Vision Language Modeling of Content, Distortion and Appearance for Image Quality Assessment0
GEB-1.3B: Open Lightweight Large Language Model0
Large language model validity via enhanced conformal prediction methodsCode1
OpenECAD: An Efficient Visual Language Model for Editable 3D-CAD Design0
Regularizing Hidden States Enables Learning Generalizable Reward Model for LLMsCode4
Show:102550
← PrevPage 91 of 284Next →

No leaderboard results yet.