SOTAVerified

Language Modeling

Papers

Showing 42514300 of 14182 papers

TitleStatusHype
Narrow Transformer: StarCoder-Based Java-LM For Desktop0
Evaluating Language Model Context Windows: A "Working Memory" Test and Inference-time CorrectionCode1
MAPO: Boosting Large Language Model Performance with Model-Adaptive Prompt Optimization0
RDBE: Reasoning Distillation-Based Evaluation Enhances Automatic Essay Scoring0
Large Language Model Agents for Improving Engagement with Behavior Change Interventions: Application to Digital Mindfulness0
LLMcap: Large Language Model for Unsupervised PCAP Failure Detection0
Towards Federated RLHF with Aggregated Client Preference for LLMs0
Learning to Reduce: Towards Improving Performance of Large Language Models on Structured Data0
MLKD-BERT: Multi-level Knowledge Distillation for Pre-trained Language Models0
CogErgLLM: Exploring Large Language Model Systems Design Perspective Using Cognitive Ergonomics0
Supporting Cross-language Cross-project Bug Localization Using Pre-trained Language Models0
Efficient Training of Language Models with Compact and Consistent Next Token Distributions0
Images Speak Louder than Words: Understanding and Mitigating Bias in Vision-Language Model from a Causal Mediation Perspective0
SAFT: Towards Out-of-Distribution Generalization in Fine-Tuning0
InternLM-XComposer-2.5: A Versatile Large Vision Language Model Supporting Long-Contextual Input and Output0
Raw Text is All you Need: Knowledge-intensive Multi-turn Instruction Tuning for Large Language Model0
Lightweight Large Language Model for Medication Enquiry: Med-Pal0
Assessing the Effectiveness of GPT-4o in Climate Change Evidence Synthesis and Systematic Assessments: Preliminary Insights0
SeqMate: A Novel Large Language Model Pipeline for Automating RNA Sequencing0
LLMs Plagiarize: Ensuring Responsible Sourcing of Large Language Model Training Data Through Knowledge Graph Comparison0
Investigating the Effects of Large-Scale Pseudo-Stereo Data and Different Speech Foundation Model on Dialogue Generative Spoken Language Model0
Language Model Alignment in Multilingual Trolley ProblemsCode1
An End-to-End Speech Summarization Using Large Language Model0
Helpful assistant or fruitful facilitator? Investigating how personas affect language model behaviorCode0
Accompanied Singing Voice Synthesis with Fully Text-controlled Melody0
Is Your Large Language Model Knowledgeable or a Choices-Only Cheater?Code0
Neurocache: Efficient Vector Retrieval for Long-range Language ModelingCode0
PromptIntern: Saving Inference Costs by Internalizing Recurrent Prompt during Large Language Model Fine-tuning0
Why do LLaVA Vision-Language Models Reply to Images in English?0
GPTCast: a weather language model for precipitation nowcastingCode1
Multi-Modal Video Dialog State Tracking in the Wild0
Synthetic Multimodal Question Generation0
A Bounding Box is Worth One Token: Interleaving Layout and Text in a Large Language Model for Document UnderstandingCode2
Fake News Detection and Manipulation Reasoning via Large Vision-Language Models0
AutoFlow: Automated Workflow Generation for Large Language Model AgentsCode2
Image-to-Text Logic Jailbreak: Your Imagination can Help You Do Anything0
SINKT: A Structure-Aware Inductive Knowledge Tracing Model with Large Language ModelCode1
An Empirical Comparison of Generative Approaches for Product Attribute-Value IdentificationCode0
Adapting Multilingual LLMs to Low-Resource Languages with Knowledge Graphs via AdaptersCode0
FoldGPT: Simple and Effective Large Language Model Compression Scheme0
Tokenize the World into Object-level Knowledge to Address Long-tail Events in Autonomous Driving0
First Place Solution of 2023 Global Artificial Intelligence Technology Innovation Competition Track 10
Large Language Model Enhanced Knowledge Representation Learning: A Survey0
Learning to Explore and Select for Coverage-Conditioned Retrieval-Augmented GenerationCode0
Optimization of Retrieval-Augmented Generation Context with Outlier Detection0
Tree Search for Language Model AgentsCode3
Large Language Models Are Involuntary Truth-Tellers: Exploiting Fallacy Failure for Jailbreak AttacksCode0
Memory^3: Language Modeling with Explicit Memory0
CRAB: Cross-environment Agent Benchmark for Multimodal Language Model AgentsCode3
RegMix: Data Mixture as Regression for Language Model Pre-trainingCode2
Show:102550
← PrevPage 86 of 284Next →

No leaderboard results yet.