SOTAVerified

Language Modeling

Papers

Showing 70017050 of 14182 papers

TitleStatusHype
Large Language Models as Generalizable Policies for Embodied Tasks0
Lil-Bevo: Explorations of Strategies for Training Language Models in More Humanlike WaysCode0
Content-based Controls For Music Large Language ModelingCode1
CompeteAI: Understanding the Competition Dynamics in Large Language Model-based AgentsCode1
LightLM: A Lightweight Deep and Narrow Language Model for Generative RecommendationCode1
PETA: Evaluating the Impact of Protein Transfer Learning with Sub-word Tokenization on Downstream ApplicationsCode1
RedCoast: A Lightweight Tool to Automate Distributed Training of LLMs on Any GPU/TPUsCode1
BOOST: Harnessing Black-Box Control to Boost Commonsense in LMs' Generation0
Unraveling Feature Extraction Mechanisms in Neural NetworksCode0
The Distributional Hypothesis Does Not Fully Explain the Benefits of Masked Language Model PretrainingCode0
math-PVS: A Large Language Model Framework to Map Scientific Publications to PVS Theories0
Improving Conversational Recommendation Systems via Bias Analysis and Language-Model-Enhanced Data AugmentationCode0
URL-BERT: Training Webpage Representations via Social Media Engagements0
Critic-Driven Decoding for Mitigating Hallucinations in Data-to-text GenerationCode0
Controlled Decoding from Language Models0
Transformer-based Live Update Generation for Soccer Matches from Microblog Posts0
Conditionally Combining Robot Skills using Large Language ModelsCode0
General Point Model with Autoencoding and Autoregressive0
Zephyr: Direct Distillation of LM AlignmentCode5
RCAgent: Cloud Root Cause Analysis by Autonomous Agents with Tool-Augmented Large Language Models0
Subspace Chronicles: How Linguistic Information Emerges, Shifts and Interacts during Language Model Training0
Discrete Diffusion Modeling by Estimating the Ratios of the Data DistributionCode2
XFEVER: Exploring Fact Verification across LanguagesCode0
SkyMath: Technical ReportCode3
Multiple Key-value Strategy in Recommendation Systems Incorporating Large Language Model0
FedTherapist: Mental Health Monitoring with User-Generated Linguistic Expressions on Smartphones via Federated Learning0
Faithful Path Language Modeling for Explainable Recommendation over Knowledge Graph0
DeSIQ: Towards an Unbiased, Challenging Benchmark for Social Intelligence Understanding0
LoRAShear: Efficient Large Language Model Structured Pruning and Knowledge RecoveryCode1
BLP-2023 Task 2: Sentiment Analysis0
Locally Differentially Private Document Generation Using Zero Shot PromptingCode0
FLTrojan: Privacy Leakage Attacks against Federated Language Models Through Selective Weight Tampering0
TCRA-LLM: Token Compression Retrieval Augmented Large Language Model for Inference Cost Reduction0
Prevalence and prevention of large language model use in crowd work0
PromptInfuser: How Tightly Coupling AI and UI Design Impacts Designers' Workflows0
Vision-Language Pseudo-Labels for Single-Positive Multi-Label LearningCode1
Integrating Language Models into Direct Speech Translation: An Inference-Time Solution to Control Gender Inflection0
Retrieval-based Knowledge Transfer: An Effective Approach for Extreme Large Language Model Compression0
DALE: Generative Data Augmentation for Low-Resource Legal NLPCode1
CRaSh: Clustering, Removing, and Sharing Enhance Fine-tuning without Full Large Language ModelCode0
AutoDiff: combining Auto-encoder and Diffusion model for tabular data synthesizingCode1
Rosetta Stone at KSAA-RD Shared Task: A Hop From Language Modeling To Word--Definition Alignment0
A Language Model with Limited Memory Capacity Captures Interference in Human Sentence Processing0
Clinfo.ai: An Open-Source Retrieval-Augmented Large Language Model System for Answering Medical Questions using Scientific LiteratureCode1
Facilitating Self-Guided Mental Health Interventions Through Human-Language Model Interaction: A Case Study of Cognitive Restructuring0
TRAMS: Training-free Memory Selection for Long-range Language ModelingCode1
MindLLM: Pre-training Lightweight Large Language Model from Scratch, Evaluations and Domain Applications0
Unnatural language processing: How do language models handle machine-generated prompts?0
WebWISE: Web Interface Control and Sequential Exploration with Large Language Models0
E-Sparse: Boosting the Large Language Model Inference through Entropy-based N:M Sparsity0
Show:102550
← PrevPage 141 of 284Next →

No leaderboard results yet.