SOTAVerified

Language Modeling

Papers

Showing 15511600 of 14182 papers

TitleStatusHype
Robust Optimization in Protein Fitness Landscapes Using Reinforcement Learning in Latent SpaceCode1
Enhancing Vision-Language Model with Unmasked Token AlignmentCode1
Language Generation with Strictly Proper Scoring RulesCode1
Detection-Correction Structure via General Language Model for Grammatical Error CorrectionCode1
Learning diverse attacks on large language models for robust red-teaming and safety tuningCode1
LLM experiments with simulation: Large Language Model Multi-Agent System for Simulation Model Parametrization in Digital TwinsCode1
SLMRec: Distilling Large Language Models into Small for Sequential RecommendationCode1
Superposed Decoding: Multiple Generations from a Single Autoregressive Inference PassCode1
4-bit Shampoo for Memory-Efficient Network TrainingCode1
Advanced Language Model-based Translator for English-Vietnamese TranslationCode1
Interesting Scientific Idea Generation using Knowledge Graphs and LLMs: Evaluations with 100 Research Group LeadersCode1
DeeperImpact: Optimizing Sparse Learned Index StructuresCode1
Video Enriched Retrieval Augmented Generation Using Aligned Video CaptionsCode1
gzip Predicts Data-dependent Scaling LawsCode1
M^3GPT: An Advanced Multimodal, Multitask Framework for Motion Comprehension and GenerationCode1
Evolutionary Large Language Model for Automated Feature TransformationCode1
Finetuning Large Language Model for Personalized RankingCode1
Sparse Matrix in Large Language Model Fine-tuningCode1
ART: Automatic Red-teaming for Text-to-Image Models to Protect Benign UsersCode1
Distributed Speculative Inference (DSI): Speculation Parallelism for Provably Faster Lossless Language Model InferenceCode1
From Text to Pixel: Advancing Long-Context Understanding in MLLMsCode1
Annotation-Efficient Preference Optimization for Language Model AlignmentCode1
RecGPT: Generative Pre-training for Text-based RecommendationCode1
Token-wise Influential Training Data Retrieval for Large Language ModelsCode1
LeaPformer: Enabling Linear Transformers for Autoregressive and Simultaneous Tasks via Learned ProportionsCode1
RDRec: Rationale Distillation for LLM-based RecommendationCode1
Conformal Alignment: Knowing When to Trust Foundation Models with GuaranteesCode1
Spectral Editing of Activations for Large Language Model AlignmentCode1
Incorporating Clinical Guidelines through Adapting Multi-modal Large Language Model for Prostate Cancer PI-RADS ScoringCode1
Differentiable Model Scaling using Differentiable TopkCode1
Value Augmented Sampling for Language Model Alignment and PersonalizationCode1
Cross-Care: Assessing the Healthcare Implications of Pre-training Data on Language Model BiasCode1
LEARN: Knowledge Adaptation from Large Language Model to Recommendation for Practical Industrial ApplicationCode1
Mozart's Touch: A Lightweight Multi-modal Music Generation Framework Based on Pre-Trained Large ModelsCode1
EDA Corpus: A Large Language Model Dataset for Enhanced Interaction with OpenROADCode1
Is Bigger Edit Batch Size Always Better? -- An Empirical Study on Model Editing with Llama-3Code1
CofiPara: A Coarse-to-fine Paradigm for Multimodal Sarcasm Target Identification with Large Multimodal ModelsCode1
BiomedRAG: A Retrieval Augmented Large Language Model for BiomedicineCode1
Distillation Matters: Empowering Sequential Recommenders to Match the Performance of Large Language ModelCode1
GUing: A Mobile GUI Search Engine using a Vision-Language ModelCode1
Markovian Transformers for Informative Language ModelingCode1
CRE-LLM: A Domain-Specific Chinese Relation Extraction Framework with Fine-tuned Large Language ModelCode1
Ranked List Truncation for Large Language Model-based Re-RankingCode1
Studying Large Language Model Behaviors Under Context-Memory Conflicts With Real DocumentsCode1
Nyonic Technical ReportCode1
Step Differences in Instructional VideoCode1
CultureBank: An Online Community-Driven Knowledge Base Towards Culturally Aware Language TechnologiesCode1
Setting up the Data Printer with Improved English to Ukrainian Machine TranslationCode1
Multi-Head Mixture-of-ExpertsCode1
Mélange: Cost Efficient Large Language Model Serving by Exploiting GPU HeterogeneityCode1
Show:102550
← PrevPage 32 of 284Next →

No leaderboard results yet.