SOTAVerified

Language Modeling

Papers

Showing 81018150 of 14182 papers

TitleStatusHype
Activity Sparsity Complements Weight Sparsity for Efficient RNN Inference0
Actor-Critic based Online Data Mixing For Language Model Pre-Training0
ACtuAL: Actor-Critic Under Adversarial Learning0
AdaBelief Optimizer: Adapting Stepsizes by theBelief in Observed Gradients0
AdaGC: Improving Training Stability for Large Language Model Pretraining0
ADALog: Adaptive Unsupervised Anomaly detection in Logs with Self-attention Masked Language Model0
ADAM-1: AI and Bioinformatics for Alzheimer's Detection and Microbiome-Clinical Data Integrations0
Adam^+: A Stochastic Method with Adaptive Variance Reduction0
AdaPrompt: Adaptive Model Training for Prompt-based NLP0
Adaptable End-to-End ASR Models using Replaceable Internal LMs and Residual Softmax0
Adaptable Multi-Domain Language Model for Transformer ASR0
Adapt-and-Adjust: Overcoming the Long-Tail Problem of Multilingual Speech Recognition0
Adaptation of Deep Bidirectional Transformers for Afrikaans Language0
Adapted Multimodal BERT with Layer-wise Fusion for Sentiment Analysis0
Adapter Pruning using Tropical Characterization0
AdapterSoup: Weight Averaging to Improve Generalization of Pretrained Language Models0
Adapter-TST: A Parameter Efficient Method for Multiple-Attribute Text Style Transfer0
AdapThink: Adaptive Thinking Preferences for Reasoning Language Model0
Adapt in Contexts: Retrieval-Augmented Domain Adaptation via In-Context Learning0
Space-LLaVA: a Vision-Language Model Adapted to Extraterrestrial Applications0
Adapting and evaluating a deep learning language model for clinical why-question answering0
Adapting BERT to Implicit Discourse Relation Classification with a Focus on Discourse Connectives0
Adapting BigScience Multilingual Model to Unseen Languages0
Adapting Decoder-Based Language Models for Diverse Encoder Downstream Tasks0
Adapting Dual-encoder Vision-language Models for Paraphrased Retrieval0
Adapting Event Extractors to Medical Data: Bridging the Covariate Shift0
Adapting Large Language Models for Character-based Augmentative and Alternative Communication0
Adapting Large Language Models to Domains via Reading Comprehension0
Adapting Mental Health Prediction Tasks for Cross-lingual Learning via Meta-Training and In-context Learning with Large Language Model0
Adapting Open Domain Fact Extraction and Verification to COVID-FACT through In-Domain Language Modeling0
Adaptive Decoding via Latent Preference Optimization0
Adaptive Differential Privacy for Language Model Training0
Adaptive Discounting of Implicit Language Models in RNN-Transducers0
Adaptive Draft-Verification for Efficient Large Language Model Decoding0
Adaptively profiling models with task elicitation0
Adaptive Mixture of Low-Rank Factorizations for Compact Neural Modeling0
Adaptive Multi-Corpora Language Model Training for Speech Recognition0
Adaptive Multi-view Rule Discovery for Weakly-Supervised Compatible Products Prediction0
Adaptive Noise Injection: A Structure-Expanding Regularization for RNN0
Adaptive Optimization for Enhanced Efficiency in Large-Scale Language Model Training0
Adaptive Question Answering: Enhancing Language Model Proficiency for Addressing Knowledge Conflicts with Source Citations0
Adaptive Reasoning and Acting in Medical Language Agents0
Adaptive Semantic Prompt Caching with VectorQ0
Adaptive Semiparametric Language Models0
Adaptive Testing and Debugging of NLP Models0
Adapt-Pruner: Adaptive Structural Pruning for Efficient Small Language Model Training0
adaQN: An Adaptive Quasi-Newton Algorithm for Training RNNs0
AdaServe: Accelerating Multi-SLO LLM Serving with SLO-Customized Speculative Decoding0
A Data Efficient End-To-End Spoken Language Understanding Architecture0
A Dataset and Benchmarks for Multimedia Social Analysis0
Show:102550
← PrevPage 163 of 284Next →

No leaderboard results yet.