SOTAVerified

Language Modeling

Papers

Showing 1250112550 of 14182 papers

TitleStatusHype
One-vs-Each Approximation to Softmax for Scalable Estimation of Probabilities0
On Fairness of Unified Multimodal Large Language Model for Image Generation0
On Improving Informativity and Grammaticality for Multi-Sentence Compression0
On Language Model Integration for RNN Transducer based Speech Recognition0
On Languaging a Simulation Engine0
On Learning Better Embeddings from Chinese Clinical Records: Study on Combining In-Domain and Out-Domain Data0
On Learning Universal Representations Across Languages0
Online Infix Probability Computation for Probabilistic Finite Automata0
On Mechanistic Circuits for Extractive Question-Answering0
On Minimum Word Error Rate Training of the Hybrid Autoregressive Transducer0
On Modeling Sense Relatedness in Multi-prototype Word Embedding0
On Modular Training of Neural Acoustics-to-Word Model for LVCSR0
On Multilingual Encoder Language Model Compression for Low-Resource Languages0
On Overcoming Miscalibrated Conversational Priors in LLM-based Chatbots0
On Randomized Classification Layers and Their Implications in Natural Language Generation0
On Reducing Repetition in Abstractive Summarization0
On Retrieval Augmentation and the Limitations of Language Model Training0
On Robustness of Prompt-based Semantic Parsing with Large Pre-trained Language Model: An Empirical Study on Codex0
On Sampling-Based Training Criteria for Neural Language Modeling0
On Scaling Up a Multilingual Vision and Language Model0
On "Scientific Debt" in NLP: A Case for More Rigour in Language Model Pre-Training Research0
ONSEP: A Novel Online Neural-Symbolic Framework for Event Prediction Based on Large Language Model0
On Speculative Decoding for Multimodal Large Language Models0
On Speeding Up Language Model Evaluation0
Making Large Language Models Better Reasoners with Step-Aware Verifier0
Towards Federated RLHF with Aggregated Client Preference for LLMs0
On the Complementarity of Data Selection and Fine Tuning for Domain Adaptation0
On the Complementarity of Data Selection and Fine Tuning for Domain Adaptation0
On the Computational Inefficiency of Large Batch Sizes for Stochastic Gradient Descent0
On the Computational Modelling of Michif Verbal Morphology0
On the Discussion of Large Language Models: Symmetry of Agents and Interplay with Prompts0
On the Effectiveness of Acoustic BPE in Decoder-Only TTS0
On the Effectiveness of Adapter-based Tuning for Pretrained Language Model Adaptation0
On the Effectiveness of Neural Text Generation based Data Augmentation for Recognition of Morphologically Rich Speech0
On the Effect of Pretraining Corpora on In-context Learning by a Large-scale Language Model0
On the Effect of Word Order on Cross-lingual Sentiment Analysis0
On the Effects of Heterogeneous Data Sources on Speech-to-Text Foundation Models0
On the Exploration of English to Urdu Machine Translation0
On-the-fly Text Retrieval for End-to-End ASR Adaptation0
On the importance of Data Scale in Pretraining Arabic Language Models0
On the importance of pre-training data volume for compact language models0
On the Importance of Text Preprocessing for Multimodal Representation Learning and Pathology Report Generation0
On the Influence of Masking Policies in Intermediate Pre-training0
On the Limitations of Steering in Language Model Alignment0
On the limit of English conversational speech recognition0
On the Limits of Language Generation: Trade-Offs Between Hallucination and Mode Collapse0
On the Multilingual Capabilities of Very Large-Scale English Language Models0
On the Origins of Linear Representations in Large Language Models0
On the Planning, Search, and Memorization Capabilities of Large Language Models0
On the Power of Convolution Augmented Transformer0
Show:102550
← PrevPage 251 of 284Next →

No leaderboard results yet.