SOTAVerified

Language Modeling

Papers

Showing 61016150 of 14182 papers

TitleStatusHype
What do Language Model Probabilities Represent? From Distribution Estimation to Response Prediction0
What do Language Representations Really Represent?0
What do LLMs Know about Financial Markets? A Case Study on Reddit Market Sentiment Analysis0
What do RNN Language Models Learn about Filler--Gap Dependencies?0
Understanding Language Model Circuits through Knowledge Editing0
What goes into a word: generating image descriptions with top-down spatial knowledge0
What Happens When Small Is Made Smaller? Exploring the Impact of Compression on Small Data Pretrained Language Models0
WHAT-IF: Exploring Branching Narratives by Meta-Prompting Large Language Models0
What is not where: the challenge of integrating spatial representations into deep learning architectures0
What Kind of Language Is Hard to Language-Model?0
What Kinds of Tokens Benefit from Distant Text? An Analysis on Long Context Language Modeling0
What Languages are Easy to Language-Model? A Perspective from Learning Probabilistic Regular Languages0
What represents ``style'' in authorship attribution?0
What Should Baby Models Read? Exploring Sample-Efficient Data Composition on Model Performance0
What's in your Head? Emergent Behaviour in Multi-Task Transformer Models0
What Syntactic Structures block Dependencies in RNN Language Models?0
What the [MASK]? Making Sense of Language-Specific BERT Models0
What Will My Model Forget? Forecasting Forgotten Examples in Language Model Refinement0
When a language model is optimized for reasoning, does it still show embers of autoregression? An analysis of OpenAI o10
When and why are log-linear models self-normalizing?0
When BERT Meets Quantum Temporal Convolution Learning for Text Classification in Heterogeneous Computing0
When does MAML Work the Best? An Empirical Study on Model-Agnostic Meta-Learning in NLP Applications0
When Does Syntax Mediate Neural Language Model Performance? Evidence from Dropout Probes0
WHEN FLUE MEETS FLANG: Benchmarks and Large Pre-trained Language Model for Financial Domain0
When Large Language Model Agents Meet 6G Networks: Perception, Grounding, and Alignment0
When Large Language Model Meets Optimization0
Mapping Biomedical Ontology Terms to IDs: Effect of Domain Prevalence on Prediction Accuracy0
SOEN-101: Code Generation by Emulating Software Process Models Using Large Language Model Agents0
When More is not Necessary Better: Multilingual Auxiliary Tasks for Zero-Shot Cross-Lingual Transfer of Hate Speech Detection Models0
When Persuasion Overrides Truth in Multi-Agent LLM Debates: Introducing a Confidence-Weighted Persuasion Override Rate (CW-POR)0
When Raw Data Prevails: Are Large Language Model Embeddings Effective in Numerical Data Representation for Medical Machine Learning Applications?0
When Reasoning Meets Compression: Benchmarking Compressed Large Reasoning Models on Complex Reasoning Tasks0
When Text Embedding Meets Large Language Model: A Comprehensive Survey0
Where exactly does contextualization in a PLM happen?0
Which Prompts Make The Difference? Data Prioritization For Efficient Human LLM Evaluation0
Which side are you on? Insider-Outsider classification in conspiracy-theoretic social media0
Which techniques does your application use?: An information extraction framework for scientific articles0
Whisper-GPT: A Hybrid Representation Audio Large Language Model0
WhisQ: Cross-Modal Representation Learning for Text-to-Music MOS Prediction0
Who Brings the Frisbee: Probing Hidden Hallucination Factors in Large Vision-Language Model via Causality Analysis0
Whose Language Counts as High Quality? Measuring Language Ideologies in Text Data Selection0
Who's to say what's funny? A computer using Language Models and Deep Learning, That's Who!0
Who Writes the Review, Human or AI?0
Who Wrote This? The Key to Zero-Shot LLM-Generated Text Detection Is GECScore0
Why Are Positional Encodings Nonessential for Deep Autoregressive Transformers? Revisiting a Petroglyph0
Why do LLaVA Vision-Language Models Reply to Images in English?0
Why do small language models underperform? Studying Language Model Saturation via the Softmax Bottleneck0
Why Gradients Rapidly Increase Near the End of Training0
Why Knowledge Distillation Works in Generative Models: A Minimal Working Explanation0
Why LLMs Cannot Think and How to Fix It0
Show:102550
← PrevPage 123 of 284Next →

No leaderboard results yet.