| What do Language Model Probabilities Represent? From Distribution Estimation to Response Prediction | May 4, 2025 | In-Context LearningLanguage Modeling | —Unverified | 0 |
| What do Language Representations Really Represent? | Jan 9, 2019 | Language ModelingLanguage Modelling | —Unverified | 0 |
| What do LLMs Know about Financial Markets? A Case Study on Reddit Market Sentiment Analysis | Dec 21, 2022 | Language ModelingLanguage Modelling | —Unverified | 0 |
| What do RNN Language Models Learn about Filler--Gap Dependencies? | Nov 1, 2018 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Understanding Language Model Circuits through Knowledge Editing | Jun 25, 2024 | knowledge editingLanguage Modeling | —Unverified | 0 |
| What goes into a word: generating image descriptions with top-down spatial knowledge | Oct 1, 2019 | DecoderLanguage Modeling | —Unverified | 0 |
| What Happens When Small Is Made Smaller? Exploring the Impact of Compression on Small Data Pretrained Language Models | Apr 6, 2024 | Knowledge DistillationLanguage Modeling | —Unverified | 0 |
| WHAT-IF: Exploring Branching Narratives by Meta-Prompting Large Language Models | Dec 13, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| What is not where: the challenge of integrating spatial representations into deep learning architectures | Jul 21, 2018 | Caption GenerationDeep Learning | —Unverified | 0 |
| What Kind of Language Is Hard to Language-Model? | Jun 11, 2019 | Language ModelingLanguage Modelling | —Unverified | 0 |
| What Kinds of Tokens Benefit from Distant Text? An Analysis on Long Context Language Modeling | Jun 17, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| What Languages are Easy to Language-Model? A Perspective from Learning Probabilistic Regular Languages | Jun 6, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| What represents ``style'' in authorship attribution? | Aug 1, 2018 | Authorship AttributionLanguage Modeling | —Unverified | 0 |
| What Should Baby Models Read? Exploring Sample-Efficient Data Composition on Model Performance | Nov 11, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| What's in your Head? Emergent Behaviour in Multi-Task Transformer Models | Apr 13, 2021 | Language ModelingLanguage Modelling | —Unverified | 0 |
| What Syntactic Structures block Dependencies in RNN Language Models? | May 24, 2019 | Language ModelingLanguage Modelling | —Unverified | 0 |
| What the [MASK]? Making Sense of Language-Specific BERT Models | Mar 5, 2020 | Language ModelingLanguage Modelling | —Unverified | 0 |
| What Will My Model Forget? Forecasting Forgotten Examples in Language Model Refinement | Feb 2, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| When a language model is optimized for reasoning, does it still show embers of autoregression? An analysis of OpenAI o1 | Oct 2, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| When and why are log-linear models self-normalizing? | May 1, 2015 | Computational EfficiencyGeneralization Bounds | —Unverified | 0 |
| When BERT Meets Quantum Temporal Convolution Learning for Text Classification in Heterogeneous Computing | Feb 17, 2022 | ClassificationCPU | —Unverified | 0 |
| When does MAML Work the Best? An Empirical Study on Model-Agnostic Meta-Learning in NLP Applications | May 24, 2020 | Few-Shot Text ClassificationLanguage Modeling | —Unverified | 0 |
| When Does Syntax Mediate Neural Language Model Performance? Evidence from Dropout Probes | Jan 16, 2022 | Language ModelingLanguage Modelling | —Unverified | 0 |
| WHEN FLUE MEETS FLANG: Benchmarks and Large Pre-trained Language Model for Financial Domain | Oct 31, 2022 | FLUELanguage Modeling | —Unverified | 0 |
| When Large Language Model Agents Meet 6G Networks: Perception, Grounding, and Alignment | Jan 15, 2024 | Integrated sensing and communicationLanguage Modeling | —Unverified | 0 |
| When Large Language Model Meets Optimization | May 16, 2024 | Decision MakingLanguage Modeling | —Unverified | 0 |
| Mapping Biomedical Ontology Terms to IDs: Effect of Domain Prevalence on Prediction Accuracy | Sep 11, 2024 | Feature ImportanceLanguage Modeling | —Unverified | 0 |
| SOEN-101: Code Generation by Emulating Software Process Models Using Large Language Model Agents | Mar 23, 2024 | Code GenerationHumanEval | —Unverified | 0 |
| When More is not Necessary Better: Multilingual Auxiliary Tasks for Zero-Shot Cross-Lingual Transfer of Hate Speech Detection Models | Jan 16, 2022 | Cross-Lingual TransferHate Speech Detection | —Unverified | 0 |
| When Persuasion Overrides Truth in Multi-Agent LLM Debates: Introducing a Confidence-Weighted Persuasion Override Rate (CW-POR) | Apr 1, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| When Raw Data Prevails: Are Large Language Model Embeddings Effective in Numerical Data Representation for Medical Machine Learning Applications? | Aug 15, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| When Reasoning Meets Compression: Benchmarking Compressed Large Reasoning Models on Complex Reasoning Tasks | Apr 2, 2025 | BenchmarkingLanguage Modeling | —Unverified | 0 |
| When Text Embedding Meets Large Language Model: A Comprehensive Survey | Dec 12, 2024 | Information RetrievalLanguage Modeling | —Unverified | 0 |
| Where exactly does contextualization in a PLM happen? | Dec 11, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Which Prompts Make The Difference? Data Prioritization For Efficient Human LLM Evaluation | Oct 22, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Which side are you on? Insider-Outsider classification in conspiracy-theoretic social media | Mar 8, 2022 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Which techniques does your application use?: An information extraction framework for scientific articles | Aug 23, 2016 | ArticlesLanguage Modeling | —Unverified | 0 |
| Whisper-GPT: A Hybrid Representation Audio Large Language Model | Dec 16, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| WhisQ: Cross-Modal Representation Learning for Text-to-Music MOS Prediction | Jun 6, 2025 | cross-modal alignmentLanguage Modeling | —Unverified | 0 |
| Who Brings the Frisbee: Probing Hidden Hallucination Factors in Large Vision-Language Model via Causality Analysis | Dec 4, 2024 | HallucinationLanguage Modeling | —Unverified | 0 |
| Whose Language Counts as High Quality? Measuring Language Ideologies in Text Data Selection | Jan 25, 2022 | ArticlesLanguage Modeling | —Unverified | 0 |
| Who's to say what's funny? A computer using Language Models and Deep Learning, That's Who! | May 29, 2017 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Who Writes the Review, Human or AI? | May 30, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Who Wrote This? The Key to Zero-Shot LLM-Generated Text Detection Is GECScore | May 7, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Why Are Positional Encodings Nonessential for Deep Autoregressive Transformers? Revisiting a Petroglyph | Dec 31, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Why do LLaVA Vision-Language Models Reply to Images in English? | Jul 2, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Why do small language models underperform? Studying Language Model Saturation via the Softmax Bottleneck | Apr 11, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Why Gradients Rapidly Increase Near the End of Training | Jun 2, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Why Knowledge Distillation Works in Generative Models: A Minimal Working Explanation | May 19, 2025 | Knowledge DistillationLanguage Modeling | —Unverified | 0 |
| Why LLMs Cannot Think and How to Fix It | Mar 12, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |