| Optimizing Singular Spectrum for Large Language Model Compression | Feb 20, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 | 0 |
| Optimizing Token Usage on Large Language Model Conversations Using the Design Structure Matrix | Oct 1, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 | 0 |
| Optimizing Vision-Language Interactions Through Decoder-Only Models | Dec 14, 2024 | DecoderImage Captioning | —Unverified | 0 | 0 |
| Oracle-Checker Scheme for Evaluating a Generative Large Language Model | May 6, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 | 0 |
| OrcaLoca: An LLM Agent Framework for Software Issue Localization | Feb 1, 2025 | Code SearchLanguage Modeling | —Unverified | 0 | 0 |
| Orchid: Flexible and Data-Dependent Convolution for Sequence Modeling | Feb 28, 2024 | Computational Efficiencyimage-classification | —Unverified | 0 | 0 |
| Orchestrate Multimodal Data with Batch Post-Balancing to Accelerate Multimodal Large Language Model Training | Mar 31, 2025 | GPULanguage Modeling | —Unverified | 0 | 0 |
| Order-agnostic Identifier for Large Language Model-based Generative Recommendation | Feb 15, 2025 | Collaborative FilteringLanguage Modeling | —Unverified | 0 | 0 |
| Order Independence With Finetuning | Mar 30, 2025 | ARCLanguage Modeling | —Unverified | 0 | 0 |
| Order Matters in the Presence of Dataset Imbalance for Multilingual Learning | Dec 11, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 | 0 |
| Ornithologist: Towards Trustworthy "Reasoning" about Central Bank Communications | May 14, 2025 | HallucinationLanguage Modeling | —Unverified | 0 | 0 |
| ORQA: A Benchmark and Foundation Model for Holistic Operating Room Modeling | May 19, 2025 | Graph GenerationKnowledge Distillation | —Unverified | 0 | 0 |
| OrthoDoc: Multimodal Large Language Model for Assisting Diagnosis in Computed Tomography | Aug 30, 2024 | Computed Tomography (CT)Diagnostic | —Unverified | 0 | 0 |
| Orthus: Autoregressive Interleaved Image-Text Generation with Modality-Specific Heads | Nov 28, 2024 | GPULanguage Modeling | —Unverified | 0 | 0 |
| OSPC: Detecting Harmful Memes with Large Language Model as a Catalyst | Jun 14, 2024 | Image CaptioningLanguage Modeling | —Unverified | 0 | 0 |
| Outlier dimensions favor frequent tokens in language models | Mar 27, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 | 0 |
| Out-of-Distribution Detection Using Peer-Class Generated by Large Language Model | Mar 20, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 | 0 |
| OVEL: Large Language Model as Memory Manager for Online Video Entity Linking | Mar 3, 2024 | Entity LinkingLanguage Modeling | —Unverified | 0 | 0 |
| Overcoming Domain Mismatch in Low Resource Sequence-to-Sequence ASR Models using Hybrid Generated Pseudotranscripts | Jun 14, 2021 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 | 0 |
| Overcoming linguistic barriers in code assistants: creating a QLoRA adapter to improve support for Russian-language code writing instructions | Sep 14, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 | 0 |
| Overcoming Vocabulary Mismatch: Vocabulary-agnostic Teacher Guided Language Modeling | Mar 24, 2025 | Continual PretrainingLanguage Modeling | —Unverified | 0 | 0 |
| Over-Tokenized Transformer: Vocabulary is Generally Worth Scaling | Jan 28, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 | 0 |
| "Ownership, Not Just Happy Talk": Co-Designing a Participatory Large Language Model for Journalism | Jan 28, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 | 0 |
| OW-VISCapTor: Abstractors for Open-World Video Instance Segmentation and Captioning | Apr 4, 2024 | DescriptiveDiversity | —Unverified | 0 | 0 |
| π_0: A Vision-Language-Action Flow Model for General Robot Control | Oct 31, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 | 0 |
| P^3LM: Probabilistically Permuted Prophet Language Modeling for Generative Pre-Training | Oct 22, 2022 | Conversational Question AnsweringDecoder | —Unverified | 0 | 0 |
| PACE: Improving Prompt with Actor-Critic Editing for Large Language Model | Aug 19, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 | 0 |
| PADriver: Towards Personalized Autonomous Driving | May 8, 2025 | Autonomous DrivingLanguage Modeling | —Unverified | 0 | 0 |
| PaECTER: Patent-level Representation Learning using Citation-informed Transformers | Feb 29, 2024 | Citation PredictionLanguage Modeling | —Unverified | 0 | 0 |
| Pagination: It's what you say, not how long it takes to say it | Apr 11, 2014 | ArticlesLanguage Modeling | —Unverified | 0 | 0 |
| PairConnect: A Compute-Efficient MLP Alternative to Attention | Jun 15, 2021 | Language ModelingLanguage Modelling | —Unverified | 0 | 0 |
| PaliGemma: A versatile 3B VLM for transfer | Jul 10, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 | 0 |
| PaLM2-VAdapter: Progressively Aligned Language Model Makes a Strong Vision-language Adapter | Feb 16, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 | 0 |
| PALM: Pre-training an Autoencoding\&Autoregressive Language Model for Context-conditioned Generation | Nov 1, 2020 | Abstractive Text SummarizationConversational Response Generation | —Unverified | 0 | 0 |
| Paloma: A Benchmark for Evaluating Language Model Fit | Dec 16, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 | 0 |
| Palomino-Ochoa at SemEval-2020 Task 9: Robust System based on Transformer for Code-Mixed Sentiment Classification | Nov 18, 2020 | Language ModelingLanguage Modelling | —Unverified | 0 | 0 |
| Pandora: A Code-Driven Large Language Model Agent for Unified Reasoning Across Diverse Structured Knowledge | Apr 17, 2025 | Knowledge GraphsLanguage Modeling | —Unverified | 0 | 0 |
| PANDORA: Diffusion Policy Learning for Dexterous Robotic Piano Playing | Mar 17, 2025 | DenoisingLanguage Modeling | —Unverified | 0 | 0 |
| PanGu-π: Enhancing Language Model Architectures via Nonlinearity Compensation | Dec 27, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 | 0 |
| PanGu-Σ: Towards Trillion Parameter Language Model with Sparse Heterogeneous Computing | Mar 20, 2023 | Code GenerationLanguage Modeling | —Unverified | 0 | 0 |
| PAPI: Exploiting Dynamic Parallelism in Large Language Model Decoding with a Processing-In-Memory-Enabled Computing System | Feb 21, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 | 0 |
| Paradigm Shift in Sustainability Disclosure Analysis: Empowering Stakeholders with CHATREPORT, a Language Model-Based Tool | Jun 27, 2023 | BenchmarkingLanguage Modeling | —Unverified | 0 | 0 |
| Paralinguistics-Enhanced Large Language Modeling of Spoken Dialogue | Dec 23, 2023 | AttributeLanguage Modeling | —Unverified | 0 | 0 |
| Parallel Corpus Augmentation using Masked Language Models | Oct 4, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 | 0 |
| Parallel Corpus Filtering via Pre-trained Language Models | May 13, 2020 | Language ModelingLanguage Modelling | —Unverified | 0 | 0 |
| Parallel Decoding via Hidden Transfer for Lossless Large Language Model Acceleration | Apr 18, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 | 0 |
| Parallelizing Linear Transformers with the Delta Rule over Sequence Length | Jun 10, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 | 0 |
| PARALLELPROMPT: Extracting Parallelism from Large Language Model Queries | Jun 23, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 | 0 |
| ParallelSpec: Parallel Drafter for Efficient Speculative Decoding | Oct 8, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 | 0 |
| PARAMANU-GANITA: Language Model with Mathematical Capabilities | Apr 22, 2024 | Domain AdaptationGSM8K | —Unverified | 0 | 0 |