| PROPS: Probabilistic personalization of black-box sequence models | Mar 5, 2019 | ArticlesLanguage Modeling | CodeCode Available | 0 |
| PROP: Pre-training with Representative Words Prediction for Ad-hoc Retrieval | Oct 20, 2020 | Information RetrievalLanguage Modeling | CodeCode Available | 0 |
| Skim-Attention: Learning to Focus via Document Layout | Sep 2, 2021 | document understandingLanguage Modeling | CodeCode Available | 0 |
| PropMEND: Hypernetworks for Knowledge Propagation in LLMs | Jun 10, 2025 | knowledge editingLanguage Modeling | CodeCode Available | 0 |
| Sketch-Guided Constrained Decoding for Boosting Blackbox Large Language Models without Logit Access | Jan 18, 2024 | Constituency ParsingLanguage Modeling | CodeCode Available | 0 |
| On the Reliability of Large Language Models to Misinformed and Demographically-Informed Prompts | Oct 6, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Sparse Sinkhorn Attention | Feb 26, 2020 | Document ClassificationImage Generation | CodeCode Available | 0 |
| SJ_AJ@DravidianLangTech-EACL2021: Task-Adaptive Pre-Training of Multilingual BERT models for Offensive Language Identification | Feb 1, 2021 | Language IdentificationLanguage Modeling | CodeCode Available | 0 |
| Topology-aware Preemptive Scheduling for Co-located LLM Workloads | Nov 18, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| SituationalLLM: Proactive language models with scene awareness for dynamic, contextual task guidance | Jun 19, 2024 | Decision MakingLanguage Modeling | CodeCode Available | 0 |
| Models In a Spelling Bee: Language Models Implicitly Learn the Character Composition of Tokens | Aug 25, 2021 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Modeling Temporal Dependencies in High-Dimensional Sequences: Application to Polyphonic Music Generation and Transcription | Jun 27, 2012 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| OFA: Unifying Architectures, Tasks, and Modalities Through a Simple Sequence-to-Sequence Learning Framework | Feb 7, 2022 | Image Captioningimage-classification | CodeCode Available | 0 |
| SpatialLLM: From Multi-modality Data to Urban Spatial Intelligence | May 19, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Your Co-Workers Matter: Evaluating Collaborative Capabilities of Language Models in Blocks World | Mar 30, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Single Headed Attention RNN: Stop Thinking With Your Head | Nov 26, 2019 | GPUHyperparameter Optimization | CodeCode Available | 0 |
| Vision-Language and Large Language Model Performance in Gastroenterology: GPT, Claude, Llama, Phi, Mistral, Gemma, and Quantized Models | Aug 25, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| To Tell The Truth: Language of Deception and Language Models | Nov 13, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| To Tune or Not To Tune? How About the Best of Both Worlds? | Jul 9, 2019 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| On the Relationship between Truth and Political Bias in Language Models | Sep 9, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Simple Unsupervised Summarization by Contextual Matching | Jul 31, 2019 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Simple Fusion: Return of the Language Model | Sep 1, 2018 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| What makes a language easy to deep-learn? Deep neural networks and humans similarly benefit from compositional structure | Feb 23, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| SILC-EFSA: Self-aware In-context Learning Correction for Entity-level Financial Sentiment Analysis | Dec 26, 2024 | In-Context LearningLanguage Modeling | CodeCode Available | 0 |
| Speaker attribution in German parliamentary debates with QLoRA-adapted large language models | Sep 18, 2023 | ArticlesLanguage Modeling | CodeCode Available | 0 |
| TourSynbio-Search: A Large Language Model Driven Agent Framework for Unified Search Method for Protein Engineering | Nov 9, 2024 | Information RetrievalLanguage Modeling | CodeCode Available | 0 |
| On the Proper Treatment of Tokenization in Psycholinguistics | Oct 3, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Unifying Visual-Semantic Embeddings with Multimodal Neural Language Models | Nov 10, 2014 | DecoderLanguage Modeling | CodeCode Available | 0 |
| Sig2text, a Vision-language model for Non-cooperative Radar Signal Parsing | Mar 19, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Speaking the Language of Your Listener: Audience-Aware Adaptation via Plug-and-Play Theory of Mind | May 31, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Vision-Language In-Context Learning Driven Few-Shot Visual Inspection Model | Feb 13, 2025 | In-Context LearningLanguage Modeling | CodeCode Available | 0 |
| On the Multilingual Capabilities of Very Large-Scale English Language Models | Aug 30, 2021 | Extractive Question-AnsweringFew-Shot Learning | CodeCode Available | 0 |
| Show and Guide: Instructional-Plan Grounded Vision and Language Model | Sep 27, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Modeling Fine-grained Information via Knowledge-aware Hierarchical Graph for Zero-shot Entity Retrieval | Nov 20, 2022 | Entity RetrievalGraph Attention | CodeCode Available | 0 |
| Toward a Thermodynamics of Meaning | Sep 24, 2020 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Specify and Edit: Overcoming Ambiguity in Text-Based Image Editing | Jul 29, 2024 | DenoisingDiversity | CodeCode Available | 0 |
| SHINE: Saliency-aware HIerarchical NEgative Ranking for Compositional Temporal Grounding | Jul 6, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Specious Sites: Tracking the Spread and Sway of Spurious News Stories at Scale | Aug 3, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| SpecNFS: A Challenge Dataset Towards Extracting Formal Models from Natural Language Specifications | Jun 1, 2022 | Dependency ParsingDomain Adaptation | CodeCode Available | 0 |
| UnihanLM: Coarse-to-Fine Chinese-Japanese Language Model Pretraining with the Unihan Database | Dec 1, 2020 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| On the Limitations of Sociodemographic Adaptation with Transformers | Aug 1, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Shifting Mean Activation Towards Zero with Bipolar Activation Functions | Sep 12, 2017 | General ClassificationLanguage Modeling | CodeCode Available | 0 |
| UNIMELB at SemEval-2016 Tasks 4A and 4B: An Ensemble of Neural Networks and a Word2Vec Based Model for Sentiment Classification | Jun 1, 2016 | Document ClassificationLanguage Modeling | CodeCode Available | 0 |
| On the Generalization Ability of Retrieval-Enhanced Transformers | Feb 23, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Shifting from endangerment to rebirth in the Artificial Intelligence Age: An Ensemble Machine Learning Approach for Hawrami Text Classification | Sep 25, 2024 | ArticlesClassification | CodeCode Available | 0 |
| Modeling Complex Event Scenarios via Simple Entity-focused Questions | Feb 14, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| SharpZO: Hybrid Sharpness-Aware Vision Language Model Prompt Tuning via Forward-Only Passes | Jun 26, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| PrOnto: Language Model Evaluations for 859 Languages | May 22, 2023 | Language Model EvaluationLanguage Modeling | CodeCode Available | 0 |
| Prompt Tuning or Fine-Tuning - Investigating Relational Knowledge in Pre-Trained Language Models | Jun 22, 2021 | fill-maskFill Mask | CodeCode Available | 0 |
| Prompt-Time Ontology-Driven Symbolic Knowledge Capture with Large Language Models | May 22, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |