| Uncovering Intermediate Variables in Transformers using Circuit Probing | Nov 7, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Neural-Hidden-CRF: A Robust Weakly-Supervised Sequence Labeler | Sep 10, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| RAS-Eval: A Comprehensive Benchmark for Security Evaluation of LLM Agents in Real-World Environments | Jun 18, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| What BERT is not: Lessons from a new suite of psycholinguistic diagnostics for language models | Jul 31, 2019 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| You can't pick your neighbors, or can you? When and how to rely on retrieval in the kNN-LM | Oct 28, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Ordered Neurons: Integrating Tree Structures into Recurrent Neural Networks | Oct 22, 2018 | Constituency Grammar InductionInductive Bias | CodeCode Available | 0 |
| Neural Generation for Czech: Data and Baselines | Oct 11, 2019 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Rare Words: A Major Problem for Contextualized Embeddings And How to Fix it by Attentive Mimicking | Apr 14, 2019 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Zero-Shot Cross-Lingual Transfer in Legal Domain Using Transformer Models | Nov 28, 2021 | ClassificationCross-Lingual Transfer | CodeCode Available | 0 |
| You Don't Know My Favorite Color: Preventing Dialogue Representations from Revealing Speakers' Private Personas | Apr 26, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Understanding Architectures Learnt by Cell-based Neural Architecture Search | Sep 20, 2019 | image-classificationImage Classification | CodeCode Available | 0 |
| Video (language) modeling: a baseline for generative models of natural videos | Dec 20, 2014 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Word Ordering Without Syntax | Apr 28, 2016 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| ORBIT: Cost-Effective Dataset Curation for Large Language Model Domain Adaptation with an Astronomy Case Study | Dec 19, 2024 | AstronomyDomain Adaptation | CodeCode Available | 0 |
| Neural Authorship Attribution: Stylometric Analysis on Large Language Models | Aug 14, 2023 | Authorship AttributionLanguage Modeling | CodeCode Available | 0 |
| Oracle performance for visual captioning | Nov 14, 2015 | Image CaptioningLanguage Modeling | CodeCode Available | 0 |
| Rapport-Driven Virtual Agent: Rapport Building Dialogue Strategy for Improving User Experience at First Meeting | Jun 14, 2024 | Dialogue GenerationForm | CodeCode Available | 0 |
| Rank-K: Test-Time Reasoning for Listwise Reranking | May 20, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Ranking Unraveled: Recipes for LLM Rankings in Head-to-Head AI Combat | Nov 19, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| The Tail Wagging the Dog: Dataset Construction Biases of Social Bias Benchmarks | Oct 18, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Neural Architecture Search with Reinforcement Learning | Nov 5, 2016 | Image ClassificationLanguage Modeling | CodeCode Available | 0 |
| Understanding Domain Learning in Language Models Through Subpopulation Analysis | Oct 22, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| What Does BERT Look At? An Analysis of BERT's Attention | Jun 11, 2019 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| The Traitors: Deception and Trust in Multi-Agent Language Model Simulations | May 19, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Understanding Hidden Computations in Chain-of-Thought Reasoning | Dec 5, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| An Investigation of Language Model Interpretability via Sentence Editing | Nov 28, 2020 | General ClassificationLanguage Modeling | CodeCode Available | 0 |
| You Don’t Know My Favorite Color: Preventing Dialogue Representations from Revealing Speakers’ Private Personas | Jul 1, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Linguistic Frameworks Go Toe-to-Toe at Neuro-Symbolic Language Modeling | Dec 15, 2021 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Mask-Free Neuron Concept Annotation for Interpreting Neural Networks in Medical Domain | Jul 16, 2024 | Decision MakingLanguage Modeling | CodeCode Available | 0 |
| The Unreasonable Effectiveness of Transformer Language Models in Grammatical Error Correction | Jun 4, 2019 | Grammatical Error CorrectionLanguage Modeling | CodeCode Available | 0 |
| Understanding Language Modeling Paradigm Adaptations in Recommender Systems: Lessons Learned and Open Challenges | Apr 4, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Neural Architecture Optimization | Aug 22, 2018 | DecoderEvolutionary Algorithms | CodeCode Available | 0 |
| Neural Academic Paper Generation | Dec 2, 2019 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Ranking Manipulation for Conversational Search Engines | Jun 5, 2024 | Conversational SearchLanguage Modeling | CodeCode Available | 0 |
| Randomized Geometric Algebra Methods for Convex Neural Networks | Jun 4, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| RALLRec+: Retrieval Augmented Large Language Model Recommendation with Reasoning | Mar 26, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| The World of an Octopus: How Reporting Bias Influences a Language Model's Perception of Color | Oct 15, 2021 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| The World of an Octopus: How Reporting Bias Influences a Language Model’s Perception of Color | Nov 1, 2021 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| RAFT: Adapting Language Model to Domain Specific RAG | Mar 15, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| The Z-loss: a shift and scale invariant classification loss belonging to the Spherical Family | Apr 29, 2016 | General ClassificationLanguage Modeling | CodeCode Available | 0 |
| video-SALMONN: Speech-Enhanced Audio-Visual Large Language Models | Jun 22, 2024 | DiversityLanguage Modeling | CodeCode Available | 0 |
| Thieves on Sesame Street! Model Extraction of BERT-based APIs | Oct 27, 2019 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Optimizing Retrieval-augmented Reader Models via Token Elimination | Oct 20, 2023 | Answer GenerationDecoder | CodeCode Available | 0 |
| Zero-shot Translation of Attention Patterns in VQA Models to Natural Language | Nov 8, 2023 | Image CaptioningLanguage Modeling | CodeCode Available | 0 |
| Network Traffic Anomaly Detection Using Recurrent Neural Networks | Mar 28, 2018 | Anomaly DetectionLanguage Modeling | CodeCode Available | 0 |
| Network-informed Prompt Engineering against Organized Astroturf Campaigns under Extreme Class Imbalance | Jan 21, 2025 | Data AugmentationLanguage Modeling | CodeCode Available | 0 |
| Thinking Outside of the Differential Privacy Box: A Case Study in Text Privatization with Language Model Prompting | Oct 1, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Understanding Stragglers in Large Model Training Using What-if Analysis | May 9, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Word Sense Induction with Neural biLM and Symmetric Patterns | Aug 26, 2018 | ClusteringLanguage Modeling | CodeCode Available | 0 |
| Think Like a Person Before Responding: A Multi-Faceted Evaluation of Persona-Guided LLMs for Countering Hate | Jun 4, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 0 |