| Critic-Guided Decoding for Controlled Text Generation | Dec 21, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| Mark My Words: Analyzing and Evaluating Language Model Watermarks | Dec 1, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| Mass-Producing Failures of Multimodal Systems with Language Models | Jun 21, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| Improving Pretrained Cross-Lingual Language Models via Self-Labeled Word Alignment | Jun 11, 2021 | DenoisingLanguage Modeling | CodeCode Available | 1 | 5 |
| Improving Seq2Seq Grammatical Error Correction via Decoding Interventions | Oct 23, 2023 | DecoderGrammatical Error Correction | CodeCode Available | 1 | 5 |
| Cross-Align: Modeling Deep Cross-lingual Interactions for Word Alignment | Oct 9, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| Improving Spoken Language Modeling with Phoneme Classification: A Simple Fine-tuning Approach | Sep 16, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| Cross-Care: Assessing the Healthcare Implications of Pre-training Data on Language Model Bias | May 9, 2024 | Data VisualizationLanguage Modeling | CodeCode Available | 1 | 5 |
| Improving Temporal Generalization of Pre-trained Language Models with Lexical Semantic Change | Oct 31, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| CDLM: Cross-Document Language Modeling | Jan 2, 2021 | Citation RecommendationCoreference Resolution | CodeCode Available | 1 | 5 |
| MAGMA -- Multimodal Augmentation of Generative Models through Adapter-based Finetuning | Dec 9, 2021 | In-Context LearningLanguage Modeling | CodeCode Available | 1 | 5 |
| A Practical Deep Learning-Based Acoustic Side Channel Attack on Keyboards | Aug 2, 2023 | Deep LearningLanguage Modeling | CodeCode Available | 1 | 5 |
| Making AI Less "Thirsty": Uncovering and Addressing the Secret Water Footprint of AI Models | Apr 6, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| Improving Visual Grounding by Encouraging Consistent Gradient-based Explanations | Jun 30, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| Can Compressed LLMs Truly Act? An Empirical Evaluation of Agentic Capabilities in LLM Compression | May 26, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| UniTAB: Unifying Text and Box Outputs for Grounded Vision-Language Modeling | Nov 23, 2021 | Image CaptioningImage Description | CodeCode Available | 1 | 5 |
| data2vec-aqc: Search for the right Teaching Assistant in the Teacher-Student training setup | Nov 2, 2022 | Automatic Speech Recognition (ASR)Language Modeling | CodeCode Available | 1 | 5 |
| Making Language Models Better Tool Learners with Execution Feedback | May 22, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| DART-Eval: A Comprehensive DNA Language Model Evaluation Benchmark on Regulatory DNA | Dec 6, 2024 | counterfactualLanguage Model Evaluation | CodeCode Available | 1 | 5 |
| Improving Vietnamese Named Entity Recognition from Speech Using Word Capitalization and Punctuation Recovery Models | Oct 1, 2020 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| Picard understanding Darmok: A Dataset and Model for Metaphor-Rich Translation in a Constructed Language | Jul 16, 2021 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| Data Augmentation using Pre-trained Transformer Models | Mar 4, 2020 | Data AugmentationDiversity | CodeCode Available | 1 | 5 |
| Approaching Deep Learning through the Spectral Dynamics of Weights | Aug 21, 2024 | Deep Learningimage-classification | CodeCode Available | 1 | 5 |
| DARTS: Differentiable Architecture Search | Jun 24, 2018 | General Classificationimage-classification | CodeCode Available | 1 | 5 |
| DALE: Generative Data Augmentation for Low-Resource Legal NLP | Oct 24, 2023 | Data AugmentationDecoder | CodeCode Available | 1 | 5 |
| VILA: Improving Structured Content Extraction from Scientific PDFs Using Visual Layout Groups | Jun 1, 2021 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| Incorporating Clinical Guidelines through Adapting Multi-modal Large Language Model for Prostate Cancer PI-RADS Scoring | May 14, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| Incorporating External POS Tagger for Punctuation Restoration | Jun 12, 2021 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 1 | 5 |
| DALDA: Data Augmentation Leveraging Diffusion Model and LLM with Adaptive Guidance Scaling | Sep 25, 2024 | Data AugmentationDiversity | CodeCode Available | 1 | 5 |
| TV-SAM: Increasing Zero-Shot Segmentation Performance on Multimodal Medical Images Using GPT-4 Generated Descriptive Prompts Without Human Annotation | Feb 24, 2024 | DescriptiveLanguage Modeling | CodeCode Available | 1 | 5 |
| Stabilizing Transformers for Reinforcement Learning | Oct 13, 2019 | General Reinforcement LearningLanguage Modeling | CodeCode Available | 1 | 5 |
| MAgIC: Investigation of Large Language Model Powered Multi-Agent in Cognition, Adaptability, Rationality and Collaboration | Nov 14, 2023 | BenchmarkingLanguage Modeling | CodeCode Available | 1 | 5 |
| DAM: Dynamic Attention Mask for Long-Context Large Language Model Inference Acceleration | Jun 6, 2025 | Computational EfficiencyLanguage Modeling | CodeCode Available | 1 | 5 |
| Daily-Omni: Towards Audio-Visual Reasoning with Temporal Alignment across Modalities | May 23, 2025 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 1 | 5 |
| MAGIC: Generating Self-Correction Guideline for In-Context Text-to-SQL | Jun 18, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| IndoBERTweet: A Pretrained Language Model for Indonesian Twitter with Effective Domain-Specific Vocabulary Initialization | Sep 10, 2021 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| CXR-LLAVA: a multimodal large language model for interpreting chest X-ray images | Oct 22, 2023 | DiagnosticLanguage Modeling | CodeCode Available | 1 | 5 |
| Inference-Time Policy Adapters (IPA): Tailoring Extreme-Scale LMs without Fine-tuning | May 24, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| M^3GPT: An Advanced Multimodal, Multitask Framework for Motion Comprehension and Generation | May 25, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| -former: Infinite Memory Transformer | May 1, 2022 | Dialogue GenerationLanguage Modeling | CodeCode Available | 1 | 5 |
| DANIEL: A fast Document Attention Network for Information Extraction and Labelling of handwritten documents | Jul 12, 2024 | Document Layout Analysisdocument understanding | CodeCode Available | 1 | 5 |
| CycleFormer : TSP Solver Based on Language Modeling | May 30, 2024 | DecoderLanguage Modeling | CodeCode Available | 1 | 5 |
| AutoScrum: Automating Project Planning Using Large Language Models | Jun 5, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| InfiniSST: Simultaneous Translation of Unbounded Speech with Large Language Model | Mar 4, 2025 | es-enLanguage Modeling | CodeCode Available | 1 | 5 |
| InfoLM: A New Metric to Evaluate Summarization & Data2Text Generation | Dec 2, 2021 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| MiLe Loss: a New Loss for Mitigating the Bias of Learning Difficulties in Generative Language Models | Oct 30, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| Steering Language Model to Stable Speech Emotion Recognition via Contextual Perception and Chain of Thought | Feb 25, 2025 | Emotion RecognitionLanguage Modeling | CodeCode Available | 1 | 5 |
| InforMask: Unsupervised Informative Masking for Language Model Pretraining | Oct 21, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| RealFormer: Transformer Likes Residual Attention | Dec 21, 2020 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| M-ABSA: A Multilingual Dataset for Aspect-Based Sentiment Analysis | Feb 17, 2025 | Aspect-Based Sentiment AnalysisAspect-Based Sentiment Analysis (ABSA) | CodeCode Available | 1 | 5 |