| Deciphering antibody affinity maturation with language models and weakly supervised learning | Dec 14, 2021 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| Can Slow-thinking LLMs Reason Over Time? Empirical Studies in Time Series Forecasting | May 30, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| A context-aware knowledge transferring strategy for CTC-based ASR | Oct 12, 2022 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 1 | 5 |
| ConZIC: Controllable Zero-shot Image Captioning by Sampling-Based Polishing | Mar 4, 2023 | DiversityImage Captioning | CodeCode Available | 1 | 5 |
| Debiasing Methods in Natural Language Understanding Make Bias More Accessible | Sep 9, 2021 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| LongKey: Keyphrase Extraction for Long Documents | Nov 26, 2024 | Keyphrase ExtractionLanguage Modeling | CodeCode Available | 1 | 5 |
| Can Retriever-Augmented Language Models Reason? The Blame Game Between the Retriever and the Language Model | Dec 18, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| CopyBench: Measuring Literal and Non-Literal Reproduction of Copyright-Protected Text in Language Model Generation | Jul 9, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| Long Expressive Memory for Sequence Modeling | Oct 10, 2021 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| Copy Is All You Need | Jul 13, 2023 | AllDomain Adaptation | CodeCode Available | 1 | 5 |
| Dealing with Typos for BERT-based Passage Retrieval and Ranking | Aug 27, 2021 | Information RetrievalLanguage Modeling | CodeCode Available | 1 | 5 |
| CORBA: Contagious Recursive Blocking Attacks on Multi-Agent Systems Based on Large Language Models | Feb 20, 2025 | BlockingLanguage Modeling | CodeCode Available | 1 | 5 |
| Data-to-Text Generation with Iterative Text Editing | Nov 3, 2020 | Data-to-Text GenerationDomain Adaptation | CodeCode Available | 1 | 5 |
| How Language Model Hallucinations Can Snowball | May 22, 2023 | HallucinationLanguage Modeling | CodeCode Available | 1 | 5 |
| How far is Language Model from 100% Few-shot Named Entity Recognition in Medical Domain | Jul 1, 2023 | few-shot-nerFew-shot NER | CodeCode Available | 1 | 5 |
| Scene Transformer: A unified architecture for predicting multiple agent trajectories | Jun 15, 2021 | Autonomous DrivingLanguage Modeling | CodeCode Available | 1 | 5 |
| DeepInception: Hypnotize Large Language Model to Be Jailbreaker | Nov 6, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| How Much Knowledge Can You Pack Into the Parameters of a Language Model? | Feb 10, 2020 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| Long-context Protein Language Modeling Using Bidirectional Mamba with Shared Projection Layers | Oct 29, 2024 | Drug DesignLanguage Modeling | CodeCode Available | 1 | 5 |
| MATE-KD: Masked Adversarial TExt, a Companion to Knowledge Distillation | May 12, 2021 | Adversarial TextData Augmentation | CodeCode Available | 1 | 5 |
| Can LLM Watermarks Robustly Prevent Unauthorized Knowledge Distillation? | Feb 17, 2025 | Knowledge DistillationLanguage Modeling | CodeCode Available | 1 | 5 |
| ScriptWorld: Text Based Environment For Learning Procedural Knowledge | Jul 8, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| LogGPT: Log Anomaly Detection via GPT | Sep 25, 2023 | Anomaly DetectionLanguage Modeling | CodeCode Available | 1 | 5 |
| How to Fine-Tune BERT for Text Classification? | May 14, 2019 | General ClassificationLanguage Modeling | CodeCode Available | 1 | 5 |
| Data Efficient Masked Language Modeling for Vision and Language | Sep 5, 2021 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| Logical Fallacy Detection | Feb 28, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| Correcting Diverse Factual Errors in Abstractive Summarization via Post-Editing and Language Model Infilling | Oct 22, 2022 | Abstractive Text SummarizationLanguage Modeling | CodeCode Available | 1 | 5 |
| How well can a large language model explain business processes as perceived by users? | Jan 23, 2024 | HallucinationLanguage Modeling | CodeCode Available | 1 | 5 |
| Logic.py: Bridging the Gap between LLMs and Constraint Solvers | Feb 17, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| Data Augmentation using Pre-trained Transformer Models | Mar 4, 2020 | Data AugmentationDiversity | CodeCode Available | 1 | 5 |
| A Qualitative Evaluation of Language Models on Automatic Question-Answering for COVID-19 | Jun 19, 2020 | ChatbotLanguage Modeling | CodeCode Available | 1 | 5 |
| Localized Vision-Language Matching for Open-vocabulary Object Detection | May 12, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| Human-in-the-Loop for Data Collection: a Multi-Target Counter Narrative Dataset to Fight Online Hate Speech | Jul 19, 2021 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| Word Embeddings Are Steers for Language Models | May 22, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| Localizing Paragraph Memorization in Language Models | Mar 28, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| CoSafe: Evaluating Large Language Model Safety in Multi-Turn Dialogue Coreference | Jun 25, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| Locret: Enhancing Eviction in Long-Context LLM Inference with Trained Retaining Heads on Consumer-Grade Devices | Oct 2, 2024 | GPULanguage Modeling | CodeCode Available | 1 | 5 |
| AMR Parsing via Graph-Sequence Iterative Inference | Apr 12, 2020 | AMR ParsingLanguage Modeling | CodeCode Available | 1 | 5 |
| Automatic Label Sequence Generation for Prompting Sequence-to-sequence Models | Sep 20, 2022 | Few-Shot LearningLanguage Modeling | CodeCode Available | 1 | 5 |
| Human Language Modeling | May 10, 2022 | Age EstimationLanguage Modeling | CodeCode Available | 1 | 5 |
| LOGO -- Long cOntext aliGnment via efficient preference Optimization | Oct 24, 2024 | GPULanguage Modeling | CodeCode Available | 1 | 5 |
| Can Large Language Models Write Parallel Code? | Jan 23, 2024 | Code CompletionCode Generation | CodeCode Available | 1 | 5 |
| DART-Eval: A Comprehensive DNA Language Model Evaluation Benchmark on Regulatory DNA | Dec 6, 2024 | counterfactualLanguage Model Evaluation | CodeCode Available | 1 | 5 |
| Counterfactual Data Augmentation for Neural Machine Translation | Jun 1, 2021 | counterfactualData Augmentation | CodeCode Available | 1 | 5 |
| HyPe: Better Pre-trained Language Model Fine-tuning with Hidden Representation Perturbation | Dec 17, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| Hypergraph Multi-modal Large Language Model: Exploiting EEG and Eye-tracking Modalities to Evaluate Heterogeneous Responses for Video Understanding | Jul 11, 2024 | EEGLanguage Modeling | CodeCode Available | 1 | 5 |
| Automatic Model Selection with Large Language Models for Reasoning | May 23, 2023 | Arithmetic ReasoningGSM8K | CodeCode Available | 1 | 5 |
| Counterfactual Token Generation in Large Language Models | Sep 25, 2024 | Bias Detectioncounterfactual | CodeCode Available | 1 | 5 |
| DARTS: Differentiable Architecture Search | Jun 24, 2018 | General Classificationimage-classification | CodeCode Available | 1 | 5 |
| Picard understanding Darmok: A Dataset and Model for Metaphor-Rich Translation in a Constructed Language | Jul 16, 2021 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |