| A Generative Language Model for Few-shot Aspect-Based Sentiment Analysis | Apr 11, 2022 | Aspect-Based Sentiment AnalysisAspect-Based Sentiment Analysis (ABSA) | CodeCode Available | 1 | 5 |
| FuxiTranyu: A Multilingual Large Language Model Trained with Balanced Data | Aug 12, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| InstructDET: Diversifying Referring Object Detection with Generalized Instructions | Oct 8, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| FANformer: Improving Large Language Models Through Effective Periodicity Modeling | Feb 28, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| InstOptima: Evolutionary Multi-objective Instruction Optimization via Large Language Model-based Instruction Operators | Oct 26, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| InstructFLIP: Exploring Unified Vision-Language Model for Face Anti-spoofing | Jul 16, 2025 | Domain GeneralizationFace Anti-Spoofing | CodeCode Available | 1 | 5 |
| -former: Infinite Memory Transformer | May 1, 2022 | Dialogue GenerationLanguage Modeling | CodeCode Available | 1 | 5 |
| Injecting Numerical Reasoning Skills into Language Models | Apr 9, 2020 | Data AugmentationDecoder | CodeCode Available | 1 | 5 |
| CTRAN: CNN-Transformer-based Network for Natural Language Understanding | Mar 19, 2023 | DecoderIntent Detection | CodeCode Available | 1 | 5 |
| Offline Handwritten Chinese Text Recognition with Convolutional Neural Networks | Jun 28, 2020 | Handwritten Chinese Text RecognitionLanguage Modeling | CodeCode Available | 1 | 5 |
| -former: Infinite Memory Transformer | Sep 1, 2021 | Dialogue GenerationLanguage Modeling | CodeCode Available | 1 | 5 |
| Instruction Multi-Constraint Molecular Generation Using a Teacher-Student Large Language Model | Mar 20, 2024 | Drug DiscoveryKnowledge Distillation | CodeCode Available | 1 | 5 |
| Cross-Care: Assessing the Healthcare Implications of Pre-training Data on Language Model Bias | May 9, 2024 | Data VisualizationLanguage Modeling | CodeCode Available | 1 | 5 |
| GateLoop: Fully Data-Controlled Linear Recurrence for Sequence Modeling | Nov 3, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| InfoLM: A New Metric to Evaluate Summarization & Data2Text Generation | Dec 2, 2021 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| Cross-Align: Modeling Deep Cross-lingual Interactions for Word Alignment | Oct 9, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| CDLM: Cross-Document Language Modeling | Jan 2, 2021 | Citation RecommendationCoreference Resolution | CodeCode Available | 1 | 5 |
| 2SSP: A Two-Stage Framework for Structured Pruning of LLMs | Jan 29, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| InforMask: Unsupervised Informative Masking for Language Model Pretraining | Oct 21, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| A Pilot Study of Text-to-SQL Semantic Parsing for Vietnamese | Oct 5, 2020 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| INF-LLaVA: Dual-perspective Perception for High-Resolution Multimodal Large Language Model | Jul 23, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| CAB: Comprehensive Attention Benchmarking on Long Sequence Modeling | Oct 14, 2022 | BenchmarkingLanguage Modeling | CodeCode Available | 1 | 5 |
| Critic-Guided Decoding for Controlled Text Generation | Dec 21, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| InfoCSE: Information-aggregated Contrastive Learning of Sentence Embeddings | Oct 8, 2022 | Contrastive LearningLanguage Modeling | CodeCode Available | 1 | 5 |
| Cached Transformers: Improving Transformers with Differentiable Memory Cache | Dec 20, 2023 | image-classificationImage Classification | CodeCode Available | 1 | 5 |
| AcTune: Uncertainty-aware Active Self-Training for Semi-Supervised Active Learning with Pretrained Language Models | Dec 16, 2021 | Active LearningLanguage Modeling | CodeCode Available | 1 | 5 |
| InfiniSST: Simultaneous Translation of Unbounded Speech with Large Language Model | Mar 4, 2025 | es-enLanguage Modeling | CodeCode Available | 1 | 5 |
| MiLe Loss: a New Loss for Mitigating the Bias of Learning Difficulties in Generative Language Models | Oct 30, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| RealFormer: Transformer Likes Residual Attention | Dec 21, 2020 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| Generated Knowledge Prompting for Commonsense Reasoning | Oct 15, 2021 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| CriticEval: Evaluating Large Language Model as Critic | Feb 21, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| Atla Selene Mini: A General Purpose Evaluation Model | Jan 27, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| Generation Meets Verification: Accelerating Large Language Model Inference with Smart Parallel Auto-Correct Decoding | Feb 19, 2024 | HumanEvalLanguage Modeling | CodeCode Available | 1 | 5 |
| On Measuring Social Biases in Prompt-Based Multi-Task Learning | May 23, 2022 | FormLanguage Modeling | CodeCode Available | 1 | 5 |
| CRE-LLM: A Domain-Specific Chinese Relation Extraction Framework with Fine-tuned Large Language Model | Apr 28, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| Cal-DPO: Calibrated Direct Preference Optimization for Language Model Alignment | Dec 19, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| CreoPep: A Universal Deep Learning Framework for Target-Specific Peptide Design and Optimization | May 5, 2025 | DiversityLanguage Modeling | CodeCode Available | 1 | 5 |
| Inference-Time Policy Adapters (IPA): Tailoring Extreme-Scale LMs without Fine-tuning | May 24, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| A comprehensive evaluation of ChatGPT's zero-shot Text-to-SQL capability | Mar 12, 2023 | Code GenerationLanguage Modeling | CodeCode Available | 1 | 5 |
| Generating Label Cohesive and Well-Formed Adversarial Claims | Sep 17, 2020 | Fact CheckingLanguage Modeling | CodeCode Available | 1 | 5 |
| Call for Papers -- The BabyLM Challenge: Sample-efficient pretraining on a developmentally plausible corpus | Jan 27, 2023 | Language AcquisitionLanguage Modeling | CodeCode Available | 1 | 5 |
| On the Inductive Bias of Masked Language Modeling: From Statistical to Syntactic Dependencies | Apr 12, 2021 | Inductive BiasLanguage Modeling | CodeCode Available | 1 | 5 |
| IndoBERTweet: A Pretrained Language Model for Indonesian Twitter with Effective Domain-Specific Vocabulary Initialization | Sep 10, 2021 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| CREAM: Consistency Regularized Self-Rewarding Language Models | Oct 16, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| On the Sentence Embeddings from Pre-trained Language Models | Nov 2, 2020 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| Generating Sentences from a Continuous Space | Nov 19, 2015 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| Inducer-tuning: Connecting Prefix-tuning and Adapter-tuning | Oct 26, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| UniTAB: Unifying Text and Box Outputs for Grounded Vision-Language Modeling | Nov 23, 2021 | Image CaptioningImage Description | CodeCode Available | 1 | 5 |
| Mask-Predict: Parallel Decoding of Conditional Masked Language Models | Apr 19, 2019 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| CTRL: A Conditional Transformer Language Model for Controllable Generation | Sep 11, 2019 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |