| IAA: Inner-Adaptor Architecture Empowers Frozen Large Language Model with Multimodal Capabilities | Aug 23, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| CreoPep: A Universal Deep Learning Framework for Target-Specific Peptide Design and Optimization | May 5, 2025 | DiversityLanguage Modeling | CodeCode Available | 1 | 5 |
| UniTAB: Unifying Text and Box Outputs for Grounded Vision-Language Modeling | Nov 23, 2021 | Image CaptioningImage Description | CodeCode Available | 1 | 5 |
| Automated Generation of Challenging Multiple-Choice Questions for Vision Language Model Evaluation | Jan 6, 2025 | Language Model EvaluationLanguage Modeling | CodeCode Available | 1 | 5 |
| CPT: Efficient Deep Neural Network Training via Cyclic Precision | Jan 25, 2021 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| Crafting Large Language Models for Enhanced Interpretability | Jul 5, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| HyperBERT: Mixing Hypergraph-Aware Layers with Language Models for Node Classification on Text-Attributed Hypergraphs | Feb 11, 2024 | Inductive BiasLanguage Modeling | CodeCode Available | 1 | 5 |
| iBOT: Image BERT Pre-Training with Online Tokenizer | Nov 15, 2021 | image-classificationImage Classification | CodeCode Available | 1 | 5 |
| DisCo: Distilled Student Models Co-training for Semi-supervised Text Mining | May 20, 2023 | Extractive SummarizationKnowledge Distillation | CodeCode Available | 1 | 5 |
| Balanced Data Sampling for Language Model Training with Clustering | Feb 22, 2024 | ClusteringLanguage Modeling | CodeCode Available | 1 | 5 |
| Analyzing the Source and Target Contributions to Predictions in Neural Machine Translation | Oct 21, 2020 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| Discovering Autoregressive Orderings with Variational Inference | Jan 1, 2021 | Code GenerationImage Captioning | CodeCode Available | 1 | 5 |
| Addressing Some Limitations of Transformers with Feedback Memory | Feb 21, 2020 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| Improved training of end-to-end attention models for speech recognition | May 8, 2018 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| Counterfactual Token Generation in Large Language Models | Sep 25, 2024 | Bias Detectioncounterfactual | CodeCode Available | 1 | 5 |
| Coupling Large Language Models with Logic Programming for Robust and General Reasoning from Text | Jul 15, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| Hybrid Ranking Network for Text-to-SQL | Aug 11, 2020 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| Discrete Flows: Invertible Generative Models of Discrete Data | May 24, 2019 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| BAMBOO: A Comprehensive Benchmark for Evaluating Long Text Modeling Capacities of Large Language Models | Sep 23, 2023 | Code CompletionHallucination | CodeCode Available | 1 | 5 |
| BanglaBERT: Language Model Pretraining and Benchmarks for Low-Resource Language Understanding Evaluation in Bangla | Jan 1, 2021 | Document ClassificationLanguage Modeling | CodeCode Available | 1 | 5 |
| Counterfactual Data Augmentation for Neural Machine Translation | Jun 1, 2021 | counterfactualData Augmentation | CodeCode Available | 1 | 5 |
| Human-in-the-Loop for Data Collection: a Multi-Target Counter Narrative Dataset to Fight Online Hate Speech | Jul 19, 2021 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| DISP-LLM: Dimension-Independent Structural Pruning for Large Language Models | Oct 15, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| BanglaNLG and BanglaT5: Benchmarks and Resources for Evaluating Low-Resource Natural Language Generation in Bangla | May 23, 2022 | Conditional Text GenerationDialogue Generation | CodeCode Available | 1 | 5 |
| Dissecting Generation Modes for Abstractive Summarization Models via Ablation and Attribution | Jun 3, 2021 | Abstractive Text SummarizationDecoder | CodeCode Available | 1 | 5 |
| Distantly-Supervised Named Entity Recognition with Noise-Robust Learning and Language Model Augmented Self-Training | Sep 10, 2021 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| CPLLM: Clinical Prediction with Large Language Models | Sep 20, 2023 | Disease PredictionLanguage Modeling | CodeCode Available | 1 | 5 |
| Human Language Modeling | May 10, 2022 | Age EstimationLanguage Modeling | CodeCode Available | 1 | 5 |
| Hydra: A System for Large Multi-Model Deep Learning | Oct 16, 2021 | Deep LearningGPU | CodeCode Available | 1 | 5 |
| Distillation Matters: Empowering Sequential Recommenders to Match the Performance of Large Language Model | May 1, 2024 | Knowledge DistillationLanguage Modeling | CodeCode Available | 1 | 5 |
| DocSCAN: Unsupervised Text Classification via Learning from Neighbors | May 9, 2021 | ClassificationClustering | CodeCode Available | 1 | 5 |
| Distilling Linguistic Context for Language Model Compression | Sep 17, 2021 | Knowledge DistillationLanguage Modeling | CodeCode Available | 1 | 5 |
| Distilling the Knowledge of BERT for Sequence-to-Sequence ASR | Aug 9, 2020 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 1 | 5 |
| LEARN: Knowledge Adaptation from Large Language Model to Recommendation for Practical Industrial Application | May 7, 2024 | Collaborative FilteringLanguage Modeling | CodeCode Available | 1 | 5 |
| HPT: Hierarchy-aware Prompt Tuning for Hierarchical Text Classification | Apr 28, 2022 | ClassificationLanguage Modeling | CodeCode Available | 1 | 5 |
| Basis Sharing: Cross-Layer Parameter Sharing for Large Language Model Compression | Oct 2, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| Batch Prompting: Efficient Inference with Large Language Model APIs | Jan 19, 2023 | Arithmetic ReasoningIn-Context Learning | CodeCode Available | 1 | 5 |
| CoSafe: Evaluating Large Language Model Safety in Multi-Turn Dialogue Coreference | Jun 25, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| cosFormer: Rethinking Softmax in Attention | Feb 17, 2022 | D4RLLanguage Modeling | CodeCode Available | 1 | 5 |
| Distributed Deep Learning in Open Collaborations | Jun 18, 2021 | Deep LearningLanguage Modeling | CodeCode Available | 1 | 5 |
| Distributed Speculative Inference (DSI): Speculation Parallelism for Provably Faster Lossless Language Model Inference | May 23, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| Knowledge-enhanced Visual-Language Pretraining for Computational Pathology | Apr 15, 2024 | Cross-Modal RetrievalLanguage Modeling | CodeCode Available | 1 | 5 |
| Correcting Diverse Factual Errors in Abstractive Summarization via Post-Editing and Language Model Infilling | Oct 22, 2022 | Abstractive Text SummarizationLanguage Modeling | CodeCode Available | 1 | 5 |
| Bayesian Optimization of Antibodies Informed by a Generative Model of Evolving Sequences | Dec 10, 2024 | Bayesian OptimizationLanguage Modeling | CodeCode Available | 1 | 5 |
| DiveR-CT: Diversity-enhanced Red Teaming Large Language Model Assistants with Relaxing Constraints | May 29, 2024 | DiversityLanguage Modeling | CodeCode Available | 1 | 5 |
| Knowledge Graph Generation From Text | Nov 18, 2022 | Graph GenerationJoint Entity and Relation Extraction | CodeCode Available | 1 | 5 |
| How well can a large language model explain business processes as perceived by users? | Jan 23, 2024 | HallucinationLanguage Modeling | CodeCode Available | 1 | 5 |
| Copy Is All You Need | Jul 13, 2023 | AllDomain Adaptation | CodeCode Available | 1 | 5 |
| ALYMPICS: LLM Agents Meet Game Theory -- Exploring Strategic Decision-Making with AI Agents | Nov 6, 2023 | Decision MakingLanguage Modeling | CodeCode Available | 1 | 5 |
| CORBA: Contagious Recursive Blocking Attacks on Multi-Agent Systems Based on Large Language Models | Feb 20, 2025 | BlockingLanguage Modeling | CodeCode Available | 1 | 5 |