| Efficient Hierarchical Domain Adaptation for Pretrained Language Models | Dec 16, 2021 | Domain AdaptationLanguage Modeling | CodeCode Available | 1 |
| AgroGPT: Efficient Agricultural Vision-Language Model with Expert Tuning | Oct 10, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| ELECTRA: Pre-training Text Encoders as Discriminators Rather Than Generators | Mar 23, 2020 | GPULanguage Modeling | CodeCode Available | 1 |
| CDLM: Cross-Document Language Modeling | Jan 2, 2021 | Citation RecommendationCoreference Resolution | CodeCode Available | 1 |
| Effective Use of Graph Convolution Network and Contextual Sub-Tree for Commodity News Event Extraction | Nov 1, 2021 | Event ExtractionLanguage Modeling | CodeCode Available | 1 |
| Effect of Pre-Training Scale on Intra- and Inter-Domain Full and Few-Shot Transfer Learning for Natural and Medical X-Ray Chest Images | May 31, 2021 | Few-Shot LearningImage Classification | CodeCode Available | 1 |
| Long-context Protein Language Modeling Using Bidirectional Mamba with Shared Projection Layers | Oct 29, 2024 | Drug DesignLanguage Modeling | CodeCode Available | 1 |
| Effective Sequence-to-Sequence Dialogue State Tracking | Aug 31, 2021 | Dialogue State TrackingLanguage Modeling | CodeCode Available | 1 |
| Contextualized Embeddings in Named-Entity Recognition: An Empirical Study on Generalization | Jan 22, 2020 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Effective Use of Graph Convolution Network and Contextual Sub-Tree forCommodity News Event Extraction | Sep 27, 2021 | Event ExtractionLanguage Modeling | CodeCode Available | 1 |
| Efficient conformer: Progressive downsampling and grouped attention for automatic speech recognition | Aug 31, 2021 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 1 |
| Connecting the Dots: A Knowledgeable Path Generator for Commonsense Question Answering | May 2, 2020 | Knowledge GraphsLanguage Modeling | CodeCode Available | 1 |
| ArcGPT: A Large Language Model Tailored for Real-world Archival Applications | Jul 27, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Cascade Speculative Drafting for Even Faster LLM Inference | Dec 18, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Effective Human-AI Teams via Learned Natural Language Rules and Onboarding | Nov 2, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| LoRAShear: Efficient Large Language Model Structured Pruning and Knowledge Recovery | Oct 24, 2023 | GPULanguage Modeling | CodeCode Available | 1 |
| Caution for the Environment: Multimodal Agents are Susceptible to Environmental Distractions | Aug 5, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Efficient Content-Based Sparse Attention with Routing Transformers | Mar 12, 2020 | Image GenerationLanguage Modeling | CodeCode Available | 1 |
| LXMERT: Learning Cross-Modality Encoder Representations from Transformers | Aug 20, 2019 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| LyricWhiz: Robust Multilingual Zero-shot Lyrics Transcription by Whispering to ChatGPT | Jun 29, 2023 | Automatic Lyrics TranscriptionLanguage Modeling | CodeCode Available | 1 |
| Cascaded Head-colliding Attention | May 31, 2021 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| ConspEmoLLM: Conspiracy Theory Detection Using an Emotion-Based Large Language Model | Mar 11, 2024 | Binary ClassificationLanguage Modeling | CodeCode Available | 1 |
| A Good Prompt Is Worth Millions of Parameters: Low-resource Prompt-based Learning for Vision-Language Models | Oct 16, 2021 | Image CaptioningLanguage Modeling | CodeCode Available | 1 |
| MAgIC: Investigation of Large Language Model Powered Multi-Agent in Cognition, Adaptability, Rationality and Collaboration | Nov 14, 2023 | BenchmarkingLanguage Modeling | CodeCode Available | 1 |
| Effective Attention Sheds Light On Interpretability | May 18, 2021 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| EDA Corpus: A Large Language Model Dataset for Enhanced Interaction with OpenROAD | May 4, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Allocating Large Vocabulary Capacity for Cross-lingual Language Model Pre-training | Sep 15, 2021 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| MAGMA -- Multimodal Augmentation of Generative Models through Adapter-based Finetuning | Dec 9, 2021 | In-Context LearningLanguage Modeling | CodeCode Available | 1 |
| ECAMP: Entity-centered Context-aware Medical Vision Language Pre-training | Dec 20, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| 3UR-LLM: An End-to-End Multimodal Large Language Model for 3D Scene Understanding | Jan 14, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| ECG-Byte: A Tokenizer for End-to-End Generative Electrocardiogram Language Modeling | Dec 18, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Effective Batching for Recurrent Neural Network Grammars | May 31, 2021 | GPULanguage Modeling | CodeCode Available | 1 |
| Efficient Dynamic Clustering-Based Document Compression for Retrieval-Augmented-Generation | Apr 4, 2025 | ClusteringHallucination | CodeCode Available | 1 |
| ELI5: Long Form Question Answering | Jul 22, 2019 | FormLanguage Modeling | CodeCode Available | 1 |
| DziriBERT: a Pre-trained Language Model for the Algerian Dialect | Sep 25, 2021 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Construction Repetition Reduces Information Rate in Dialogue | Oct 15, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| AraGPT2: Pre-Trained Transformer for Arabic Language Generation | Dec 31, 2020 | ArticlesLanguage Modeling | CodeCode Available | 1 |
| Massive Editing for Large Language Models via Meta Learning | Nov 8, 2023 | Fact CheckingLanguage Modeling | CodeCode Available | 1 |
| Content-based Controls For Music Large Language Modeling | Oct 26, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Content Planning for Neural Story Generation with Aristotelian Rescoring | Sep 21, 2020 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Dynamic Programming in Rank Space: Scaling Structured Inference with Low-Rank HMMs and PCFGs | May 1, 2022 | Constituency Grammar InductionLanguage Modeling | CodeCode Available | 1 |
| UniTAB: Unifying Text and Box Outputs for Grounded Vision-Language Modeling | Nov 23, 2021 | Image CaptioningImage Description | CodeCode Available | 1 |
| Materials Informatics Transformer: A Language Model for Interpretable Materials Properties Prediction | Aug 30, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| MathBERT: A Pre-trained Language Model for General NLP Tasks in Mathematics Education | Jun 2, 2021 | Knowledge TracingLanguage Modeling | CodeCode Available | 1 |
| MatSciBERT: A Materials Domain Language Model for Text Mining and Information Extraction | Sep 30, 2021 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| AraELECTRA: Pre-Training Text Discriminators for Arabic Language Understanding | Dec 31, 2020 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Dynamic Grained Encoder for Vision Transformers | Jan 10, 2023 | image-classificationImage Classification | CodeCode Available | 1 |
| DuSSS: Dual Semantic Similarity-Supervised Vision-Language Model for Semi-Supervised Medical Image Segmentation | Dec 17, 2024 | Contrastive LearningImage Segmentation | CodeCode Available | 1 |
| Contextual Information and Commonsense Based Prompt for Emotion Recognition in Conversation | Jul 27, 2022 | Emotion RecognitionEmotion Recognition in Conversation | CodeCode Available | 1 |
| Dynamic Contextualized Word Embeddings | Oct 23, 2020 | Language ModelingLanguage Modelling | CodeCode Available | 1 |