| Efficient Online Data Mixing For Language Model Pre-Training | Dec 5, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Efficient OCR for Building a Diverse Digital History | Apr 5, 2023 | DiversityImage Retrieval | CodeCode Available | 1 |
| GUing: A Mobile GUI Search Engine using a Vision-Language Model | Apr 30, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| CLIP2Video: Mastering Video-Text Retrieval via Image CLIP | Jun 21, 2021 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Reinforcement Learning Friendly Vision-Language Model for Minecraft | Mar 19, 2023 | Contrastive LearningLanguage Modeling | CodeCode Available | 1 |
| CLIP4STR: A Simple Baseline for Scene Text Recognition with Pre-trained Vision-Language Model | May 23, 2023 | DecoderLanguage Modeling | CodeCode Available | 1 |
| KnowPrompt: Knowledge-aware Prompt-tuning with Synergistic Optimization for Relation Extraction | Apr 15, 2021 | Dialog Relation ExtractionLanguage Modeling | CodeCode Available | 1 |
| ELECTRA: Pre-training Text Encoders as Discriminators Rather Than Generators | Mar 23, 2020 | GPULanguage Modeling | CodeCode Available | 1 |
| Handwritten Mathematical Expression Recognition with Bidirectionally Trained Transformer | May 6, 2021 | Data AugmentationDecoder | CodeCode Available | 1 |
| HARDMath: A Benchmark Dataset for Challenging Problems in Applied Mathematics | Oct 13, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Empowering Large Language Model for Continual Video Question Answering with Collaborative Prompting | Oct 1, 2024 | Continual LearningLanguage Modeling | CodeCode Available | 1 |
| Efficient Long Sequence Modeling via State Space Augmented Transformer | Dec 15, 2022 | Computational EfficiencyDecoder | CodeCode Available | 1 |
| Efficient Dynamic Clustering-Based Document Compression for Retrieval-Augmented-Generation | Apr 4, 2025 | ClusteringHallucination | CodeCode Available | 1 |
| Efficient Content-Based Sparse Attention with Routing Transformers | Mar 12, 2020 | Image GenerationLanguage Modeling | CodeCode Available | 1 |
| CLIP the Gap: A Single Domain Generalization Approach for Object Detection | Jan 13, 2023 | Domain Generalizationimage-classification | CodeCode Available | 1 |
| Efficient Hierarchical Domain Adaptation for Pretrained Language Models | Dec 16, 2021 | Domain AdaptationLanguage Modeling | CodeCode Available | 1 |
| Hessian of Perplexity for Large Language Models by PyTorch autograd (Open Source) | Apr 6, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Heterogeneous Graph Reasoning for Fact Checking over Texts and Tables | Feb 20, 2024 | Fact CheckingGraph Neural Network | CodeCode Available | 1 |
| Efficiently Modeling Long Sequences with Structured State Spaces | Oct 31, 2021 | Data AugmentationLanguage Modeling | CodeCode Available | 1 |
| Hexatagging: Projective Dependency Parsing as Tagging | Jun 8, 2023 | Computational EfficiencyDependency Parsing | CodeCode Available | 1 |
| Closed-Loop Long-Horizon Robotic Planning via Equilibrium Sequence Modeling | Oct 2, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Effective Use of Graph Convolution Network and Contextual Sub-Tree for Commodity News Event Extraction | Nov 1, 2021 | Event ExtractionLanguage Modeling | CodeCode Available | 1 |
| Effective Use of Graph Convolution Network and Contextual Sub-Tree forCommodity News Event Extraction | Sep 27, 2021 | Event ExtractionLanguage Modeling | CodeCode Available | 1 |
| High-Dimension Human Value Representation in Large Language Models | Apr 11, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Effect of Pre-Training Scale on Intra- and Inter-Domain Full and Few-Shot Transfer Learning for Natural and Medical X-Ray Chest Images | May 31, 2021 | Few-Shot LearningImage Classification | CodeCode Available | 1 |