| CogniBench: A Legal-inspired Framework and Dataset for Assessing Cognitive Faithfulness of Large Language Models | May 27, 2025 | HallucinationLanguage Modeling | CodeCode Available | 1 |
| Handwritten Mathematical Expression Recognition with Bidirectionally Trained Transformer | May 6, 2021 | Data AugmentationDecoder | CodeCode Available | 1 |
| Have You Merged My Model? On The Robustness of Large Language Model IP Protection Methods Against Model Merging | Apr 8, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| CLIP2Video: Mastering Video-Text Retrieval via Image CLIP | Jun 21, 2021 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Reinforcement Learning Friendly Vision-Language Model for Minecraft | Mar 19, 2023 | Contrastive LearningLanguage Modeling | CodeCode Available | 1 |
| CLIP4STR: A Simple Baseline for Scene Text Recognition with Pre-trained Vision-Language Model | May 23, 2023 | DecoderLanguage Modeling | CodeCode Available | 1 |
| KnowPrompt: Knowledge-aware Prompt-tuning with Synergistic Optimization for Relation Extraction | Apr 15, 2021 | Dialog Relation ExtractionLanguage Modeling | CodeCode Available | 1 |
| ELECTRA: Pre-training Text Encoders as Discriminators Rather Than Generators | Mar 23, 2020 | GPULanguage Modeling | CodeCode Available | 1 |
| Chain of Natural Language Inference for Reducing Large Language Model Ungrounded Hallucinations | Oct 6, 2023 | HallucinationLanguage Modeling | CodeCode Available | 1 |
| Help me write a poem: Instruction Tuning as a Vehicle for Collaborative Poetry Writing | Oct 25, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| HerO at AVeriTeC: The Herd of Open Large Language Models for Verifying Real-World Claims | Oct 16, 2024 | Fact CheckingLanguage Modeling | CodeCode Available | 1 |
| HERO: Hierarchical Encoder for Video+Language Omni-representation Pre-training | May 1, 2020 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Efficient Pre-training of Masked Language Model via Concept-based Curriculum Masking | Dec 15, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Cognitive Dissonance: Why Do Language Model Outputs Disagree with Internal Representations of Truthfulness? | Nov 27, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| EGFI: Drug-Drug Interaction Extraction and Generation with Fusion of Enriched Entity and Sentence Information | Jan 25, 2021 | ClassificationDrug–drug Interaction Extraction | CodeCode Available | 1 |
| ELI5: Long Form Question Answering | Jul 22, 2019 | FormLanguage Modeling | CodeCode Available | 1 |
| Empower Entity Set Expansion via Language Model Probing | Apr 29, 2020 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Efficient Nearest Neighbor Language Models | Sep 9, 2021 | Domain AdaptationLanguage Modeling | CodeCode Available | 1 |
| CogBench: a large language model walks into a psychology lab | Feb 28, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| hmBERT: Historical Multilingual Language Models for Named Entity Recognition | May 31, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Closed-Loop Long-Horizon Robotic Planning via Equilibrium Sequence Modeling | Oct 2, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| HOP: History-and-Order Aware Pre-training for Vision-and-Language Navigation | Mar 22, 2022 | Decision MakingLanguage Modeling | CodeCode Available | 1 |
| A Study of Generative Large Language Model for Medical Research and Healthcare | May 22, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| How does GPT-2 compute greater-than?: Interpreting mathematical abilities in a pre-trained language model | Apr 30, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Efficiently Modeling Long Sequences with Structured State Spaces | Oct 31, 2021 | Data AugmentationLanguage Modeling | CodeCode Available | 1 |
| Clover: Towards A Unified Video-Language Alignment and Fusion Model | Jul 16, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| How Much Knowledge Can You Pack Into the Parameters of a Language Model? | Feb 10, 2020 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| How multilingual is Multilingual BERT? | Jun 4, 2019 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Efficient OCR for Building a Diverse Digital History | Apr 5, 2023 | DiversityImage Retrieval | CodeCode Available | 1 |
| Efficient Hierarchical Domain Adaptation for Pretrained Language Models | Dec 16, 2021 | Domain AdaptationLanguage Modeling | CodeCode Available | 1 |
| Efficient Dynamic Clustering-Based Document Compression for Retrieval-Augmented-Generation | Apr 4, 2025 | ClusteringHallucination | CodeCode Available | 1 |
| Efficient conformer: Progressive downsampling and grouped attention for automatic speech recognition | Aug 31, 2021 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 1 |
| Acoustic Prompt Tuning: Empowering Large Language Models with Audition Capabilities | Nov 30, 2023 | Audio ClassificationFew-Shot Audio Classification | CodeCode Available | 1 |
| Efficient Content-Based Sparse Attention with Routing Transformers | Mar 12, 2020 | Image GenerationLanguage Modeling | CodeCode Available | 1 |
| Efficient Long Sequence Modeling via State Space Augmented Transformer | Dec 15, 2022 | Computational EfficiencyDecoder | CodeCode Available | 1 |
| Efficient Online Data Mixing For Language Model Pre-Training | Dec 5, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Hydra: A System for Large Multi-Model Deep Learning | Oct 16, 2021 | Deep LearningGPU | CodeCode Available | 1 |
| Effective Sequence-to-Sequence Dialogue State Tracking | Aug 31, 2021 | Dialogue State TrackingLanguage Modeling | CodeCode Available | 1 |
| CFGPT: Chinese Financial Assistant with Large Language Model | Sep 19, 2023 | Decision MakingFinancial Analysis | CodeCode Available | 1 |
| Hypergraph Multi-modal Large Language Model: Exploiting EEG and Eye-tracking Modalities to Evaluate Heterogeneous Responses for Video Understanding | Jul 11, 2024 | EEGLanguage Modeling | CodeCode Available | 1 |
| Can ChatGPT replace StackOverflow? A Study on Robustness and Reliability of Large Language Model Code Generation | Aug 20, 2023 | Code GenerationLanguage Modeling | CodeCode Available | 1 |
| IAA: Inner-Adaptor Architecture Empowers Frozen Large Language Model with Multimodal Capabilities | Aug 23, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Effective Use of Graph Convolution Network and Contextual Sub-Tree forCommodity News Event Extraction | Sep 27, 2021 | Event ExtractionLanguage Modeling | CodeCode Available | 1 |
| IDAS: Intent Discovery with Abstractive Summarization | May 31, 2023 | Abstractive Text SummarizationDescriptive | CodeCode Available | 1 |
| Salmon: A Suite for Acoustic Language Model Evaluation | Sep 11, 2024 | Language Model EvaluationLanguage Modeling | CodeCode Available | 1 |
| CofiPara: A Coarse-to-fine Paradigm for Multimodal Sarcasm Target Identification with Large Multimodal Models | May 1, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| CMoralEval: A Moral Evaluation Benchmark for Chinese Large Language Models | Aug 19, 2024 | DiversityLanguage Modeling | CodeCode Available | 1 |
| ABINet++: Autonomous, Bidirectional and Iterative Language Modeling for Scene Text Spotting | Nov 19, 2022 | BlockingLanguage Modeling | CodeCode Available | 1 |
| CFBenchmark: Chinese Financial Assistant Benchmark for Large Language Model | Nov 10, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Effective Use of Graph Convolution Network and Contextual Sub-Tree for Commodity News Event Extraction | Nov 1, 2021 | Event ExtractionLanguage Modeling | CodeCode Available | 1 |