| RetroMAE-2: Duplex Masked Auto-Encoder For Pre-Training Retrieval-Oriented Language Models | May 4, 2023 | Information RetrievalOpen-Domain Question Answering | CodeCode Available | 2 | 5 |
| RetroMAE: Pre-Training Retrieval-oriented Language Models Via Masked Auto-Encoder | May 24, 2022 | DecoderInformation Retrieval | CodeCode Available | 2 | 5 |
| Segment and Caption Anything | Dec 1, 2023 | Caption Generationobject-detection | CodeCode Available | 2 | 5 |
| ALBERT: A Lite BERT for Self-supervised Learning of Language Representations | Sep 26, 2019 | Common Sense ReasoningGPU | CodeCode Available | 2 | 5 |
| Exploring Human-Like Translation Strategy with Large Language Models | May 6, 2023 | HallucinationMachine Translation | CodeCode Available | 2 | 5 |
| Active Retrieval Augmented Generation | May 11, 2023 | RetrievalRetrieval-augmented Generation | CodeCode Available | 2 | 5 |
| SHiNe: Semantic Hierarchy Nexus for Open-vocabulary Object Detection | May 16, 2024 | object-detectionObject Detection | CodeCode Available | 2 | 5 |
| SimCSE: Simple Contrastive Learning of Sentence Embeddings | Apr 18, 2021 | Contrastive LearningData Augmentation | CodeCode Available | 2 | 5 |
| Improving CLIP Training with Language Rewrites | May 31, 2023 | In-Context LearningSentence | CodeCode Available | 2 | 5 |
| TEACH: Temporal Action Composition for 3D Humans | Sep 9, 2022 | Motion SynthesisSentence | CodeCode Available | 2 | 5 |
| Thought Anchors: Which LLM Reasoning Steps Matter? | Jun 23, 2025 | counterfactualSentence | CodeCode Available | 2 | 5 |
| Toward Controlled Generation of Text | Mar 2, 2017 | AttributeSentence | CodeCode Available | 2 | 5 |
| MeViS: A Large-scale Benchmark for Video Segmentation with Motion Expressions | Aug 16, 2023 | Motion Expressions Guided Video SegmentationObject | CodeCode Available | 2 | 5 |
| DelTA: An Online Document-Level Translation Agent Based on Multi-Level Memory | Oct 10, 2024 | Document TranslationMachine Translation | CodeCode Available | 2 | 5 |
| BeLLM: Backward Dependency Enhanced Large Language Model for Sentence Embeddings | Nov 9, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 2 | 5 |
| Disco-Bench: A Discourse-Aware Evaluation Benchmark for Language Modelling | Jul 16, 2023 | DiagnosticLanguage Modelling | CodeCode Available | 2 | 5 |
| Correlation-Guided Query-Dependency Calibration for Video Temporal Grounding | Nov 15, 2023 | Highlight DetectionMoment Retrieval | CodeCode Available | 2 | 5 |
| Compositional Visual Generation with Composable Diffusion Models | Jun 3, 2022 | Sentence | CodeCode Available | 2 | 5 |
| Compositional Entailment Learning for Hyperbolic Vision-Language Models | Oct 9, 2024 | Language ModellingRepresentation Learning | CodeCode Available | 2 | 5 |
| Comprehending and Ordering Semantics for Image Captioning | Jun 14, 2022 | Cross-Modal RetrievalImage Captioning | CodeCode Available | 2 | 5 |
| Decoupling Static and Hierarchical Motion Perception for Referring Video Segmentation | Apr 4, 2024 | Contrastive LearningReferring Expression | CodeCode Available | 2 | 5 |
| DiscoSG: Towards Discourse-Level Text Scene Graph Parsing through Iterative Graph Refinement | Jun 18, 2025 | Graph GenerationHallucination | CodeCode Available | 2 | 5 |
| BuboGPT: Enabling Visual Grounding in Multi-Modal LLMs | Jul 17, 2023 | Instruction FollowingSentence | CodeCode Available | 2 | 5 |
| CVSS Corpus and Massively Multilingual Speech-to-Speech Translation | Jan 11, 2022 | SentenceSpeech-to-Speech Translation | CodeCode Available | 2 | 5 |
| AgentRE: An Agent-Based Framework for Navigating Complex Information Landscapes in Relation Extraction | Sep 3, 2024 | RelationRelation Extraction | CodeCode Available | 2 | 5 |
| Deduplicating Training Data Makes Language Models Better | Jul 14, 2021 | Language ModelingLanguage Modelling | CodeCode Available | 2 | 5 |
| CampNet: Context-Aware Mask Prediction for End-to-End Text-Based Speech Editing | Feb 21, 2022 | Few-Shot LearningSentence | CodeCode Available | 2 | 5 |
| DiffCSE: Difference-based Contrastive Learning for Sentence Embeddings | Apr 21, 2022 | Contrastive LearningLanguage Modeling | CodeCode Available | 2 | 5 |
| MedCPT: Contrastive Pre-trained Transformers with Large-scale PubMed Search Logs for Zero-shot Biomedical Information Retrieval | Jul 2, 2023 | Biomedical Information RetrievalContrastive Learning | CodeCode Available | 2 | 5 |
| DreamLIP: Language-Image Pre-training with Long Captions | Mar 25, 2024 | Contrastive LearningImage-text Retrieval | CodeCode Available | 2 | 5 |
| Enhancing Retrieval-Augmented Generation: A Study of Best Practices | Jan 13, 2025 | In-Context LearningRAG | CodeCode Available | 2 | 5 |
| BEYOND DIALOGUE: A Profile-Dialogue Alignment Framework Towards General Role-Playing Language Model | Aug 20, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 | 5 |
| BLASER: A Text-Free Speech-to-Speech Translation Evaluation Metric | Dec 16, 2022 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 2 | 5 |
| CCTC: A Cross-Sentence Chinese Text Correction Dataset for Native Speakers | Oct 1, 2022 | Grammatical Error CorrectionSentence | CodeCode Available | 2 | 5 |
| Generative Pretrained Structured Transformers: Unsupervised Syntactic Language Models at Scale | Mar 13, 2024 | Constituency Grammar InductionLanguage Modeling | CodeCode Available | 2 | 5 |
| Grounded-VideoLLM: Sharpening Fine-grained Temporal Grounding in Video Large Language Models | Oct 4, 2024 | Dense Video CaptioningSentence | CodeCode Available | 2 | 5 |
| AutoRE: Document-Level Relation Extraction with Large Language Models | Mar 21, 2024 | Document-level Relation ExtractionRelation | CodeCode Available | 2 | 5 |
| "I'm sorry to hear that": Finding New Biases in Language Models with a Holistic Descriptor Dataset | May 18, 2022 | Sentence | CodeCode Available | 2 | 5 |
| A Simple yet Effective Training-free Prompt-free Approach to Chinese Spelling Correction Based on Large Language Models | Oct 5, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 | 5 |
| Advancing Large Language Models to Capture Varied Speaking Styles and Respond Properly in Spoken Conversations | Feb 20, 2024 | Sentence | CodeCode Available | 2 | 5 |
| Abstractive Summarization of Spoken andWritten Instructions with BERT | Aug 21, 2020 | Abstractive Text SummarizationArticles | CodeCode Available | 2 | 5 |
| Learning representations of learning representations | Apr 12, 2024 | Sentence | CodeCode Available | 2 | 5 |
| ANAH: Analytical Annotation of Hallucinations in Large Language Models | May 30, 2024 | Generative Question AnsweringHallucination | CodeCode Available | 2 | 5 |
| ARAGOG: Advanced RAG Output Grading | Apr 1, 2024 | Document EmbeddingLanguage Modeling | CodeCode Available | 2 | 5 |
| beeFormer: Bridging the Gap Between Semantic and Interaction Similarity in Recommender Systems | Sep 16, 2024 | Collaborative FilteringRecommendation Systems | CodeCode Available | 2 | 5 |
| MuCGEC: a Multi-Reference Multi-Source Evaluation Dataset for Chinese Grammatical Error Correction | Apr 23, 2022 | Grammatical Error CorrectionSentence | CodeCode Available | 2 | 5 |
| One Thousand and One Pairs: A "novel" challenge for long-context language models | Jun 24, 2024 | RetrievalSentence | CodeCode Available | 2 | 5 |
| PAWS-X: A Cross-lingual Adversarial Dataset for Paraphrase Identification | Aug 30, 2019 | Paraphrase IdentificationSentence | CodeCode Available | 2 | 5 |
| CLUE: A Chinese Language Understanding Evaluation Benchmark | Apr 13, 2020 | General ClassificationMachine Reading Comprehension | CodeCode Available | 2 | 5 |
| DRAGIN: Dynamic Retrieval Augmented Generation based on the Information Needs of Large Language Models | Mar 15, 2024 | RAGRetrieval | CodeCode Available | 2 | 5 |