| How do Large Language Models Learn In-Context? Query and Key Matrices of In-Context Heads are Two Towers for Metric Learning | Feb 5, 2024 | In-Context LearningMetric Learning | CodeCode Available | 2 |
| SNP-S3: Shared Network Pre-training and Significant Semantic Strengthening for Various Video-Text Tasks | Jan 31, 2024 | Sentence | CodeCode Available | 2 |
| Aligning and Prompting Everything All at Once for Universal Visual Perception | Dec 4, 2023 | AllObject | CodeCode Available | 2 |
| Segment and Caption Anything | Dec 1, 2023 | Caption Generationobject-detection | CodeCode Available | 2 |
| Correlation-Guided Query-Dependency Calibration for Video Temporal Grounding | Nov 15, 2023 | Highlight DetectionMoment Retrieval | CodeCode Available | 2 |
| BeLLM: Backward Dependency Enhanced Large Language Model for Sentence Embeddings | Nov 9, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| The Reversal Curse: LLMs trained on "A is B" fail to learn "B is A" | Sep 21, 2023 | Data AugmentationSentence | CodeCode Available | 2 |
| SONAR: Sentence-Level Multimodal and Language-Agnostic Representations | Aug 22, 2023 | DecoderMachine Translation | CodeCode Available | 2 |
| MeViS: A Large-scale Benchmark for Video Segmentation with Motion Expressions | Aug 16, 2023 | Motion Expressions Guided Video SegmentationObject | CodeCode Available | 2 |
| BuboGPT: Enabling Visual Grounding in Multi-Modal LLMs | Jul 17, 2023 | Instruction FollowingSentence | CodeCode Available | 2 |
| Disco-Bench: A Discourse-Aware Evaluation Benchmark for Language Modelling | Jul 16, 2023 | DiagnosticLanguage Modelling | CodeCode Available | 2 |
| MedCPT: Contrastive Pre-trained Transformers with Large-scale PubMed Search Logs for Zero-shot Biomedical Information Retrieval | Jul 2, 2023 | Biomedical Information RetrievalContrastive Learning | CodeCode Available | 2 |
| SituatedGen: Incorporating Geographical and Temporal Contexts into Generative Commonsense Reasoning | Jun 21, 2023 | SentenceText Generation | CodeCode Available | 2 |
| Fine-Grained Human Feedback Gives Better Rewards for Language Model Training | Jun 2, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| Improving CLIP Training with Language Rewrites | May 31, 2023 | In-Context LearningSentence | CodeCode Available | 2 |
| IndicTrans2: Towards High-Quality and Accessible Machine Translation Models for all 22 Scheduled Indian Languages | May 25, 2023 | AllMachine Translation | CodeCode Available | 2 |
| HalOmi: A Manually Annotated Benchmark for Multilingual Hallucination and Omission Detection in Machine Translation | May 19, 2023 | HallucinationMachine Translation | CodeCode Available | 2 |
| Active Retrieval Augmented Generation | May 11, 2023 | RetrievalRetrieval-augmented Generation | CodeCode Available | 2 |
| Exploring Human-Like Translation Strategy with Large Language Models | May 6, 2023 | HallucinationMachine Translation | CodeCode Available | 2 |
| RetroMAE-2: Duplex Masked Auto-Encoder For Pre-Training Retrieval-Oriented Language Models | May 4, 2023 | Information RetrievalOpen-Domain Question Answering | CodeCode Available | 2 |
| SelfCheckGPT: Zero-Resource Black-Box Hallucination Detection for Generative Large Language Models | Mar 15, 2023 | Fact CheckingHallucination | CodeCode Available | 2 |
| Vid2Seq: Large-Scale Pretraining of a Visual Language Model for Dense Video Captioning | Feb 27, 2023 | Dense Video CaptioningLanguage Modeling | CodeCode Available | 2 |
| Is ChatGPT A Good Translator? Yes With GPT-4 As The Engine | Jan 20, 2023 | Machine TranslationSentence | CodeCode Available | 2 |
| BLASER: A Text-Free Speech-to-Speech Translation Evaluation Metric | Dec 16, 2022 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 2 |
| Semantic-Conditional Diffusion Networks for Image Captioning | Dec 6, 2022 | Cross-Modal RetrievalDecoder | CodeCode Available | 2 |
| RetroMAE v2: Duplex Masked Auto-Encoder For Pre-Training Retrieval-Oriented Language Models | Nov 16, 2022 | Dimensionality ReductionInformation Retrieval | CodeCode Available | 2 |
| Towards Realistic Low-resource Relation Extraction: A Benchmark with Empirical Baseline Study | Oct 19, 2022 | Data AugmentationRelation | CodeCode Available | 2 |
| CCTC: A Cross-Sentence Chinese Text Correction Dataset for Native Speakers | Oct 1, 2022 | Grammatical Error CorrectionSentence | CodeCode Available | 2 |
| TEACH: Temporal Action Composition for 3D Humans | Sep 9, 2022 | Motion SynthesisSentence | CodeCode Available | 2 |
| Comprehending and Ordering Semantics for Image Captioning | Jun 14, 2022 | Cross-Modal RetrievalImage Captioning | CodeCode Available | 2 |
| Compositional Visual Generation with Composable Diffusion Models | Jun 3, 2022 | Sentence | CodeCode Available | 2 |
| RetroMAE: Pre-Training Retrieval-oriented Language Models Via Masked Auto-Encoder | May 24, 2022 | DecoderInformation Retrieval | CodeCode Available | 2 |
| "I'm sorry to hear that": Finding New Biases in Language Models with a Holistic Descriptor Dataset | May 18, 2022 | Sentence | CodeCode Available | 2 |
| NaturalSpeech: End-to-End Text to Speech Synthesis with Human-Level Quality | May 9, 2022 | SentenceSpeech Synthesis | CodeCode Available | 2 |
| MuCGEC: a Multi-Reference Multi-Source Evaluation Dataset for Chinese Grammatical Error Correction | Apr 23, 2022 | Grammatical Error CorrectionSentence | CodeCode Available | 2 |
| Exploring a Fine-Grained Multiscale Method for Cross-Modal Remote Sensing Image Retrieval | Apr 21, 2022 | Cross-Modal RetrievalImage Retrieval | CodeCode Available | 2 |
| DiffCSE: Difference-based Contrastive Learning for Sentence Embeddings | Apr 21, 2022 | Contrastive LearningLanguage Modeling | CodeCode Available | 2 |
| CampNet: Context-Aware Mask Prediction for End-to-End Text-Based Speech Editing | Feb 21, 2022 | Few-Shot LearningSentence | CodeCode Available | 2 |
| SGPT: GPT Sentence Embeddings for Semantic Search | Feb 17, 2022 | Argument RetrievalBiomedical Information Retrieval | CodeCode Available | 2 |
| PromptBERT: Improving BERT Sentence Embeddings with Prompts | Jan 12, 2022 | Contrastive LearningDenoising | CodeCode Available | 2 |
| CVSS Corpus and Massively Multilingual Speech-to-Speech Translation | Jan 11, 2022 | SentenceSpeech-to-Speech Translation | CodeCode Available | 2 |
| Deduplicating Training Data Makes Language Models Better | Jul 14, 2021 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| SimCSE: Simple Contrastive Learning of Sentence Embeddings | Apr 18, 2021 | Contrastive LearningData Augmentation | CodeCode Available | 2 |
| Pretrained Transformers for Text Ranking: BERT and Beyond | Oct 13, 2020 | Information RetrievalReranking | CodeCode Available | 2 |
| Abstractive Summarization of Spoken andWritten Instructions with BERT | Aug 21, 2020 | Abstractive Text SummarizationArticles | CodeCode Available | 2 |
| Reevaluating Adversarial Examples in Natural Language | Apr 25, 2020 | Sentence | CodeCode Available | 2 |
| MPNet: Masked and Permuted Pre-training for Language Understanding | Apr 20, 2020 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| CLUE: A Chinese Language Understanding Evaluation Benchmark | Apr 13, 2020 | General ClassificationMachine Reading Comprehension | CodeCode Available | 2 |
| ALBERT: A Lite BERT for Self-supervised Learning of Language Representations | Sep 26, 2019 | Common Sense ReasoningGPU | CodeCode Available | 2 |
| PAWS-X: A Cross-lingual Adversarial Dataset for Paraphrase Identification | Aug 30, 2019 | Paraphrase IdentificationSentence | CodeCode Available | 2 |