| Making Monolingual Sentence Embeddings Multilingual using Knowledge Distillation | Apr 21, 2020 | Knowledge DistillationSentence | CodeCode Available | 1 |
| BASPRO: a balanced script producer for speech corpus collection based on the genetic algorithm | Dec 11, 2022 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 1 |
| AIMS.au: A Dataset for the Analysis of Modern Slavery Countermeasures in Corporate Statements | Feb 10, 2025 | Sentence | CodeCode Available | 1 |
| MasakhaNEWS: News Topic Classification for African languages | Apr 19, 2023 | ClassificationFew-Shot Learning | CodeCode Available | 1 |
| Mask Grounding for Referring Image Segmentation | Dec 19, 2023 | cross-modal alignmentImage Segmentation | CodeCode Available | 1 |
| MaskLID: Code-Switching Language Identification through Iterative Masking | Jun 10, 2024 | Language IdentificationSentence | CodeCode Available | 1 |
| Approximate Attributions for Off-the-Shelf Siamese Transformers | Feb 5, 2024 | NegationSentence | CodeCode Available | 1 |
| Balancing Diversity and Risk in LLM Sampling: How to Select Your Method and Parameter for Open-Ended Text Generation | Aug 24, 2024 | DiversitySentence | CodeCode Available | 1 |
| MatchXML: An Efficient Text-label Matching Framework for Extreme Multi-label Text Classification | Aug 25, 2023 | Multi Label Text ClassificationMulti-Label Text Classification | CodeCode Available | 1 |
| Mathematical Foundations for a Compositional Distributional Model of Meaning | Mar 23, 2010 | Sentence | CodeCode Available | 1 |
| Learning to Generate Grounded Visual Captions without Localization Supervision | Jun 1, 2019 | Image CaptioningLanguage Modelling | CodeCode Available | 1 |
| Measuring and Improving Compositional Generalization in Text-to-SQL via Component Alignment | May 4, 2022 | SentenceText to SQL | CodeCode Available | 1 |
| Measuring Harmful Sentence Completion in Language Models for LGBTQIA+ Individuals | May 1, 2022 | SentenceSentence Completion | CodeCode Available | 1 |
| MedSTS: A Resource for Clinical Semantic Textual Similarity | Aug 28, 2018 | Decision MakingSemantic Similarity | CodeCode Available | 1 |
| MELM: Data Augmentation with Masked Entity Language Modeling for Low-Resource NER | Aug 31, 2021 | Cross-Lingual NERData Augmentation | CodeCode Available | 1 |
| MemCap: Memorizing Style Knowledge for Image Captioning | Apr 3, 2020 | Image CaptioningLanguage Modeling | CodeCode Available | 1 |
| Mere Contrastive Learning for Cross-Domain Sentiment Analysis | Aug 18, 2022 | Contrastive LearningSentence | CodeCode Available | 1 |
| MERMAID: Metaphor Generation with Symbolism and Discriminative Decoding | Mar 11, 2021 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| AREDSUM: Adaptive Redundancy-Aware Iterative Sentence Ranking for Extractive Document Summarization | Apr 13, 2020 | DiversityDocument Summarization | CodeCode Available | 1 |
| MEXMA: Token-level objectives improve sentence representations | Sep 19, 2024 | Sentence | CodeCode Available | 1 |
| BAM-DETR: Boundary-Aligned Moment Detection Transformer for Temporal Sentence Grounding in Videos | Nov 30, 2023 | Moment RetrievalNatural Language Moment Retrieval | CodeCode Available | 1 |
| Mind the Style of Text! Adversarial and Backdoor Attacks Based on Text Style Transfer | Oct 14, 2021 | Adversarial AttackBackdoor Attack | CodeCode Available | 1 |
| Adapting BERT for Word Sense Disambiguation with Gloss Selection Objective and Example Sentences | Sep 24, 2020 | Data AugmentationDomain Adaptation | CodeCode Available | 1 |
| A Structured Self-attentive Sentence Embedding | Mar 9, 2017 | Author ProfilingGeneral Classification | CodeCode Available | 1 |
| Mitigating Object Hallucinations via Sentence-Level Early Intervention | Jul 16, 2025 | HallucinationMM-Vet | CodeCode Available | 1 |
| MLBiNet: A Cross-Sentence Collective Event Detection Network | May 20, 2021 | DecoderEvent Detection | CodeCode Available | 1 |
| Benchmarking for Biomedical Natural Language Processing Tasks with a Domain Specific ALBERT | Jul 9, 2021 | BenchmarkingDocument Classification | CodeCode Available | 1 |
| Modelling Context and Syntactical Features for Aspect-based Sentiment Analysis | Jul 1, 2020 | Aspect-Based Sentiment AnalysisAspect-Based Sentiment Analysis (ABSA) | CodeCode Available | 1 |
| MoPS: Modular Story Premise Synthesis for Open-Ended Automatic Story Generation | Jun 9, 2024 | DiversitySentence | CodeCode Available | 1 |
| Proposition-Level Clustering for Multi-Document Summarization | Dec 16, 2021 | ClusteringDocument Summarization | CodeCode Available | 1 |
| BigVideo: A Large-scale Video Subtitle Translation Dataset for Multimodal Machine Translation | May 23, 2023 | Contrastive LearningMachine Translation | CodeCode Available | 1 |
| MT6: Multilingual Pretrained Text-to-Text Transformer with Translation Pairs | Apr 18, 2021 | Abstractive Text SummarizationMachine Translation | CodeCode Available | 1 |
| A Qualitative Evaluation of Language Models on Automatic Question-Answering for COVID-19 | Jun 19, 2020 | ChatbotLanguage Modeling | CodeCode Available | 1 |
| MTet: Multi-domain Translation for English and Vietnamese | Oct 11, 2022 | Machine Translation | CodeCode Available | 1 |
| Mukayese: Turkish NLP Strikes Back | Mar 2, 2022 | BenchmarkingLanguage Modeling | CodeCode Available | 1 |
| MultiCite: Modeling realistic citations requires moving beyond the single-sentence single-label setting | Jul 1, 2021 | Sentencetext-classification | CodeCode Available | 1 |
| Multi-Granularity Guided Fusion-in-Decoder | Apr 3, 2024 | DecoderMulti-Task Learning | CodeCode Available | 1 |
| Multi-granularity Textual Adversarial Attack with Behavior Cloning | Sep 9, 2021 | Adversarial AttackSentence | CodeCode Available | 1 |
| Multi-label Sequential Sentence Classification via Large Language Model | Nov 23, 2024 | Contrastive LearningExtractive Summarization | CodeCode Available | 1 |
| Multi-Label Text Classification using Attention-based Graph Neural Network | Mar 22, 2020 | ClassificationGeneral Classification | CodeCode Available | 1 |
| Multi-LexSum: Real-World Summaries of Civil Rights Lawsuits at Multiple Granularities | Jun 22, 2022 | Abstractive Text SummarizationDocument Summarization | CodeCode Available | 1 |
| Multilingual and code-switching ASR challenges for low resource Indian languages | Apr 1, 2021 | Automatic Speech Recognition (ASR)Sentence | CodeCode Available | 1 |
| MUSS: Multilingual Unsupervised Sentence Simplification by Mining Paraphrases | May 1, 2020 | Parallel Corpus MiningSentence | CodeCode Available | 1 |
| Efficient Training for Multilingual Visual Speech Recognition: Pre-training with Discretized Visual Speech Representation | Jan 18, 2024 | Sentencespeech-recognition | CodeCode Available | 1 |
| Arabisc: Context-Sensitive Neural Spelling Checker | Dec 1, 2020 | Language ModellingSentence | CodeCode Available | 1 |
| Multi-Modal Interaction Graph Convolutional Network for Temporal Language Localization in Videos | Oct 12, 2021 | Semantic correspondenceSemantic Similarity | CodeCode Available | 1 |
| Bootstrapping meaning through listening: Unsupervised learning of spoken sentence embeddings | Oct 23, 2022 | Acoustic Unit DiscoveryContrastive Learning | CodeCode Available | 1 |
| A large annotated corpus for learning natural language inference | Aug 21, 2015 | Image CaptioningNatural Language Inference | CodeCode Available | 1 |
| A Large Cross-Modal Video Retrieval Dataset with Reading Comprehension | May 5, 2023 | Reading ComprehensionRetrieval | CodeCode Available | 1 |
| AutoMeTS: The Autocomplete for Medical Text Simplification | Oct 20, 2020 | SentenceText Simplification | CodeCode Available | 1 |