| Debiasing the Cloze Task in Sequential Recommendation with Bidirectional Transformers | Jan 22, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| JiuZhang: A Chinese Pre-trained Language Model for Mathematical Problem Understanding | Jun 13, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| Dealing with Typos for BERT-based Passage Retrieval and Ranking | Aug 27, 2021 | Information RetrievalLanguage Modeling | CodeCode Available | 1 | 5 |
| JMultiWOZ: A Large-Scale Japanese Multi-Domain Task-Oriented Dialogue Dataset | Mar 26, 2024 | Dialogue State TrackingLanguage Modeling | CodeCode Available | 1 | 5 |
| Decoding Speculative Decoding | Feb 2, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| Memory-Based Model Editing at Scale | Jun 13, 2022 | counterfactualDialogue Generation | CodeCode Available | 1 | 5 |
| MELM: Data Augmentation with Masked Entity Language Modeling for Low-Resource NER | Aug 31, 2021 | Cross-Lingual NERData Augmentation | CodeCode Available | 1 | 5 |
| Joint Entity and Relation Extraction Based on Table Labeling Using Convolutional Neural Networks | May 1, 2022 | Joint Entity and Relation ExtractionLanguage Modeling | CodeCode Available | 1 | 5 |
| MemCap: Memorizing Style Knowledge for Image Captioning | Apr 3, 2020 | Image CaptioningLanguage Modeling | CodeCode Available | 1 | 5 |
| The birth of Romanian BERT | Sep 18, 2020 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| Mélange: Cost Efficient Large Language Model Serving by Exploiting GPU Heterogeneity | Apr 22, 2024 | GPULanguage Modeling | CodeCode Available | 1 | 5 |
| CXR-LLAVA: a multimodal large language model for interpreting chest X-ray images | Oct 22, 2023 | DiagnosticLanguage Modeling | CodeCode Available | 1 | 5 |
| A Comprehensive Evaluation of Contemporary ML-Based Solvers for Combinatorial Optimization | May 22, 2025 | Combinatorial OptimizationLanguage Modeling | CodeCode Available | 1 | 5 |
| CycleFormer : TSP Solver Based on Language Modeling | May 30, 2024 | DecoderLanguage Modeling | CodeCode Available | 1 | 5 |
| Aya Dataset: An Open-Access Collection for Multilingual Instruction Tuning | Feb 9, 2024 | Instruction FollowingLanguage Modeling | CodeCode Available | 1 | 5 |
| The Curious Case of Neural Text Degeneration | Apr 22, 2019 | DiversityLanguage Modeling | CodeCode Available | 1 | 5 |
| Data-to-Text Generation with Iterative Text Editing | Nov 3, 2020 | Data-to-Text GenerationDomain Adaptation | CodeCode Available | 1 | 5 |
| Mementos: A Comprehensive Benchmark for Multimodal Large Language Model Reasoning over Image Sequences | Jan 19, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| Camoscio: an Italian Instruction-tuned LLaMA | Jul 31, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| Data Efficient Masked Language Modeling for Vision and Language | Sep 5, 2021 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| MechAgents: Large language model multi-agent collaborations can solve mechanics problems, generate new data, and integrate knowledge | Nov 14, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| Keep CALM and Explore: Language Models for Action Generation in Text-based Games | Oct 6, 2020 | Action GenerationLanguage Modeling | CodeCode Available | 1 | 5 |
| AgentMove: Predicting Human Mobility Anywhere Using Large Language Model based Agentic Framework | Aug 26, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| Daily-Omni: Towards Audio-Visual Reasoning with Temporal Alignment across Modalities | May 23, 2025 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 1 | 5 |
| Data Augmentation using Pre-trained Transformer Models | Mar 4, 2020 | Data AugmentationDiversity | CodeCode Available | 1 | 5 |
| DALE: Generative Data Augmentation for Low-Resource Legal NLP | Oct 24, 2023 | Data AugmentationDecoder | CodeCode Available | 1 | 5 |
| EDA Corpus: A Large Language Model Dataset for Enhanced Interaction with OpenROAD | May 4, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| KERPLE: Kernelized Relative Positional Embedding for Length Extrapolation | May 20, 2022 | DiversityLanguage Modeling | CodeCode Available | 1 | 5 |
| Meerkat: Audio-Visual Large Language Model for Grounding in Space and Time | Jul 1, 2024 | AUDIO-VISUAL QUESTION ANSWERING (MUSIC-AVQA-v2.0)Fact Checking | CodeCode Available | 1 | 5 |
| MemeSem:A Multi-modal Framework for Sentimental Analysis of Meme via Transfer Learning | Jun 12, 2020 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| Meta-Adapter: An Online Few-shot Learner for Vision-Language Model | Nov 7, 2023 | Few-Shot Learningimage-classification | CodeCode Available | 1 | 5 |
| MoqaGPT : Zero-Shot Multi-modal Open-domain Question Answering with Large Language Model | Oct 20, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| DANIEL: A fast Document Attention Network for Information Extraction and Labelling of handwritten documents | Jul 12, 2024 | Document Layout Analysisdocument understanding | CodeCode Available | 1 | 5 |
| Publicly Shareable Clinical Large Language Model Built on Synthetic Clinical Notes | Sep 1, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| MaxUp: A Simple Way to Improve Generalization of Neural Network Training | Feb 20, 2020 | Few-Shot Image ClassificationGeneral Classification | CodeCode Available | 0 | 5 |
| Maybe Deep Neural Networks are the Best Choice for Modeling Source Code | Mar 13, 2019 | Language ModelingLanguage Modelling | CodeCode Available | 0 | 5 |
| Calibrating LLM-Based Evaluator | Sep 23, 2023 | In-Context LearningLanguage Modeling | CodeCode Available | 0 | 5 |
| MATHSENSEI: A Tool-Augmented Large Language Model for Mathematical Reasoning | Feb 27, 2024 | 8kLanguage Modeling | CodeCode Available | 0 | 5 |
| Agentic Society: Merging skeleton from real world and texture from Large Language Model | Sep 2, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 | 5 |
| CAiRE: An Empathetic Neural Chatbot | Jul 28, 2019 | ChatbotEmpathetic Response Generation | CodeCode Available | 0 | 5 |
| Agentic Reasoning: Reasoning LLMs with Tools for the Deep Research | Feb 7, 2025 | Decision MakingLanguage Modeling | CodeCode Available | 0 | 5 |
| Matching Visual Features to Hierarchical Semantic Topics for Image Paragraph Captioning | May 10, 2021 | Image Paragraph CaptioningLanguage Modeling | CodeCode Available | 0 | 5 |
| MDAPT: Multilingual Domain Adaptive Pretraining in a Single Model | Sep 14, 2021 | Language ModelingLanguage Modelling | CodeCode Available | 0 | 5 |
| Mask-Free Neuron Concept Annotation for Interpreting Neural Networks in Medical Domain | Jul 16, 2024 | Decision MakingLanguage Modeling | CodeCode Available | 0 | 5 |
| Mask-Enhanced Autoregressive Prediction: Pay Less Attention to Learn More | Feb 11, 2025 | DecoderInformation Retrieval | CodeCode Available | 0 | 5 |
| Masked Latent Semantic Modeling: an Efficient Pre-training Alternative to Masked Language Modeling | Jul 7, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 0 | 5 |
| Masked Language Models are Good Heterogeneous Graph Generalizers | Jun 6, 2025 | Graph LearningLanguage Modeling | CodeCode Available | 0 | 5 |
| MarSan at SemEval-2022 Task 11: Multilingual complex named entity recognition using T5 and transformer encoder | Jul 1, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 0 | 5 |
| Masked and Permuted Implicit Context Learning for Scene Text Recognition | May 25, 2023 | DecoderLanguage Modeling | CodeCode Available | 0 | 5 |
| Masked Generative Story Transformer with Character Guidance and Caption Augmentation | Mar 13, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 | 5 |