| A Comprehensive Survey of Mixture-of-Experts: Algorithms, Theory, and Applications | Mar 10, 2025 | Continual LearningMeta-Learning | CodeCode Available | 9 |
| Arcee's MergeKit: A Toolkit for Merging Large Language Models | Mar 20, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 9 |
| Gradient-Based Multi-Objective Deep Learning: Algorithms, Theories, Applications, and Beyond | Jan 19, 2025 | Deep LearningMulti-Task Learning | CodeCode Available | 7 |
| VITA: Towards Open-Source Interactive Omni Multimodal LLM | Aug 9, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 7 |
| MiniGPT-v2: large language model as a unified interface for vision-language multi-task learning | Oct 14, 2023 | Image ClassificationImage Description | CodeCode Available | 7 |
| StreamSpeech: Simultaneous Speech-to-Speech Translation with Multi-task Learning | Jun 5, 2024 | Automatic Speech Recognition (ASR)de-en | CodeCode Available | 5 |
| MING-MOE: Enhancing Medical Multi-Task Learning in Large Language Models with Sparse Mixture of Low-Rank Adapter Experts | Apr 13, 2024 | DiversityLanguage Modeling | CodeCode Available | 5 |
| YOLOR-Based Multi-Task Learning | Sep 29, 2023 | Image CaptioningInstance Segmentation | CodeCode Available | 5 |
| CoBa: Convergence Balancer for Multitask Finetuning of Large Language Models | Oct 9, 2024 | Multi-Task Learning | CodeCode Available | 4 |
| Model Merging in LLMs, MLLMs, and Beyond: Methods, Theories, Applications and Opportunities | Aug 14, 2024 | Continual LearningFew-Shot Learning | CodeCode Available | 4 |
| DocRes: A Generalist Model Toward Unifying Document Image Restoration Tasks | May 7, 2024 | BinarizationDeblurring | CodeCode Available | 4 |
| InternLM-Math: Open Math Large Language Models Toward Verifiable Reasoning | Feb 9, 2024 | Data AugmentationGSM8K | CodeCode Available | 4 |
| Uni-Perceiver v2: A Generalist Model for Large-Scale Vision and Vision-Language Tasks | Nov 17, 2022 | DecoderLanguage Modelling | CodeCode Available | 4 |
| Gradient Alignment in Physics-informed Neural Networks: A Second-Order Optimization Perspective | Feb 2, 2025 | Multi-Task Learning | CodeCode Available | 3 |
| DARWIN 1.5: Large Language Models as Materials Science Adapted Learners | Dec 16, 2024 | Large Language ModelMulti-Task Learning | CodeCode Available | 3 |
| YourMT3+: Multi-instrument Music Transcription with Enhanced Transformer Architectures and Cross-dataset Stem Augmentation | Jul 5, 2024 | Drum TranscriptionDrum Transcription in Music (DTM) | CodeCode Available | 3 |
| MVMoE: Multi-Task Vehicle Routing Solver with Mixture-of-Experts | May 2, 2024 | Combinatorial OptimizationMixture-of-Experts | CodeCode Available | 3 |
| MixLoRA: Enhancing Large Language Models Fine-Tuning with LoRA-based Mixture of Experts | Apr 22, 2024 | Common Sense ReasoningGPU | CodeCode Available | 3 |
| UCF: Uncovering Common Features for Generalizable Deepfake Detection | Apr 27, 2023 | Binary ClassificationDecoder | CodeCode Available | 3 |
| Relational Multi-Task Learning: Modeling Relations between Data and Tasks | Mar 14, 2023 | Multi-Task LearningTransfer Learning | CodeCode Available | 3 |
| Zero-shot Entity Linking with Less Data | Jul 1, 2022 | Entity LinkingMulti-Task Learning | CodeCode Available | 3 |
| PETRv2: A Unified Framework for 3D Perception from Multi-Camera Images | Jun 2, 2022 | 3D Lane Detection3D Object Detection | CodeCode Available | 3 |
| Scientific Machine Learning through Physics-Informed Neural Networks: Where we are and What's next | Jan 14, 2022 | Multi-Task Learning | CodeCode Available | 3 |
| Language Models are Few-Shot Learners | May 28, 2020 | answerability predictionArticles | CodeCode Available | 3 |
| Ludwig: a type-based declarative deep learning toolbox | Sep 17, 2019 | DecoderDeep Learning | CodeCode Available | 3 |
| ERNIE 2.0: A Continual Pre-training Framework for Language Understanding | Jul 29, 2019 | Chinese Named Entity RecognitionChinese Reading Comprehension | CodeCode Available | 3 |
| SonicVerse: Multi-Task Learning for Music Feature-Informed Captioning | Jun 18, 2025 | Caption GenerationDescriptive | CodeCode Available | 2 |
| Fast and Accurate Blind Flexible Docking | Feb 20, 2025 | Blind DockingComputational Efficiency | CodeCode Available | 2 |
| Joint Perception and Prediction for Autonomous Driving: A Survey | Dec 18, 2024 | Autonomous Drivingmotion prediction | CodeCode Available | 2 |
| Diffusion-based Visual Anagram as Multi-task Learning | Dec 3, 2024 | DenoisingMulti-Task Learning | CodeCode Available | 2 |
| GeoGround: A Unified Large Vision-Language Model for Remote Sensing Visual Grounding | Nov 16, 2024 | Instruction FollowingLanguage Modeling | CodeCode Available | 2 |
| radarODE-MTL: A Multi-Task Learning Framework with Eccentric Gradient Alignment for Robust Radar-Based ECG Reconstruction | Oct 11, 2024 | Multi-Task Learning | CodeCode Available | 2 |
| Tissue Concepts: supervised foundation models in computational pathology | Sep 5, 2024 | DiagnosticMulti-Task Learning | CodeCode Available | 2 |
| LibMOON: A Gradient-based MultiObjective OptimizatioN Library in PyTorch | Sep 4, 2024 | Evolutionary AlgorithmsFairness | CodeCode Available | 2 |
| NeuroLM: A Universal Multi-task Foundation Model for Bridging the Gap between Language and EEG Signals | Aug 27, 2024 | EEGElectroencephalogram (EEG) | CodeCode Available | 2 |
| Hokoff: Real Game Dataset from Honor of Kings and its Offline Reinforcement Learning Benchmarks | Aug 20, 2024 | Multi-agent Reinforcement LearningMulti-Task Learning | CodeCode Available | 2 |
| RouteFinder: Towards Foundation Models for Vehicle Routing Problems | Jun 21, 2024 | AttributeMulti-Task Learning | CodeCode Available | 2 |
| Generalization-Enhanced Code Vulnerability Detection via Multi-Task Instruction Fine-Tuning | Jun 6, 2024 | Multi-Task LearningVulnerability Detection | CodeCode Available | 2 |
| Modality-agnostic Domain Generalizable Medical Image Segmentation by Multi-Frequency in Multi-Scale Attention | May 10, 2024 | Image SegmentationMedical Image Segmentation | CodeCode Available | 2 |
| Unleashing the Power of Multi-Task Learning: A Comprehensive Survey Spanning Traditional, Deep, and Pretrained Foundation Model Eras | Apr 29, 2024 | Multi-Task LearningPrognosis | CodeCode Available | 2 |
| OmniSearchSage: Multi-Task Multi-Entity Embeddings for Pinterest Search | Apr 25, 2024 | Entity EmbeddingsImage Captioning | CodeCode Available | 2 |
| EGTR: Extracting Graph from Transformer for Scene Graph Generation | Apr 2, 2024 | Graph GenerationMulti-Task Learning | CodeCode Available | 2 |
| MTLoRA: A Low-Rank Adaptation Approach for Efficient Multi-Task Learning | Mar 29, 2024 | Multi-Task Learningparameter-efficient fine-tuning | CodeCode Available | 2 |
| Volumetric Environment Representation for Vision-Language Navigation | Mar 21, 2024 | 3D geometryMulti-Task Learning | CodeCode Available | 2 |
| Switch Diffusion Transformer: Synergizing Denoising Tasks with Sparse Mixture-of-Experts | Mar 14, 2024 | DenoisingMixture-of-Experts | CodeCode Available | 2 |
| One Train for Two Tasks: An Encrypted Traffic Classification Framework Using Supervised Contrastive Learning | Feb 12, 2024 | ClassificationContrastive Learning | CodeCode Available | 2 |
| Delving into Multi-modal Multi-task Foundation Models for Road Scene Understanding: From Learning Paradigm Perspectives | Feb 5, 2024 | Continual LearningMulti-Task Learning | CodeCode Available | 2 |
| LAA-Net: Localized Artifact Attention Network for Quality-Agnostic and Generalizable Deepfake Detection | Jan 24, 2024 | DeepFake DetectionFace Swapping | CodeCode Available | 2 |
| MTLoRA: Low-Rank Adaptation Approach for Efficient Multi-Task Learning | Jan 1, 2024 | Multi-Task Learningparameter-efficient fine-tuning | CodeCode Available | 2 |
| LoRAMoE: Alleviate World Knowledge Forgetting in Large Language Models via MoE-Style Plugin | Dec 15, 2023 | Language ModellingMixture-of-Experts | CodeCode Available | 2 |