| A Comprehensive Survey of Mixture-of-Experts: Algorithms, Theory, and Applications | Mar 10, 2025 | Continual LearningMeta-Learning | CodeCode Available | 9 |
| Arcee's MergeKit: A Toolkit for Merging Large Language Models | Mar 20, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 9 |
| Gradient-Based Multi-Objective Deep Learning: Algorithms, Theories, Applications, and Beyond | Jan 19, 2025 | Deep LearningMulti-Task Learning | CodeCode Available | 7 |
| VITA: Towards Open-Source Interactive Omni Multimodal LLM | Aug 9, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 7 |
| MiniGPT-v2: large language model as a unified interface for vision-language multi-task learning | Oct 14, 2023 | Image ClassificationImage Description | CodeCode Available | 7 |
| MING-MOE: Enhancing Medical Multi-Task Learning in Large Language Models with Sparse Mixture of Low-Rank Adapter Experts | Apr 13, 2024 | DiversityLanguage Modeling | CodeCode Available | 5 |
| StreamSpeech: Simultaneous Speech-to-Speech Translation with Multi-task Learning | Jun 5, 2024 | Automatic Speech Recognition (ASR)de-en | CodeCode Available | 5 |
| YOLOR-Based Multi-Task Learning | Sep 29, 2023 | Image CaptioningInstance Segmentation | CodeCode Available | 5 |
| Model Merging in LLMs, MLLMs, and Beyond: Methods, Theories, Applications and Opportunities | Aug 14, 2024 | Continual LearningFew-Shot Learning | CodeCode Available | 4 |
| CoBa: Convergence Balancer for Multitask Finetuning of Large Language Models | Oct 9, 2024 | Multi-Task Learning | CodeCode Available | 4 |
| Uni-Perceiver v2: A Generalist Model for Large-Scale Vision and Vision-Language Tasks | Nov 17, 2022 | DecoderLanguage Modelling | CodeCode Available | 4 |
| InternLM-Math: Open Math Large Language Models Toward Verifiable Reasoning | Feb 9, 2024 | Data AugmentationGSM8K | CodeCode Available | 4 |
| DocRes: A Generalist Model Toward Unifying Document Image Restoration Tasks | May 7, 2024 | BinarizationDeblurring | CodeCode Available | 4 |
| MVMoE: Multi-Task Vehicle Routing Solver with Mixture-of-Experts | May 2, 2024 | Combinatorial OptimizationMixture-of-Experts | CodeCode Available | 3 |
| MixLoRA: Enhancing Large Language Models Fine-Tuning with LoRA-based Mixture of Experts | Apr 22, 2024 | Common Sense ReasoningGPU | CodeCode Available | 3 |
| Gradient Alignment in Physics-informed Neural Networks: A Second-Order Optimization Perspective | Feb 2, 2025 | Multi-Task Learning | CodeCode Available | 3 |
| Ludwig: a type-based declarative deep learning toolbox | Sep 17, 2019 | DecoderDeep Learning | CodeCode Available | 3 |
| ERNIE 2.0: A Continual Pre-training Framework for Language Understanding | Jul 29, 2019 | Chinese Named Entity RecognitionChinese Reading Comprehension | CodeCode Available | 3 |
| YourMT3+: Multi-instrument Music Transcription with Enhanced Transformer Architectures and Cross-dataset Stem Augmentation | Jul 5, 2024 | Drum TranscriptionDrum Transcription in Music (DTM) | CodeCode Available | 3 |
| Language Models are Few-Shot Learners | May 28, 2020 | answerability predictionArticles | CodeCode Available | 3 |
| Zero-shot Entity Linking with Less Data | Jul 1, 2022 | Entity LinkingMulti-Task Learning | CodeCode Available | 3 |
| Scientific Machine Learning through Physics-Informed Neural Networks: Where we are and What's next | Jan 14, 2022 | Multi-Task Learning | CodeCode Available | 3 |
| PETRv2: A Unified Framework for 3D Perception from Multi-Camera Images | Jun 2, 2022 | 3D Lane Detection3D Object Detection | CodeCode Available | 3 |
| DARWIN 1.5: Large Language Models as Materials Science Adapted Learners | Dec 16, 2024 | Large Language ModelMulti-Task Learning | CodeCode Available | 3 |
| Relational Multi-Task Learning: Modeling Relations between Data and Tasks | Mar 14, 2023 | Multi-Task LearningTransfer Learning | CodeCode Available | 3 |
| UCF: Uncovering Common Features for Generalizable Deepfake Detection | Apr 27, 2023 | Binary ClassificationDecoder | CodeCode Available | 3 |
| Multi-Task Learning as Multi-Objective Optimization | Oct 10, 2018 | Depth EstimationGeneral Classification | CodeCode Available | 2 |
| MTLoRA: A Low-Rank Adaptation Approach for Efficient Multi-Task Learning | Mar 29, 2024 | Multi-Task Learningparameter-efficient fine-tuning | CodeCode Available | 2 |
| MTLoRA: Low-Rank Adaptation Approach for Efficient Multi-Task Learning | Jan 1, 2024 | Multi-Task Learningparameter-efficient fine-tuning | CodeCode Available | 2 |
| Multi-Task Learning as a Bargaining Game | Feb 2, 2022 | Multi-Task Learning | CodeCode Available | 2 |
| Modality-agnostic Domain Generalizable Medical Image Segmentation by Multi-Frequency in Multi-Scale Attention | May 10, 2024 | Image SegmentationMedical Image Segmentation | CodeCode Available | 2 |
| MFTCoder: Boosting Code LLMs with Multitask Fine-Tuning | Nov 4, 2023 | Multi-Task Learning | CodeCode Available | 2 |
| MAGVIT: Masked Generative Video Transformer | Dec 10, 2022 | Multi-Task LearningText-to-Video Generation | CodeCode Available | 2 |
| Measuring Massive Multitask Language Understanding | Sep 7, 2020 | Elementary MathematicsMulti-task Language Understanding | CodeCode Available | 2 |
| Modeling the Sequential Dependence among Audience Multi-step Conversions with Multi-task Learning in Targeted Display Advertising | May 18, 2021 | Multi-Task Learning | CodeCode Available | 2 |
| NeuroLM: A Universal Multi-task Foundation Model for Bridging the Gap between Language and EEG Signals | Aug 27, 2024 | EEGElectroencephalogram (EEG) | CodeCode Available | 2 |
| LAA-Net: Localized Artifact Attention Network for Quality-Agnostic and Generalizable Deepfake Detection | Jan 24, 2024 | DeepFake DetectionFace Swapping | CodeCode Available | 2 |
| In-BoXBART: Get Instructions into Biomedical Multi-Task Learning | Apr 15, 2022 | Few-Shot LearningMulti-Task Learning | CodeCode Available | 2 |
| InvPT++: Inverted Pyramid Multi-Task Transformer for Visual Scene Understanding | Jun 8, 2023 | DecoderMulti-Task Learning | CodeCode Available | 2 |
| Hokoff: Real Game Dataset from Honor of Kings and its Offline Reinforcement Learning Benchmarks | Aug 20, 2024 | Multi-agent Reinforcement LearningMulti-Task Learning | CodeCode Available | 2 |
| A Multi-task Learning Model for Chinese-oriented Aspect Polarity Classification and Aspect Term Extraction | Dec 17, 2019 | Aspect-Based Sentiment AnalysisAspect-Based Sentiment Analysis (ABSA) | CodeCode Available | 2 |
| GeoGround: A Unified Large Vision-Language Model for Remote Sensing Visual Grounding | Nov 16, 2024 | Instruction FollowingLanguage Modeling | CodeCode Available | 2 |
| Hypersim: A Photorealistic Synthetic Dataset for Holistic Indoor Scene Understanding | Nov 4, 2020 | Multi-Task LearningScene Understanding | CodeCode Available | 2 |
| LauraGPT: Listen, Attend, Understand, and Regenerate Audio with GPT | Oct 7, 2023 | Audio captioningAutomatic Speech Recognition | CodeCode Available | 2 |
| Joint 2D-3D Multi-Task Learning on Cityscapes-3D: 3D Detection, Segmentation, and Depth Estimation | Apr 3, 2023 | 3D Object DetectionAutonomous Driving | CodeCode Available | 2 |
| Joint Perception and Prediction for Autonomous Driving: A Survey | Dec 18, 2024 | Autonomous Drivingmotion prediction | CodeCode Available | 2 |
| EGTR: Extracting Graph from Transformer for Scene Graph Generation | Apr 2, 2024 | Graph GenerationMulti-Task Learning | CodeCode Available | 2 |
| ExT5: Towards Extreme Multi-Task Scaling for Transfer Learning | Nov 22, 2021 | DenoisingMulti-Task Learning | CodeCode Available | 2 |
| Meta-World: A Benchmark and Evaluation for Multi-Task and Meta Reinforcement Learning | Oct 24, 2019 | Meta-LearningMeta Reinforcement Learning | CodeCode Available | 2 |
| Diffusion-based Visual Anagram as Multi-task Learning | Dec 3, 2024 | DenoisingMulti-Task Learning | CodeCode Available | 2 |