SOTAVerified

Multi-Task Learning

Multi-task learning aims to learn multiple different tasks simultaneously while maximizing performance on one or all of the tasks.

( Image credit: Cross-stitch Networks for Multi-task Learning )

Papers

Showing 151200 of 3687 papers

TitleStatusHype
Achelous++: Power-Oriented Water-Surface Panoptic Perception Framework on Edge Devices based on Vision-Radar Fusion and Pruning of Heterogeneous ModalitiesCode1
You Only Learn One Query: Learning Unified Human Query for Single-Stage Multi-Person Multi-Task Human-Centric PerceptionCode1
Compressed Context Memory For Online Language Model InteractionCode1
Acoustic Prompt Tuning: Empowering Large Language Models with Audition CapabilitiesCode1
AV-RIR: Audio-Visual Room Impulse Response EstimationCode1
PEAN: A Diffusion-Based Prior-Enhanced Attention Network for Scene Text Image Super-ResolutionCode1
FedHCA^2: Towards Hetero-Client Federated Multi-Task LearningCode1
Overcoming Data Scarcity in Biomedical Imaging with a Foundational Multi-Task ModelCode1
Florence-2: Advancing a Unified Representation for a Variety of Vision TasksCode1
APGL4SR: A Generic Framework with Adaptive and Personalized Global Collaborative Information in Sequential RecommendationCode1
GaitFormer: Learning Gait Representations with Noisy Multi-Task LearningCode1
When MOE Meets LLMs: Parameter Efficient Fine-tuning for Multi-task Medical ApplicationsCode1
HEProto: A Hierarchical Enhancing ProtoNet based on Multi-Task Learning for Few-shot Named Entity RecognitionCode1
LeTFuser: Light-weight End-to-end Transformer-Based Sensor Fusion for Autonomous Driving with Multi-Task LearningCode1
Denoising Task Routing for Diffusion ModelsCode1
KoMultiText: Large-Scale Korean Text Dataset for Classifying Biased Speech in Real-World Online ServicesCode1
AdaMerging: Adaptive Model Merging for Multi-Task LearningCode1
Multi-task Learning with 3D-Aware RegularizationCode1
PolarNet: 3D Point Clouds for Language-Guided Robotic ManipulationCode1
BoIR: Box-Supervised Instance Representation for Multi-Person Pose EstimationCode1
A Multi-Head Ensemble Multi-Task Learning Approach for Dynamical Computation OffloadingCode1
Multi-Modal Multi-Task (3MT) Road SegmentationCode1
OFVL-MS: Once for Visual Localization across Multiple Indoor ScenesCode1
Multi-Objective Optimization for Sparse Deep Multi-Task LearningCode1
Learning to Generate Semantic Layouts for Higher Text-Image Correspondence in Text-to-Image SynthesisCode1
PEvoLM: Protein Sequence Evolutionary Information Language ModelCode1
STEM: Unleashing the Power of Embeddings for Multi-task RecommendationCode1
FINER: Enhancing State-of-the-art Classifiers with Feature Attribution to Facilitate Security AnalysisCode1
Parallel Knowledge Enhancement based Framework for Multi-behavior RecommendationCode1
Improvable Gap Balancing for Multi-Task LearningCode1
Prompt Guided Transformer for Multi-Task Dense PredictionCode1
Longitudinal Data and a Semantic Similarity Reward for Chest X-Ray Report GenerationCode1
TransNuSeg: A Lightweight Multi-Task Transformer for Nuclei SegmentationCode1
Noise-aware Speech Enhancement using Diffusion Probabilistic ModelCode1
Hyperspherical Embedding for Point Cloud CompletionCode1
Precursor-of-Anomaly Detection for Irregular Time SeriesCode1
BioREx: Improving Biomedical Relation Extraction by Leveraging Heterogeneous DatasetsCode1
Multi-task Learning for Radar Signal CharacterisationCode1
MOFI: Learning Image Representations from Noisy Entity Annotated ImagesCode1
Prefer to Classify: Improving Text Classifiers via Auxiliary Preference LearningCode1
Allophant: Cross-lingual Phoneme Recognition with Articulatory AttributesCode1
Learning to Relate to Previous Turns in Conversational SearchCode1
Cyclic Learning: Bridging Image-level Labels and Nuclei Instance SegmentationCode1
Sonicverse: A Multisensory Simulation Platform for Embodied Household Agents that See and HearCode1
Edge-MoE: Memory-Efficient Multi-Task Vision Transformer Architecture with Task-level Sparsity via Mixture-of-ExpertsCode1
ComSL: A Composite Speech-Language Model for End-to-End Speech-to-Text TranslationCode1
Multi-task Hierarchical Adversarial Inverse Reinforcement LearningCode1
AdaMSS: Adaptive Multi-Modality Segmentation-to-Survival Learning for Survival Outcome Prediction from PET/CT ImagesCode1
Understanding and Bridging the Modality Gap for Speech TranslationCode1
Zenseact Open Dataset: A large-scale and diverse multimodal dataset for autonomous drivingCode1
Show:102550
← PrevPage 4 of 74Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1PCGrad∆m%125.7Unverified
2CAGrad∆m%112.8Unverified
3IMTL-G∆m%77.2Unverified
4Nash-MTL∆m%62Unverified
5BayesAgg-MTL∆m%53.7Unverified
#ModelMetricClaimedVerifiedStatus
1SwinMTLmIoU76.41Unverified
2Nash-MTLmIoU75.41Unverified
3MultiObjectiveOptimizationmIoU66.63Unverified
#ModelMetricClaimedVerifiedStatus
1SwinMTLMean IoU58.14Unverified
2Nash-MTLMean IoU40.13Unverified
#ModelMetricClaimedVerifiedStatus
1Gumbel-Matrix RoutingAverage Accuracy93.52Unverified
2Mixture-of-ExpertsAverage Accuracy92.19Unverified
#ModelMetricClaimedVerifiedStatus
1MGDA-UBError8.25Unverified
#ModelMetricClaimedVerifiedStatus
1BayesAgg-MTLdelta_m-2.23Unverified
#ModelMetricClaimedVerifiedStatus
1LETRFH83.3Unverified