| Achelous++: Power-Oriented Water-Surface Panoptic Perception Framework on Edge Devices based on Vision-Radar Fusion and Pruning of Heterogeneous Modalities | Dec 14, 2023 | Autonomous NavigationMulti-Task Learning | CodeCode Available | 1 |
| You Only Learn One Query: Learning Unified Human Query for Single-Stage Multi-Person Multi-Task Human-Centric Perception | Dec 9, 2023 | AttributeHuman Instance Segmentation | CodeCode Available | 1 |
| Compressed Context Memory For Online Language Model Interaction | Dec 6, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Acoustic Prompt Tuning: Empowering Large Language Models with Audition Capabilities | Nov 30, 2023 | Audio ClassificationFew-Shot Audio Classification | CodeCode Available | 1 |
| AV-RIR: Audio-Visual Room Impulse Response Estimation | Nov 30, 2023 | Multi-Task LearningRoom Impulse Response (RIR) | CodeCode Available | 1 |
| PEAN: A Diffusion-Based Prior-Enhanced Attention Network for Scene Text Image Super-Resolution | Nov 29, 2023 | Image Super-ResolutionMulti-Task Learning | CodeCode Available | 1 |
| FedHCA^2: Towards Hetero-Client Federated Multi-Task Learning | Nov 22, 2023 | DecoderFederated Learning | CodeCode Available | 1 |
| Overcoming Data Scarcity in Biomedical Imaging with a Foundational Multi-Task Model | Nov 16, 2023 | Multi-Task Learningobject-detection | CodeCode Available | 1 |
| Florence-2: Advancing a Unified Representation for a Variety of Vision Tasks | Nov 10, 2023 | DiversityMulti-Task Learning | CodeCode Available | 1 |
| APGL4SR: A Generic Framework with Adaptive and Personalized Global Collaborative Information in Sequential Recommendation | Nov 6, 2023 | Graph LearningMulti-Task Learning | CodeCode Available | 1 |
| GaitFormer: Learning Gait Representations with Noisy Multi-Task Learning | Oct 30, 2023 | AttributeMulti-Task Learning | CodeCode Available | 1 |
| When MOE Meets LLMs: Parameter Efficient Fine-tuning for Multi-task Medical Applications | Oct 21, 2023 | Multi-Task Learningparameter-efficient fine-tuning | CodeCode Available | 1 |
| HEProto: A Hierarchical Enhancing ProtoNet based on Multi-Task Learning for Few-shot Named Entity Recognition | Oct 21, 2023 | Contrastive LearningFew-shot NER | CodeCode Available | 1 |
| LeTFuser: Light-weight End-to-end Transformer-Based Sensor Fusion for Autonomous Driving with Multi-Task Learning | Oct 19, 2023 | Autonomous DrivingImitation Learning | CodeCode Available | 1 |
| Denoising Task Routing for Diffusion Models | Oct 11, 2023 | DenoisingMulti-Task Learning | CodeCode Available | 1 |
| KoMultiText: Large-Scale Korean Text Dataset for Classifying Biased Speech in Real-World Online Services | Oct 6, 2023 | Hate Speech DetectionMulti-Task Learning | CodeCode Available | 1 |
| AdaMerging: Adaptive Model Merging for Multi-Task Learning | Oct 4, 2023 | modelMulti-Task Learning | CodeCode Available | 1 |
| Multi-task Learning with 3D-Aware Regularization | Oct 2, 2023 | Depth EstimationMulti-Task Learning | CodeCode Available | 1 |
| PolarNet: 3D Point Clouds for Language-Guided Robotic Manipulation | Sep 27, 2023 | Multi-Task LearningRobot Manipulation | CodeCode Available | 1 |
| BoIR: Box-Supervised Instance Representation for Multi-Person Pose Estimation | Sep 25, 2023 | DisentanglementKeypoint Estimation | CodeCode Available | 1 |
| A Multi-Head Ensemble Multi-Task Learning Approach for Dynamical Computation Offloading | Sep 2, 2023 | Edge-computingMulti-Task Learning | CodeCode Available | 1 |
| Multi-Modal Multi-Task (3MT) Road Segmentation | Aug 23, 2023 | Multi-Task LearningRoad Segmentation | CodeCode Available | 1 |
| OFVL-MS: Once for Visual Localization across Multiple Indoor Scenes | Aug 23, 2023 | Multi-Task LearningVisual Localization | CodeCode Available | 1 |
| Multi-Objective Optimization for Sparse Deep Multi-Task Learning | Aug 23, 2023 | Multi-Task Learning | CodeCode Available | 1 |
| Learning to Generate Semantic Layouts for Higher Text-Image Correspondence in Text-to-Image Synthesis | Aug 16, 2023 | Image Generationmultimodal generation | CodeCode Available | 1 |
| PEvoLM: Protein Sequence Evolutionary Information Language Model | Aug 16, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| STEM: Unleashing the Power of Embeddings for Multi-task Recommendation | Aug 16, 2023 | Multi-Task LearningRecommendation Systems | CodeCode Available | 1 |
| FINER: Enhancing State-of-the-art Classifiers with Feature Attribution to Facilitate Security Analysis | Aug 10, 2023 | Malware AnalysisMulti-Task Learning | CodeCode Available | 1 |
| Parallel Knowledge Enhancement based Framework for Multi-behavior Recommendation | Aug 9, 2023 | Multi-Task LearningPrediction | CodeCode Available | 1 |
| Improvable Gap Balancing for Multi-Task Learning | Jul 28, 2023 | Deep Reinforcement LearningMulti-Task Learning | CodeCode Available | 1 |
| Prompt Guided Transformer for Multi-Task Dense Prediction | Jul 28, 2023 | Boundary DetectionDecoder | CodeCode Available | 1 |
| Longitudinal Data and a Semantic Similarity Reward for Chest X-Ray Report Generation | Jul 19, 2023 | DiagnosticFace Model | CodeCode Available | 1 |
| TransNuSeg: A Lightweight Multi-Task Transformer for Nuclei Segmentation | Jul 16, 2023 | Multi-Task LearningSegmentation | CodeCode Available | 1 |
| Noise-aware Speech Enhancement using Diffusion Probabilistic Model | Jul 16, 2023 | Denoisingmodel | CodeCode Available | 1 |
| Hyperspherical Embedding for Point Cloud Completion | Jul 11, 2023 | DecoderMulti-Task Learning | CodeCode Available | 1 |
| Precursor-of-Anomaly Detection for Irregular Time Series | Jun 27, 2023 | Anomaly DetectionIrregular Time Series | CodeCode Available | 1 |
| BioREx: Improving Biomedical Relation Extraction by Leveraging Heterogeneous Datasets | Jun 19, 2023 | graph constructionMulti-Task Learning | CodeCode Available | 1 |
| Multi-task Learning for Radar Signal Characterisation | Jun 19, 2023 | ClassificationManagement | CodeCode Available | 1 |
| MOFI: Learning Image Representations from Noisy Entity Annotated Images | Jun 13, 2023 | image-classificationImage Classification | CodeCode Available | 1 |
| Prefer to Classify: Improving Text Classifiers via Auxiliary Preference Learning | Jun 8, 2023 | Multi-Task Learning | CodeCode Available | 1 |
| Allophant: Cross-lingual Phoneme Recognition with Articulatory Attributes | Jun 7, 2023 | AttributeCross-Lingual Transfer | CodeCode Available | 1 |
| Learning to Relate to Previous Turns in Conversational Search | Jun 5, 2023 | Conversational SearchMulti-Task Learning | CodeCode Available | 1 |
| Cyclic Learning: Bridging Image-level Labels and Nuclei Instance Segmentation | Jun 5, 2023 | Instance SegmentationMulti-Task Learning | CodeCode Available | 1 |
| Sonicverse: A Multisensory Simulation Platform for Embodied Household Agents that See and Hear | Jun 1, 2023 | Multi-Task LearningVisual Navigation | CodeCode Available | 1 |
| Edge-MoE: Memory-Efficient Multi-Task Vision Transformer Architecture with Task-level Sparsity via Mixture-of-Experts | May 30, 2023 | CPUGPU | CodeCode Available | 1 |
| ComSL: A Composite Speech-Language Model for End-to-End Speech-to-Text Translation | May 24, 2023 | GPULanguage Modeling | CodeCode Available | 1 |
| Multi-task Hierarchical Adversarial Inverse Reinforcement Learning | May 22, 2023 | Imitation LearningMulti-Task Learning | CodeCode Available | 1 |
| AdaMSS: Adaptive Multi-Modality Segmentation-to-Survival Learning for Survival Outcome Prediction from PET/CT Images | May 17, 2023 | Multi-Task LearningPrediction | CodeCode Available | 1 |
| Understanding and Bridging the Modality Gap for Speech Translation | May 15, 2023 | Machine TranslationMulti-Task Learning | CodeCode Available | 1 |
| Zenseact Open Dataset: A large-scale and diverse multimodal dataset for autonomous driving | May 3, 2023 | Autonomous DrivingDiversity | CodeCode Available | 1 |