Mixture-of-Experts

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 201–225 of 1312 papers

Title	Date	Tasks	Status	Hype
Norface: Improving Facial Expression Analysis by Identity Normalization	Jul 22, 2024	ClassificationEmotion Recognition	CodeCode Available	1
Swin SMT: Global Sequential Modeling in 3D Medical Image Segmentation	Jul 10, 2024	Image SegmentationMedical Image Segmentation	CodeCode Available	1
Efficient Expert Pruning for Sparse Mixture-of-Experts Language Models: Enhancing Performance and Reducing Inference Costs	Jul 1, 2024	GPUMixture-of-Experts	CodeCode Available	1
Solving Token Gradient Conflict in Mixture-of-Experts for Large Vision-Language Model	Jun 28, 2024	Language ModelingLanguage Modelling	CodeCode Available	1
AdaMoE: Token-Adaptive Routing with Null Experts for Mixture-of-Experts Language Models	Jun 19, 2024	ARCMixture-of-Experts	CodeCode Available	1
Dynamic Data Mixing Maximizes Instruction Tuning for Mixture-of-Experts	Jun 17, 2024	Mixture-of-Experts	CodeCode Available	1
MoE-RBench: Towards Building Reliable Language Models with Sparse Mixture-of-Experts	Jun 17, 2024	HallucinationMixture-of-Experts	CodeCode Available	1
Towards Efficient Pareto Set Approximation via Mixture of Experts Based Model Fusion	Jun 14, 2024	Mixture-of-ExpertsMulti-Task Learning	CodeCode Available	1
DeepUnifiedMom: Unified Time-series Momentum Portfolio Construction via Multi-Task Learning with Multi-Gate Mixture of Experts	Jun 13, 2024	ManagementMixture-of-Experts	CodeCode Available	1
Examining Post-Training Quantization for Mixture-of-Experts: A Benchmark	Jun 12, 2024	BenchmarkingMixture-of-Experts	CodeCode Available	1
MEFT: Memory-Efficient Fine-Tuning through Sparse Adapter	Jun 7, 2024	CPUGPU	CodeCode Available	1
Enhancing Fast Feed Forward Networks with Load Balancing and a Master Leaf Node	May 27, 2024	Computational EfficiencyMixture-of-Experts	CodeCode Available	1
Graph Sparsification via Mixture of Graphs	May 23, 2024	Graph LearningMixture-of-Experts	CodeCode Available	1
Mixture of Experts Meets Prompt-Based Continual Learning	May 23, 2024	Continual LearningMixture-of-Experts	CodeCode Available	1
Unchosen Experts Can Contribute Too: Unleashing MoE Models' Power by Self-Contrast	May 23, 2024	Computational EfficiencyGSM8K	CodeCode Available	1
DirectMultiStep: Direct Route Generation for Multi-Step Retrosynthesis	May 22, 2024	DiversityMixture-of-Experts	CodeCode Available	1
MeteoRA: Multiple-tasks Embedded LoRA for Large Language Models	May 19, 2024	Mixture-of-Expertsparameter-efficient fine-tuning	CodeCode Available	1
M^4oE: A Foundation Model for Medical Multimodal Image Segmentation with Mixture of Experts	May 15, 2024	Image SegmentationMixture-of-Experts	CodeCode Available	1
EWMoE: An effective model for global weather forecasting with mixture-of-experts	May 9, 2024	Mixture-of-ExpertsWeather Forecasting	CodeCode Available	1
Revisiting RGBT Tracking Benchmarks from the Perspective of Modality Validity: A New Benchmark, Problem, and Method	Apr 30, 2024	Mixture-of-ExpertsRgb-T Tracking	CodeCode Available	1
M3oE: Multi-Domain Multi-Task Mixture-of Experts Recommendation Framework	Apr 29, 2024	AutoMLMixture-of-Experts	CodeCode Available	1
Swin2-MoSE: A New Single Image Super-Resolution Model for Remote Sensing	Apr 29, 2024	Image Super-ResolutionMixture-of-Experts	CodeCode Available	1
Large Multi-modality Model Assisted AI-Generated Image Quality Assessment	Apr 27, 2024	Image Quality AssessmentMixture-of-Experts	CodeCode Available	1
Multi-Head Mixture-of-Experts	Apr 23, 2024	Language ModelingLanguage Modelling	CodeCode Available	1
XFT: Unlocking the Power of Code Instruction Tuning by Simply Merging Upcycled Mixture-of-Experts	Apr 23, 2024	HumanEvalmbpp	CodeCode Available	1

Show:10 25 50

← PrevPage 9 of 53Next →

No leaderboard results yet.