Mixture-of-Experts

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 126–150 of 1312 papers

Title	Date	Tasks	Status	Hype	Score
CNMBERT: A Model for Converting Hanyu Pinyin Abbreviations to Chinese Characters	Nov 18, 2024	fill-maskFill Mask	CodeCode Available	2	5
CLIP-MoE: Towards Building Mixture of Experts for CLIP with Diversified Multiplet Upcycling	Sep 28, 2024	image-classificationImage Classification	CodeCode Available	2	5
MiniDrive: More Efficient Vision-Language Models with Multi-Level 2D Features as Text Tokens for Autonomous Driving	Sep 11, 2024	Autonomous DrivingFeature Engineering	CodeCode Available	2	5
MC-MoE: Mixture Compressor for Mixture-of-Experts LLMs Gains More	Oct 8, 2024	Mixture-of-ExpertsQuantization	CodeCode Available	2	5
MDFEND: Multi-domain Fake News Detection	Jan 4, 2022	Fake News DetectionMixture-of-Experts	CodeCode Available	2	5
SLED: Self Logits Evolution Decoding for Improving Factuality in Large Language Models	Nov 1, 2024	Mixture-of-Experts	CodeCode Available	2	5
Few-Shot and Continual Learning with Attentive Independent Mechanisms	Jul 29, 2021	Continual LearningFew-Shot Learning	CodeCode Available	1	5
FineMoGen: Fine-Grained Spatio-Temporal Motion Generation and Editing	Dec 22, 2023	Mixture-of-ExpertsMotion Generation	CodeCode Available	1	5
M4: Multi-Proxy Multi-Gate Mixture of Experts Network for Multiple Instance Learning in Histopathology Image Analysis	Jul 24, 2024	Mixture-of-ExpertsMultiple Instance Learning	CodeCode Available	1	5
M^4oE: A Foundation Model for Medical Multimodal Image Segmentation with Mixture of Experts	May 15, 2024	Image SegmentationMixture-of-Experts	CodeCode Available	1	5
M3-Jepa: Multimodal Alignment via Multi-directional MoE based on the JEPA framework	Sep 9, 2024	Computational EfficiencyCross-Modal Retrieval	CodeCode Available	1	5
M3oE: Multi-Domain Multi-Task Mixture-of Experts Recommendation Framework	Apr 29, 2024	AutoMLMixture-of-Experts	CodeCode Available	1	5
Specialized federated learning using a mixture of experts	Oct 5, 2020	Federated LearningMixture-of-Experts	CodeCode Available	1	5
FLAME-MoE: A Transparent End-to-End Research Platform for Mixture-of-Experts Language Models	May 26, 2025	Mixture-of-Experts	CodeCode Available	1	5
M^3ViT: Mixture-of-Experts Vision Transformer for Efficient Multi-task Learning with Model-Accelerator Co-design	Oct 26, 2022	Mixture-of-ExpertsMulti-Task Learning	CodeCode Available	1	5
Making Neural Networks Interpretable with Attribution: Application to Implicit Signals Prediction	Aug 26, 2020	Interpretable Machine LearningMixture-of-Experts	CodeCode Available	1	5
Exploiting Inter-Layer Expert Affinity for Accelerating Mixture-of-Experts Model Inference	Jan 16, 2024	GPUMixture-of-Experts	CodeCode Available	1	5
LMHaze: Intensity-aware Image Dehazing with a Large-scale Multi-intensity Real Haze Dataset	Oct 21, 2024	Image DehazingMamba	CodeCode Available	1	5
LOLA -- An Open-Source Massively Multilingual Large Language Model	Sep 17, 2024	DiversityLanguage Modeling	CodeCode Available	1	5
AlphaLoRA: Assigning LoRA Experts Based on Layer Training Quality	Oct 14, 2024	Mixture-of-Expertsparameter-efficient fine-tuning	CodeCode Available	1	5
A Time Series is Worth Five Experts: Heterogeneous Mixture of Experts for Traffic Flow Prediction	Sep 26, 2024	Mixture-of-ExpertsPrediction	CodeCode Available	1	5
LLMCarbon: Modeling the end-to-end Carbon Footprint of Large Language Models	Sep 25, 2023	GPUMixture-of-Experts	CodeCode Available	1	5
Long-Tailed Visual Recognition via Self-Heterogeneous Integration with Knowledge Excavation	Apr 3, 2023	Mixture-of-ExpertsTransfer Learning	CodeCode Available	1	5
EWMoE: An effective model for global weather forecasting with mixture-of-experts	May 9, 2024	Mixture-of-ExpertsWeather Forecasting	CodeCode Available	1	5
Examining Post-Training Quantization for Mixture-of-Experts: A Benchmark	Jun 12, 2024	BenchmarkingMixture-of-Experts	CodeCode Available	1	5

Show:10 25 50

← PrevPage 6 of 53Next →

No leaderboard results yet.