Mixture-of-Experts

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 751–800 of 1312 papers

Title	Date	Tasks	Status
Towards Vision Mixture of Experts for Wildlife Monitoring on the Edge	Nov 12, 2024	Mixture-of-Experts	—Unverified
Training-efficient density quantum machine learning	May 30, 2024	LEMMAMixture-of-Experts	—Unverified
Training of Neural Networks with Uncertain Data: A Mixture of Experts Approach	Dec 13, 2023	Autonomous DrivingMixture-of-Experts	—Unverified
TrajMoE: Spatially-Aware Mixture of Experts for Unified Human Mobility Modeling	May 24, 2025	Mixture-of-Experts	—Unverified
Transformer Layer Injection: A Novel Approach for Efficient Upscaling of Large Language Models	Oct 15, 2024	Mixture-of-Experts	—Unverified
Tree-gated Deep Mixture-of-Experts For Pose-robust Face Alignment	Oct 21, 2019	Face AlignmentMixture-of-Experts	—Unverified
Trend Filtered Mixture of Experts for Automated Gating of High-Frequency Flow Cytometry Data	Apr 16, 2025	Mixture-of-Experts	—Unverified
Towards Incremental Learning in Large Language Models: A Critical Review	Apr 28, 2024	Continual LearningIncremental Learning	—Unverified
True Zero-Shot Inference of Dynamical Systems Preserving Long-Term Statistics	May 19, 2025	Mixture-of-ExpertsTime Series	—Unverified
TS-RAG: Retrieval-Augmented Generation based Time Series Foundation Models are Stronger Zero-Shot Forecaster	Mar 6, 2025	Domain AdaptationMixture-of-Experts	—Unverified
Tuning of Mixture-of-Experts Mixed-Precision Neural Networks	Sep 29, 2022	image-classificationImage Classification	—Unverified
Turn Waste into Worth: Rectifying Top-k Router of MoE	Feb 17, 2024	Computational EfficiencyGPU	—Unverified
Two Experts Are All You Need for Steering Thinking: Reinforcing Cognitive Effort in MoE Reasoning Models Without Additional Training	May 20, 2025	AllDomain Generalization	—Unverified
Two Is Better Than One: Rotations Scale LoRAs	May 29, 2025	Mixture-of-Experts	—Unverified
U2++ MoE: Scaling 4.7x parameters with minimal impact on RTF	Apr 25, 2024	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified
UGG-ReID: Uncertainty-Guided Graph Model for Multi-Modal Object Re-Identification	Jul 7, 2025	Mixture-of-Experts	—Unverified
Fast Deep Mixtures of Gaussian Process Experts	Jun 11, 2020	Gaussian ProcessesMixture-of-Experts	—Unverified
Ultra-Sparse Memory Network	Nov 19, 2024	Mixture-of-Experts	—Unverified
UME: Upcycling Mixture-of-Experts for Scalable and Efficient Automatic Speech Recognition	Dec 23, 2024	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified
UMoE: Unifying Attention and FFN with Shared Experts	May 12, 2025	Mixture-of-Experts	—Unverified
Unbiased Gradient Estimation with Balanced Assignments for Mixtures of Experts	Sep 24, 2021	Mixture-of-Experts	—Unverified
Uncertainty-Aware Driver Trajectory Prediction at Urban Intersections	Jan 16, 2019	Mixture-of-ExpertsPrediction	—Unverified
Uncertainty-Encoded Multi-Modal Fusion for Robust Object Detection in Autonomous Driving	Jul 30, 2023	Autonomous DrivingMixture-of-Experts	—Unverified
Understanding Expert Structures on Minimax Parameter Estimation in Contaminated Mixture of Experts	Oct 16, 2024	Mixture-of-Expertsparameter estimation	—Unverified
UniAdapt: A Universal Adapter for Knowledge Calibration	Oct 1, 2024	Mixture-of-ExpertsModel Editing	—Unverified
UNIALIGN: Scaling Multimodal Alignment within One Unified Model	Jan 1, 2025	Mixture-of-Experts	—Unverified
UniCodec: Unified Audio Codec with Single Domain-Adaptive Codebook	Feb 27, 2025	Language ModelingLanguage Modelling	—Unverified
UniCoRN: Latent Diffusion-based Unified Controllable Image Restoration Network across Multiple Degradations	Mar 20, 2025	Image RestorationMixture-of-Experts	—Unverified
Unified Modeling of Multi-Domain Multi-Device ASR Systems	May 13, 2022	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified
Unify and Anchor: A Context-Aware Transformer for Cross-Domain Time Series Forecasting	Mar 3, 2025	Domain GeneralizationMixture-of-Experts	—Unverified
Unimodal-driven Distillation in Multimodal Emotion Recognition with Dynamic Fusion	Mar 31, 2025	Emotion RecognitionKnowledge Distillation	—Unverified
UniPaint: Unified Space-time Video Inpainting via Mixture-of-Experts	Dec 9, 2024	Mixture-of-ExpertsVideo Inpainting	—Unverified
UniF^2ace: Fine-grained Face Understanding and Generation with Unified Multimodal Models	Mar 11, 2025	AttributeMixture-of-Experts	—Unverified
UniUIR: Considering Underwater Image Restoration as An All-in-One Learner	Jan 22, 2025	AllDepth Estimation	—Unverified
Unraveling the Localized Latents: Learning Stratified Manifold Structures in LLM Embedding Space with Sparse Mixture-of-Experts	Feb 19, 2025	Dictionary LearningMixture-of-Experts	—Unverified
Unveiling Hidden Collaboration within Mixture-of-Experts in Large Language Models	Apr 16, 2025	Dictionary LearningMixture-of-Experts	—Unverified
UOE: Unlearning One Expert Is Enough For Mixture-of-experts LLMS	Nov 27, 2024	Large Language ModelMixture-of-Experts	—Unverified
Upcycling Instruction Tuning from Dense to Mixture-of-Experts via Parameter Merging	Oct 2, 2024	DiversityMixture-of-Experts	—Unverified
Upcycling Large Language Models into Mixture of Experts	Oct 10, 2024	Mixture-of-ExpertsMMLU	—Unverified
Using Deep Mixture-of-Experts to Detect Word Meaning Shift for TempoWiC	Nov 7, 2022	Data AugmentationMixture-of-Experts	—Unverified
Utility-Driven Speculative Decoding for Mixture-of-Experts	Jun 17, 2025	GPULarge Language Model	—Unverified
Vanilla Transformers are Transfer Capability Teachers	Mar 4, 2024	Computational EfficiencyMixture-of-Experts	—Unverified
Variational Distillation of Diffusion Policies into Mixture of Experts	Jun 18, 2024	DenoisingMixture-of-Experts	—Unverified
Variational Mixture of Gaussian Process Experts	Dec 1, 2008	Gaussian ProcessesMixture-of-Experts	—Unverified
ViMoE: An Empirical Study of Designing Vision Mixture-of-Experts	Oct 21, 2024	image-classificationImage Classification	—Unverified
Visual Saliency Prediction Using a Mixture of Deep Neural Networks	Feb 1, 2017	Mixture-of-ExpertsSaliency Prediction	—Unverified
WDMoE: Wireless Distributed Large Language Models with Mixture of Experts	May 6, 2024	Mixture-of-Experts	—Unverified
WDMoE: Wireless Distributed Mixture of Experts for Large Language Models	Nov 11, 2024	Mixture-of-Experts	—Unverified
WeNet: Weighted Networks for Recurrent Network Architecture Search	Apr 8, 2019	General Classificationimage-classification	—Unverified
Who Says Elephants Can't Run: Bringing Large Scale MoE Models into Cloud Scale Production	Nov 18, 2022	Machine TranslationMixture-of-Experts	—Unverified

Show:10 25 50

← PrevPage 16 of 27Next →

No leaderboard results yet.