Mixture-of-Experts

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 1251–1300 of 1312 papers

Title	Date	Tasks	Status
Extreme Classification in Log Memory using Count-Min Sketch: A Case Study of Amazon Search with 50M Products	Oct 28, 2019	ClassificationGeneral Classification	CodeCode Available
Probabilistic Mixture-of-Experts for Efficient Deep Reinforcement Learning	Apr 19, 2021	Deep Reinforcement LearningMixture-of-Experts	CodeCode Available
Exploring Model Consensus to Generate Translation Paraphrases	Jul 1, 2020	DiversityMachine Translation	CodeCode Available
Probabilistic Rainfall Estimation from Automotive Lidar	Apr 23, 2021	Mixture-of-Experts	CodeCode Available
An Empirical Study on Model-agnostic Debiasing Strategies for Robust Natural Language Inference	Oct 8, 2020	Data AugmentationMixture-of-Experts	CodeCode Available
Exploiting Activation Sparsity with Dense to Dynamic-k Mixture-of-Experts Conversion	Oct 6, 2023	Mixture-of-Experts	CodeCode Available
VoiceGRPO: Modern MoE Transformers with Group Relative Policy Optimization GRPO for AI Voice Health Care Applications on Voice Pathology Detection	Mar 5, 2025	DiagnosticMixture-of-Experts	CodeCode Available
Lifelong Mixture of Variational Autoencoders	Jul 9, 2021	Lifelong learningMixture-of-Experts	CodeCode Available
A multi-scale lithium-ion battery capacity prediction using mixture of experts and patch-based MLP	Mar 26, 2025	Mixture-of-Experts	CodeCode Available
Expert Sample Consensus Applied to Camera Re-Localization	Aug 7, 2019	Camera LocalizationMixture-of-Experts	CodeCode Available
Specializing Versatile Skill Libraries using Local Mixture of Experts	Dec 8, 2021	Incremental LearningMixture-of-Experts	CodeCode Available
Adaptive Expert Models for Personalization in Federated Learning	Jun 15, 2022	Federated LearningMixture-of-Experts	CodeCode Available
Unveiling the Hidden: Movie Genre and User Bias in Spoiler Detection	Apr 24, 2025	Graph AttentionMixture-of-Experts	CodeCode Available
PT-MoE: An Efficient Finetuning Framework for Integrating Mixture-of-Experts into Prompt Tuning	May 14, 2025	MathMathematical Problem-Solving	CodeCode Available
Learning to Adapt Clinical Sequences with Residual Mixture of Experts	Apr 6, 2022	Mixture-of-Experts	CodeCode Available
Multi-Source Cross-Lingual Model Transfer: Learning What to Share	Oct 8, 2018	Cross-Lingual NERCross-Lingual Transfer	CodeCode Available
Learning multi-modal generative models with permutation-invariant encoders and tighter variational objectives	Sep 1, 2023	Mixture-of-Experts	CodeCode Available
Equipping Computational Pathology Systems with Artifact Processing Pipelines: A Showcase for Computation and Performance Trade-offs	Mar 12, 2024	Airbubbles DetectionAnomaly Detection	CodeCode Available
Weakly-Supervised Multimodal Learning on MIMIC-CXR	Nov 15, 2024	Data IntegrationMixture-of-Experts	CodeCode Available
Adaptive 3D descattering with a dynamic synthesis network	Jul 1, 2021	DenoisingMixture-of-Experts	CodeCode Available
Ensemble and Mixture-of-Experts DeepONets For Operator Learning	May 20, 2024	Mixture-of-ExpertsOperator learning	CodeCode Available
Learning Mixture-of-Experts for General-Purpose Black-Box Discrete Optimization	May 29, 2024	Mixture-of-Experts	CodeCode Available
Learning Gating ConvNet for Two-Stream based Methods in Action Recognition	Sep 12, 2017	Action ClassificationAction Recognition	CodeCode Available
Learning Deep Mixtures of Gaussian Process Experts Using Sum-Product Networks	Sep 12, 2018	Gaussian ProcessesMixture-of-Experts	CodeCode Available
R^2MoE: Redundancy-Removal Mixture of Experts for Lifelong Concept Learning	Jul 17, 2025	Mixture-of-Experts	CodeCode Available
Learning CHARME models with neural networks	Feb 8, 2020	Learning TheoryMixture-of-Experts	CodeCode Available
A Multi-Modal Deep Learning Framework for Pan-Cancer Prognosis	Jan 13, 2025	Deep LearningMixture-of-Experts	CodeCode Available
RAPHAEL: Text-to-Image Generation via Large Mixture of Diffusion Paths	May 29, 2023	Image GenerationMixture-of-Experts	CodeCode Available
Embarrassingly Parallel Inference for Gaussian Processes	Feb 27, 2017	Gaussian ProcessesMixture-of-Experts	CodeCode Available
Learning a Mixture of Granularity-Specific Experts for Fine-Grained Categorization	Oct 1, 2019	DiversityFine-Grained Image Classification	CodeCode Available
Towards Adversarial Robustness of Model-Level Mixture-of-Experts Architectures for Semantic Segmentation	Dec 16, 2024	Adversarial RobustnessMixture-of-Experts	CodeCode Available
Latent Prototype Routing: Achieving Near-Perfect Load Balancing in Mixture-of-Experts	Jun 26, 2025	Mixture-of-Experts	CodeCode Available
Elucidating Robust Learning with Uncertainty-Aware Corruption Pattern Estimation	Nov 2, 2021	Mixture-of-Experts	CodeCode Available
STAMImputer: Spatio-Temporal Attention MoE for Traffic Data Imputation	Jun 9, 2025	Graph AttentionImputation	CodeCode Available
CompeteSMoE - Effective Training of Sparse Mixture of Experts via Competition	Feb 4, 2024	Mixture-of-Experts	CodeCode Available
CoLA: Collaborative Low-Rank Adaptation	May 21, 2025	CoLAMixture-of-Experts	CodeCode Available
What You Have is What You Track: Adaptive and Robust Multimodal Tracking	Jul 8, 2025	Mixture-of-ExpertsVisual Tracking	CodeCode Available
Beyond Sharing: Conflict-Aware Multivariate Time Series Anomaly Detection	Aug 17, 2023	Anomaly DetectionMixture-of-Experts	CodeCode Available
k-Winners-Take-All Ensemble Neural Network	Jan 4, 2024	AllMixture-of-Experts	CodeCode Available
Towards Being Parameter-Efficient: A Stratified Sparsely Activated Transformer with Dynamic Capacity	May 3, 2023	Machine TranslationMixture-of-Experts	CodeCode Available
Klotski: Efficient Mixture-of-Expert Inference via Expert-Aware Multi-Batch Pipeline	Feb 9, 2025	CPUGPU	CodeCode Available
Jamba: A Hybrid Transformer-Mamba Language Model	Mar 28, 2024	GPULanguage Modeling	CodeCode Available
A Mixture of Experts Approach to 3D Human Motion Prediction	May 9, 2024	Human motion predictionMixture-of-Experts	CodeCode Available
Understanding the Performance and Estimating the Cost of LLM Fine-Tuning	Aug 8, 2024	GPUMixture-of-Experts	CodeCode Available
ResMoE: Space-efficient Compression of Mixture of Experts LLMs via Residual Restoration	Mar 10, 2025	Mixture-of-Experts	CodeCode Available
Restoring Spatially-Heterogeneous Distortions using Mixture of Experts Network	Sep 30, 2020	Mixture-of-ExpertsMulti-Task Learning	CodeCode Available
Rethinking Gating Mechanism in Sparse MoE: Handling Arbitrary Modality Inputs with Confidence-Guided Gate	May 26, 2025	ImputationMixture-of-Experts	CodeCode Available
Intrinsic User-Centric Interpretability through Global Mixture of Experts	Feb 5, 2024	Mixture-of-ExpertsNews Classification	CodeCode Available
Integrating Multi-view Analysis: Multi-view Mixture-of-Expert for Textual Personality Detection	Aug 16, 2024	Mixture-of-Experts	CodeCode Available
Revisiting Hate Speech Benchmarks: From Data Curation to System Deployment	Jun 1, 2023	BenchmarkingHate Speech Detection	CodeCode Available

Show:10 25 50

← PrevPage 26 of 27Next →

No leaderboard results yet.