Mixture-of-Experts

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 901–950 of 1312 papers

Title	Date	Tasks	Status
BadMoE: Backdooring Mixture-of-Experts LLMs via Optimizing Routing Triggers and Infecting Dormant Experts	Apr 24, 2025	Backdoor AttackMixture-of-Experts	—Unverified
Balanced and Elastic End-to-end Training of Dynamic LLMs	May 20, 2025	GPUMixture-of-Experts	—Unverified
BAM! Just Like That: Simple and Efficient Parameter Upcycling for Mixture of Experts	Aug 15, 2024	Mixture-of-Experts	—Unverified
Bayesian Hierarchical Mixtures of Experts	Oct 19, 2012	Mixture-of-ExpertsVariational Inference	—Unverified
Bayesian shrinkage in mixture of experts models: Identifying robust determinants of class membership	Jan 12, 2019	Bayesian InferenceMixture-of-Experts	—Unverified
Beyond Distillation: Task-level Mixture-of-Experts for Efficient Inference	Sep 24, 2021	Mixture-of-ExpertsSentence	—Unverified
Beyond Parameter Count: Implicit Bias in Soft Mixture of Experts	Sep 2, 2024	Mixture-of-Experts	—Unverified
Beyond Standard MoE: Mixture of Latent Experts for Resource-Efficient Language Models	Mar 29, 2025	Computational EfficiencyMixture-of-Experts	—Unverified
Biased Mixtures Of Experts: Enabling Computer Vision Inference Under Data Transfer Limitations	Aug 21, 2020	Action ClassificationImage Super-Resolution	—Unverified
BigMac: A Communication-Efficient Mixture-of-Experts Model Structure for Fast Training and Inference	Feb 24, 2025	Mixture-of-Experts	—Unverified
BiPrompt-SAM: Enhancing Image Segmentation via Explicit Selection between Point and Text Prompts	Mar 25, 2025	Image SegmentationMixture-of-Experts	—Unverified
BLR-MoE: Boosted Language-Routing Mixture of Experts for Domain-Robust Multilingual E2E ASR	Jan 22, 2025	Mixture-of-Experts	—Unverified
Boosting Code-Switching ASR with Mixture of Experts Enhanced Speech-Conditioned LLM	Sep 24, 2024	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified
Boost Your NeRF: A Model-Agnostic Mixture of Experts Framework for High Quality and Efficient Rendering	Jul 15, 2024	Mixture-of-ExpertsNeRF	—Unverified
Brain-Like Processing Pathways Form in Models With Heterogeneous Experts	Jun 3, 2025	FormMixture-of-Experts	—Unverified
BrainNet-MoE: Brain-Inspired Mixture-of-Experts Learning for Neurological Disease Identification	Mar 5, 2025	Mixture-of-Experts	—Unverified
Branch-Train-MiX: Mixing Expert LLMs into a Mixture-of-Experts LLM	Mar 12, 2024	Arithmetic ReasoningCode Generation	—Unverified
Breaking Data Silos: Towards Open and Scalable Mobility Foundation Models via Generative Continual Learning	Jun 7, 2025	Continual LearningFederated Learning	—Unverified
Approximation Rates and VC-Dimension Bounds for (P)ReLU MLP Mixture of Experts	Feb 5, 2024	GPUMixture-of-Experts	—Unverified
Breaking the gridlock in Mixture-of-Experts: Consistent and Efficient Algorithms	Feb 21, 2018	Ensemble LearningMixture-of-Experts	—Unverified
Handling Trade-Offs in Speech Separation with Sparsely-Gated Mixture of Experts	Nov 11, 2022	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified
Brief analysis of DeepSeek R1 and it's implications for Generative AI	Feb 4, 2025	GPUMixture-of-Experts	—Unverified
Buffer Overflow in Mixture of Experts	Feb 8, 2024	Mixture-of-Experts	—Unverified
Building a great multi-lingual teacher with sparsely-gated mixture of experts for speech recognition	Dec 10, 2021	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified
CAME: Competitively Learning a Mixture-of-Experts Model for First-stage Retrieval	Nov 6, 2023	Mixture-of-ExpertsRetrieval	—Unverified
Context-aware Mixture-of-Experts for Unbiased Scene Graph Generation	Aug 15, 2022	DiversityGraph Generation	—Unverified
Capacity-Aware Inference: Mitigating the Straggler Effect in Mixture of Experts	Mar 7, 2025	Mixture-of-Experts	—Unverified
Changing Model Behavior at Test-Time Using Reinforcement Learning	Feb 24, 2017	BIG-bench Machine LearningMixture-of-Experts	—Unverified
Channel Gain Cartography via Mixture of Experts	Dec 8, 2020	Mixture-of-Experts	—Unverified
Wonderful Matrices: More Efficient and Effective Architecture for Language Modeling Tasks	Jul 24, 2024	Language ModelingLanguage Modelling	—Unverified
CICADA: Cross-Domain Interpretable Coding for Anomaly Detection and Adaptation in Multivariate Time Series	May 1, 2025	Anomaly DetectionMeta-Learning	—Unverified
CLER: Cross-task Learning with Expert Representation to Generalize Reading and Understanding	Nov 1, 2019	Mixture-of-ExpertsMulti-Task Learning	—Unverified
ClimateLLM: Efficient Weather Forecasting via Frequency-Aware Large Language Models	Feb 16, 2025	energy managementMixture-of-Experts	—Unverified
CLIP-UP: A Simple and Efficient Mixture-of-Experts CLIP Training Recipe with Sparse Upcycling	Feb 3, 2025	Mixture-of-Experts	—Unverified
CL-MoE: Enhancing Multimodal Large Language Model with Dual Momentum Mixture-of-Experts for Continual Visual Question Answering	Mar 1, 2025	Continual LearningLanguage Modeling	—Unverified
CNC: Cross-modal Normality Constraint for Unsupervised Multi-class Anomaly Detection	Dec 31, 2024	Anomaly DetectionAttribute	—Unverified
CoCoAFusE: Beyond Mixtures of Experts via Model Fusion	May 2, 2025	Mixture-of-ExpertsPhilosophy	—Unverified
Combinations of Adaptive Filters	Dec 22, 2021	Mixture-of-Experts	—Unverified
Combining Parametric and Nonparametric Models for Off-Policy Evaluation	May 14, 2019	Mixture-of-ExpertsOff-policy evaluation	—Unverified
Combining Spectral and Self-Supervised Features for Low Resource Speech Recognition and Translation	Apr 5, 2022	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified
Complexity Experts are Task-Discriminative Learners for Any Image Restoration	Nov 27, 2024	AttributeBlind All-in-One Image Restoration	—Unverified
On the Adaptation to Concept Drift for CTR Prediction	Apr 1, 2022	Click-Through Rate PredictionIncremental Learning	—Unverified
Conditional computation in neural networks: principles and research trends	Mar 12, 2024	Mixture-of-Expertsscientific discovery	—Unverified
Configurable Foundation Models: Building LLMs from a Modular Perspective	Sep 4, 2024	Computational EfficiencyMixture-of-Experts	—Unverified
Connector-S: A Survey of Connectors in Multi-modal Large Language Models	Feb 17, 2025	Mixture-of-ExpertsSurvey	—Unverified
ConstitutionalExperts: Training a Mixture of Principle-based Prompts	Mar 7, 2024	Mixture-of-Experts	—Unverified
Contextual Mixture of Experts: Integrating Knowledge into Predictive Modeling	Nov 1, 2022	Mixture-of-Experts	—Unverified
Contextual Policy Transfer in Reinforcement Learning Domains via Deep Mixtures-of-Experts	Feb 29, 2020	Mixture-of-ExpertsOpenAI Gym	—Unverified
ContextWIN: Whittle Index Based Mixture-of-Experts Neural Model For Restless Bandits Via Deep RL	Oct 13, 2024	Decision MakingMixture-of-Experts	—Unverified
Continual Learning Using Task Conditional Neural Networks	Sep 29, 2021	Continual LearningMixture-of-Experts	—Unverified

Show:10 25 50

← PrevPage 19 of 27Next →

No leaderboard results yet.