Mixture-of-Experts

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 1001–1050 of 1312 papers

Title	Date	Tasks	Status	Hype
Double Deep Q-Learning in Opponent Modeling	Nov 24, 2022	Mixture-of-ExpertsQ-Learning	—Unverified	0
Spatial Mixture-of-Experts	Nov 24, 2022	Mixture-of-Experts	CodeCode Available	1
Who Says Elephants Can't Run: Bringing Large Scale MoE Models into Cloud Scale Production	Nov 18, 2022	Machine TranslationMixture-of-Experts	—Unverified	0
A Bird's-eye View of Reranking: from List Level to Page Level	Nov 17, 2022	Mixture-of-ExpertsRecommendation Systems	CodeCode Available	0
HMOE: Hypernetwork-based Mixture of Experts for Domain Generalization	Nov 15, 2022	Domain GeneralizationMixture-of-Experts	—Unverified	0
Handling Trade-Offs in Speech Separation with Sparsely-Gated Mixture of Experts	Nov 11, 2022	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified	0
PAD-Net: An Efficient Framework for Dynamic Networks	Nov 10, 2022	image-classificationImage Classification	CodeCode Available	1
SpeechMatrix: A Large-Scale Mined Corpus of Multilingual Speech-to-Speech Translations	Nov 8, 2022	Mixture-of-ExpertsSpeech-to-Speech Translation	—Unverified	0
Using Deep Mixture-of-Experts to Detect Word Meaning Shift for TempoWiC	Nov 7, 2022	Data AugmentationMixture-of-Experts	—Unverified	0
Safe Real-World Autonomous Driving by Learning to Predict and Plan with a Mixture of Experts	Nov 3, 2022	Autonomous DrivingAutonomous Vehicles	—Unverified	0
Contextual Mixture of Experts: Integrating Knowledge into Predictive Modeling	Nov 1, 2022	Mixture-of-Experts	—Unverified	0
Prediction Sets for High-Dimensional Mixture of Experts Models	Oct 30, 2022	Mixture-of-ExpertsPrediction	—Unverified	0
Knowledge-in-Context: Towards Knowledgeable Semi-Parametric Language Models	Oct 28, 2022	Common Sense ReasoningCoreference Resolution	—Unverified	0
Coordination with Humans via Strategy Matching	Oct 27, 2022	Mixture-of-Experts	—Unverified	0
M^3ViT: Mixture-of-Experts Vision Transformer for Efficient Multi-task Learning with Model-Accelerator Co-design	Oct 26, 2022	Mixture-of-ExpertsMulti-Task Learning	CodeCode Available	1
On the Adversarial Robustness of Mixture of Experts	Oct 19, 2022	Adversarial RobustnessMixture-of-Experts	—Unverified	0
Tiny-Attention Adapter: Contexts Are More Important Than the Number of Parameters	Oct 18, 2022	Language ModelingLanguage Modelling	—Unverified	0
AutoMoE: Heterogeneous Mixture-of-Experts with Adaptive Computation for Efficient Neural Machine Translation	Oct 14, 2022	CPUMachine Translation	CodeCode Available	1
Mixture of Attention Heads: Selecting Attention Heads Per Token	Oct 11, 2022	Computational EfficiencyLanguage Modeling	CodeCode Available	1
FEAMOE: Fair, Explainable and Adaptive Mixture of Experts	Oct 10, 2022	FairnessMixture-of-Experts	—Unverified	0
Meta-DMoE: Adapting to Domain Shift by Meta-Distillation from Mixture-of-Experts	Oct 8, 2022	Domain GeneralizationKnowledge Distillation	CodeCode Available	1
Deep Learning Mixture-of-Experts Approach for Cytotoxic Edema Assessment in Infants and Children	Oct 6, 2022	image-classificationImage Classification	—Unverified	0
Probabilistic partition of unity networks for high-dimensional regression problems	Oct 6, 2022	Dimensionality ReductionMixture-of-Experts	—Unverified	0
Table-based Fact Verification with Self-labeled Keypoint Alignment	Oct 1, 2022	AttributeContrastive Learning	—Unverified	0
Parameter-varying neural ordinary differential equations with partition-of-unity networks	Oct 1, 2022	Mixture-of-ExpertsUnity	—Unverified	0
Sparsity-Constrained Optimal Transport	Sep 30, 2022	Mixture-of-Experts	—Unverified	0
Mixture of experts models for multilevel data: modelling framework and approximation theory	Sep 30, 2022	Mixture-of-Expertsregression	—Unverified	0
Tuning of Mixture-of-Experts Mixed-Precision Neural Networks	Sep 29, 2022	image-classificationImage Classification	—Unverified	0
Diversified Dynamic Routing for Vision Tasks	Sep 26, 2022	Instance SegmentationMixture-of-Experts	—Unverified	0
Parameter-Efficient Conformers via Sharing Sparsely-Gated Experts for End-to-End Speech Recognition	Sep 17, 2022	Knowledge DistillationMixture-of-Experts	—Unverified	0
Sparse Video Representation Using Steered Mixture-of-Experts With Global Motion Compensation	Sep 13, 2022	Mixture-of-ExpertsMotion Compensation	—Unverified	0
A Review of Sparse Expert Models in Deep Learning	Sep 4, 2022	Deep LearningMixture-of-Experts	—Unverified	0
ADMoE: Anomaly Detection with Mixture-of-Experts from Noisy Labels	Aug 24, 2022	Anomaly DetectionMixture-of-Experts	—Unverified	0
Mask and Reason: Pre-Training Knowledge Graph Transformers for Complex Logical Queries	Aug 16, 2022	Mixture-of-Experts	CodeCode Available	1
Context-aware Mixture-of-Experts for Unbiased Scene Graph Generation	Aug 15, 2022	DiversityGraph Generation	—Unverified	0
A Theoretical View on Sparsely Activated Networks	Aug 8, 2022	Mixture-of-Experts	—Unverified	0
Towards Understanding Mixture of Experts in Deep Learning	Aug 4, 2022	Deep LearningMixture-of-Experts	CodeCode Available	1
Edge-Aware Autoencoder Design for Real-Time Mixture-of-Experts Image Compression	Jul 25, 2022	DenoisingImage Compression	—Unverified	0
Learning Soccer Juggling Skills with Layer-wise Mixture-of-Experts	Jul 24, 2022	Deep Reinforcement LearningHumanoid Control	CodeCode Available	1
Adaptive Mixture of Experts Learning for Generalizable Face Anti-Spoofing	Jul 20, 2022	Domain GeneralizationFace Anti-Spoofing	—Unverified	0
MoEC: Mixture of Expert Clusters	Jul 19, 2022	Machine TranslationMixture-of-Experts	—Unverified	0
Learning Large-scale Universal User Representation with Sparse Mixture of Experts	Jul 11, 2022	Mixture-of-Experts	—Unverified	0
No Language Left Behind: Scaling Human-Centered Machine Translation	Jul 11, 2022	Machine TranslationMixture-of-Experts	CodeCode Available	2
DeepSpeed Inference: Enabling Efficient Inference of Transformer Models at Unprecedented Scale	Jun 30, 2022	CPUGPU	CodeCode Available	4
RoME: Role-aware Mixture-of-Expert Transformer for Text-to-Video Retrieval	Jun 26, 2022	Mixture-of-ExpertsRetrieval	CodeCode Available	0
Scalable Neural Data Server: A Data Recommender for Transfer Learning	Jun 19, 2022	Mixture-of-ExpertsTransfer Learning	—Unverified	0
Adaptive Expert Models for Personalization in Federated Learning	Jun 15, 2022	Federated LearningMixture-of-Experts	CodeCode Available	0
Towards Universal Sequence Representation Learning for Recommender Systems	Jun 13, 2022	Mixture-of-ExpertsRecommendation Systems	CodeCode Available	2
Uni-Perceiver-MoE: Learning Sparse Generalist Models with Conditional MoEs	Jun 9, 2022	Image CaptioningImage Classification	CodeCode Available	2
Sparse Mixture-of-Experts are Domain Generalizable Learners	Jun 8, 2022	Domain GeneralizationMixture-of-Experts	CodeCode Available	1

Show:10 25 50

← PrevPage 21 of 27Next →

No leaderboard results yet.