SOTAVerified

Mixture-of-Experts

Papers

Showing 12511300 of 1312 papers

TitleStatusHype
WeNet: Weighted Networks for Recurrent Network Architecture Search0
On the Functional Equivalence of TSK Fuzzy Systems to Neural Networks, Mixture of Experts, CART, and Stacking Ensemble Regression0
A mixture of experts model for predicting persistent weather patterns0
Affect in Tweets Using Experts Model0
Hierarchical Routing Mixture of Experts0
Tensor-variate Mixture of Experts for Proportional Myographic Control of a Robotic HandCode0
Uncertainty-Aware Driver Trajectory Prediction at Urban Intersections0
Improving Sepsis Treatment Strategies by Combining Deep and Kernel-Based Reinforcement Learning0
Bayesian shrinkage in mixture of experts models: Identifying robust determinants of class membership0
Dropout Regularization in Hierarchical Mixture of Experts0
Nesti-Net: Normal Estimation for Unstructured 3D Point Clouds using Convolutional Neural NetworksCode0
Mixture of Regression Experts in fMRI Encoding0
Learning to Compose Topic-Aware Mixture of Experts for Zero-Shot Video Captioning0
A Mixture of Expert Approach for Low-Cost Customization of Deep Neural Networks0
Regularized Maximum Likelihood Estimation and Feature Selection in Mixtures-of-Experts Models0
Multi-Source Cross-Lingual Model Transfer: Learning What to ShareCode0
Zero-Resource Multilingual Model Transfer: Learning What to Share0
Learning Deep Mixtures of Gaussian Process Experts Using Sum-Product NetworksCode0
Multi-Source Domain Adaptation with Mixture of ExpertsCode0
Modeling Task Relationships in Multi-task Learning with Multi-gate Mixture-of-ExpertsCode0
MoE-SPNet: A Mixture-of-Experts Scene Parsing Network0
Modularity Matters: Learning Invariant Relational Reasoning Tasks0
Deep Mixture of Experts via Shallow EmbeddingCode0
MEGAN: Mixture of Experts of Generative Adversarial Networks for Multimodal Image Generation0
Stylistic Variation in Social Media Part-of-Speech Tagging0
Discontinuity-Sensitive Optimal Control Learning by Mixture of ExpertsCode0
Breaking the gridlock in Mixture-of-Experts: Consistent and Efficient Algorithms0
Granger-causal Attentive Mixtures of Experts: Learning Important Features with Neural NetworksCode0
Topic Compositional Neural Language Model0
Diversity-Promoting Bayesian Learning of Latent Variable Models0
Deep Gaussian Covariance Network0
Learning Gating ConvNet for Two-Stream based Methods in Action RecognitionCode0
UTS submission to Google YouTube-8M Challenge 2017Code0
An Introduction to the Practical and Theoretical Aspects of Mixture-of-Experts Modeling0
Hierarchical Deep Recurrent Architecture for Video UnderstandingCode0
Effective Approaches to Batch Parallelization for Dynamic Neural Network ArchitecturesCode0
Large-Scale YouTube-8M Video Understanding with Deep Neural Networks0
Hard Mixtures of Experts for Large Scale Weakly Supervised Vision0
Quality Resilient Deep Neural Networks0
Embarrassingly Parallel Inference for Gaussian ProcessesCode0
Changing Model Behavior at Test-Time Using Reinforcement Learning0
Gated Multimodal Units for Information FusionCode1
Visual Saliency Prediction Using a Mixture of Deep Neural Networks0
Outrageously Large Neural Networks: The Sparsely-Gated Mixture-of-Experts LayerCode2
Robust mixture of experts modeling using the skew t distribution0
Robust mixture of experts modeling using the t distribution0
Opponent Modeling in Deep Reinforcement LearningCode0
LSTM-based Mixture-of-Experts for Knowledge-Aware Dialogues0
A Universal Approximation Theorem for Mixture of Experts Models0
Addressing Complex and Subjective Product-Related Queries with Customer Reviews0
Show:102550
← PrevPage 26 of 27Next →

No leaderboard results yet.