SOTAVerified

Mixture-of-Experts

Papers

Showing 11511200 of 1312 papers

TitleStatusHype
MECATS: Mixture-of-Experts for Probabilistic Forecasts of Aggregated Time Series0
Continual Learning Using Task Conditional Neural Networks0
Full-Precision Free Binary Graph Neural Networks0
HydraSum - Disentangling Stylistic Features in Text Summarization using Multi-Decoder Models0
Unbiased Gradient Estimation with Balanced Assignments for Mixtures of Experts0
Beyond Distillation: Task-level Mixture-of-Experts for Efficient Inference0
Scalable and Efficient MoE Training for Multitask Multilingual Models0
Universal Simultaneous Machine Translation with Mixture-of-Experts Wait-k PolicyCode0
Cross-token Modeling with Conditional Computation0
Personalised Federated Learning: A Combinational Approach0
SPMoE: Generate Multiple Pattern-Aware Outputs with Sparse Pattern Mixture of Experts0
AIREX: Neural Network-based Approach for Air Quality Inference in Unmonitored Cities0
A Mixture-of-Experts Model for Antonym-Synonym DiscriminationCode0
Strength in Numbers: Averaging and Clustering Effects in Mixture of Experts for Graph-Based Dependency Parsing0
ExpertRank: A Multi-level Coarse-grained Expert-based Listwise Ranking Loss0
Federated Mixture of Experts0
Lifelong Mixture of Variational AutoencodersCode0
AdaSpeech 3: Adaptive Text to Speech for Spontaneous Style0
Adaptive 3D descattering with a dynamic synthesis networkCode0
On component interactions in two-stage recommender systems0
Mixtures of Deep Neural Experts for Automated Speech Scoring0
On Multi-objective Policy Optimization as a Tool for Reinforcement Learning: Case Studies in Offline RL and Finetuning0
Automatic Document Sketching: Generating Drafts from Analogous Texts0
DSelect-k: Differentiable Selection in the Mixture of Experts with Applications to Multi-Task LearningCode0
AdaTag: Multi-Attribute Value Extraction from Product Profiles with Adaptive Decoding0
GEMNET: Effective Gated Gazetteer Representations for Recognizing Complex Entities in Low-context Input0
M6-T: Exploring Sparse Expert Models and Beyond0
Data Expansion using Back Translation and Paraphrasing for Hate Speech Detection0
Mixture of ELM based experts with trainable gating network0
Generalizable Person Re-identification with Relevance-aware Mixture of Experts0
MTNet: A Multi-Task Neural Network for On-Field Calibration of Low-Cost Air Monitoring Sensors0
KDExplainer: A Task-oriented Attention Model for Explaining Knowledge Distillation0
Robust Federated Learning by Mixture of ExpertsCode0
Probabilistic Rainfall Estimation from Automotive LidarCode0
Probabilistic Mixture-of-Experts for Efficient Deep Reinforcement LearningCode0
Non-asymptotic model selection in block-diagonal mixture of polynomial experts models0
A non-asymptotic approach for model selection via penalization in high-dimensional mixture of experts modelsCode0
Cross-Topic Rumor Detection using Topic-Mixtures0
Multi-GAT: A Graphical Attention-based Hierarchical Multimodal Representation Learning Approach for Human Activity Recognition0
Imitation Learning from MPC for Quadrupedal Multi-Gait Control0
An Autonomous Negotiating Agent Framework with Reinforcement Learning Based Strategies and Adaptive Strategy Switching Mechanism0
A Novel Cluster Classify Regress Model Predictive Controller Formulation; CCR-MPC0
Preferential Mixture-of-Experts: Interpretable Models that Rely on Human Expertise as much as Possible0
Federated learning using mixture of experts0
Exploring Routing Strategies for Multilingual Mixture-of-Experts Models0
Gated Ensemble of Spatio-temporal Mixture of Experts for Multi-task Learning in Ride-hailing System0
Self-Supervised Multimodal Domino: in Search of Biomarkers for Alzheimer's DiseaseCode0
Channel Gain Cartography via Mixture of Experts0
A similarity-based Bayesian mixture-of-experts model0
A Mixture-of-Experts Model for Learning Multi-Facet Entity EmbeddingsCode0
Show:102550
← PrevPage 24 of 27Next →

No leaderboard results yet.