SOTAVerified

Mixture-of-Experts

Papers

Showing 9761000 of 1312 papers

TitleStatusHype
Direct Neural Machine Translation with Task-level Mixture of Experts models0
Discussion: Effective and Interpretable Outcome Prediction by Training Sparse Mixtures of Linear Experts0
Disguise without Disruption: Utility-Preserving Face De-Identification0
Distribution Learning for Molecular Regression0
Diverse Machine Translation with a Single Multinomial Latent Variable0
Diversified Dynamic Routing for Vision Tasks0
Diversifying the Expert Knowledge for Task-Agnostic Pruning in Sparse Mixture-of-Experts0
Diversifying the Mixture-of-Experts Representation for Language Models with Orthogonal Optimizer0
Diversity-Promoting Bayesian Learning of Latent Variable Models0
Divide-and-Conquer: Cold-Start Bundle Recommendation via Mixture of Diffusion Experts0
Divide, Conquer, and Combine: Mixture of Semantic-Independent Experts for Zero-Shot Dialogue State Tracking0
Double Deep Q-Learning in Opponent Modeling0
Double-Stage Feature-Level Clustering-Based Mixture of Experts Framework0
Double-Wing Mixture of Experts for Streaming Recommendations0
DriveMoE: Mixture-of-Experts for Vision-Language-Action Model in End-to-End Autonomous Driving0
Dropout Regularization in Hierarchical Mixture of Experts0
Drop-Upcycling: Training Sparse Mixture of Experts with Partial Re-initialization0
DSMoE: Matrix-Partitioned Experts with Dynamic Routing for Computation-Efficient Dense LLMs0
DualComp: End-to-End Learning of a Unified Dual-Modality Lossless Compressor0
Duplex: A Device for Large Language Models with Mixture of Experts, Grouped Query Attention, and Continuous Batching0
Each Rank Could be an Expert: Single-Ranked Mixture of Experts LoRA for Multi-Task Learning0
EC-DIT: Scaling Diffusion Transformers with Adaptive Expert-Choice Routing0
ECG-EmotionNet: Nested Mixture of Expert (NMoE) Adaptation of ECG-Foundation Model for Driver Emotion Recognition0
Edge-Aware Autoencoder Design for Real-Time Mixture-of-Experts Image Compression0
EEGMamba: Bidirectional State Space Model with Mixture of Experts for EEG Multi-task Classification0
Show:102550
← PrevPage 40 of 53Next →

No leaderboard results yet.