SOTAVerified

Mixture-of-Experts

Papers

Showing 901950 of 1312 papers

TitleStatusHype
BadMoE: Backdooring Mixture-of-Experts LLMs via Optimizing Routing Triggers and Infecting Dormant Experts0
Balanced and Elastic End-to-end Training of Dynamic LLMs0
BAM! Just Like That: Simple and Efficient Parameter Upcycling for Mixture of Experts0
Bayesian Hierarchical Mixtures of Experts0
Bayesian shrinkage in mixture of experts models: Identifying robust determinants of class membership0
Beyond Distillation: Task-level Mixture-of-Experts for Efficient Inference0
Beyond Parameter Count: Implicit Bias in Soft Mixture of Experts0
Beyond Standard MoE: Mixture of Latent Experts for Resource-Efficient Language Models0
Biased Mixtures Of Experts: Enabling Computer Vision Inference Under Data Transfer Limitations0
BigMac: A Communication-Efficient Mixture-of-Experts Model Structure for Fast Training and Inference0
BiPrompt-SAM: Enhancing Image Segmentation via Explicit Selection between Point and Text Prompts0
BLR-MoE: Boosted Language-Routing Mixture of Experts for Domain-Robust Multilingual E2E ASR0
Boosting Code-Switching ASR with Mixture of Experts Enhanced Speech-Conditioned LLM0
Boost Your NeRF: A Model-Agnostic Mixture of Experts Framework for High Quality and Efficient Rendering0
Brain-Like Processing Pathways Form in Models With Heterogeneous Experts0
BrainNet-MoE: Brain-Inspired Mixture-of-Experts Learning for Neurological Disease Identification0
Branch-Train-MiX: Mixing Expert LLMs into a Mixture-of-Experts LLM0
Breaking Data Silos: Towards Open and Scalable Mobility Foundation Models via Generative Continual Learning0
Approximation Rates and VC-Dimension Bounds for (P)ReLU MLP Mixture of Experts0
Breaking the gridlock in Mixture-of-Experts: Consistent and Efficient Algorithms0
Handling Trade-Offs in Speech Separation with Sparsely-Gated Mixture of Experts0
Brief analysis of DeepSeek R1 and it's implications for Generative AI0
Buffer Overflow in Mixture of Experts0
Building a great multi-lingual teacher with sparsely-gated mixture of experts for speech recognition0
CAME: Competitively Learning a Mixture-of-Experts Model for First-stage Retrieval0
Context-aware Mixture-of-Experts for Unbiased Scene Graph Generation0
Capacity-Aware Inference: Mitigating the Straggler Effect in Mixture of Experts0
Changing Model Behavior at Test-Time Using Reinforcement Learning0
Channel Gain Cartography via Mixture of Experts0
Wonderful Matrices: More Efficient and Effective Architecture for Language Modeling Tasks0
CICADA: Cross-Domain Interpretable Coding for Anomaly Detection and Adaptation in Multivariate Time Series0
CLER: Cross-task Learning with Expert Representation to Generalize Reading and Understanding0
ClimateLLM: Efficient Weather Forecasting via Frequency-Aware Large Language Models0
CLIP-UP: A Simple and Efficient Mixture-of-Experts CLIP Training Recipe with Sparse Upcycling0
CL-MoE: Enhancing Multimodal Large Language Model with Dual Momentum Mixture-of-Experts for Continual Visual Question Answering0
CNC: Cross-modal Normality Constraint for Unsupervised Multi-class Anomaly Detection0
CoCoAFusE: Beyond Mixtures of Experts via Model Fusion0
Combinations of Adaptive Filters0
Combining Parametric and Nonparametric Models for Off-Policy Evaluation0
Combining Spectral and Self-Supervised Features for Low Resource Speech Recognition and Translation0
Complexity Experts are Task-Discriminative Learners for Any Image Restoration0
On the Adaptation to Concept Drift for CTR Prediction0
Conditional computation in neural networks: principles and research trends0
Configurable Foundation Models: Building LLMs from a Modular Perspective0
Connector-S: A Survey of Connectors in Multi-modal Large Language Models0
ConstitutionalExperts: Training a Mixture of Principle-based Prompts0
Contextual Mixture of Experts: Integrating Knowledge into Predictive Modeling0
Contextual Policy Transfer in Reinforcement Learning Domains via Deep Mixtures-of-Experts0
ContextWIN: Whittle Index Based Mixture-of-Experts Neural Model For Restless Bandits Via Deep RL0
Continual Learning Using Task Conditional Neural Networks0
Show:102550
← PrevPage 19 of 27Next →

No leaderboard results yet.