SOTAVerified

Mixture-of-Experts

Papers

Showing 851900 of 1312 papers

TitleStatusHype
A Mixture of Expert Based Deep Neural Network for Improved ASR0
A Mixture-of-Experts Approach to Few-Shot Task Transfer in Open-Ended Text Worlds0
A mixture of experts model for predicting persistent weather patterns0
A Mixture of h-1 Heads is Better than h Heads0
A Mixture of h - 1 Heads is Better than h Heads0
A Modular Task-oriented Dialogue System Using a Neural Mixture-of-Experts0
AM-Thinking-v1: Advancing the Frontier of Reasoning at 32B Scale0
An Audio-centric Multi-task Learning Framework for Streaming Ads Targeting on Spotify0
An Autonomous Negotiating Agent Framework with Reinforcement Learning Based Strategies and Adaptive Strategy Switching Mechanism0
An Autonomous Network Orchestration Framework Integrating Large Language Models with Continual Reinforcement Learning0
Anchoring to Exemplars for Training Mixture-of-Expert Cell Embeddings0
An Effective Mixture-Of-Experts Approach For Code-Switching Speech Recognition Leveraging Encoder Disentanglement0
An efficient application of Bayesian optimization to an industrial MDO framework for aircraft design0
An Efficient General-Purpose Modular Vision Model via Multi-Task Heterogeneous Training0
An Entailment Tree Generation Approach for Multimodal Multi-Hop Question Answering with Mixture-of-Experts and Iterative Feedback Mechanism0
An Introduction to the Practical and Theoretical Aspects of Mixture-of-Experts Modeling0
Non-asymptotic oracle inequalities for the Lasso in high-dimensional mixture of experts0
Non-asymptotic model selection in block-diagonal mixture of polynomial experts models0
A Novel A.I Enhanced Reservoir Characterization with a Combined Mixture of Experts -- NVIDIA Modulus based Physics Informed Neural Operator Forward Model0
A Novel Cluster Classify Regress Model Predictive Controller Formulation; CCR-MPC0
A Novel Temporal Multi-Gate Mixture-of-Experts Approach for Vehicle Trajectory and Driving Intention Prediction0
A Hybrid Tensor-Expert-Data Parallelism Approach to Optimize Mixture-of-Experts Training0
A Novel Trustworthy Video Summarization Algorithm Through a Mixture of LoRA Experts0
An Unsupervised Domain Adaptation Method for Locating Manipulated Region in partially fake Audio0
Aphasic Speech Recognition using a Mixture of Speech Intelligibility Experts0
A Provably Effective Method for Pruning Experts in Fine-tuned Sparse Mixture-of-Experts0
A Review of DeepSeek Models' Key Innovative Techniques0
A Review of Sparse Expert Models in Deep Learning0
A similarity-based Bayesian mixture-of-experts model0
A Simple Architecture for Enterprise Large Language Model Applications based on Role based security and Clearance Levels using Retrieval-Augmented Generation or Mixture of Experts0
Improving Transformer Performance for French Clinical Notes Classification Using Mixture of Experts on a Limited Dataset0
Astrea: A MOE-based Visual Understanding Model with Progressive Alignment0
A Survey on Dynamic Neural Networks for Natural Language Processing0
A Survey on Model MoErging: Recycling and Routing Among Specialized Experts for Collaborative Learning0
A Theoretical View on Sparsely Activated Networks0
AT-MoE: Adaptive Task-planning Mixture of Experts via LoRA Approach0
A Tree Architecture of LSTM Networks for Sequential Regression with Missing Data0
Attention Weighted Mixture of Experts with Contrastive Learning for Personalized Ranking in E-commerce0
A Two-Phase Deep Learning Framework for Adaptive Time-Stepping in High-Speed Flow Modeling0
A Unified Approach to Universal Prediction: Generalized Upper and Lower Bounds0
A Unified Framework for Iris Anti-Spoofing: Introducing IrisGeneral Dataset and Masked-MoE Method0
A Unified Virtual Mixture-of-Experts Framework:Enhanced Inference and Hallucination Mitigation in Single-Model System0
A Universal Approximation Theorem for Mixture of Experts Models0
Automatically Extracting Information in Medical Dialogue: Expert System And Attention for Labelling0
Automatic Document Sketching: Generating Drafts from Analogous Texts0
Automatic Expert Selection for Multi-Scenario and Multi-Task Search0
Automatic Operator-level Parallelism Planning for Distributed Deep Learning -- A Mixed-Integer Programming Approach0
類神經網路訓練結合環境群集及專家混合系統於強健性語音辨識(Automatic Speech Recognition using Neural Network based Acoustic Model with the Environment Clustering and Mixture of Experts Algorithms) [In Chinese]0
Autonomy-of-Experts Models0
Auxiliary-Loss-Free Load Balancing Strategy for Mixture-of-Experts0
Show:102550
← PrevPage 18 of 27Next →

No leaderboard results yet.