SOTAVerified

Mixture-of-Experts

Papers

Showing 12511300 of 1312 papers

TitleStatusHype
Extreme Classification in Log Memory using Count-Min Sketch: A Case Study of Amazon Search with 50M ProductsCode0
Probabilistic Mixture-of-Experts for Efficient Deep Reinforcement LearningCode0
Exploring Model Consensus to Generate Translation ParaphrasesCode0
Probabilistic Rainfall Estimation from Automotive LidarCode0
An Empirical Study on Model-agnostic Debiasing Strategies for Robust Natural Language InferenceCode0
Exploiting Activation Sparsity with Dense to Dynamic-k Mixture-of-Experts ConversionCode0
VoiceGRPO: Modern MoE Transformers with Group Relative Policy Optimization GRPO for AI Voice Health Care Applications on Voice Pathology DetectionCode0
Lifelong Mixture of Variational AutoencodersCode0
A multi-scale lithium-ion battery capacity prediction using mixture of experts and patch-based MLPCode0
Expert Sample Consensus Applied to Camera Re-LocalizationCode0
Specializing Versatile Skill Libraries using Local Mixture of ExpertsCode0
Adaptive Expert Models for Personalization in Federated LearningCode0
Unveiling the Hidden: Movie Genre and User Bias in Spoiler DetectionCode0
PT-MoE: An Efficient Finetuning Framework for Integrating Mixture-of-Experts into Prompt TuningCode0
Learning to Adapt Clinical Sequences with Residual Mixture of ExpertsCode0
Multi-Source Cross-Lingual Model Transfer: Learning What to ShareCode0
Learning multi-modal generative models with permutation-invariant encoders and tighter variational objectivesCode0
Equipping Computational Pathology Systems with Artifact Processing Pipelines: A Showcase for Computation and Performance Trade-offsCode0
Weakly-Supervised Multimodal Learning on MIMIC-CXRCode0
Adaptive 3D descattering with a dynamic synthesis networkCode0
Ensemble and Mixture-of-Experts DeepONets For Operator LearningCode0
Learning Mixture-of-Experts for General-Purpose Black-Box Discrete OptimizationCode0
Learning Gating ConvNet for Two-Stream based Methods in Action RecognitionCode0
Learning Deep Mixtures of Gaussian Process Experts Using Sum-Product NetworksCode0
R^2MoE: Redundancy-Removal Mixture of Experts for Lifelong Concept LearningCode0
Learning CHARME models with neural networksCode0
A Multi-Modal Deep Learning Framework for Pan-Cancer PrognosisCode0
RAPHAEL: Text-to-Image Generation via Large Mixture of Diffusion PathsCode0
Embarrassingly Parallel Inference for Gaussian ProcessesCode0
Learning a Mixture of Granularity-Specific Experts for Fine-Grained CategorizationCode0
Towards Adversarial Robustness of Model-Level Mixture-of-Experts Architectures for Semantic SegmentationCode0
Latent Prototype Routing: Achieving Near-Perfect Load Balancing in Mixture-of-ExpertsCode0
Elucidating Robust Learning with Uncertainty-Aware Corruption Pattern EstimationCode0
STAMImputer: Spatio-Temporal Attention MoE for Traffic Data ImputationCode0
CompeteSMoE - Effective Training of Sparse Mixture of Experts via CompetitionCode0
CoLA: Collaborative Low-Rank AdaptationCode0
What You Have is What You Track: Adaptive and Robust Multimodal TrackingCode0
Beyond Sharing: Conflict-Aware Multivariate Time Series Anomaly DetectionCode0
k-Winners-Take-All Ensemble Neural NetworkCode0
Towards Being Parameter-Efficient: A Stratified Sparsely Activated Transformer with Dynamic CapacityCode0
Klotski: Efficient Mixture-of-Expert Inference via Expert-Aware Multi-Batch PipelineCode0
Jamba: A Hybrid Transformer-Mamba Language ModelCode0
A Mixture of Experts Approach to 3D Human Motion PredictionCode0
Understanding the Performance and Estimating the Cost of LLM Fine-TuningCode0
ResMoE: Space-efficient Compression of Mixture of Experts LLMs via Residual RestorationCode0
Restoring Spatially-Heterogeneous Distortions using Mixture of Experts NetworkCode0
Rethinking Gating Mechanism in Sparse MoE: Handling Arbitrary Modality Inputs with Confidence-Guided GateCode0
Intrinsic User-Centric Interpretability through Global Mixture of ExpertsCode0
Integrating Multi-view Analysis: Multi-view Mixture-of-Expert for Textual Personality DetectionCode0
Revisiting Hate Speech Benchmarks: From Data Curation to System DeploymentCode0
Show:102550
← PrevPage 26 of 27Next →

No leaderboard results yet.