| Extreme Classification in Log Memory using Count-Min Sketch: A Case Study of Amazon Search with 50M Products | Oct 28, 2019 | ClassificationGeneral Classification | CodeCode Available | 0 |
| Probabilistic Mixture-of-Experts for Efficient Deep Reinforcement Learning | Apr 19, 2021 | Deep Reinforcement LearningMixture-of-Experts | CodeCode Available | 0 |
| Exploring Model Consensus to Generate Translation Paraphrases | Jul 1, 2020 | DiversityMachine Translation | CodeCode Available | 0 |
| Probabilistic Rainfall Estimation from Automotive Lidar | Apr 23, 2021 | Mixture-of-Experts | CodeCode Available | 0 |
| An Empirical Study on Model-agnostic Debiasing Strategies for Robust Natural Language Inference | Oct 8, 2020 | Data AugmentationMixture-of-Experts | CodeCode Available | 0 |
| Exploiting Activation Sparsity with Dense to Dynamic-k Mixture-of-Experts Conversion | Oct 6, 2023 | Mixture-of-Experts | CodeCode Available | 0 |
| VoiceGRPO: Modern MoE Transformers with Group Relative Policy Optimization GRPO for AI Voice Health Care Applications on Voice Pathology Detection | Mar 5, 2025 | DiagnosticMixture-of-Experts | CodeCode Available | 0 |
| Lifelong Mixture of Variational Autoencoders | Jul 9, 2021 | Lifelong learningMixture-of-Experts | CodeCode Available | 0 |
| A multi-scale lithium-ion battery capacity prediction using mixture of experts and patch-based MLP | Mar 26, 2025 | Mixture-of-Experts | CodeCode Available | 0 |
| Expert Sample Consensus Applied to Camera Re-Localization | Aug 7, 2019 | Camera LocalizationMixture-of-Experts | CodeCode Available | 0 |
| Specializing Versatile Skill Libraries using Local Mixture of Experts | Dec 8, 2021 | Incremental LearningMixture-of-Experts | CodeCode Available | 0 |
| Adaptive Expert Models for Personalization in Federated Learning | Jun 15, 2022 | Federated LearningMixture-of-Experts | CodeCode Available | 0 |
| Unveiling the Hidden: Movie Genre and User Bias in Spoiler Detection | Apr 24, 2025 | Graph AttentionMixture-of-Experts | CodeCode Available | 0 |
| PT-MoE: An Efficient Finetuning Framework for Integrating Mixture-of-Experts into Prompt Tuning | May 14, 2025 | MathMathematical Problem-Solving | CodeCode Available | 0 |
| Learning to Adapt Clinical Sequences with Residual Mixture of Experts | Apr 6, 2022 | Mixture-of-Experts | CodeCode Available | 0 |
| Multi-Source Cross-Lingual Model Transfer: Learning What to Share | Oct 8, 2018 | Cross-Lingual NERCross-Lingual Transfer | CodeCode Available | 0 |
| Learning multi-modal generative models with permutation-invariant encoders and tighter variational objectives | Sep 1, 2023 | Mixture-of-Experts | CodeCode Available | 0 |
| Equipping Computational Pathology Systems with Artifact Processing Pipelines: A Showcase for Computation and Performance Trade-offs | Mar 12, 2024 | Airbubbles DetectionAnomaly Detection | CodeCode Available | 0 |
| Weakly-Supervised Multimodal Learning on MIMIC-CXR | Nov 15, 2024 | Data IntegrationMixture-of-Experts | CodeCode Available | 0 |
| Adaptive 3D descattering with a dynamic synthesis network | Jul 1, 2021 | DenoisingMixture-of-Experts | CodeCode Available | 0 |
| Ensemble and Mixture-of-Experts DeepONets For Operator Learning | May 20, 2024 | Mixture-of-ExpertsOperator learning | CodeCode Available | 0 |
| Learning Mixture-of-Experts for General-Purpose Black-Box Discrete Optimization | May 29, 2024 | Mixture-of-Experts | CodeCode Available | 0 |
| Learning Gating ConvNet for Two-Stream based Methods in Action Recognition | Sep 12, 2017 | Action ClassificationAction Recognition | CodeCode Available | 0 |
| Learning Deep Mixtures of Gaussian Process Experts Using Sum-Product Networks | Sep 12, 2018 | Gaussian ProcessesMixture-of-Experts | CodeCode Available | 0 |
| R^2MoE: Redundancy-Removal Mixture of Experts for Lifelong Concept Learning | Jul 17, 2025 | Mixture-of-Experts | CodeCode Available | 0 |