| Is Temperature Sample Efficient for Softmax Gaussian Mixture of Experts? | Jan 25, 2024 | Mixture-of-Expertsparameter estimation | —Unverified | 0 |
| M^3TN: Multi-gate Mixture-of-Experts based Multi-valued Treatment Network for Uplift Modeling | Jan 24, 2024 | Mixture-of-Experts | —Unverified | 0 |
| Towards A Better Metric for Text-to-Video Generation | Jan 15, 2024 | Mixture-of-ExpertsText-to-Video Generation | —Unverified | 0 |
| Prompt-based mental health screening from social media text | Jan 11, 2024 | Mixture-of-Experts | —Unverified | 0 |
| Robust Calibration For Improved Weather Prediction Under Distributional Shift | Jan 8, 2024 | Data AugmentationMixture-of-Experts | —Unverified | 0 |
| Incorporating Visual Experts to Resolve the Information Loss in Multimodal Large Language Models | Jan 6, 2024 | Instruction FollowingMixture-of-Experts | —Unverified | 0 |
| Subjective and Objective Analysis of Indian Social Media Video Quality | Jan 5, 2024 | Mixture-of-ExpertsVisual Question Answering (VQA) | CodeCode Available | 0 |
| k-Winners-Take-All Ensemble Neural Network | Jan 4, 2024 | AllMixture-of-Experts | CodeCode Available | 0 |
| Efficient Deweather Mixture-of-Experts with Uncertainty-aware Feature-wise Linear Modulation | Dec 27, 2023 | Image RestorationMixture-of-Experts | —Unverified | 0 |
| Agent4Ranking: Semantic Robust Ranking via Personalized Query Rewriting Using Multi-agent LLM | Dec 24, 2023 | Mixture-of-Experts | —Unverified | 0 |
| Mixture of Cluster-conditional LoRA Experts for Vision-language Instruction Tuning | Dec 19, 2023 | DiversityInstruction Following | —Unverified | 0 |
| Generator Assisted Mixture of Experts For Feature Acquisition in Batch | Dec 19, 2023 | Mixture-of-Experts | —Unverified | 0 |
| From Google Gemini to OpenAI Q* (Q-Star): A Survey of Reshaping the Generative Artificial Intelligence (AI) Research Landscape | Dec 18, 2023 | Mixture-of-Experts | —Unverified | 0 |
| Online Action Recognition for Human Risk Prediction with Anticipated Haptic Alert via Wearables | Dec 14, 2023 | Action RecognitionMixture-of-Experts | CodeCode Available | 0 |
| Training of Neural Networks with Uncertain Data: A Mixture of Experts Approach | Dec 13, 2023 | Autonomous DrivingMixture-of-Experts | —Unverified | 0 |
| MoE-AMC: Enhancing Automatic Modulation Classification Performance Using Mixture-of-Experts | Dec 4, 2023 | ClassificationMixture-of-Experts | —Unverified | 0 |
| MoEC: Mixture of Experts Implicit Neural Compression | Dec 3, 2023 | Data CompressionMixture-of-Experts | —Unverified | 0 |
| Language-driven All-in-one Adverse Weather Removal | Dec 3, 2023 | AllDiversity | —Unverified | 0 |
| Omni-SMoLA: Boosting Generalist Multimodal Models with Soft Mixture of Low-rank Experts | Dec 1, 2023 | Chart Question AnsweringDocument AI | —Unverified | 0 |
| HOMOE: A Memory-Based and Composition-Aware Framework for Zero-Shot Learning with Hopfield Network and Soft Mixture of Experts | Nov 23, 2023 | Compositional Zero-Shot LearningMixture-of-Experts | —Unverified | 0 |
| Efficient Model Agnostic Approach for Implicit Neural Representation Based Arbitrary-Scale Image Super-Resolution | Nov 20, 2023 | Computational EfficiencyDecoder | —Unverified | 0 |
| Memory Augmented Language Models through Mixture of Word Experts | Nov 15, 2023 | Mixture-of-Experts | —Unverified | 0 |
| Intentional Biases in LLM Responses | Nov 11, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| CAME: Competitively Learning a Mixture-of-Experts Model for First-stage Retrieval | Nov 6, 2023 | Mixture-of-ExpertsRetrieval | —Unverified | 0 |
| Octavius: Mitigating Task Interference in MLLMs via LoRA-MoE | Nov 5, 2023 | DecoderMixture-of-Experts | CodeCode Available | 0 |
| Mixture-of-Experts for Open Set Domain Adaptation: A Dual-Space Detection Approach | Nov 1, 2023 | Domain AdaptationMixture-of-Experts | —Unverified | 0 |
| A General Theory for Softmax Gating Multinomial Logistic Mixture of Experts | Oct 22, 2023 | Density EstimationMixture-of-Experts | —Unverified | 0 |
| Manifold-Preserving Transformers are Effective for Short-Long Range Encoding | Oct 22, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Direct Neural Machine Translation with Task-level Mixture of Experts models | Oct 18, 2023 | Direct NMTLarge Language Model | —Unverified | 0 |
| Multi-view Contrastive Learning for Entity Typing over Knowledge Graphs | Oct 18, 2023 | Contrastive LearningEntity Typing | CodeCode Available | 0 |
| Diversifying the Mixture-of-Experts Representation for Language Models with Orthogonal Optimizer | Oct 15, 2023 | DiversityMixture-of-Experts | —Unverified | 0 |
| Adaptive Gating in Mixture-of-Experts based Language Models | Oct 11, 2023 | Mixture-of-Experts | —Unverified | 0 |
| Beyond the Typical: Modeling Rare Plausible Patterns in Chemical Reactions by Leveraging Sequential Mixture-of-Experts | Oct 7, 2023 | Mixture-of-Experts | —Unverified | 0 |
| Exploiting Activation Sparsity with Dense to Dynamic-k Mixture-of-Experts Conversion | Oct 6, 2023 | Mixture-of-Experts | CodeCode Available | 0 |
| Reinforcement Learning-based Mixture of Vision Transformers for Video Violence Recognition | Oct 4, 2023 | Mixture-of-Expertsreinforcement-learning | —Unverified | 0 |
| Mixture of Quantized Experts (MoQE): Complementary Effect of Low-bit Quantization and Robustness | Oct 3, 2023 | GPUMachine Translation | —Unverified | 0 |
| FT-Shield: A Watermark Against Unauthorized Fine-tuning in Text-to-Image Diffusion Models | Oct 3, 2023 | Face TransferMixture-of-Experts | CodeCode Available | 0 |
| Statistical Perspective of Top-K Sparse Softmax Gating Mixture of Experts | Sep 25, 2023 | Density EstimationMixture-of-Experts | —Unverified | 0 |
| Mobile V-MoEs: Scaling Down Vision Transformers via Sparse Mixture-of-Experts | Sep 8, 2023 | Mixture-of-Experts | —Unverified | 0 |
| Learning multi-modal generative models with permutation-invariant encoders and tighter variational objectives | Sep 1, 2023 | Mixture-of-Experts | CodeCode Available | 0 |
| Task-Based MoE for Multitask Multilingual Machine Translation | Aug 30, 2023 | Machine TranslationMixture-of-Experts | —Unverified | 0 |
| SwapMoE: Serving Off-the-shelf MoE-based Large Language Models with Tunable Memory Budget | Aug 29, 2023 | Mixture-of-Expertsobject-detection | —Unverified | 0 |
| EVE: Efficient Vision-Language Pre-training with Masked Prediction and Modality-Aware MoE | Aug 23, 2023 | Image-text matchingImage-text Retrieval | —Unverified | 0 |
| Beyond Sharing: Conflict-Aware Multivariate Time Series Anomaly Detection | Aug 17, 2023 | Anomaly DetectionMixture-of-Experts | CodeCode Available | 0 |
| FineQuant: Unlocking Efficiency with Fine-Grained Weight-Only Quantization for LLMs | Aug 16, 2023 | GPUMixture-of-Experts | —Unverified | 0 |
| Experts Weights Averaging: A New General Training Scheme for Vision Transformers | Aug 11, 2023 | Mixture-of-Experts | —Unverified | 0 |
| A Novel Temporal Multi-Gate Mixture-of-Experts Approach for Vehicle Trajectory and Driving Intention Prediction | Aug 1, 2023 | Mixture-of-ExpertsPosition | —Unverified | 0 |
| Uncertainty-Encoded Multi-Modal Fusion for Robust Object Detection in Autonomous Driving | Jul 30, 2023 | Autonomous DrivingMixture-of-Experts | —Unverified | 0 |
| Domain-Agnostic Neural Architecture for Class Incremental Continual Learning in Document Processing Platform | Jul 11, 2023 | Continual LearningMixture-of-Experts | CodeCode Available | 0 |
| Bidirectional Attention as a Mixture of Continuous Word Experts | Jul 8, 2023 | Language ModellingMixture-of-Experts | CodeCode Available | 0 |