Mixture-of-Experts

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 901–950 of 1312 papers

Title	Date	Tasks	Status
Half-Space Feature Learning in Neural Networks	Apr 5, 2024	Mixture-of-Experts	—Unverified
Two Heads are Better than One: Nested PoE for Robust Defense Against Multi-Backdoors	Apr 2, 2024	Data PoisoningHate Speech Detection	CodeCode Available
Revolutionizing Disease Diagnosis with simultaneous functional PET/MR and Deeply Integrated Brain Metabolic, Hemodynamic, and Perfusion Networks	Mar 29, 2024	Mixture-of-Experts	—Unverified
Psychometry: An Omnifit Model for Image Reconstruction from Human Brain Activity	Mar 29, 2024	Brain Computer InterfaceImage Reconstruction	—Unverified
Jamba: A Hybrid Transformer-Mamba Language Model	Mar 28, 2024	GPULanguage Modeling	CodeCode Available
Generalization Error Analysis for Sparse Mixture-of-Experts: A Preliminary Study	Mar 26, 2024	Learning TheoryMixture-of-Experts	—Unverified
DESIRE-ME: Domain-Enhanced Supervised Information REtrieval using Mixture-of-Experts	Mar 20, 2024	Information RetrievalMixture-of-Experts	CodeCode Available
GeRM: A Generalist Robotic Model with Mixture-of-experts for Quadruped Robot	Mar 20, 2024	Mixture-of-ExpertsMulti-Task Learning	—Unverified
Skeleton-Based Human Action Recognition with Noisy Labels	Mar 15, 2024	Action RecognitionDenoising	CodeCode Available
MM1: Methods, Analysis & Insights from Multimodal LLM Pre-training	Mar 14, 2024	In-Context LearningMixture-of-Experts	—Unverified
Equipping Computational Pathology Systems with Artifact Processing Pipelines: A Showcase for Computation and Performance Trade-offs	Mar 12, 2024	Airbubbles DetectionAnomaly Detection	CodeCode Available
Conditional computation in neural networks: principles and research trends	Mar 12, 2024	Mixture-of-Expertsscientific discovery	—Unverified
Branch-Train-MiX: Mixing Expert LLMs into a Mixture-of-Experts LLM	Mar 12, 2024	Arithmetic ReasoningCode Generation	—Unverified
Acquiring Diverse Skills using Curriculum Reinforcement Learning with Mixture of Experts	Mar 11, 2024	Mixture-of-ExpertsReinforcement Learning (RL)	—Unverified
MMoE: Robust Spoiler Detection with Multi-modal Information and Domain-aware Mixture-of-Experts	Mar 8, 2024	Domain GeneralizationMixture-of-Experts	—Unverified
ConstitutionalExperts: Training a Mixture of Principle-based Prompts	Mar 7, 2024	Mixture-of-Experts	—Unverified
Mixture-of-LoRAs: An Efficient Multitask Tuning for Large Language Models	Mar 6, 2024	Mixture-of-ExpertsMulti-Task Learning	—Unverified
Video Relationship Detection Using Mixture of Experts	Mar 6, 2024	Action RecognitionMixture-of-Experts	CodeCode Available
Vanilla Transformers are Transfer Capability Teachers	Mar 4, 2024	Computational EfficiencyMixture-of-Experts	—Unverified
Hypertext Entity Extraction in Webpage	Mar 4, 2024	Mixture-of-Experts	—Unverified
How does Architecture Influence the Base Capabilities of Pre-trained Language Models? A Case Study Based on FFN-Wider and MoE Transformers	Mar 4, 2024	Few-Shot LearningLanguage Modeling	—Unverified
Enhancing the "Immunity" of Mixture-of-Experts Networks for Adversarial Defense	Feb 29, 2024	Adversarial DefenseAdversarial Robustness	—Unverified
An Effective Mixture-Of-Experts Approach For Code-Switching Speech Recognition Leveraging Encoder Disentanglement	Feb 27, 2024	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified
m2mKD: Module-to-Module Knowledge Distillation for Modular Transformers	Feb 26, 2024	Knowledge DistillationMixture-of-Experts	CodeCode Available
ASEM: Enhancing Empathy in Chatbot through Attention-based Sentiment and Emotion Modeling	Feb 25, 2024	ChatbotDiversity	CodeCode Available
Co-Supervised Learning: Improving Weak-to-Strong Generalization with Hierarchical Mixture of Experts	Feb 23, 2024	Mixture-of-Experts	CodeCode Available
PEMT: Multi-Task Correlation Guided Mixture-of-Experts Enables Parameter-Efficient Transfer Learning	Feb 23, 2024	Mixture-of-Expertsparameter-efficient fine-tuning	—Unverified
MoELoRA: Contrastive Learning Guided Mixture of Experts on Parameter-Efficient Fine-Tuning for Large Language Models	Feb 20, 2024	Common Sense ReasoningContrastive Learning	—Unverified
Denoising OCT Images Using Steered Mixture of Experts with Multi-Model Inference	Feb 20, 2024	DenoisingDiagnostic	—Unverified
Towards an empirical understanding of MoE design choices	Feb 20, 2024	Mixture-of-Experts	—Unverified
Turn Waste into Worth: Rectifying Top-k Router of MoE	Feb 17, 2024	Computational EfficiencyGPU	—Unverified
MoRAL: MoE Augmented LoRA for LLMs' Lifelong Learning	Feb 17, 2024	Lifelong learningMixture-of-Experts	—Unverified
Mixture of Link Predictors on Graphs	Feb 13, 2024	Link PredictionMixture-of-Experts	CodeCode Available
AMEND: A Mixture of Experts Framework for Long-tailed Trajectory Prediction	Feb 13, 2024	Contrastive LearningMixture-of-Experts	—Unverified
P-Mamba: Marrying Perona Malik Diffusion with Mamba for Efficient Pediatric Echocardiographic Left Ventricular Segmentation	Feb 13, 2024	MambaMixture-of-Experts	—Unverified
Differentially Private Training of Mixture of Experts Models	Feb 11, 2024	Computational EfficiencyMixture-of-Experts	—Unverified
Buffer Overflow in Mixture of Experts	Feb 8, 2024	Mixture-of-Experts	—Unverified
Task-customized Masked AutoEncoder via Mixture of Cluster-conditional Experts	Feb 8, 2024	Mixture-of-ExpertsSelf-Supervised Learning	—Unverified
On Parameter Estimation in Deviated Gaussian Mixture of Experts	Feb 7, 2024	Mixture-of-Expertsparameter estimation	—Unverified
Intrinsic User-Centric Interpretability through Global Mixture of Experts	Feb 5, 2024	Mixture-of-ExpertsNews Classification	CodeCode Available
Approximation Rates and VC-Dimension Bounds for (P)ReLU MLP Mixture of Experts	Feb 5, 2024	GPUMixture-of-Experts	—Unverified
On Least Square Estimation in Softmax Gating Mixture of Experts	Feb 5, 2024	Mixture-of-Experts	—Unverified
FuseMoE: Mixture-of-Experts Transformers for Fleximodal Fusion	Feb 5, 2024	Missing ElementsMixture-of-Experts	—Unverified
CompeteSMoE - Effective Training of Sparse Mixture of Experts via Competition	Feb 4, 2024	Mixture-of-Experts	CodeCode Available
pFedMoE: Data-Level Personalization with Mixture of Experts for Model-Heterogeneous Personalized Federated Learning	Feb 2, 2024	Federated LearningMixture-of-Experts	CodeCode Available
MoDE: A Mixture-of-Experts Model with Mutual Distillation among the Experts	Jan 31, 2024	Mixture-of-Experts	—Unverified
Explainable data-driven modeling via mixture of experts: towards effective blending of grey and black-box models	Jan 30, 2024	Mixture-of-Experts	—Unverified
Checkmating One, by Using Many: Combining Mixture of Experts with MCTS to Improve in Chess	Jan 30, 2024	Mixture-of-Experts	CodeCode Available
LLaVA-MoLE: Sparse Mixture of LoRA Experts for Mitigating Data Conflicts in Instruction Finetuning MLLMs	Jan 29, 2024	Language ModellingLarge Language Model	—Unverified
Routers in Vision Mixture of Experts: An Empirical Study	Jan 29, 2024	Language ModelingLanguage Modelling	—Unverified

Show:10 25 50

← PrevPage 19 of 27Next →

No leaderboard results yet.