Mixture-of-Experts

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 601–650 of 1312 papers

Title	Date	Tasks	Status
Sigmoid Self-Attention has Lower Sample Complexity than Softmax Self-Attention: A Mixture-of-Experts Perspective	Feb 1, 2025	Mixture-of-Experts	—Unverified
Adaptive Prompt: Unlocking the Power of Visual Prompt Tuning	Jan 31, 2025	Mixture-of-ExpertsVisual Prompt Tuning	—Unverified
Pheromone-based Learning of Optimal Reasoning Paths	Jan 31, 2025	ARCGSM8K	—Unverified
MolGraph-xLSTM: A graph-based dual-level xLSTM framework with multi-head mixture-of-experts for enhanced molecular representation and interpretability	Jan 30, 2025	Drug DiscoveryMixture-of-Experts	—Unverified
Heuristic-Informed Mixture of Experts for Link Prediction in Multilayer Networks	Jan 29, 2025	Link PredictionMixture-of-Experts	—Unverified
Free Agent in Agent-Based Mixture-of-Experts Generative AI Framework	Jan 29, 2025	Fraud DetectionMixture-of-Experts	—Unverified
3D-MoE: A Mixture-of-Experts Multi-modal LLM for 3D Vision and Pose Diffusion via Rectified Flow	Jan 28, 2025	Instruction FollowingMixture-of-Experts	—Unverified
Static Batching of Irregular Workloads on GPUs: Framework and Application to Efficient MoE Model Inference	Jan 27, 2025	GPUMixture-of-Experts	—Unverified
ToMoE: Converting Dense Large Language Models to Mixture-of-Experts through Dynamic Structural Pruning	Jan 25, 2025	Mixture-of-Experts	—Unverified
Each Rank Could be an Expert: Single-Ranked Mixture of Experts LoRA for Multi-Task Learning	Jan 25, 2025	Mixture-of-ExpertsMulti-Task Learning	—Unverified
Mean-field limit from general mixtures of experts to quantum neural networks	Jan 24, 2025	Mixture-of-Experts	—Unverified
Sparse Mixture-of-Experts for Non-Uniform Noise Reduction in MRI Images	Jan 24, 2025	DenoisingDiagnostic	—Unverified
CSAOT: Cooperative Multi-Agent System for Active Object Tracking	Jan 23, 2025	Autonomous NavigationDeep Reinforcement Learning	—Unverified
UniUIR: Considering Underwater Image Restoration as An All-in-One Learner	Jan 22, 2025	AllDepth Estimation	—Unverified
BLR-MoE: Boosted Language-Routing Mixture of Experts for Domain-Robust Multilingual E2E ASR	Jan 22, 2025	Mixture-of-Experts	—Unverified
LLM4WM: Adapting LLM for Wireless Multi-Tasking	Jan 22, 2025	General KnowledgeLanguage Modeling	—Unverified
Autonomy-of-Experts Models	Jan 22, 2025	Decision MakingMixture-of-Experts	—Unverified
Demons in the Detail: On Implementing Load Balancing Loss for Training Specialized Mixture-of-Expert Models	Jan 21, 2025	Mixture-of-Experts	—Unverified
Parameters vs FLOPs: Scaling Laws for Optimal Sparsity for Mixture-of-Experts Language Models	Jan 21, 2025	Mixture-of-Experts	—Unverified
SCFCRC: Simultaneously Counteract Feature Camouflage and Relation Camouflage for Fraud Detection	Jan 21, 2025	Contrastive LearningFraud Detection	—Unverified
FSMoE: A Flexible and Scalable Training System for Sparse Mixture-of-Experts Models	Jan 18, 2025	GPUMixture-of-Experts	—Unverified
OMoE: Diversifying Mixture of Low-Rank Adaptation by Orthogonal Finetuning	Jan 17, 2025	Computational EfficiencyDiversity	—Unverified
LLM-Based Routing in Mixture of Experts: A Novel Framework for Trading	Jan 16, 2025	Mixture-of-ExpertsWorld Knowledge	—Unverified
GRAPHMOE: Amplifying Cognitive Depth of Mixture-of-Experts Network via Introducing Self-Rethinking Mechanism	Jan 14, 2025	Mixture-of-Experts	—Unverified
PSReg: Prior-guided Sparse Mixture of Experts for Point Cloud Registration	Jan 14, 2025	Mixture-of-ExpertsPoint Cloud Registration	—Unverified
A Multi-Modal Deep Learning Framework for Pan-Cancer Prognosis	Jan 13, 2025	Deep LearningMixture-of-Experts	CodeCode Available
TAMER: A Test-Time Adaptive MoE-Driven Framework for EHR Representation Learning	Jan 10, 2025	Mixture-of-ExpertsRepresentation Learning	CodeCode Available
Optimizing Distributed Deployment of Mixture-of-Experts Model Inference in Serverless Computing	Jan 9, 2025	Bayesian OptimizationCPU	—Unverified
mFabric: An Efficient and Scalable Fabric for Mixture-of-Experts Training	Jan 7, 2025	BlockingGPU	—Unverified
Mixture-of-Experts Graph Transformers for Interpretable Particle Collision Detection	Jan 6, 2025	Decision MakingMixture-of-Experts	CodeCode Available
Fresh-CL: Feature Realignment through Experts on Hypersphere in Continual Learning	Jan 4, 2025	Continual LearningMixture-of-Experts	—Unverified
MoVE-KD: Knowledge Distillation for VLMs with Mixture of Visual Encoders	Jan 3, 2025	Knowledge DistillationMixture-of-Experts	—Unverified
MExD: An Expert-Infused Diffusion Model for Whole-Slide Image Classification	Jan 1, 2025	image-classificationImage Classification	—Unverified
Correlative and Discriminative Label Grouping for Multi-Label Visual Prompt Tuning	Jan 1, 2025	image-classificationImage Classification	—Unverified
Towards Efficient Foundation Model for Zero-shot Amodal Segmentation	Jan 1, 2025	Mixture-of-Experts	—Unverified
REM: A Scalable Reinforced Multi-Expert Framework for Multiplex Influence Maximization	Jan 1, 2025	Mixture-of-Experts	—Unverified
UNIALIGN: Scaling Multimodal Alignment within One Unified Model	Jan 1, 2025	Mixture-of-Experts	—Unverified
Learning Heterogeneous Tissues with Mixture of Experts for Gigapixel Whole Slide Images	Jan 1, 2025	Mixture-of-Expertswhole slide images	—Unverified
CNC: Cross-modal Normality Constraint for Unsupervised Multi-class Anomaly Detection	Dec 31, 2024	Anomaly DetectionAttribute	—Unverified
Multimodal Variational Autoencoder: a Barycentric View	Dec 29, 2024	Mixture-of-ExpertsRepresentation Learning	—Unverified
UniRestorer: Universal Image Restoration via Adaptively Estimating Image Degradation at Proper Granularity	Dec 28, 2024	Image RestorationMixture-of-Experts	CodeCode Available
Graph Mixture of Experts and Memory-augmented Routers for Multivariate Time Series Anomaly Detection	Dec 26, 2024	Anomaly DetectionMixture-of-Experts	—Unverified
AskChart: Universal Chart Understanding through Textual Enhancement	Dec 26, 2024	Chart UnderstandingMixture-of-Experts	CodeCode Available
BIG-MoE: Bypass Isolated Gating MoE for Generalized Multimodal Face Anti-Spoofing	Dec 24, 2024	Decision MakingFace Anti-Spoofing	CodeCode Available
UME: Upcycling Mixture-of-Experts for Scalable and Efficient Automatic Speech Recognition	Dec 23, 2024	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified
Part-Of-Speech Sensitivity of Routers in Mixture of Experts Models	Dec 22, 2024	Mixture-of-ExpertsPOS	—Unverified
Theory of Mixture-of-Experts for Mobile Edge Computing	Dec 20, 2024	Computational EfficiencyContinual Learning	—Unverified
SEKE: Specialised Experts for Keyword Extraction	Dec 18, 2024	DescriptiveKeyword Extraction	CodeCode Available
SMOSE: Sparse Mixture of Shallow Experts for Interpretable Reinforcement Learning in Continuous Control Tasks	Dec 17, 2024	continuous-controlContinuous Control	CodeCode Available
DAOP: Data-Aware Offloading and Predictive Pre-Calculation for Efficient MoE Inference	Dec 16, 2024	CPUGPU	CodeCode Available

Show:10 25 50

← PrevPage 13 of 27Next →

No leaderboard results yet.