Mixture-of-Experts

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 126–150 of 1312 papers

Title	Date	Tasks	Status	Hype
Seed1.5-VL Technical Report	May 11, 2025	Mixture-of-ExpertsMultimodal Reasoning	—Unverified	0
QoS-Efficient Serving of Multiple Mixture-of-Expert LLMs Using Partial Runtime Reconfiguration	May 10, 2025	GPUMixture-of-Experts	—Unverified	0
Emotion-Qwen: Training Hybrid Experts for Unified Emotion and General Vision-Language Understanding	May 10, 2025	DescriptiveEmotion Recognition	CodeCode Available	1
Gated Attention for Large Language Models: Non-linearity, Sparsity, and Attention-Sink-Free	May 10, 2025	AttributeMixture-of-Experts	CodeCode Available	4
FloE: On-the-Fly MoE Inference on Memory-constrained GPU	May 9, 2025	CPUGPU	—Unverified	0
MxMoE: Mixed-precision Quantization for MoE with Accuracy and Performance Co-Design	May 9, 2025	Mixture-of-ExpertsQuantization	CodeCode Available	1
Divide-and-Conquer: Cold-Start Bundle Recommendation via Mixture of Diffusion Experts	May 8, 2025	Mixture-of-Experts	—Unverified	0
Pangu Ultra MoE: How to Train Your Big MoE on Ascend NPUs	May 7, 2025	Mixture-of-Experts	—Unverified	0
SToLa: Self-Adaptive Touch-Language Framework with Tactile Commonsense Reasoning in Open-Ended Scenarios	May 7, 2025	DiversityMixture-of-Experts	—Unverified	0
LLM-e Guess: Can LLMs Capabilities Advance Without Hardware Progress?	May 7, 2025	Large Language ModelMixture-of-Experts	CodeCode Available	0
STAR-Rec: Making Peace with Length Variance and Pattern Diversity in Sequential Recommendation	May 6, 2025	DiversityMixture-of-Experts	—Unverified	0
Faster MoE LLM Inference for Extremely Large Models	May 6, 2025	Inference OptimizationMixture-of-Experts	—Unverified	0
3D Gaussian Splatting Data Compression with Mixture of Priors	May 6, 2025	3DGSData Compression	—Unverified	0
Towards Smart Point-and-Shoot Photography	May 6, 2025	Mixture-of-ExpertsWord Embeddings	—Unverified	0
Multimodal Deep Learning-Empowered Beam Prediction in Future THz ISAC Systems	May 5, 2025	Beam PredictionDeep Learning	—Unverified	0
Optimizing LLMs for Resource-Constrained Environments: A Survey of Model Compression Techniques	May 5, 2025	Knowledge DistillationMixture-of-Experts	—Unverified	0
Finger Pose Estimation for Under-screen Fingerprint Sensor	May 5, 2025	Mixture-of-ExpertsPose Estimation	CodeCode Available	0
Learning Heterogeneous Mixture of Scene Experts for Large-scale Neural Radiance Fields	May 4, 2025	Mixture-of-ExpertsNeRF	CodeCode Available	3
Perception-Informed Neural Networks: Beyond Physics-Informed Neural Networks	May 2, 2025	Mixture-of-Experts	—Unverified	0
CoCoAFusE: Beyond Mixtures of Experts via Model Fusion	May 2, 2025	Mixture-of-ExpertsPhilosophy	—Unverified	0
CICADA: Cross-Domain Interpretable Coding for Anomaly Detection and Adaptation in Multivariate Time Series	May 1, 2025	Anomaly DetectionMeta-Learning	—Unverified	0
Mixture of Sparse Attention: Content-Based Learnable Sparse Attention via Expert-Choice Routing	May 1, 2025	Mixture-of-Experts	CodeCode Available	1
MoxE: Mixture of xLSTM Experts with Entropy-Aware Routing for Efficient Language Modeling	May 1, 2025	Language ModelingLanguage Modelling	—Unverified	0
MicarVLMoE: A Modern Gated Cross-Aligned Vision-Language Mixture of Experts Model for Medical Image Captioning and Report Generation	Apr 29, 2025	cross-modal alignmentDecoder	CodeCode Available	0
Accelerating Mixture-of-Experts Training with Adaptive Expert Replication	Apr 28, 2025	GPUMixture-of-Experts	—Unverified	0

Show:10 25 50

← PrevPage 6 of 53Next →

No leaderboard results yet.