Mixture-of-Experts

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 501–550 of 1312 papers

Title	Date	Tasks	Status
BiPrompt-SAM: Enhancing Image Segmentation via Explicit Selection between Point and Text Prompts	Mar 25, 2025	Image SegmentationMixture-of-Experts	—Unverified
Galaxy Walker: Geometry-aware VLMs For Galaxy-scale Understanding	Mar 24, 2025	Mixture-of-ExpertsMorphology classification	—Unverified
ExpertRAG: Efficient RAG with Mixture of Experts -- Optimizing Context Retrieval for Adaptive LLM Responses	Mar 23, 2025	Language ModelingLanguage Modelling	—Unverified
Every Sample Matters: Leveraging Mixture-of-Experts and High-Quality Data for Efficient and Accurate Code LLM	Mar 22, 2025	Code GenerationMixture-of-Experts	—Unverified
UniCoRN: Latent Diffusion-based Unified Controllable Image Restoration Network across Multiple Degradations	Mar 20, 2025	Image RestorationMixture-of-Experts	—Unverified
Expert Race: A Flexible Routing Strategy for Scaling Diffusion Transformer with Mixture of Experts	Mar 20, 2025	Mixture-of-Experts	—Unverified
SemEval-2025 Task 1: AdMIRe -- Advancing Multimodal Idiomaticity Representation	Mar 19, 2025	Mixture-of-Experts	—Unverified
Leveraging MoE-based Large Language Model for Zero-Shot Multi-Task Semantic Communication	Mar 19, 2025	Language ModelingLanguage Modelling	—Unverified
Core-Periphery Principle Guided State Space Model for Functional Connectome Classification	Mar 18, 2025	Functional ConnectivityMamba	—Unverified
MAST-Pro: Dynamic Mixture-of-Experts for Adaptive Segmentation of Pan-Tumors with Knowledge-Driven Prompts	Mar 18, 2025	Mixture-of-Expertsparameter-efficient fine-tuning	—Unverified
Fast filtering of non-Gaussian models using Amortized Optimal Transport Maps	Mar 16, 2025	Mixture-of-Experts	CodeCode Available
Adaptive Mixture of Low-Rank Experts for Robust Audio Spoofing Detection	Mar 15, 2025	Mixture-of-Experts	—Unverified
A Review of DeepSeek Models' Key Innovative Techniques	Mar 14, 2025	Mixture-of-Expertsreinforcement-learning	—Unverified
MoLEx: Mixture of Layer Experts for Finetuning with Sparse Upcycling	Mar 14, 2025	Mixture-of-Expertsparameter-efficient fine-tuning	CodeCode Available
Ensemble Learning for Large Language Models in Text and Code Generation: A Survey	Mar 13, 2025	Code GenerationEnsemble Learning	—Unverified
dFLMoE: Decentralized Federated Learning via Mixture of Experts for Medical Data Analysis	Mar 13, 2025	Federated LearningMixture-of-Experts	—Unverified
Astrea: A MOE-based Visual Understanding Model with Progressive Alignment	Mar 12, 2025	Contrastive LearningCross-Modal Retrieval	—Unverified
FaVChat: Unlocking Fine-Grained Facail Video Understanding with Multimodal Large Language Models	Mar 12, 2025	Mixture-of-ExpertsQuestion Answering	—Unverified
Towards Robust Multimodal Representation: A Unified Approach with Adaptive Experts and Alignment	Mar 12, 2025	Contrastive LearningDecision Making	CodeCode Available
Priority-Aware Preemptive Scheduling for Mixed-Priority Workloads in MoE Inference	Mar 12, 2025	BlockingGPU	—Unverified
Double-Stage Feature-Level Clustering-Based Mixture of Experts Framework	Mar 12, 2025	ClusteringDiversity	—Unverified
Automatic Operator-level Parallelism Planning for Distributed Deep Learning -- A Mixed-Integer Programming Approach	Mar 12, 2025	Computational EfficiencyMixture-of-Experts	—Unverified
UniF^2ace: Fine-grained Face Understanding and Generation with Unified Multimodal Models	Mar 11, 2025	AttributeMixture-of-Experts	—Unverified
MoE-Loco: Mixture of Experts for Multitask Locomotion	Mar 11, 2025	Mixture-of-Experts	—Unverified
MoRE: Unlocking Scalability in Reinforcement Learning for Quadruped Vision-Language-Action Models	Mar 11, 2025	Large Language ModelMixture-of-Experts	—Unverified
Accelerating MoE Model Inference with Expert Sharding	Mar 11, 2025	DecoderGPU	—Unverified
GM-MoE: Low-Light Enhancement with Gated-Mechanism Mixture-of-Experts	Mar 10, 2025	3D ReconstructionAutonomous Driving	—Unverified
eMoE: Task-aware Memory Efficient Mixture-of-Experts-Based (MoE) Model Inference	Mar 10, 2025	Mixture-of-ExpertsScheduling	—Unverified
ResMoE: Space-efficient Compression of Mixture of Experts LLMs via Residual Restoration	Mar 10, 2025	Mixture-of-Experts	CodeCode Available
MoFE: Mixture of Frozen Experts Architecture	Mar 9, 2025	Mixture-of-Expertsparameter-efficient fine-tuning	—Unverified
Swift Hydra: Self-Reinforcing Generative Framework for Anomaly Detection with Multiple Mamba Models	Mar 9, 2025	Anomaly DetectionMamba	CodeCode Available
MANDARIN: Mixture-of-Experts Framework for Dynamic Delirium and Coma Prediction in ICU Patients: Development and Validation of an Acute Brain Dysfunction Prediction Model	Mar 8, 2025	Mixture-of-Experts	—Unverified
A Novel Trustworthy Video Summarization Algorithm Through a Mixture of LoRA Experts	Mar 8, 2025	Mixture-of-ExpertsVideo Summarization	—Unverified
MoEMoE: Question Guided Dense and Scalable Sparse Mixture-of-Expert for Multi-source Multi-modal Answering	Mar 8, 2025	Answer GenerationMixture-of-Experts	—Unverified
FMT:A Multimodal Pneumonia Detection Model Based on Stacking MOE Framework	Mar 7, 2025	DiagnosticMedical Image Analysis	—Unverified
Every FLOP Counts: Scaling a 300B Mixture-of-Experts LING LLM without Premium GPUs	Mar 7, 2025	Knowledge GraphsMixture-of-Experts	—Unverified
Capacity-Aware Inference: Mitigating the Straggler Effect in Mixture of Experts	Mar 7, 2025	Mixture-of-Experts	—Unverified
Symbolic Mixture-of-Experts: Adaptive Skill-based Routing for Heterogeneous Reasoning	Mar 7, 2025	GPUMath	—Unverified
TS-RAG: Retrieval-Augmented Generation based Time Series Foundation Models are Stronger Zero-Shot Forecaster	Mar 6, 2025	Domain AdaptationMixture-of-Experts	—Unverified
Continual Pre-training of MoEs: How robust is your router?	Mar 6, 2025	DecoderMixture-of-Experts	—Unverified
A Generalist Cross-Domain Molecular Learning Framework for Structure-Based Drug Discovery	Mar 6, 2025	DenoisingDrug Discovery	—Unverified
Predictable Scale: Part I -- Optimal Hyperparameter Scaling Law in Large Language Model Pretraining	Mar 6, 2025	GPUHyperparameter Optimization	—Unverified
Speculative MoE: Communication Efficient Parallel MoE Inference with Speculative Token and Expert Pre-scheduling	Mar 6, 2025	Mixture-of-ExpertsScheduling	—Unverified
Convergence Rates for Softmax Gating Mixture of Experts	Mar 5, 2025	Mixture-of-Expertsparameter estimation	—Unverified
BrainNet-MoE: Brain-Inspired Mixture-of-Experts Learning for Neurological Disease Identification	Mar 5, 2025	Mixture-of-Experts	—Unverified
VoiceGRPO: Modern MoE Transformers with Group Relative Policy Optimization GRPO for AI Voice Health Care Applications on Voice Pathology Detection	Mar 5, 2025	DiagnosticMixture-of-Experts	CodeCode Available
Tabby: Tabular Data Synthesis with Language Models	Mar 4, 2025	Language ModelingLanguage Modelling	—Unverified
Union of Experts: Adapting Hierarchical Routing to Equivalently Decomposed Transformer	Mar 4, 2025	Computational EfficiencyMixture-of-Experts	CodeCode Available
How Do Consumers Really Choose: Exposing Hidden Preferences with the Mixture of Experts Model	Mar 3, 2025	Decision MakingDemand Forecasting	—Unverified
PROPER: A Progressive Learning Framework for Personalized Large Language Models with Group-Level Adaptation	Mar 3, 2025	Mixture-of-Expertsparameter-efficient fine-tuning	—Unverified

Show:10 25 50

← PrevPage 11 of 27Next →

No leaderboard results yet.