Mixture-of-Experts

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 551–575 of 1312 papers

Title	Date	Tasks	Status
PROPER: A Progressive Learning Framework for Personalized Large Language Models with Group-Level Adaptation	Mar 3, 2025	Mixture-of-Expertsparameter-efficient fine-tuning	—Unverified
Unify and Anchor: A Context-Aware Transformer for Cross-Domain Time Series Forecasting	Mar 3, 2025	Domain GeneralizationMixture-of-Experts	—Unverified
DeRS: Towards Extremely Efficient Upcycled Mixture-of-Experts Models	Mar 3, 2025	Mixture-of-ExpertsQuantization	—Unverified
Explainable Classifier for Malignant Lymphoma Subtyping via Cell Graph and Image Fusion	Mar 2, 2025	Mixture-of-Expertswhole slide images	—Unverified
CL-MoE: Enhancing Multimodal Large Language Model with Dual Momentum Mixture-of-Experts for Continual Visual Question Answering	Mar 1, 2025	Continual LearningLanguage Modeling	—Unverified
CoSMoEs: Compact Sparse Mixture of Experts	Feb 28, 2025	Mixture-of-Experts	—Unverified
UniCodec: Unified Audio Codec with Single Domain-Adaptive Codebook	Feb 27, 2025	Language ModelingLanguage Modelling	—Unverified
Mixture of Experts for Recognizing Depression from Interview and Reading Tasks	Feb 27, 2025	Mixture-of-Experts	—Unverified
Mixture of Experts-augmented Deep Unfolding for Activity Detection in IRS-aided Systems	Feb 27, 2025	Action DetectionActivity Detection	—Unverified
OneRec: Unifying Retrieve and Rank with Generative Recommender and Iterative Preference Alignment	Feb 26, 2025	Mixture-of-ExpertsRecommendation Systems	—Unverified
Drop-Upcycling: Training Sparse Mixture of Experts with Partial Re-initialization	Feb 26, 2025	Mixture-of-Experts	—Unverified
Evaluating Expert Contributions in a MoE LLM for Quiz-Based Tasks	Feb 24, 2025	Mixture-of-ExpertsMMLU	—Unverified
ENACT-Heart -- ENsemble-based Assessment Using CNN and Transformer on Heart Sounds	Feb 24, 2025	DiagnosticMixture-of-Experts	—Unverified
The Empirical Impact of Reducing Symmetries on the Performance of Deep Ensembles and MoE	Feb 24, 2025	Linear Mode ConnectivityMixture-of-Experts	—Unverified
BigMac: A Communication-Efficient Mixture-of-Experts Model Structure for Fast Training and Inference	Feb 24, 2025	Mixture-of-Experts	—Unverified
An Autonomous Network Orchestration Framework Integrating Large Language Models with Continual Reinforcement Learning	Feb 22, 2025	ARCContinual Learning	—Unverified
Tight Clusters Make Specialized Experts	Feb 21, 2025	ClusteringLanguage Modeling	CodeCode Available
Binary-Integer-Programming Based Algorithm for Expert Load Balancing in Mixture-of-Experts Models	Feb 21, 2025	Mixture-of-Experts	CodeCode Available
Ray-Tracing for Conditionally Activated Neural Networks	Feb 20, 2025	Mixture-of-Experts	—Unverified
Unraveling the Localized Latents: Learning Stratified Manifold Structures in LLM Embedding Space with Sparse Mixture-of-Experts	Feb 19, 2025	Dictionary LearningMixture-of-Experts	—Unverified
Every Expert Matters: Towards Effective Knowledge Distillation for Mixture-of-Experts Language Models	Feb 18, 2025	Knowledge DistillationMixture-of-Experts	—Unverified
DSMoE: Matrix-Partitioned Experts with Dynamic Routing for Computation-Efficient Dense LLMs	Feb 18, 2025	Computational EfficiencyLanguage Modeling	—Unverified
How to Upscale Neural Networks with Scaling Law? A Survey and Practical Guidelines	Feb 17, 2025	Mixture-of-Experts	—Unverified
Connector-S: A Survey of Connectors in Multi-modal Large Language Models	Feb 17, 2025	Mixture-of-ExpertsSurvey	—Unverified
Fate: Fast Edge Inference of Mixture-of-Experts Models via Cross-Layer Gate	Feb 17, 2025	GPUMixture-of-Experts	CodeCode Available

Show:10 25 50

← PrevPage 23 of 53Next →

No leaderboard results yet.