Mixture-of-Experts

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 76–100 of 1312 papers

Title	Date	Tasks	Status	Hype
Object Detection using Event Camera: A MoE Heat Conduction based Detector and A New Benchmark Dataset	Dec 9, 2024	Computational EfficiencyMixture-of-Experts	CodeCode Available	2
Monet: Mixture of Monosemantic Experts for Transformers	Dec 5, 2024	Dictionary LearningMixture-of-Experts	CodeCode Available	2
LLaMA-MoE v2: Exploring Sparsity of LLaMA from Perspective of Mixture-of-Experts with Post-Training	Nov 24, 2024	MathMixture-of-Experts	CodeCode Available	2
CNMBERT: A Model for Converting Hanyu Pinyin Abbreviations to Chinese Characters	Nov 18, 2024	fill-maskFill Mask	CodeCode Available	2
SLED: Self Logits Evolution Decoding for Improving Factuality in Large Language Models	Nov 1, 2024	Mixture-of-Experts	CodeCode Available	2
LoRA-IR: Taming Low-Rank Experts for Efficient All-in-One Image Restoration	Oct 20, 2024	AllComputational Efficiency	CodeCode Available	2
Efficiently Democratizing Medical LLMs for 50 Languages via a Mixture of Language Family Experts	Oct 14, 2024	Mixture-of-Experts	CodeCode Available	2
Your Mixture-of-Experts LLM Is Secretly an Embedding Model For Free	Oct 14, 2024	Mixture-of-Experts	CodeCode Available	2
Flex-MoE: Modeling Arbitrary Modality Combination via the Flexible Mixture-of-Experts	Oct 10, 2024	Mixture-of-Experts	CodeCode Available	2
MC-MoE: Mixture Compressor for Mixture-of-Experts LLMs Gains More	Oct 8, 2024	Mixture-of-ExpertsQuantization	CodeCode Available	2
Open-RAG: Enhanced Retrieval-Augmented Reasoning with Open-Source Large Language Models	Oct 2, 2024	Mixture-of-ExpertsNavigate	CodeCode Available	2
CLIP-MoE: Towards Building Mixture of Experts for CLIP with Diversified Multiplet Upcycling	Sep 28, 2024	image-classificationImage Classification	CodeCode Available	2
MiniDrive: More Efficient Vision-Language Models with Multi-Level 2D Features as Text Tokens for Autonomous Driving	Sep 11, 2024	Autonomous DrivingFeature Engineering	CodeCode Available	2
KAN4TSF: Are KAN and KAN-based models Effective for Time Series Forecasting?	Aug 21, 2024	Mixture-of-ExpertsTime Series	CodeCode Available	2
Mixture of A Million Experts	Jul 4, 2024	Computational EfficiencyLanguage Modeling	CodeCode Available	2
A Closer Look into Mixture-of-Experts in Large Language Models	Jun 26, 2024	Computational EfficiencyDiversity	CodeCode Available	2
MoE Jetpack: From Dense Checkpoints to Adaptive Mixture of Experts for Vision Tasks	Jun 7, 2024	Computational EfficiencyMixture-of-Experts	CodeCode Available	2
Demystifying the Compression of Mixture-of-Experts Through a Unified Framework	Jun 4, 2024	Mixture-of-Experts	CodeCode Available	2
Yuan 2.0-M32: Mixture of Experts with Attention Router	May 28, 2024	ARCMath	CodeCode Available	2
XTrack: Multimodal Training Boosts RGB-X Video Object Trackers	May 28, 2024	Inductive BiasMixture-of-Experts	CodeCode Available	2
Decomposing the Neurons: Activation Sparsity via Mixture of Experts for Continual Test Time Adaptation	May 26, 2024	feature selectionMixture-of-Experts	CodeCode Available	2
MoEUT: Mixture-of-Experts Universal Transformers	May 25, 2024	Language ModelingLanguage Modelling	CodeCode Available	2
Dynamic Mixture of Experts: An Auto-Tuning Approach for Efficient Transformer Models	May 23, 2024	Mixture-of-ExpertsVisual Question Answering	CodeCode Available	2
xRAG: Extreme Context Compression for Retrieval-augmented Generation with One Token	May 22, 2024	Language ModelingLanguage Modelling	CodeCode Available	2
CuMo: Scaling Multimodal LLM with Co-Upcycled Mixture-of-Experts	May 9, 2024	Image CaptioningInstruction Following	CodeCode Available	2

Show:10 25 50

← PrevPage 4 of 53Next →

No leaderboard results yet.