Mixture-of-Experts

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 951–1000 of 1312 papers

Title	Date	Tasks	Status
Is Temperature Sample Efficient for Softmax Gaussian Mixture of Experts?	Jan 25, 2024	Mixture-of-Expertsparameter estimation	—Unverified
M^3TN: Multi-gate Mixture-of-Experts based Multi-valued Treatment Network for Uplift Modeling	Jan 24, 2024	Mixture-of-Experts	—Unverified
Towards A Better Metric for Text-to-Video Generation	Jan 15, 2024	Mixture-of-ExpertsText-to-Video Generation	—Unverified
Prompt-based mental health screening from social media text	Jan 11, 2024	Mixture-of-Experts	—Unverified
Robust Calibration For Improved Weather Prediction Under Distributional Shift	Jan 8, 2024	Data AugmentationMixture-of-Experts	—Unverified
Incorporating Visual Experts to Resolve the Information Loss in Multimodal Large Language Models	Jan 6, 2024	Instruction FollowingMixture-of-Experts	—Unverified
Subjective and Objective Analysis of Indian Social Media Video Quality	Jan 5, 2024	Mixture-of-ExpertsVisual Question Answering (VQA)	CodeCode Available
k-Winners-Take-All Ensemble Neural Network	Jan 4, 2024	AllMixture-of-Experts	CodeCode Available
Efficient Deweather Mixture-of-Experts with Uncertainty-aware Feature-wise Linear Modulation	Dec 27, 2023	Image RestorationMixture-of-Experts	—Unverified
Agent4Ranking: Semantic Robust Ranking via Personalized Query Rewriting Using Multi-agent LLM	Dec 24, 2023	Mixture-of-Experts	—Unverified
Mixture of Cluster-conditional LoRA Experts for Vision-language Instruction Tuning	Dec 19, 2023	DiversityInstruction Following	—Unverified
Generator Assisted Mixture of Experts For Feature Acquisition in Batch	Dec 19, 2023	Mixture-of-Experts	—Unverified
From Google Gemini to OpenAI Q* (Q-Star): A Survey of Reshaping the Generative Artificial Intelligence (AI) Research Landscape	Dec 18, 2023	Mixture-of-Experts	—Unverified
Online Action Recognition for Human Risk Prediction with Anticipated Haptic Alert via Wearables	Dec 14, 2023	Action RecognitionMixture-of-Experts	CodeCode Available
Training of Neural Networks with Uncertain Data: A Mixture of Experts Approach	Dec 13, 2023	Autonomous DrivingMixture-of-Experts	—Unverified
MoE-AMC: Enhancing Automatic Modulation Classification Performance Using Mixture-of-Experts	Dec 4, 2023	ClassificationMixture-of-Experts	—Unverified
MoEC: Mixture of Experts Implicit Neural Compression	Dec 3, 2023	Data CompressionMixture-of-Experts	—Unverified
Language-driven All-in-one Adverse Weather Removal	Dec 3, 2023	AllDiversity	—Unverified
Omni-SMoLA: Boosting Generalist Multimodal Models with Soft Mixture of Low-rank Experts	Dec 1, 2023	Chart Question AnsweringDocument AI	—Unverified
HOMOE: A Memory-Based and Composition-Aware Framework for Zero-Shot Learning with Hopfield Network and Soft Mixture of Experts	Nov 23, 2023	Compositional Zero-Shot LearningMixture-of-Experts	—Unverified
Efficient Model Agnostic Approach for Implicit Neural Representation Based Arbitrary-Scale Image Super-Resolution	Nov 20, 2023	Computational EfficiencyDecoder	—Unverified
Memory Augmented Language Models through Mixture of Word Experts	Nov 15, 2023	Mixture-of-Experts	—Unverified
Intentional Biases in LLM Responses	Nov 11, 2023	Language ModelingLanguage Modelling	—Unverified
CAME: Competitively Learning a Mixture-of-Experts Model for First-stage Retrieval	Nov 6, 2023	Mixture-of-ExpertsRetrieval	—Unverified
Octavius: Mitigating Task Interference in MLLMs via LoRA-MoE	Nov 5, 2023	DecoderMixture-of-Experts	CodeCode Available
Mixture-of-Experts for Open Set Domain Adaptation: A Dual-Space Detection Approach	Nov 1, 2023	Domain AdaptationMixture-of-Experts	—Unverified
A General Theory for Softmax Gating Multinomial Logistic Mixture of Experts	Oct 22, 2023	Density EstimationMixture-of-Experts	—Unverified
Manifold-Preserving Transformers are Effective for Short-Long Range Encoding	Oct 22, 2023	Language ModelingLanguage Modelling	CodeCode Available
Direct Neural Machine Translation with Task-level Mixture of Experts models	Oct 18, 2023	Direct NMTLarge Language Model	—Unverified
Multi-view Contrastive Learning for Entity Typing over Knowledge Graphs	Oct 18, 2023	Contrastive LearningEntity Typing	CodeCode Available
Diversifying the Mixture-of-Experts Representation for Language Models with Orthogonal Optimizer	Oct 15, 2023	DiversityMixture-of-Experts	—Unverified
Adaptive Gating in Mixture-of-Experts based Language Models	Oct 11, 2023	Mixture-of-Experts	—Unverified
Beyond the Typical: Modeling Rare Plausible Patterns in Chemical Reactions by Leveraging Sequential Mixture-of-Experts	Oct 7, 2023	Mixture-of-Experts	—Unverified
Exploiting Activation Sparsity with Dense to Dynamic-k Mixture-of-Experts Conversion	Oct 6, 2023	Mixture-of-Experts	CodeCode Available
Reinforcement Learning-based Mixture of Vision Transformers for Video Violence Recognition	Oct 4, 2023	Mixture-of-Expertsreinforcement-learning	—Unverified
Mixture of Quantized Experts (MoQE): Complementary Effect of Low-bit Quantization and Robustness	Oct 3, 2023	GPUMachine Translation	—Unverified
FT-Shield: A Watermark Against Unauthorized Fine-tuning in Text-to-Image Diffusion Models	Oct 3, 2023	Face TransferMixture-of-Experts	CodeCode Available
Statistical Perspective of Top-K Sparse Softmax Gating Mixture of Experts	Sep 25, 2023	Density EstimationMixture-of-Experts	—Unverified
Mobile V-MoEs: Scaling Down Vision Transformers via Sparse Mixture-of-Experts	Sep 8, 2023	Mixture-of-Experts	—Unverified
Learning multi-modal generative models with permutation-invariant encoders and tighter variational objectives	Sep 1, 2023	Mixture-of-Experts	CodeCode Available
Task-Based MoE for Multitask Multilingual Machine Translation	Aug 30, 2023	Machine TranslationMixture-of-Experts	—Unverified
SwapMoE: Serving Off-the-shelf MoE-based Large Language Models with Tunable Memory Budget	Aug 29, 2023	Mixture-of-Expertsobject-detection	—Unverified
EVE: Efficient Vision-Language Pre-training with Masked Prediction and Modality-Aware MoE	Aug 23, 2023	Image-text matchingImage-text Retrieval	—Unverified
Beyond Sharing: Conflict-Aware Multivariate Time Series Anomaly Detection	Aug 17, 2023	Anomaly DetectionMixture-of-Experts	CodeCode Available
FineQuant: Unlocking Efficiency with Fine-Grained Weight-Only Quantization for LLMs	Aug 16, 2023	GPUMixture-of-Experts	—Unverified
Experts Weights Averaging: A New General Training Scheme for Vision Transformers	Aug 11, 2023	Mixture-of-Experts	—Unverified
A Novel Temporal Multi-Gate Mixture-of-Experts Approach for Vehicle Trajectory and Driving Intention Prediction	Aug 1, 2023	Mixture-of-ExpertsPosition	—Unverified
Uncertainty-Encoded Multi-Modal Fusion for Robust Object Detection in Autonomous Driving	Jul 30, 2023	Autonomous DrivingMixture-of-Experts	—Unverified
Domain-Agnostic Neural Architecture for Class Incremental Continual Learning in Document Processing Platform	Jul 11, 2023	Continual LearningMixture-of-Experts	CodeCode Available
Bidirectional Attention as a Mixture of Continuous Word Experts	Jul 8, 2023	Language ModellingMixture-of-Experts	CodeCode Available

Show:10 25 50

← PrevPage 20 of 27Next →

No leaderboard results yet.