SOTAVerified

Inference Optimization

Papers

Showing 2130 of 56 papers

TitleStatusHype
An approach to optimize inference of the DIART speaker diarization pipeline0
LLaSA: Large Language and E-Commerce Shopping AssistantCode0
Patched MOA: optimizing inference for diverse software development tasksCode0
Inference Optimization of Foundation Models on AI Accelerators0
Inference Performance Optimization for Large Language Models on CPUsCode3
Robust Zero-Shot Text-to-Speech Synthesis with Reverse Inference Optimization0
Scaling the Vocabulary of Non-autoregressive Models for Efficient Generative Retrieval0
Efficiency optimization of large-scale language models based on deep learning in natural language processing tasks0
Advances and Open Challenges in Federated Foundation Models0
Federated Learning While Providing Model as a Service: Joint Training and Inference Optimization0
Show:102550
← PrevPage 3 of 6Next →

No leaderboard results yet.