SOTAVerified

Inference Optimization

Papers

Showing 2650 of 56 papers

TitleStatusHype
The Ultimate Guide to Fine-Tuning LLMs from Basics to Breakthroughs: An Exhaustive Review of Technologies, Research, Best Practices, Applied Research Challenges and Opportunities0
An approach to optimize inference of the DIART speaker diarization pipeline0
LLaSA: Large Language and E-Commerce Shopping AssistantCode0
Patched MOA: optimizing inference for diverse software development tasksCode0
Inference Optimization of Foundation Models on AI Accelerators0
Robust Zero-Shot Text-to-Speech Synthesis with Reverse Inference Optimization0
Scaling the Vocabulary of Non-autoregressive Models for Efficient Generative Retrieval0
Efficiency optimization of large-scale language models based on deep learning in natural language processing tasks0
Advances and Open Challenges in Federated Foundation Models0
Federated Learning While Providing Model as a Service: Joint Training and Inference Optimization0
Integrating Pre-Trained Speech and Language Models for End-to-End Speech Recognition0
SySMOL: Co-designing Algorithms and Hardware for Neural Networks with Heterogeneous Precisions0
Bayesian Active Learning in the Presence of Nuisance Parameters0
Representing Edge Flows on Graphs via Sparse Cell ComplexesCode0
Residual-Based Error Corrector Operator to Enhance Accuracy and Reliability of Neural Operator Surrogates of Nonlinear Variational Boundary-Value Problems0
Networked Signal and Information Processing0
Enhanced graph-learning schemes driven by similar distributions of motifsCode0
Self-Constrained Inference Optimization on Structural Groups for Human Pose Estimation0
Easy and Efficient Transformer: Scalable Inference Solution For Large NLP Model0
SBbadger: Biochemical Reaction Networks with Definable Degree Distributions0
Bifocal Neural ASR: Exploiting Keyword Spotting for Inference Optimization0
Developing efficient transfer learning strategies for robust scene recognition in mobile robotics using pre-trained convolutional neural networks0
Investigations on the inference optimization techniques and their impact on multiple hardware platforms for Semantic Segmentation0
SNDCNN: Self-normalizing deep CNNs with scaled exponential linear units for speech recognition0
A bi-partite generative model framework for analyzing and simulating large scale multiple discrete-continuous travel behaviour data0
Show:102550
← PrevPage 2 of 3Next →

No leaderboard results yet.