SOTAVerified

Inference Optimization

Papers

Showing 125 of 56 papers

TitleStatusHype
The 1st Solution for 4th PVUW MeViS Challenge: Unleashing the Potential of Large Multimodal Models for Referring Video SegmentationCode5
Optimizing LLM Inference: Fluid-Guided Online Scheduling with Memory ConstraintsCode4
Inference Performance Optimization for Large Language Models on CPUsCode3
A Survey on Inference Optimization Techniques for Mixture of Experts ModelsCode3
SimpleAR: Pushing the Frontier of Autoregressive Visual Generation through Pretraining, SFT, and RLCode3
CycleBNN: Cyclic Precision Training in Binary Neural NetworksCode2
ADJUST: A Dictionary-Based Joint Reconstruction and Unmixing Method for Spectral TomographyCode1
A Novel 1D State Space for Efficient Music Rhythmic AnalysisCode1
Painterly Image Harmonization using Diffusion ModelCode1
Easy and Efficient Transformer : Scalable Inference Solution For large NLP modelCode1
Adaptive Deep Neural Network Inference Optimization with EENetCode1
CRVI: Convex Relaxation for Variational Inference0
Federated Learning While Providing Model as a Service: Joint Training and Inference Optimization0
Residual-Based Error Corrector Operator to Enhance Accuracy and Reliability of Neural Operator Surrogates of Nonlinear Variational Boundary-Value Problems0
Bifocal Neural ASR: Exploiting Keyword Spotting for Inference Optimization0
Advances and Open Challenges in Federated Foundation Models0
FluidML: Fast and Memory Efficient Inference Optimization0
EdgeRL: Reinforcement Learning-driven Deep Learning Model Inference Optimization at Edge0
Easy and Efficient Transformer: Scalable Inference Solution For Large NLP Model0
An approach to optimize inference of the DIART speaker diarization pipeline0
DVFS-Aware DNN Inference on GPUs: Latency Modeling and Performance Analysis0
DSMentor: Enhancing Data Science Agents with Curriculum Learning and Online Knowledge Accumulation0
Efficiency optimization of large-scale language models based on deep learning in natural language processing tasks0
Energy-Efficient Transformer Inference: Optimization Strategies for Time Series Classification0
Developing efficient transfer learning strategies for robust scene recognition in mobile robotics using pre-trained convolutional neural networks0
Show:102550
← PrevPage 1 of 3Next →

No leaderboard results yet.