SOTAVerified

Computational Efficiency

Methods and optimizations to reduce the computational resources (e.g., time, memory, or power) needed for training and inference in models. This involves techniques that streamline processing, optimize algorithms, or leverage hardware to enhance performance without compromising accuracy.

Papers

Showing 771780 of 4891 papers

TitleStatusHype
Consistent Accelerated Inference via Confident Adaptive TransformersCode1
Prompt Compression with Context-Aware Sentence Encoding for Fast and Improved LLM InferenceCode1
Content-aware Token Sharing for Efficient Semantic Segmentation with Vision TransformersCode1
PROTO: Iterative Policy Regularized Offline-to-Online Reinforcement LearningCode1
Automated Lane Merging via Game Theory and Branch Model Predictive ControlCode1
Pyramidal Reservoir Graph Neural NetworkCode1
CondenseNet V2: Sparse Feature Reactivation for Deep NetworksCode1
Quantized Distillation: Optimizing Driver Activity Recognition Models for Resource-Constrained EnvironmentsCode1
A Flexible 2.5D Medical Image Segmentation Approach with In-Slice and Cross-Slice AttentionCode1
Confronting Ambiguity in 6D Object Pose Estimation via Score-Based Diffusion on SE(3)Code1
Show:102550
← PrevPage 78 of 490Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1ViTaLHamming Loss0.05Unverified