SOTAVerified

Computational Efficiency

Methods and optimizations to reduce the computational resources (e.g., time, memory, or power) needed for training and inference in models. This involves techniques that streamline processing, optimize algorithms, or leverage hardware to enhance performance without compromising accuracy.

Papers

Showing 5160 of 4891 papers

TitleStatusHype
When Precision Meets Position: BFloat16 Breaks Down RoPE in Long-Context TrainingCode3
Effects of charging and discharging capabilities on trade-offs between model accuracy and computational efficiency in pumped thermal electricity storageCode3
FiTv2: Scalable and Improved Flexible Vision Transformer for Diffusion ModelCode3
Residual Kolmogorov-Arnold Network for Enhanced Deep LearningCode3
SOAP: Improving and Stabilizing Shampoo using AdamCode3
Apollo: Band-sequence Modeling for High-Quality Audio RestorationCode3
LLaVA-MoD: Making LLaVA Tiny via MoE Knowledge DistillationCode3
GSFusion: Online RGB-D Mapping Where Gaussian Splatting Meets TSDF FusionCode3
FlashGS: Efficient 3D Gaussian Splatting for Large-scale and High-resolution RenderingCode3
Human-like Episodic Memory for Infinite Context LLMsCode3
Show:102550
← PrevPage 6 of 490Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1ViTaLHamming Loss0.05Unverified