SOTAVerified

Computational Efficiency

Methods and optimizations to reduce the computational resources (e.g., time, memory, or power) needed for training and inference in models. This involves techniques that streamline processing, optimize algorithms, or leverage hardware to enhance performance without compromising accuracy.

Papers

Showing 11811190 of 4891 papers

TitleStatusHype
Latenrgy: Model Agnostic Latency and Energy Consumption Prediction for Binary Classifiers0
Exploring GLU Expansion Ratios: A Study of Structured Pruning in LLaMA-3.2 ModelsCode5
Optimizing Large Language Models with an Enhanced LoRA Fine-Tuning Algorithm for Efficiency and Robustness in NLP Tasks0
Optimization and Scalability of Collaborative Filtering Algorithms in Large Language Models0
Ister: Inverted Seasonal-Trend Decomposition Transformer for Explainable Multivariate Time Series Forecasting0
HAND: Hierarchical Attention Network for Multi-Scale Handwritten Document Recognition and Layout AnalysisCode0
1.58-bit FLUX0
VORTEX: A Spatial Computing Framework for Optimized Drone Telemetry Extraction from First-Person View Flight Data0
Quantum framework for Reinforcement Learning: Integrating Markov decision process, quantum arithmetic, and trajectory search0
Exact Acceleration of Subgraph Graph Neural Networks by Eliminating Computation Redundancy0
Show:102550
← PrevPage 119 of 490Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1ViTaLHamming Loss0.05Unverified