SOTAVerified

Computational Efficiency

Methods and optimizations to reduce the computational resources (e.g., time, memory, or power) needed for training and inference in models. This involves techniques that streamline processing, optimize algorithms, or leverage hardware to enhance performance without compromising accuracy.

Papers

Showing 926950 of 4891 papers

TitleStatusHype
Provably Efficient RLHF Pipeline: A Unified View from Contextual Bandits0
Latent Convergence Modulation in Large Language Models: A Novel Approach to Iterative Contextual Realignment0
Calibrating LLMs with Information-Theoretic Evidential Deep LearningCode1
Smell of Source: Learning-Based Odor Source Localization with Molecular Communication0
Inventory Consensus Control in Supply Chain Networks using Dissipativity-Based Control and Topology Co-Design0
Bayesian Optimization by Kernel Regression and Density-based Exploration0
Kolmogorov-Arnold Fourier Networks0
Hierarchical Lexical Manifold Projection in Large Language Models: A Novel Mechanism for Multi-Scale Semantic Representation0
Federated Learning with Reservoir State Analysis for Time Series Anomaly DetectionCode0
Graph Neural Network Enabled Pinching Antennas0
CluStRE: Streaming Graph Clustering with Multi-Stage Refinement0
Sequential Stochastic Combinatorial Optimization Using Hierarchal Reinforcement Learning0
Native Fortran Implementation of TensorFlow-Trained Deep and Bayesian Neural NetworksCode0
Cached Multi-Lora Composition for Multi-Concept Image GenerationCode1
FlashVideo:Flowing Fidelity to Detail for Efficient High-Resolution Video GenerationCode3
Flopping for FLOPs: Leveraging equivariance for computational efficiency0
Hybrid machine learning based scale bridging framework for permeability prediction of fibrous structures0
Joint MoE Scaling Laws: Mixture of Experts Can Be Memory Efficient0
Tighter sparse variational Gaussian processes0
Hierarchical Contextual Manifold Alignment for Structuring Latent Representations in Large Language Models0
Gaussian Process Regression for Inverse Problems in Linear PDEs0
UniCP: A Unified Caching and Pruning Framework for Efficient Video Generation0
OneTrack-M: A multitask approach to transformer-based MOT models0
TQ-DiT: Efficient Time-Aware Quantization for Diffusion Transformers0
Learning Real-World Action-Video Dynamics with Heterogeneous Masked Autoregression0
Show:102550
← PrevPage 38 of 196Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1ViTaLHamming Loss0.05Unverified