SOTAVerified

Computational Efficiency

Methods and optimizations to reduce the computational resources (e.g., time, memory, or power) needed for training and inference in models. This involves techniques that streamline processing, optimize algorithms, or leverage hardware to enhance performance without compromising accuracy.

Papers

Showing 301325 of 4891 papers

TitleStatusHype
TabMixer: advancing tabular data analysis with an enhanced MLP-mixer approachCode1
H3DE-Net: Efficient and Accurate 3D Landmark Detection in Medical ImagingCode1
Exploiting Deblurring Networks for Radiance FieldsCode1
Table-Critic: A Multi-Agent Framework for Collaborative Criticism and Refinement in Table ReasoningCode1
CalibQuant: 1-Bit KV Cache Quantization for Multimodal LLMsCode1
Reduced Order Modeling with Shallow Recurrent Decoder NetworksCode1
Medicine on the Edge: Comparative Performance Analysis of On-Device LLMs for Clinical ReasoningCode1
PTZ-Calib: Robust Pan-Tilt-Zoom Camera CalibrationCode1
Hi-End-MAE: Hierarchical encoder-driven masked autoencoders are stronger vision learners for medical image segmentationCode1
InTAR: Inter-Task Auto-Reconfigurable Accelerator Design for High Data Volume Variation in DNNsCode1
SelfElicit: Your Language Model Secretly Knows Where is the Relevant EvidenceCode1
Calibrating LLMs with Information-Theoretic Evidential Deep LearningCode1
Cached Multi-Lora Composition for Multi-Concept Image GenerationCode1
Activation-Informed Merging of Large Language ModelsCode1
FastKV: KV Cache Compression for Fast Long-Context Processing with Token-Selective PropagationCode1
MatIR: A Hybrid Mamba-Transformer Image Restoration ModelCode1
Return of the Encoder: Maximizing Parameter Efficiency for SLMsCode1
CFT-RAG: An Entity Tree Based Retrieval Augmented Generation Algorithm With Cuckoo FilterCode1
Finer-CAM: Spotting the Difference Reveals Finer Details for Visual ExplanationCode1
A Survey of World Models for Autonomous DrivingCode1
DualOpt: A Dual Divide-and-Optimize Algorithm for the Large-scale Traveling Salesman ProblemCode1
Poseidon: A ViT-based Architecture for Multi-Frame Pose Estimation with Adaptive Frame Weighting and Multi-Scale Feature FusionCode1
Flash Window Attention: speedup the attention computation for Swin TransformerCode1
DispFormer: Pretrained Transformer for Flexible Dispersion Curve Inversion from Global Synthesis to Regional ApplicationsCode1
CAMP: Collaborative Attention Model with Profiles for Vehicle Routing ProblemsCode1
Show:102550
← PrevPage 13 of 196Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1ViTaLHamming Loss0.05Unverified