SOTAVerified

GPU

Papers

Showing 51100 of 5629 papers

TitleStatusHype
FlatCAD: Fast Curvature Regularization of Neural SDFs for CAD Models0
LazyEviction: Lagged KV Eviction with Attention Pattern Observation for Efficient Long Reasoning0
InfiniPot-V: Memory-Constrained KV Cache Compression for Streaming Video Understanding0
Utility-Driven Speculative Decoding for Mixture-of-Experts0
NeuralPDR: Neural Differential Equations as surrogate models for Photodissociation RegionsCode0
VideoMAR: Autoregressive Video Generatio with Continuous Tokens0
MT-PCR: A Hybrid Mamba-Transformer with Spatial Serialization for Hierarchical Point Cloud Registration0
Arctic Long Sequence Training: Scalable And Efficient Training For Multi-Million Token SequencesCode3
From Flat to Feeling: A Feasibility and Impact Study on Dynamic Facial Emotions in AI-Generated Avatars0
TextureSplat: Per-Primitive Texture Mapping for Reflective Gaussian SplattingCode0
Vine Copulas as Differentiable Computational GraphsCode3
Parallel Branch Model Predictive Control on GPUs0
Versatile and Fast Location-Based Private Information Retrieval with Fully Homomorphic Encryption over the TorusCode0
ECLIP: Energy-efficient and Practical Co-Location of ML Inference on Spatially Partitioned GPUs0
Deploying and Evaluating Multiple Deep Learning Models on Edge Devices for Diabetic Retinopathy Detection0
GroupNL: Low-Resource and Robust CNN Design over Cloud and Device0
GraphGSOcc: Semantic-Geometric Graph Transformer with Dynamic-Static Decoupling for 3D Gaussian Splatting-based Occupancy Prediction0
SecONNds: Secure Outsourced Neural Network Inference on ImageNetCode0
FeNN: A RISC-V vector processor for Spiking Neural Network acceleration0
Farseer: A Refined Scaling Law in Large Language ModelsCode1
Prompts to Summaries: Zero-Shot Language-Guided Video Summarization0
GigaVideo-1: Advancing Video Generation via Automatic Feedback with 4 GPU-Hours Fine-Tuning0
MNN-LLM: A Generic Inference Engine for Fast Large Language Model Deployment on Mobile Devices0
Vector Representations of Vessel Trees0
Mutual-Supervised Learning for Sequential-to-Parallel Code TranslationCode1
AtmosMJ: Revisiting Gating Mechanism for AI Weather Forecasting Beyond the Year ScaleCode0
GPU-accelerated Modeling of Biological Regulatory Networks0
Can A Gamer Train A Mathematical Reasoning Model?Code0
A PDE-Based Image Dehazing Method via Atmospheric Scattering Theory0
Towards Secure and Private Language Models for Nuclear Power Plants0
SeerAttention-R: Sparse Attention Adaptation for Long ReasoningCode2
ECMNet:Lightweight Semantic Segmentation with Efficient CNN-Mamba Network0
Olica: Efficient Structured Pruning of Large Language Models without RetrainingCode0
PerfTracker: Online Performance Troubleshooting for Large-scale Model Training in Production0
FZOO: Fast Zeroth-Order Optimizer for Fine-Tuning Large Language Models towards Adam-Scale Speed0
Plug-and-Play Linear Attention for Pre-trained Image and Video Restoration ModelsCode0
Self Forcing: Bridging the Train-Test Gap in Autoregressive Video Diffusion0
GaussianVAE: Adaptive Learning Dynamics of 3D Gaussians for High-Fidelity Super-Resolution0
NeurIPS 2025 E2LM Competition : Early Training Evaluation of Language Models0
ProteinZero: Self-Improving Protein Generation via Online Reinforcement Learning0
MoE-GPS: Guidlines for Prediction Strategy for Dynamic Expert Duplication in MoE Load Balancing0
Fractional-order Jacobian Matrix Differentiation and Its Application in Artificial Neural Networks0
Paged Attention Meets FlexAttention: Unlocking Long-Context Efficiency in Deployed Inference0
E-BATS: Efficient Backpropagation-Free Test-Time Adaptation for Speech Foundation Models0
Faster than Fast: Accelerating Oriented FAST Feature Detection on Low-end Embedded GPUs0
FuncGNN: Learning Functional Semantics of Logic Circuits with Graph Neural NetworksCode0
BestServe: Serving Strategies with Optimal Goodput in Collocation and Disaggregation Architectures0
Cost-Efficient LLM Training with Lifetime-Aware Tensor Offloading via GPUDirect Storage0
On-the-fly Reconstruction for Large-Scale Novel View Synthesis from Unposed Images0
Single GPU Task Adaptation of Pathology Foundation Models for Whole Slide Image Analysis0
Show:102550
← PrevPage 2 of 113Next →

No leaderboard results yet.