SOTAVerified

GPU

Papers

Showing 17011750 of 5629 papers

TitleStatusHype
TrainVerify: Equivalence-Based Verification for Distributed LLM Training0
FlatCAD: Fast Curvature Regularization of Neural SDFs for CAD Models0
LazyEviction: Lagged KV Eviction with Attention Pattern Observation for Efficient Long Reasoning0
InfiniPot-V: Memory-Constrained KV Cache Compression for Streaming Video Understanding0
VideoMAR: Autoregressive Video Generatio with Continuous Tokens0
Utility-Driven Speculative Decoding for Mixture-of-Experts0
NeuralPDR: Neural Differential Equations as surrogate models for Photodissociation RegionsCode0
From Flat to Feeling: A Feasibility and Impact Study on Dynamic Facial Emotions in AI-Generated Avatars0
Parallel Branch Model Predictive Control on GPUs0
TextureSplat: Per-Primitive Texture Mapping for Reflective Gaussian SplattingCode0
MT-PCR: A Hybrid Mamba-Transformer with Spatial Serialization for Hierarchical Point Cloud Registration0
Versatile and Fast Location-Based Private Information Retrieval with Fully Homomorphic Encryption over the TorusCode0
ECLIP: Energy-efficient and Practical Co-Location of ML Inference on Spatially Partitioned GPUs0
Deploying and Evaluating Multiple Deep Learning Models on Edge Devices for Diabetic Retinopathy Detection0
GroupNL: Low-Resource and Robust CNN Design over Cloud and Device0
FeNN: A RISC-V vector processor for Spiking Neural Network acceleration0
GraphGSOcc: Semantic-Geometric Graph Transformer with Dynamic-Static Decoupling for 3D Gaussian Splatting-based Occupancy Prediction0
SecONNds: Secure Outsourced Neural Network Inference on ImageNetCode0
MNN-LLM: A Generic Inference Engine for Fast Large Language Model Deployment on Mobile Devices0
GigaVideo-1: Advancing Video Generation via Automatic Feedback with 4 GPU-Hours Fine-Tuning0
Prompts to Summaries: Zero-Shot Language-Guided Video Summarization0
Vector Representations of Vessel Trees0
AtmosMJ: Revisiting Gating Mechanism for AI Weather Forecasting Beyond the Year ScaleCode0
A PDE-Based Image Dehazing Method via Atmospheric Scattering Theory0
Plug-and-Play Linear Attention for Pre-trained Image and Video Restoration ModelsCode0
Can A Gamer Train A Mathematical Reasoning Model?Code0
Towards Secure and Private Language Models for Nuclear Power Plants0
FZOO: Fast Zeroth-Order Optimizer for Fine-Tuning Large Language Models towards Adam-Scale Speed0
ECMNet:Lightweight Semantic Segmentation with Efficient CNN-Mamba Network0
GPU-accelerated Modeling of Biological Regulatory Networks0
Olica: Efficient Structured Pruning of Large Language Models without RetrainingCode0
PerfTracker: Online Performance Troubleshooting for Large-scale Model Training in Production0
GaussianVAE: Adaptive Learning Dynamics of 3D Gaussians for High-Fidelity Super-Resolution0
Fractional-order Jacobian Matrix Differentiation and Its Application in Artificial Neural Networks0
NeurIPS 2025 E2LM Competition : Early Training Evaluation of Language Models0
ProteinZero: Self-Improving Protein Generation via Online Reinforcement Learning0
MoE-GPS: Guidlines for Prediction Strategy for Dynamic Expert Duplication in MoE Load Balancing0
Self Forcing: Bridging the Train-Test Gap in Autoregressive Video Diffusion0
Paged Attention Meets FlexAttention: Unlocking Long-Context Efficiency in Deployed Inference0
Faster than Fast: Accelerating Oriented FAST Feature Detection on Low-end Embedded GPUs0
E-BATS: Efficient Backpropagation-Free Test-Time Adaptation for Speech Foundation Models0
FuncGNN: Learning Functional Semantics of Logic Circuits with Graph Neural NetworksCode0
BestServe: Serving Strategies with Optimal Goodput in Collocation and Disaggregation Architectures0
Cost-Efficient LLM Training with Lifetime-Aware Tensor Offloading via GPUDirect Storage0
Single GPU Task Adaptation of Pathology Foundation Models for Whole Slide Image Analysis0
On-the-fly Reconstruction for Large-Scale Novel View Synthesis from Unposed Images0
Astraea: A GPU-Oriented Token-wise Acceleration Framework for Video Diffusion Transformers0
Generalizable, real-time neural decoding with hybrid state-space models0
High-Speed Ultra-Energy-Efficient Memristor-Based Massive MIMO SIC Detector Circuit with Hybrid Analog-Digital Computing Architecture0
FALO: Fast and Accurate LiDAR 3D Object Detection on Resource-Constrained Devices0
Show:102550
← PrevPage 35 of 113Next →

No leaderboard results yet.