SOTAVerified

GPU

Papers

Showing 11011150 of 5629 papers

TitleStatusHype
AutoFreeze: Automatically Freezing Model Blocks to Accelerate Fine-tuningCode1
Accelerating Sampling and Aggregation Operations in GNN Frameworks with GPU Initiated Direct Storage AccessesCode1
LeetDecoding: A PyTorch Library for Exponentially Decaying Causal Linear Attention with CUDA ImplementationsCode1
Efficient Lifelong Model Evaluation in an Era of Rapid ProgressCode1
Edge-MoE: Memory-Efficient Multi-Task Vision Transformer Architecture with Task-level Sparsity via Mixture-of-ExpertsCode1
AutoDNNchip: An Automated DNN Chip Predictor and Builder for Both FPGAs and ASICsCode1
EdgeNAT: Transformer for Efficient Edge DetectionCode1
Edge and Identity Preserving Network for Face Super-ResolutionCode1
A GPU-accelerated Large-scale Simulator for Transportation System Optimization BenchmarkingCode1
EEEA-Net: An Early Exit Evolutionary Neural Architecture SearchCode1
Learning Tracking Representations via Dual-Branch Fully Transformer NetworksCode1
Learning to Generate Wasserstein BarycentersCode1
Learning to Upsample by Learning to SampleCode1
Kindling the Darkness: A Practical Low-light Image EnhancerCode1
Effective Batching for Recurrent Neural Network GrammarsCode1
Learning Universal Shape Dictionary for Realtime Instance SegmentationCode1
EAGAN: Efficient Two-stage Evolutionary Architecture Search for GANsCode1
A Unified Framework for Implicit Sinkhorn DifferentiationCode1
A Unified Framework for 3D Point Cloud Visual GroundingCode1
Dynamic Structure Pruning for Compressing CNNsCode1
Learning Partial Correlation based Deep Visual Representation for Image ClassificationCode1
Learning Image-adaptive 3D Lookup Tables for High Performance Photo Enhancement in Real-timeCode1
A Generic Inverted Index Framework for Similarity Search on the GPU - Technical ReportCode1
Marius: Learning Massive Graph Embeddings on a Single MachineCode1
Dynamic Sparse Training with Structured SparsityCode1
Learning Neural Volumetric Representations of Dynamic Humans in MinutesCode1
Learning Rich Features at High-Speed for Single-Shot Object DetectionCode1
Dynamic Mesh-Aware Radiance FieldsCode1
Dynamic Low-Rank Sparse Adaptation for Large Language ModelsCode1
Dynamic-OFA: Runtime DNN Architecture Switching for Performance Scaling on Heterogeneous Embedded PlatformsCode1
Dynamic Perceiver for Efficient Visual RecognitionCode1
Dynamic DNNs and Runtime Management for Efficient Inference on Mobile/Embedded DevicesCode1
Dynamic GPU Energy Optimization for Machine Learning Training WorkloadsCode1
Dynamic Pooling Improves Nanopore Base Calling AccuracyCode1
Easy and Efficient Transformer : Scalable Inference Solution For large NLP modelCode1
Learning from Event Cameras with Sparse Spiking Convolutional Neural NetworksCode1
Learning to Enhance Low-Light Image via Zero-Reference Deep Curve EstimationCode1
LightAvatar: Efficient Head Avatar as Dynamic Neural Light FieldCode1
DuoDecoding: Hardware-aware Heterogeneous Speculative Decoding with Dynamic Multi-Sequence DraftingCode1
DVIS: Decoupled Video Instance Segmentation FrameworkCode1
Accelerating Neural Architecture Search via Proxy DataCode1
DXSLAM: A Robust and Efficient Visual SLAM System with Deep FeaturesCode1
Attention in SRAM on Tenstorrent GrayskullCode1
Attention-based Proposals Refinement for 3D Object DetectionCode1
Latent Variable Sequential Set Transformers For Joint Multi-Agent Motion PredictionCode1
Attention as an RNNCode1
Attaining Real-Time Super-Resolution for Microscopic Images Using GANCode1
3rd Place: A Global and Local Dual Retrieval Solution to Facebook AI Image Similarity ChallengeCode1
DSNAS: Direct Neural Architecture Search without Parameter RetrainingCode1
3D Small Object Detection with Dynamic Spatial PruningCode1
Show:102550
← PrevPage 23 of 113Next →

No leaderboard results yet.