SOTAVerified

GPU

Papers

Showing 25012550 of 5629 papers

TitleStatusHype
Exploring shared memory architectures for end-to-end gigapixel deep learning0
Learning Partial Correlation based Deep Visual Representation for Image ClassificationCode1
Speed Is All You Need: On-Device Acceleration of Large Diffusion Models via GPU-Aware Optimizations0
eWaSR -- an embedded-compute-ready maritime obstacle detection networkCode1
FindVehicle and VehicleFinder: A NER dataset for natural language-based vehicle retrieval and a keyword-based cross-modal vehicle retrieval systemCode1
Scaling the leading accuracy of deep equivariant models to biomolecular simulations of realistic sizeCode2
Local object crop collision network for efficient simulation of non-convex objects in GPU-based simulators0
DADFNet: Dual Attention and Dual Frequency-Guided Dehazing Network for Video-Empowered Intelligent Transportation0
Cooperative Multi-Agent Reinforcement Learning for Inventory Management0
DisCo-CLIP: A Distributed Contrastive Loss for Memory Efficient CLIP TrainingCode1
Sustainable AIGC Workload Scheduling of Geo-Distributed Data Centers: A Multi-Agent Reinforcement Learning Approach0
DETRs Beat YOLOs on Real-time Object DetectionCode8
HEAT: A Highly Efficient and Affordable Training System for Collaborative Filtering Based Recommendation on CPUsCode0
SEA: A Scalable Entity Alignment SystemCode0
Unsupervised ANN-Based Equalizer and Its Trainable FPGA Implementation0
EWT: Efficient Wavelet-Transformer for Single Image Denoising0
Representing Volumetric Videos as Dynamic MLP Maps0
DGNN-Booster: A Generic FPGA Accelerator Framework For Dynamic Graph Neural Network InferenceCode1
Robust Multiview Multimodal Driver Monitoring System Using Masked Multi-Head Self-AttentionCode0
Gradient-Free Textual Inversion0
Semantic Segmentation with High Inference Speed in Off-Road EnvironmentsCode0
Rotation-Scale Equivariant Steerable FiltersCode0
Towards Real-time Text-driven Image Manipulation with Unconditional Diffusion ModelsCode1
ADS_UNet: A Nested UNet for Histopathology Image Segmentation0
AI-assisted Automated Workflow for Real-time X-ray Ptychography Data Analysis via Federated Resources0
ParaGraph: Weighted Graph Representation for Performance Optimization of HPC Kernels0
Efficient automatic segmentation for multi-level pulmonary arteries: The PARSE challenge0
EZClone: Improving DNN Model Extraction Attack via Shape Distillation from GPU Execution Profiles0
Hyper-parameter Tuning for Adversarially Robust ModelsCode0
FakET: Simulating Cryo-Electron Tomograms with Neural Style TransferCode1
DLRover-RM: Resource Optimization for Deep Recommendation Models Training in the Cloud0
FisHook -- An Optimized Approach to Marine Specie Classification using MobileNetV20
A real-time algorithm for human action recognition in RGB and thermal video0
TransPimLib: A Library for Efficient Transcendental Functions on Processing-in-Memory SystemsCode1
X-TIME: An in-memory engine for accelerating machine learning on tabular data with CAMsCode1
Where are we in the search for an Artificial Visual Cortex for Embodied Intelligence?0
Pgx: Hardware-Accelerated Parallel Game Simulators for Reinforcement LearningCode2
PopSparse: Accelerated block sparse matrix multiplication on IPU0
Fast inference of latent space dynamics in huge relational event networks0
GNNBuilder: An Automated Framework for Generic Graph Neural Network Accelerator Generation, Simulation, and OptimizationCode1
Instant Photorealistic Neural Radiance Fields StylizationCode0
FMAS: Fast Multi-Objective SuperNet Architecture Search for Semantic Segmentation0
Hard-normal Example-aware Template Mutual Matching for Industrial Anomaly DetectionCode1
CARTO: Category and Joint Agnostic Reconstruction of ARTiculated ObjectsCode1
4K-HAZE: A Dehazing Benchmark with 4K Resolution Hazy and Haze-Free ImagesCode1
Seer: Language Instructed Video Prediction with Latent Diffusion ModelsCode1
Single-subject Multi-contrast MRI Super-resolution via Implicit Neural RepresentationsCode1
SimpleNet: A Simple Network for Image Anomaly Detection and LocalizationCode2
EfficientAD: Accurate Visual Anomaly Detection at Millisecond-Level LatenciesCode2
GPU-accelerated Matrix Cover Algorithm for Multiple Patterning Layout Decomposition0
Show:102550
← PrevPage 51 of 113Next →

No leaderboard results yet.