SOTAVerified

GPU

Papers

Showing 55515600 of 5629 papers

TitleStatusHype
Salus: Fine-Grained GPU Sharing Primitives for Deep Learning ApplicationsCode0
Development of Fast Refinement Detectors on AI Edge PlatformsCode0
Comparing Energy Efficiency of CPU, GPU and FPGA Implementations for Vision KernelsCode0
TensorLy: Tensor Learning in PythonCode0
Fast Algorithms for Spiking Neural Network Simulation with FPGAsCode0
SANDWICH: Towards an Offline, Differentiable, Fully-Trainable Wireless Neural Ray-Tracing SurrogateCode0
Tensor Monte Carlo: particle methods for the GPU eraCode0
Comparative Analysis of FPGA and GPU Performance for Machine Learning-Based Track Reconstruction at LHCbCode0
TensorNetwork for Machine LearningCode0
TensorNetwork on TensorFlow: A Spin Chain Application Using Tree Tensor NetworksCode0
FALCON: Feature-Label Constrained Graph Net Collapse for Memory Efficient GNNsCode0
A GPU-based Hydrodynamic Simulator with Boid InteractionsCode0
A unified framework for 21cm tomography sample generation and parameter inference with Progressively Growing GANsCode0
A Generative Appearance Model for End-to-end Video Object SegmentationCode0
Faith: An Efficient Framework for Transformer Verification on GPUsCode0
AttriReBoost: A Gradient-Free Propagation Optimization Method for Cold Start Mitigation in Attribute Missing GraphsCode0
Factored Latent-Dynamic Conditional Random Fields for Single and Multi-label Sequence ModelingCode0
Scalable Data Assimilation with Message PassingCode0
TernaryNet: Faster Deep Model Inference without GPUs for Medical 3D Segmentation using Sparse and Binary ConvolutionsCode0
Attention on Attention: Architectures for Visual Question Answering (VQA)Code0
UberNet: Training a `Universal' Convolutional Neural Network for Low-, Mid-, and High-Level Vision using Diverse Datasets and Limited MemoryCode0
Compact Convolutional Neural Network Cascade for Face DetectionCode0
Scalable Graph Networks for Particle SimulationsCode0
Face-NMS: A Core-set Selection Approach for Efficient Face RecognitionCode0
Scalable K-FAC Training for Deep Neural Networks with Distributed PreconditioningCode0
Learning Scalable Model Soup on a Single GPU: An Efficient Subspace Training StrategyCode0
FaceBoxes: A CPU Real-time Face Detector with High AccuracyCode0
Scalable Multitask Learning Using Gradient-based Estimation of Task AffinityCode0
Accelerating the Training of Video Super-Resolution ModelsCode0
Enhanced Recurrent Neural Tangent Kernels for Non-Time-Series DataCode0
XLand-100B: A Large-Scale Multi-Task Dataset for In-Context Reinforcement LearningCode0
ExVideo: Extending Video Diffusion Models via Parameter-Efficient Post-TuningCode0
Extensions and Limitations of the Neural GPUCode0
A Frequency-aware Software Cache for Large Recommendation System EmbeddingsCode0
Expressive Higher-Order Link Prediction through Hypergraph Symmetry BreakingCode0
reCSE: Portable Reshaping Features for Sentence Embedding in Self-supervised Contrastive LearningCode0
ScaleFreeCTR: MixCache-based Distributed Training System for CTR Models with Huge Embedding TableCode0
A Truncated Newton Method for Optimal TransportCode0
Scaling Attention to Very Long Sequences in Linear Time with Wavelet-Enhanced Random Spectral Attention (WERSA)Code0
Ultra-High-Definition Image Deblurring via Multi-scale Cubic-MixerCode0
Exploring RWKV for Sentence Embeddings: Layer-wise Analysis and Baseline Comparison for Semantic SimilarityCode0
Explore as a Storm, Exploit as a Raindrop: On the Benefit of Fine-Tuning Kernel Schedulers with Coordinate DescentCode0
Exploiting Sparsity for Long Context Inference: Million Token Contexts on Commodity GPUsCode0
Exploiting Local Features and Range Images for Small Data Real-Time Point Cloud Semantic SegmentationCode0
Exact Gaussian Processes on a Million Data PointsCode0
Accelerating Simulation-based Inference with Emerging AI HardwareCode0
Evolving Neural Architecture Using One Shot ModelCode0
Evolutionary NAS with Gene Expression Programming of Cellular EncodingCode0
Communication-Efficient Graph Neural Networks with Probabilistic Neighborhood Expansion Analysis and CachingCode0
Evaluating Quantized Large Language Models for Code Generation on Low-Resource Language BenchmarksCode0
Show:102550
← PrevPage 112 of 113Next →

No leaderboard results yet.