SOTAVerified

GPU

Papers

Showing 23512400 of 5629 papers

TitleStatusHype
Processing Energy Modeling for Neural Network Based Image Compression0
Rapid-INR: Storage Efficient CPU-free DNN Training Using Implicit Neural RepresentationCode0
Separable Physics-Informed Neural Networks0
DenseBAM-GI: Attention Augmented DeneseNet with momentum aided GRU for HMER0
Accelerating Transducers through Adjacent Token Merging0
Accelerating Sampling and Aggregation Operations in GNN Frameworks with GPU Initiated Direct Storage AccessesCode1
cuSLINK: Single-linkage Agglomerative Clustering on the GPUCode2
Reduce Computational Complexity for Convolutional Layers by Skipping Zeros0
LeanDojo: Theorem Proving with Retrieval-Augmented Language ModelsCode2
Physics-inspired spatiotemporal-graph AI ensemble for the detection of higher order wave mode signals of spinning binary black hole mergersCode0
A Survey on Deep Learning Hardware Accelerators for Heterogeneous HPC Platforms0
DNABERT-2: Efficient Foundation Model and Benchmark For Multi-Species GenomeCode2
Fauno: The Italian Large Language Model that will leave you senza parole!Code1
Faster Segment Anything: Towards Lightweight SAM for Mobile ApplicationsCode5
Im2win: An Efficient Convolution Paradigm on GPUCode1
H_2O: Heavy-Hitter Oracle for Efficient Generative Inference of Large Language ModelsCode2
Computron: Serving Distributed Deep Learning Models with Model Parallel SwappingCode0
Scaling MLPs: A Tale of Inductive BiasCode1
Implementing contextual biasing in GPU decoder for online ASRCode1
BatchGNN: Efficient CPU-Based Distributed GNN Training on Very Large Graphs0
Accelerating SNN Training with Stochastic Parallelizable Spiking NeuronsCode1
Slimmable Encoders for Flexible Split DNNs in Bandwidth and Resource Constrained IoT Systems0
FFCV: Accelerating Training by Removing Data BottlenecksCode4
Low Latency Edge Classification GNN for Particle Trajectory Tracking on FPGAs0
Visual Analysis of Large Multi-Field AMR Data on GPUs Using Interactive Volume Lines0
Dynamic Perceiver for Efficient Visual RecognitionCode1
RoMe: Towards Large Scale Road Surface Reconstruction via Mesh RepresentationCode2
GPU-Accelerated Verification of Machine Learning Models for Power Systems0
Air Traffic Management Using a GPU-Accelerated Genetic AlgorithmCode0
Efficient HDR Reconstruction from Real-World Raw Images0
Runtime Construction of Large-Scale Spiking Neuronal Network Models on GPU Devices0
Implementation of Real-Time Automotive SAR Imaging0
ZeRO++: Extremely Efficient Collective Communication for Giant Model Training0
Full Parameter Fine-tuning for Large Language Models with Limited ResourcesCode2
Deformation Monitoring of Tunnel using Phase-based Motion Magnification and Optical Flow0
Evaluation and Optimization of Gradient Compression for Distributed Deep LearningCode1
FFB: A Fair Fairness Benchmark for In-Processing Group Fairness MethodsCode1
Datasets and Benchmarks for Offline Safe Reinforcement LearningCode2
Generate to Understand for RepresentationCode1
Expanding Versatility of Agile Locomotion through Policy Transitions Using Latent State Representation0
TAPIR: Tracking Any Point with per-frame Initialization and temporal RefinementCode3
Efficient 3D Semantic Segmentation with Superpoint TransformerCode2
Flexible Channel Dimensions for Differentiable Architecture Search0
SqueezeLLM: Dense-and-Sparse QuantizationCode6
Practice with Graph-based ANN Algorithms on Sparse Data: Chi-square Two-tower model, HNSW, Sign Cauchy Projections0
Towards a Machine-Learned Poisson Solver for Low-Temperature Plasma Simulations in Complex Geometries0
Galactic: Scaling End-to-End Reinforcement Learning for Rearrangement at 100k Steps-Per-SecondCode1
Slicing Unbalanced Optimal TransportCode0
Resource Efficient Neural Networks Using Hessian Based Pruning0
Polyhedral Complex Extraction from ReLU Networks using Edge SubdivisionCode0
Show:102550
← PrevPage 48 of 113Next →

No leaderboard results yet.