SOTAVerified

GPU

Papers

Showing 11511200 of 5629 papers

TitleStatusHype
Efficient fine-tuning of 37-level GraphCast with the Canadian global deterministic analysisCode1
Exploring GPU-to-GPU Communication: Insights into Supercomputer Interconnects0
Theoretical Proportion Label Perturbation for Learning from Label Proportions in Large BagsCode0
More Pictures Say More: Visual Intersection Network for Open Set Object Detection0
Quantum-Powered Personalized Learning0
Selectively Dilated Convolution for Accuracy-Preserving Sparse Pillar-based Embedded 3D Object Detection0
Batch-FPM: Random batch-update multi-parameter physical Fourier ptychography neural network0
HGNAS: Hardware-Aware Graph Neural Architecture Search for Edge Devices0
S4D: Streaming 4D Real-World Reconstruction with Gaussians and 3D Control PointsCode1
Energy-Efficient Spiking Recurrent Neural Network for Gesture Recognition on Embedded GPUs0
Exploiting Student Parallelism for Low-latency GPU Inference of BERT-like Models in Online Services0
PCGRL+: Scaling, Control and Generalization in Reinforcement Learning Level Generators0
Visual Localization in 3D Maps: Comparing Point Cloud, Mesh, and NeRF Representations0
Mixed Sparsity Training: Achieving 4 FLOP Reduction for Transformer Pretraining0
MARLIN: Mixed-Precision Auto-Regressive Parallel Inference on Large Language ModelsCode5
Vision HgNN: An Electron-Micrograph is Worth Hypergraph of Hypernodes0
Slicing Input Features to Accelerate Deep Learning: A Case Study with Graph Neural Networks0
EmbodiedSAM: Online Segment Any 3D Thing in Real TimeCode4
Practical Aspects on Solving Differential Equations Using Deep Learning: A PrimerCode0
deepmriprep: Voxel-based Morphometry (VBM) Preprocessing via Deep Neural NetworksCode2
UKAN: Unbound Kolmogorov-Arnold Network Accompanied with Accelerated Library0
ShapeSplat: A Large-scale Dataset of Gaussian Splats and Their Self-Supervised Pretraining0
Fine-Tuning a Local LLaMA-3 Large Language Model for Automated Privacy-Preserving Physician Letter Generation in Radiation Oncology0
EdgeNAT: Transformer for Efficient Edge DetectionCode1
HiRED: Attention-Guided Token Dropping for Efficient Inference of High-Resolution Vision-Language ModelsCode1
Fine-Tuning and Deploying Large Language Models Over Edges: Issues and Approaches0
Near, far: Patch-ordering enhances vision foundation models' scene understanding0
LLM-Barber: Block-Aware Rebuilder for Sparsity Mask in One-Shot for Large Language ModelsCode0
Accelerating Goal-Conditioned RL Algorithms and ResearchCode3
Stream-Based Ground Segmentation for Real-Time LiDAR Point Cloud Processing on FPGA0
Characteristic Performance Study on Solving Oscillator ODEs via Soft-constrained Physics-informed Neural Network with Small DataCode0
MoDeGPT: Modular Decomposition for Large Language Model Compression0
Liquid Fourier Latent Dynamics Networks for fast GPU-based numerical simulations in computational cardiologyCode0
SSDTrain: An Activation Offloading Framework to SSDs for Faster Large Language Model Training0
TeamLoRA: Boosting Low-Rank Adaptation with Expert Collaboration and CompetitionCode0
Demystifying the Communication Characteristics for Distributed Transformer Models0
Threshold Filtering Packing for Supervised Fine-Tuning: Training Related Samples within Packs0
ELASTIC: Efficient Linear Attention for Sequential Interest Compression0
ABQ-LLM: Arbitrary-Bit Quantized Inference Acceleration for Large Language ModelsCode3
Training Overhead Ratio: A Practical Reliability Metric for Large Language Model Training Systems0
Kraken: Inherently Parallel Transformers For Efficient Multi-Device Inference0
Review Learning: Advancing All-in-One Ultra-High-Definition Image Restoration Training Method0
Bridging LLMs and KGs without Fine-Tuning: Intermediate Probing Meets Subgraph-Aware Entity Descriptions0
Breast-NET: a lightweight DCNN model for breast cancer detection and grading using histological samplesCode0
LLMServingSim: A HW/SW Co-Simulation Infrastructure for LLM Inference Serving at ScaleCode3
A Versatile Framework for Attributed Network Clustering via K-Nearest Neighbor AugmentationCode0
UniBench: Visual Reasoning Requires Rethinking Vision-Language Beyond ScalingCode3
reCSE: Portable Reshaping Features for Sentence Embedding in Self-supervised Contrastive LearningCode0
Impacts of floating-point non-associativity on reproducibility for HPC and deep learning applicationsCode0
An Edge AI System Based on FPGA Platform for Railway Fault Detection0
Show:102550
← PrevPage 24 of 113Next →

No leaderboard results yet.