SOTAVerified

GPU

Papers

Showing 25012550 of 5629 papers

TitleStatusHype
H-SGANet: Hybrid Sparse Graph Attention Network for Deformable Medical Image Registration0
microYOLO: Towards Single-Shot Object Detection on Microcontrollers0
3-in-1: 2D Rotary Adaptation for Efficient Finetuning, Efficient Batching and ComposabilityCode0
Conan-embedding: General Text Embedding with More and Better Negative Samples0
SCAN-Edge: Finding MobileNet-speed Hybrid Networks for Diverse Edge Devices via Hardware-Aware Evolutionary Search0
Text-guided Foundation Model Adaptation for Long-Tailed Medical Image Classification0
More Pictures Say More: Visual Intersection Network for Open Set Object Detection0
Exploring GPU-to-GPU Communication: Insights into Supercomputer Interconnects0
Theoretical Proportion Label Perturbation for Learning from Label Proportions in Large BagsCode0
Quantum-Powered Personalized Learning0
Batch-FPM: Random batch-update multi-parameter physical Fourier ptychography neural network0
Selectively Dilated Convolution for Accuracy-Preserving Sparse Pillar-based Embedded 3D Object Detection0
Energy-Efficient Spiking Recurrent Neural Network for Gesture Recognition on Embedded GPUs0
HGNAS: Hardware-Aware Graph Neural Architecture Search for Edge Devices0
Exploiting Student Parallelism for Low-latency GPU Inference of BERT-like Models in Online Services0
PCGRL+: Scaling, Control and Generalization in Reinforcement Learning Level Generators0
Vision HgNN: An Electron-Micrograph is Worth Hypergraph of Hypernodes0
Visual Localization in 3D Maps: Comparing Point Cloud, Mesh, and NeRF Representations0
Slicing Input Features to Accelerate Deep Learning: A Case Study with Graph Neural Networks0
Practical Aspects on Solving Differential Equations Using Deep Learning: A PrimerCode0
Mixed Sparsity Training: Achieving 4 FLOP Reduction for Transformer Pretraining0
Near, far: Patch-ordering enhances vision foundation models' scene understanding0
LLM-Barber: Block-Aware Rebuilder for Sparsity Mask in One-Shot for Large Language ModelsCode0
UKAN: Unbound Kolmogorov-Arnold Network Accompanied with Accelerated Library0
ShapeSplat: A Large-scale Dataset of Gaussian Splats and Their Self-Supervised Pretraining0
Fine-Tuning and Deploying Large Language Models Over Edges: Issues and Approaches0
Fine-Tuning a Local LLaMA-3 Large Language Model for Automated Privacy-Preserving Physician Letter Generation in Radiation Oncology0
Characteristic Performance Study on Solving Oscillator ODEs via Soft-constrained Physics-informed Neural Network with Small DataCode0
Stream-Based Ground Segmentation for Real-Time LiDAR Point Cloud Processing on FPGA0
MoDeGPT: Modular Decomposition for Large Language Model Compression0
Demystifying the Communication Characteristics for Distributed Transformer Models0
SSDTrain: An Activation Offloading Framework to SSDs for Faster Large Language Model Training0
Liquid Fourier Latent Dynamics Networks for fast GPU-based numerical simulations in computational cardiologyCode0
TeamLoRA: Boosting Low-Rank Adaptation with Expert Collaboration and CompetitionCode0
ELASTIC: Efficient Linear Attention for Sequential Interest Compression0
Threshold Filtering Packing for Supervised Fine-Tuning: Training Related Samples within Packs0
Kraken: Inherently Parallel Transformers For Efficient Multi-Device Inference0
Training Overhead Ratio: A Practical Reliability Metric for Large Language Model Training Systems0
Bridging LLMs and KGs without Fine-Tuning: Intermediate Probing Meets Subgraph-Aware Entity Descriptions0
Review Learning: Advancing All-in-One Ultra-High-Definition Image Restoration Training Method0
Breast-NET: a lightweight DCNN model for breast cancer detection and grading using histological samplesCode0
A Versatile Framework for Attributed Network Clustering via K-Nearest Neighbor AugmentationCode0
reCSE: Portable Reshaping Features for Sentence Embedding in Self-supervised Contrastive LearningCode0
Impacts of floating-point non-associativity on reproducibility for HPC and deep learning applicationsCode0
An Edge AI System Based on FPGA Platform for Railway Fault Detection0
Understanding the Performance and Estimating the Cost of LLM Fine-TuningCode0
Design of a Quality Management System based on the EU Artificial Intelligence ActCode0
Sparse Spiking Neural-like Membrane Systems on Graphics Processing UnitsCode0
Optimization-Driven Adaptive Experimentation0
Arctic-TILT. Business Document Understanding at Sub-Billion Scale0
Show:102550
← PrevPage 51 of 113Next →

No leaderboard results yet.