SOTAVerified

GPU

Papers

Showing 23012350 of 5629 papers

TitleStatusHype
Integrating Homomorphic Encryption and Trusted Execution Technology for Autonomous and Confidential Model Refining in Cloud0
DiviML: A Module-based Heuristic for Mapping Neural Networks onto Heterogeneous PlatformsCode0
LaFiCMIL: Rethinking Large File Classification from the Perspective of Correlated Multiple Instance Learning0
Interpolation-Split: a data-centric deep learning approach with big interpolated data to boost airway segmentation performance0
MLIC++: Linear Complexity Multi-Reference Entropy Modeling for Learned Image CompressionCode1
Detection of Children Abuse by Voice and Audio Classification by Short-Time Fourier Transform Machine Learning implemented on Nvidia Edge GPU device0
To Adapt or Not to Adapt? Real-Time Adaptation for Semantic SegmentationCode1
Guaranteed Approximation Bounds for Mixed-Precision Neural OperatorsCode4
Benchmarking Performance of Deep Learning Model for Material Segmentation on Two HPC Systems0
EasyNet: An Easy Network for 3D Industrial Anomaly Detection0
YOLOBench: Benchmarking Efficient Object Detectors on Embedded SystemsCode0
Implementing and Benchmarking the Locally Competitive Algorithm on the Loihi 2 Neuromorphic Processor0
Multi-GPU Approach for Training of Graph ML Models on large CFD Meshes0
Duet: efficient and scalable hybriD neUral rElation undersTandingCode0
Mitigating Memory Wall Effects in CNN Engines with On-the-Fly Weights Generation0
Parallel Q-Learning: Scaling Off-policy Reinforcement Learning under Massively Parallel SimulationCode0
Tackling the Curse of Dimensionality with Physics-Informed Neural NetworksCode1
Rail-only: A Low-Cost High-Performance Network for Training LLMs with Trillion Parameters0
Character Time-series Matching For Robust License Plate RecognitionCode1
TwinLiteNet: An Efficient and Lightweight Model for Driveable Area and Lane Segmentation in Self-Driving CarsCode1
Radar-STDA: A High-Performance Spatial-Temporal Denoising Autoencoder for Interference Mitigation of FMCW RadarsCode1
Systematic comparison of semi-supervised and self-supervised learning for medical image classificationCode1
Ada3D : Exploiting the Spatial Redundancy with Adaptive Inference for Efficient 3D Object Detection0
Efficient and Accurate Optimal Transport with Mirror Descent and Conjugate GradientsCode0
Retentive Network: A Successor to Transformer for Large Language ModelsCode3
FlashAttention-2: Faster Attention with Better Parallelism and Work PartitioningCode6
An Empirical Study of Pre-trained Model Selection for Out-of-Distribution Generalization and CalibrationCode0
Adaptively Placed Multi-Grid Scene Representation Networks for Large-Scale Data VisualizationCode1
MaGNAS: A Mapping-Aware Graph Neural Architecture Search Framework for Heterogeneous MPSoC Deployment0
DistTGL: Distributed Memory-Based Temporal Graph Neural Network Training0
CoTracker: It is Better to Track TogetherCode4
Machine-learned molecular mechanics force field for the simulation of protein-ligand systems and beyondCode2
HyperDreamBooth: HyperNetworks for Fast Personalization of Text-to-Image ModelsCode1
In-context Autoencoder for Context Compression in a Large Language ModelCode1
Differentiable Forward Projector for X-ray Computed TomographyCode2
PowerFusion: A Tensor Compiler with Explicit Data Movement Description and Instruction-level Graph IR0
ReLoRA: High-Rank Training Through Low-Rank UpdatesCode5
Fast Neural Network Inference on FPGAs for Triggering on Long-Lived Particles at Colliders0
Miriam: Exploiting Elastic Kernels for Real-time Multi-DNN Inference on Edge GPU0
Weakly-supervised positional contrastive learning: application to cirrhosis classificationCode1
InPars Toolkit: A Unified and Reproducible Synthetic Data Generation Pipeline for Neural Information RetrievalCode2
Carbon-Efficient Neural Architecture Search0
GP-guided MPPI for Efficient Navigation in Complex Unknown Cluttered EnvironmentsCode1
Improving Automatic Parallel Training via Balanced Memory Workload Optimization0
Improving Address Matching using Siamese Transformer NetworksCode0
Unbalanced Optimal Transport: A Unified Framework for Object DetectionCode1
Neural Fields for Interactive Visualization of Statistical Dependencies in 3D Simulation Ensembles0
GHOST: A Graph Neural Network Accelerator using Silicon Photonics0
TinySiamese Network for Biometric Analysis0
Meta-training with Demonstration Retrieval for Efficient Few-shot Learning0
Show:102550
← PrevPage 47 of 113Next →

No leaderboard results yet.