SOTAVerified

GPU

Papers

Showing 18511900 of 5629 papers

TitleStatusHype
EnergonAI: An Inference System for 10-100 Billion Parameter Transformer Models0
AdaCM^2: On Understanding Extremely Long-Term Video with Adaptive Cross-Modality Memory Reduction0
EnTranNAS: Towards Closing the Gap between the Architectures in Search and Evaluation0
Flexible Piecewise Curves Estimation for Photo Enhancement0
End-to-end Transformer for Compressed Video Quality Enhancement0
AdaCM^2: On Understanding Extremely Long-Term Video with Adaptive Cross-Modality Memory Reduction0
FleetX0
end-to-end training of a large vocabulary end-to-end speech recognition system0
EO-VLM: VLM-Guided Energy Overload Attacks on Vision Models0
End-to-end Optimization of Machine Learning Prediction Queries0
EPAM: A Predictive Energy Model for Mobile AI0
FlexGS: Train Once, Deploy Everywhere with Many-in-One Flexible 3D Gaussian Splatting0
EPNAS: Efficient Progressive Neural Architecture Search0
End-to-end Learning of Cost-Volume Aggregation for Real-time Dense Stereo0
EQ-Net: A Unified Deep Learning Framework for Log-Likelihood Ratio Estimation and Quantization0
Characterizing and Understanding HGNN Training on GPUs0
CBQ: Cross-Block Quantization for Large Language Models0
ESE: Efficient Speech Recognition Engine with Sparse LSTM on FPGA0
ES-MVSNet: Efficient Framework for End-to-end Self-supervised Multi-View Stereo0
ESNet: An Efficient Symmetric Network for Real-time Semantic Segmentation0
End-to-End Learning for the Deep Multivariate Probit Model0
End-to-End JPEG Decoding and Artifacts Suppression Using Heterogeneous Residual Convolutional Neural Network0
An Investigation on Hardware-Aware Vision Transformer Scaling0
Character-level Transformer-based Neural Machine Translation0
Flexible and Scalable Deep Dendritic Spiking Neural Networks with Multiple Nonlinear Branching0
Ansor: Generating High-Performance Tensor Programs for Deep Learning0
FlightLLM: Efficient Large Language Model Inference with a Complete Mapping Flow on FPGAs0
Estimating the randomness of quantum circuit ensembles up to 50 qubits0
End to End Brain Fiber Orientation Estimation using Deep Learning0
Estimator-Coupled Reinforcement Learning for Robust Purely Tactile In-Hand Manipulation0
Ada3D : Exploiting the Spatial Redundancy with Adaptive Inference for Efficient 3D Object Detection0
ET-Former: Efficient Triplane Deformable Attention for 3D Semantic Scene Completion From Monocular Camera0
Evaluating Emerging AI/ML Accelerators: IPU, RDU, and NVIDIA/AMD GPUs0
Evaluating Neural Radiance Fields (NeRFs) for 3D Plant Geometry Reconstruction in Field Conditions0
End-to-end Adaptive Distributed Training on PaddlePaddle0
Evaluating Performance of an Adult Pornography Classifier for Child Sexual Abuse Detection0
Endor: Hardware-Friendly Sparse Format for Offloaded LLM Inference0
FlatCAD: Fast Curvature Regularization of Neural SDFs for CAD Models0
Encoding Motion Primitives for Autonomous Vehicles using Virtual Velocity Constraints and Neural Network Scheduling0
An Integrated Artificial Intelligence Operating System for Advanced Low-Altitude Aviation Applications0
Enabling On-Device Smartphone GPU based Training: Lessons Learned0
Ev-Edge: Efficient Execution of Event-based Vision Algorithms on Commodity Edge Platforms0
Multi-tiling Neural Radiance Field (NeRF) -- Geometric Assessment on Large-scale Aerial Datasets0
Eventprop training for efficient neuromorphic applications0
Event Transformer+. A multi-purpose solution for efficient event data processing0
Category-level Meta-learned NeRF Priors for Efficient Object Mapping0
Accelerated Training on Low-Power Edge Devices0
FlattenQuant: Breaking Through the Inference Compute-bound for Large Language Models with Per-tensor Quantization0
Everything Perturbed All at Once: Enabling Differentiable Graph Attacks0
Enabling Highly Efficient Capsule Networks Processing Through A PIM-Based Architecture Design0
Show:102550
← PrevPage 38 of 113Next →

No leaderboard results yet.