SOTAVerified

GPU

Papers

Showing 26512700 of 5629 papers

TitleStatusHype
Fast and Cost-effective Speculative Edge-Cloud Decoding with Early Exits0
Combining Local and Global Pose Estimation for Precise Tracking of Similar Objects0
Fast and Accurate Poisson Denoising with Optimized Nonlinear Diffusion0
Fast and Accurate Point Cloud Registration using Trees of Gaussian Mixtures0
Fast and Accurate FSA System Using ELBERT: An Efficient and Lightweight BERT0
Combining Efficient and Precise Sign Language Recognition: Good pose estimation library is all you need0
Interpolation-Split: a data-centric deep learning approach with big interpolated data to boost airway segmentation performance0
Fast and Accurate 3D Medical Image Segmentation with Data-swapping Method0
Fast 3D Acoustic Scattering via Discrete Laplacian Based Implicit Function Encoders0
FASP: Fast and Accurate Structured Pruning of Large Language Models0
Collaborative Texture Filtering0
FarSee-Net: Real-Time Semantic Segmentation by Efficient Multi-scale Context Aggregation and Feature Space Super-resolution0
FANNG: Fast Approximate Nearest Neighbour Graphs0
FAMOUS: Flexible Accelerator for the Attention Mechanism of Transformer on UltraScale+ FPGAs0
FALO: Fast and Accurate LiDAR 3D Object Detection on Resource-Constrained Devices0
Approximating High-Dimensional Minimal Surfaces with Physics-Informed Neural Networks0
A Data-Center FPGA Acceleration Platform for Convolutional Neural Networks0
3DGS-ReLoc: 3D Gaussian Splatting for Map Representation and Visual ReLocalization0
FairKV: Balancing Per-Head KV Cache for Fast Multi-GPU Inference0
Failure Tolerant Training with Persistent Memory Disaggregation over CXL0
Cognitively Inspired Energy-Based World Models0
FADNet++: Real-Time and Accurate Disparity Estimation with Configurable Networks0
COEF-VQ: Cost-Efficient Video Quality Understanding through a Cascaded Multimodal LLM Framework0
Approximate Caching for Efficiently Serving Diffusion Models0
Facial Expression Recognition at the Edge: CPU vs GPU vs VPU vs TPU0
FACETS: Efficient Once-for-all Object Detection via Constrained Iterative Search0
Face Recognition with Hybrid Efficient Convolution Algorithms on FPGAs0
CodeVIO: Visual-Inertial Odometry with Learned Optimizable Dense Depth0
ApproxDARTS: Differentiable Neural Architecture Search with Approximate Multipliers0
AdaptSR: Low-Rank Adaptation for Efficient and Scalable Real-World Super-Resolution0
Face Parsing via Recurrent Propagation0
Co-design of Embodied Neural Intelligence via Constrained Evolution0
Code generation and runtime techniques for enabling data-efficient deep learning training on GPUs0
EZLDA: Efficient and Scalable LDA on GPUs0
EZClone: Improving DNN Model Extraction Attack via Shape Distillation from GPU Execution Profiles0
Cocktail: Chunk-Adaptive Mixed-Precision Quantization for Long-Context LLM Inference0
Accelerating DNN Training through Selective Localized Learning0
Extreme Software Defined Radio -- GHz in Real Time0
COBRA: Cpu-Only aBdominal oRgan segmentAtion0
Extreme Classification in Log Memory0
Extensive networks would eliminate the demand for pricing formulas0
Coarse-to-Fine Searching for Efficient Generative Adversarial Networks0
Apply Distributed CNN on Genomics to accelerate Transcription-Factor TAL1 Motif Prediction0
Extensible and Efficient Proxy for Neural Architecture Search0
Coarformer: Transformer for large graph via graph coarsening0
Extend the shallow part of Single Shot MultiBox Detector via Convolutional Neural Network0
Extend the FFmpeg Framework to Analyze Media Content0
COALESCE: Economic and Security Dynamics of Skill-Based Task Outsourcing Among Team of Autonomous LLM Agents0
Adaptive Periodic Averaging: A Practical Approach to Reducing Communication in Distributed Learning0
Extending Llama-3's Context Ten-Fold Overnight0
Show:102550
← PrevPage 54 of 113Next →

No leaderboard results yet.