SOTAVerified

GPU

Papers

Showing 32013250 of 5629 papers

TitleStatusHype
Flash Communication: Reducing Tensor Parallelization Bottleneck for Fast Large Language Model Inference0
FlashMask: Efficient and Rich Mask Extension of FlashAttention0
FlashOverlap: A Lightweight Design for Efficiently Overlapping Communication and Computation0
FlatCAD: Fast Curvature Regularization of Neural SDFs for CAD Models0
FlattenQuant: Breaking Through the Inference Compute-bound for Large Language Models with Per-tensor Quantization0
FleetX0
FlexGS: Train Once, Deploy Everywhere with Many-in-One Flexible 3D Gaussian Splatting0
Flexible and Scalable Deep Dendritic Spiking Neural Networks with Multiple Nonlinear Branching0
Flexible Channel Dimensions for Differentiable Architecture Search0
Flexible Piecewise Curves Estimation for Photo Enhancement0
Flexible Techniques for Differentiable Rendering with 3D Gaussians0
FlightLLM: Efficient Large Language Model Inference with a Complete Mapping Flow on FPGAs0
FL-MISR: Fast Large-Scale Multi-Image Super-Resolution for Computed Tomography Based on Multi-GPU Acceleration0
FloE: On-the-Fly MoE Inference on Memory-constrained GPU0
Floorplan-SLAM: A Real-Time, High-Accuracy, and Long-Term Multi-Session Point-Plane SLAM for Efficient Floorplan Reconstruction0
FLOPs as a Direct Optimization Objective for Learning Sparse Neural Networks0
FlowIBR: Leveraging Pre-Training for Efficient Neural Image-Based Rendering of Dynamic Scenes0
FlowR: Flowing from Sparse to Dense 3D Reconstructions0
FMAS: Fast Multi-Objective SuperNet Architecture Search for Semantic Segmentation0
fMoE: Fine-Grained Expert Offloading for Large Mixture-of-Experts Serving0
FNAS: Uncertainty-Aware Fast Neural Architecture Search0
Focal-PETR: Embracing Foreground for Efficient Multi-Camera 3D Object Detection0
Focus: Querying Large Video Datasets with Low Latency and Low Cost0
Folding@home: achievements from over twenty years of citizen science herald the exascale era0
Foreground object segmentation in RGB-D data implemented on GPU0
Forensic Video Analytic Software0
Formulations and scalability of neural network surrogates in nonlinear optimization problems0
Fourier or Wavelet bases as counterpart self-attention in spikformer for efficient visual classification0
Foveated image processing for faster object detection and recognition in embedded systems using deep convolutional neural networks0
FPGA-Accelerated SpeckleNN with SNL for Real-time X-ray Single-Particle Imaging0
FPGA-based Acceleration of Neural Network for Image Classification using Vitis AI0
FPGA Based Implementation of Deep Neural Networks Using On-chip Memory Only0
FPGA-based Neural Network Accelerator for Millimeter-Wave Radio-over-Fiber Systems0
fpgaConvNet: A Toolflow for Mapping Diverse Convolutional Neural Networks on Embedded FPGAs0
FP-Stereo: Hardware-Efficient Stereo Vision for Embedded Applications0
Fractional-order Jacobian Matrix Differentiation and Its Application in Artificial Neural Networks0
FRDet: Balanced and Lightweight Object Detector based on Fire-Residual Modules for Embedded Processor of Autonomous Driving0
FreeKV: Boosting KV Cache Retrieval for Efficient LLM Inference0
FreeRide: Harvesting Bubbles in Pipeline Parallelism0
Fried Parameter Estimation from Single Wavefront Sensor Image with Artificial Neural Networks0
From Computation to Consumption: Exploring the Compute-Energy Link for Training and Testing Neural Networks for SED Systems0
From Flat to Feeling: A Feasibility and Impact Study on Dynamic Facial Emotions in AI-Generated Avatars0
From Hand-Crafted Metrics to Evolved Training-Free Performance Predictors for Neural Architecture Search via Genetic Programming0
From Point Clouds to Mesh Using Regression0
From Research to Production and Back: Ludicrously Fast Neural Machine Translation0
From Slow Bidirectional to Fast Autoregressive Video Diffusion Models0
From Words to Watts: Benchmarking the Energy Costs of Large Language Model Inference0
FROST: Towards Energy-efficient AI-on-5G Platforms -- A GPU Power Capping Evaluation0
Frozen Layers: Memory-efficient Many-fidelity Hyperparameter Optimization0
FSMoE: A Flexible and Scalable Training System for Sparse Mixture-of-Experts Models0
Show:102550
← PrevPage 65 of 113Next →

No leaderboard results yet.