SOTAVerified

GPU

Papers

Showing 651700 of 5629 papers

TitleStatusHype
TimeRL: Efficient Deep Reinforcement Learning with Polyhedral Dependence Graphs0
Decentralized Diffusion Models0
iServe: An Intent-based Serving System for LLMs0
Scaling Large Language Model Training on Frontier with Low-Bandwidth Partitioning0
asanAI: In-Browser, No-Code, Offline-First Machine Learning Toolkit0
LLaVA-Mini: Efficient Image and Video Large Multimodal Models with One Vision TokenCode4
A GPU Implementation of Multi-Guiding Spark Fireworks Algorithm for Efficient Black-Box Neural Network OptimizationCode0
mFabric: An Efficient and Scalable Fabric for Mixture-of-Experts Training0
The Artificial Scientist -- in-transit Machine Learning of Plasma Simulations0
Single-Channel Distance-Based Source Separation for Mobile GPU in Outdoor and Indoor Environments0
TAPAS: Thermal- and Power-Aware Scheduling for LLM Inference in Cloud Platforms0
LeetDecoding: A PyTorch Library for Exponentially Decaying Causal Linear Attention with CUDA ImplementationsCode1
DeServe: Towards Affordable Offline LLM Inference via Decentralization0
RadarNeXt: Real-Time and Reliable 3D Object Detector Based On 4D mmWave Imaging RadarCode1
The Race to Efficiency: A New Perspective on AI Scaling Laws0
Operator Learning for Reconstructing Flow Fields from Sparse Measurements: an Energy Transformer Approach0
FlashInfer: Efficient and Customizable Attention Engine for LLM Inference ServingCode9
FED: Fast and Efficient Dataset Deduplication Framework with GPU AccelerationCode0
Mamba4D: Efficient 4D Point Cloud Video Understanding with Disentangled Spatial-Temporal State Space ModelsCode1
FlexGS: Train Once, Deploy Everywhere with Many-in-One Flexible 3D Gaussian Splatting0
DaCapo: Score Distillation as Stacked Bridge for Fast and High-quality 3D Editing0
Efficient Video Super-Resolution for Real-time Rendering with Decoupled G-buffer Guidance0
AdaCM^2: On Understanding Extremely Long-Term Video with Adaptive Cross-Modality Memory Reduction0
Building Vision Models upon Heat Conduction0
Breaking the Memory Barrier of Contrastive Loss via Tile-Based Strategy0
Dataset Distillation with Neural Characteristic Function: A Minmax PerspectiveCode3
Higher-Order Ratio Cycles for Fast and Globally Optimal Shape MatchingCode0
ICP: Immediate Compensation Pruning for Mid-to-high Sparsity0
Minimal Interaction Seperated Tuning: A New Paradigm for Visual Adaptation0
IGC: Integrating a Gated Calculator into an LLM to Solve Arithmetic Tasks Reliably and Efficiently0
AttriReBoost: A Gradient-Free Propagation Optimization Method for Cold Start Mitigation in Attribute Missing GraphsCode0
Adjoint sharding for very long context training of state space models0
Towards Sustainable Large Language Model Serving0
Lightweight G-YOLOv11: Advancing Efficient Fracture Detection in Pediatric Wrist X-raysCode1
Debunking the CUDA Myth Towards GPU-based AI Systems0
LTX-Video: Realtime Video Latent DiffusionCode9
FastCHGNet: Training one Universal Interatomic Potential to 1.5 Hours with 32 GPUs0
Efficient Multi-Task Inferencing with a Shared Backbone and Lightweight Task-Specific Adapters for Automatic Scoring0
TangoFlux: Super Fast and Faithful Text to Audio Generation with Flow Matching and Clap-Ranked Preference OptimizationCode4
FPGA-based Acceleration of Neural Network for Image Classification using Vitis AI0
IMSSA: Deploying modern state-space models on memristive in-memory compute hardware0
MambaVO: Deep Visual Odometry Based on Sequential Matching Refinement and Training Smoothing0
Towards Ideal Temporal Graph Neural Networks: Evaluations and Conclusions after 10,000 GPU Hours0
Pushing the Envelope of Low-Bit LLM via Dynamic Error Compensation0
LoL-PIM: Long-Context LLM Decoding with Scalable DRAM-PIM System0
Assessing Text Classification Methods for Cyberbullying Detection on Social Media Platforms0
Learning to Forget: Bayesian Time Series Forecasting using Recurrent Sparse Spectrum Signature Gaussian Processes0
Paleoinspired Vision: From Exploring Colour Vision Evolution to Inspiring Camera Design0
RAIN: Real-time Animation of Infinite Video Stream0
MBQ: Modality-Balanced Quantization for Large Vision-Language ModelsCode2
Show:102550
← PrevPage 14 of 113Next →

No leaderboard results yet.