SOTAVerified

GPU

Papers

Showing 12011250 of 5629 papers

TitleStatusHype
Sparse Spiking Neural-like Membrane Systems on Graphics Processing UnitsCode0
Understanding the Performance and Estimating the Cost of LLM Fine-TuningCode0
Design of a Quality Management System based on the EU Artificial Intelligence ActCode0
Optimization-Driven Adaptive Experimentation0
Arctic-TILT. Business Document Understanding at Sub-Billion Scale0
Quantum Annealing based Power Grid Partitioning for Parallel Simulation0
Tree Attention: Topology-aware Decoding for Long-Context Attention on GPU clustersCode2
PackMamba: Efficient Processing of Variable-Length Sequences in Mamba training0
Optimus: Accelerating Large-Scale Multi-Modal LLM Training by Bubble Exploitation0
Real-time Event Recognition of Long-distance Distributed Vibration Sensing with Knowledge Distillation and Hardware AccelerationCode1
Advancing Multimodal Large Language Models with Quantization-Aware Scale Learning for Efficient AdaptationCode1
Image-to-LaTeX Converter for Mathematical Formulas and TextCode1
L3iTC at the FinLLM Challenge Task: Quantization for Financial Text Classification & Summarization0
A Real-Time Adaptive Multi-Stream GPU System for Online Approximate Nearest Neighborhood Search0
SLO-aware GPU Frequency Scaling for Energy Efficient LLM Inference Serving0
VoxelTrack: Exploring Voxel Representation for 3D Point Cloud Object Tracking0
RECE: Reduced Cross-Entropy Loss for Large-Catalogue Sequential RecommendersCode1
PanoFree: Tuning-Free Holistic Multi-view Image Generation with Cross-view Self-Guidance0
Deep Patch Visual SLAMCode4
GPUDrive: Data-driven, multi-agent driving simulation at 1 million FPSCode4
FT K-means: A High-Performance K-means on GPU with Fault ToleranceCode0
The Impact of Hyperparameters on Large Language Model Inference Performance: An Evaluation of vLLM and HuggingFace Pipelines0
Adaptive Two-Stage Cloud Resource Scaling via Hierarchical Multi-Indicator Forecasting and Bayesian Decision-MakingCode1
Enabling High Data Throughput Reinforcement Learning on GPUs: A Domain Agnostic Framework for Data-Driven Scientific Research0
Data-Driven Traffic Simulation for an Intersection in a Metropolis0
Conformal Trajectory Prediction with Multi-View Data Integration in Cooperative DrivingCode1
Towards Scalable GPU-Accelerated SNN Training via Temporal FusionCode0
Finch: Prompt-guided Key-Value Cache Compression0
GPU-based data processing for speeding-up correlation plenoptic imaging0
CLEFT: Language-Image Contrastive Learning with Efficient Large Language Model and Prompt Fine-TuningCode1
Toward Efficient Permutation for Hierarchical N:M Sparsity on GPUs0
Palu: Compressing KV-Cache with Low-Rank ProjectionCode2
NeuroSEM: A hybrid framework for simulating multiphysics problems by coupling PINNs and spectral elementsCode0
ThinK: Thinner Key Cache by Query-Driven Pruning0
OmniBal: Towards Fast Instruct-tuning for Vision-Language Models via Omniverse Computation BalanceCode1
Pruning Large Language Models with Semi-Structural Adaptive Sparse TrainingCode1
Graphite: A Graph-based Extreme Multi-Label Short Text Classifier for Keyphrase Recommendation0
SAPG: Split and Aggregate Policy Gradients0
ByteCheckpoint: A Unified Checkpointing System for Large Foundation Model Development0
Practical Video Object Detection via Feature Selection and AggregationCode3
Simply Trainable Nearest Neighbour Machine Translation with GPU Inference0
Mini-batch Coresets for Memory-efficient Training of Large Language Models0
WindsorML: High-Fidelity Computational Fluid Dynamics Dataset For Automotive Aerodynamics0
NARVis: Neural Accelerated Rendering for Real-Time Scientific Point Cloud Visualization0
Textile Anomaly Detection: Evaluation of the State-of-the-Art for Automated Quality Inspection of Carpet0
HybridDepth: Robust Metric Depth Fusion by Leveraging Depth from Focus and Single-Image PriorsCode2
HG-PIPE: Vision Transformer Acceleration with Hybrid-Grained Pipeline0
Keep the Cost Down: A Review on Methods to Optimize LLM' s KV-Cache ConsumptionCode0
SPLAT: A framework for optimised GPU code-generation for SParse reguLar ATtention0
ESOD: Efficient Small Object Detection on High-Resolution ImagesCode2
Show:102550
← PrevPage 25 of 113Next →

No leaderboard results yet.