SOTAVerified

GPU

Papers

Showing 15011550 of 5629 papers

TitleStatusHype
SSAMBA: Self-Supervised Audio Representation Learning with Mamba State Space ModelCode2
Parallelization of the K-Means Algorithm with Applications to Big Data Clustering0
Hybrid CNN-Transformer Architecture for Efficient Large-Scale Video Snapshot Compressive ImagingCode1
Advancing 6-DoF Instrument Pose Estimation in Variable X-Ray Imaging GeometriesCode2
MAMCA -- Optimal on Accuracy and Efficiency for Automatic Modulation Classification with Extended Signal LengthCode2
ENOVA: Autoscaling towards Cost-effective and Stable Serverless LLM Serving0
Specialising and Analysing Instruction-Tuned and Byte-Level Language Models for Organic Reaction Prediction0
HW-GPT-Bench: Hardware-Aware Architecture Benchmark for Language ModelsCode1
IGOT: Information Gain Optimized Tokenizer on Domain Adaptive Pretraining0
Xmodel-VLM: A Simple Baseline for Multimodal Vision Language ModelCode2
The Developing Human Connectome Project: A Fast Deep Learning-based Pipeline for Neonatal Cortical Surface ReconstructionCode1
Computation-Aware Kalman Filtering and SmoothingCode1
Hierarchical Resource Partitioning on Modern GPUs: A Reinforcement Learning Approach0
Challenges in Deploying Long-Context Transformers: A Theoretical Peak Performance Analysis0
No Time to Waste: Squeeze Time into Channel for Mobile Video UnderstandingCode1
Do Bayesian imaging methods report trustworthy probabilities?0
Infinite Texture: Text-guided High Resolution Diffusion Texture Synthesis0
Consistency Policy: Accelerated Visuomotor Policies via Consistency Distillation0
Sparse Sampling is All You Need for Fast Wrong-way Cycling Detection in CCTV Videos0
NGD-SLAM: Towards Real-Time Dynamic SLAM without GPUCode3
Differentiable Model Scaling using Differentiable TopkCode1
Input Snapshots Fusion for Scalable Discrete Dynamic Graph Nerual Networks0
SKVQ: Sliding-window Key and Value Cache Quantization for Large Language Models0
Aerial-NeRF: Adaptive Spatial Partitioning and Sampling for Large-Scale Aerial Rendering0
Selective Focus: Investigating Semantics Sensitivity in Post-training Quantization for Lane Detection0
Mirage: A Multi-Level Superoptimizer for Tensor ProgramsCode7
Preble: Efficient Distributed Prompt Scheduling for LLM ServingCode2
You Only Cache Once: Decoder-Decoder Architectures for Language ModelsCode0
Vidur: A Large-Scale Simulation Framework For LLM InferenceCode4
Open Implementation and Study of BEST-RQ for Speech Processing0
A New Dataset and Comparative Study for Aphid Cluster Detection and Segmentation in Sorghum Fields0
Group-aware Parameter-efficient Updating for Content-Adaptive Neural Video Compression0
DistGrid: Scalable Scene Reconstruction with Distributed Multi-resolution Hash Grid0
QServe: W4A8KV4 Quantization and System Co-design for Efficient LLM ServingCode4
vAttention: Dynamic Memory Management for Serving LLMs without PagedAttentionCode3
KV Cache is 1 Bit Per Channel: Efficient Large Language Model Inference with Coupled Quantization0
SwiftRL: Towards Efficient Reinforcement Learning on Real Processing-In-Memory SystemsCode0
Neural Graphics Texture Compression Supporting Random Access0
QuadraNet V2: Efficient and Sustainable Training of High-Order Neural Networks with Quadratic Adaptation0
Get more for less: Principled Data Selection for Warming Up Fine-Tuning in LLMs0
Labeling supervised fine-tuning data with the scaling lawCode7
UniDEC : Unified Dual Encoder and Classifier Training for Extreme Multi-Label Classification0
Fast Algorithms for Spiking Neural Network Simulation with FPGAsCode0
SoftMCL: Soft Momentum Contrastive Learning for Fine-grained Sentiment-aware Pre-trainingCode0
Structural Pruning of Pre-trained Language Models via Neural Architecture SearchCode0
MTDT: A Multi-Task Deep Learning Digital Twin0
Self-Supervised Learning for Interventional Image Analytics: Towards Robust Device Trackers0
FeNNol: an Efficient and Flexible Library for Building Force-field-enhanced Neural Network PotentialsCode2
Deep Learning Models in Speech Recognition: Measuring GPU Energy Consumption, Impact of Noise and Model Quantization for Edge DeploymentCode0
Addressing Diverging Training Costs using BEVRestore for High-resolution Bird's Eye View Map Construction0
Show:102550
← PrevPage 31 of 113Next →

No leaderboard results yet.