SOTAVerified

GPU

Papers

Showing 18011850 of 5629 papers

TitleStatusHype
Short-Range Dependency Effects on Transformer Instability and a Decomposed Attention Solution0
Small Language Models in the Real World: Insights from Industrial Text Classification0
RAZER: Robust Accelerated Zero-Shot 3D Open-Vocabulary Panoptic Reconstruction with Spatio-Temporal Aggregation0
DeepCEE: Efficient Cross-Region Model Distributed Training System under Heterogeneous GPUs and Networks0
Flashback: Memory-Driven Zero-shot, Real-time Video Anomaly Detection0
Guidelines for the Quality Assessment of Energy-Aware NAS Benchmarks0
ServerlessLoRA: Minimizing Latency and Cost in Serverless Inference for LoRA-Based LLMs0
UHD Image Dehazing via anDehazeFormer with Atmospheric-aware KV Cache0
Polar Sparsity: High Throughput Batched LLM Inferencing with Scalable Contextual SparsityCode0
Balanced and Elastic End-to-end Training of Dynamic LLMs0
4D-ROLLS: 4D Radar Occupancy Learning via LiDAR SupervisionCode0
TSPulse: Dual Space Tiny Pre-Trained Models for Rapid Time-Series Analysis0
FreeKV: Boosting KV Cache Retrieval for Efficient LLM Inference0
Half Search Space is All You Need0
Frozen Backpropagation: Relaxing Weight Symmetry in Temporally-Coded Deep Spiking Neural NetworksCode0
CALM: Co-evolution of Algorithms and Language Model for Automatic Heuristic Design0
A Case for Library-Level k-Means Binning in Histogram Gradient-Boosted TreesCode0
HybridServe: Efficient Serving of Large AI Models with Confidence-Based Cascade Routing0
ZenFlow: Enabling Stall-Free Offloading Training via Asynchronous Updates0
VGGT-SLAM: Dense RGB SLAM Optimized on the SL(4) Manifold0
HessFormer: Hessians at Foundation Scale0
Group Think: Multiple Concurrent Reasoning Agents Collaborating at Token Level Granularity0
From Hand-Crafted Metrics to Evolved Training-Free Performance Predictors for Neural Architecture Search via Genetic Programming0
Entropy-Driven Genetic Optimization for Deep-Feature-Guided Low-Light Image EnhancementCode0
Gaussian Weight Sampling for Scalable, Efficient and Stable Pseudo-Quantization Training0
Single-shot prediction of parametric partial differential equations0
Generative Molecular Design with Steerable and Granular Synthesizability Control0
Scaling Multi Agent Reinforcement Learning for Underwater Acoustic Tracking via Autonomous Vehicles0
AI Accelerators for Large Language Model In-ference: Architecture Analysis and Scaling Strategies0
Fused3S: Fast Sparse Attention on Tensor CoresCode0
Private LoRA Fine-tuning of Open-Source LLMs with Homomorphic Encryption0
L-SWAG: Layer-Sample Wise Activation with Gradients information for Zero-Shot NAS on Vision Transformers0
Cache-Efficient Posterior Sampling for Reinforcement Learning with LLM-Derived Priors Across Discrete and Continuous Domains0
SLAG: Scalable Language-Augmented Gaussian Splatting0
On the Cost and Benefits of Training Context with Utterance or Full Conversation Training: A Comparative Stud0
Matrix Is All You Need0
Streaming Krylov-Accelerated Stochastic Gradient Descent0
QoS-Efficient Serving of Multiple Mixture-of-Expert LLMs Using Partial Runtime Reconfiguration0
Challenging GPU Dominance: When CPUs Outperform for On-Device LLM Inference0
FloE: On-the-Fly MoE Inference on Memory-constrained GPU0
UltraGauss: Ultrafast Gaussian Reconstruction of 3D Ultrasound Volumes0
Boosting Performance on ARC is a Matter of Perspective0
Steepest Descent Density Control for Compact 3D Gaussian Splatting0
Supporting renewable energy planning and operation with data-driven high-resolution ensemble weather forecast0
Leveraging Simultaneous Usage of Edge GPU Hardware Engines for Video Face Detection and Recognition0
Plexus: Taming Billion-edge Graphs with 3D Parallel GNN Training0
Edge-GPU Based Face Tracking for Face Detection and Recognition Acceleration0
LONGER: Scaling Up Long Sequence Modeling in Industrial Recommenders0
AnomalyMatch: Discovering Rare Objects of Interest with Semi-supervised and Active LearningCode0
Prism: Unleashing GPU Sharing for Cost-Efficient Multi-LLM Serving0
Show:102550
← PrevPage 37 of 113Next →

No leaderboard results yet.