SOTAVerified

GPU

Papers

Showing 151200 of 5629 papers

TitleStatusHype
ADGSyn: Dual-Stream Learning for Efficient Anticancer Drug Synergy PredictionCode1
Is Architectural Complexity Overrated? Competitive and Interpretable Knowledge Graph Completion with RelatE0
KerZOO: Kernel Function Informed Zeroth-Order Optimization for Accurate and Accelerated LLM Fine-Tuning0
GenPO: Generative Diffusion Models Meet On-Policy Reinforcement Learning0
HD-PiSSA: High-Rank Distributed Orthogonal Adaptation0
Climate Implications of Diffusion-based Generative Visual AI Systems and their Mass Adoption0
VLA-RL: Towards Masterful and General Robotic Manipulation with Scalable Reinforcement LearningCode3
A DSP-Free Carrier Phase Recovery System using 16-Offset-QAM Laser Forwarded Links for 400Gb/s and Beyond0
Inference-Time Decomposition of Activations (ITDA): A Scalable Approach to Interpreting Large Language ModelsCode0
Guided by Gut: Efficient Test-Time Scaling with Reinforced Intrinsic Confidence0
Dynamic Risk Assessments for Offensive Cybersecurity AgentsCode0
A deep solver for backward stochastic Volterra integral equationsCode0
JanusDNA: A Powerful Bi-directional Hybrid DNA Foundation ModelCode1
FPQVAR: Floating Point Quantization for Visual Autoregressive Model with FPGA Hardware Co-designCode0
Data-Driven Breakthroughs and Future Directions in AI Infrastructure: A Comprehensive Review0
GMatch: Geometry-Constrained Feature Matching for RGB-D Object Pose Estimation0
QuickVideo: Real-Time Long Video Understanding with System Algorithm Co-DesignCode2
Training Long-Context LLMs Efficiently via Chunk-wise OptimizationCode2
CASS: Nvidia to AMD Transpilation with Data, Models, and BenchmarkCode1
LLM-Based Emulation of the Radio Resource Control Layer: Towards AI-Native RAN Protocols0
PICT -- A Differentiable, GPU-Accelerated Multi-Block PISO Solver for Simulation-Coupled Learning Tasks in Fluid DynamicsCode1
The Polar Express: Optimal Matrix Sign Methods and Their Application to the Muon Algorithm0
Small Language Models in the Real World: Insights from Industrial Text Classification0
DeepCEE: Efficient Cross-Region Model Distributed Training System under Heterogeneous GPUs and Networks0
Short-Range Dependency Effects on Transformer Instability and a Decomposed Attention Solution0
Efficient Differentiable Approximation of Generalized Low-rank RegularizationCode0
Flashback: Memory-Driven Zero-shot, Real-time Video Anomaly Detection0
Guidelines for the Quality Assessment of Energy-Aware NAS Benchmarks0
RAZER: Robust Accelerated Zero-Shot 3D Open-Vocabulary Panoptic Reconstruction with Spatio-Temporal Aggregation0
Quaff: Quantized Parameter-Efficient Fine-Tuning under Outlier Spatial Stability HypothesisCode1
Polar Sparsity: High Throughput Batched LLM Inferencing with Scalable Contextual SparsityCode0
Balanced and Elastic End-to-end Training of Dynamic LLMs0
UltraEdit: Training-, Subject-, and Memory-Free Lifelong Editing in Large Language ModelsCode2
UHD Image Dehazing via anDehazeFormer with Atmospheric-aware KV Cache0
Grouping First, Attending Smartly: Training-Free Acceleration for Diffusion TransformersCode2
ServerlessLoRA: Minimizing Latency and Cost in Serverless Inference for LoRA-Based LLMs0
4D-ROLLS: 4D Radar Occupancy Learning via LiDAR SupervisionCode0
Multi-head Temporal Latent AttentionCode4
Frozen Backpropagation: Relaxing Weight Symmetry in Temporally-Coded Deep Spiking Neural NetworksCode0
Half Search Space is All You Need0
MVAR: Visual Autoregressive Modeling with Scale and Spatial Markovian ConditioningCode1
Fine-tuning Quantized Neural Networks with Zeroth-order OptimizationCode1
FreeKV: Boosting KV Cache Retrieval for Efficient LLM Inference0
TSPulse: Dual Space Tiny Pre-Trained Models for Rapid Time-Series Analysis0
CALM: Co-evolution of Algorithms and Language Model for Automatic Heuristic Design0
HybridServe: Efficient Serving of Large AI Models with Confidence-Based Cascade Routing0
ZenFlow: Enabling Stall-Free Offloading Training via Asynchronous Updates0
LightRetriever: A LLM-based Hybrid Retrieval Architecture with 1000x Faster Query InferenceCode1
VGGT-SLAM: Dense RGB SLAM Optimized on the SL(4) Manifold0
A Case for Library-Level k-Means Binning in Histogram Gradient-Boosted TreesCode0
Show:102550
← PrevPage 4 of 113Next →

No leaderboard results yet.