SOTAVerified

GPU

Papers

Showing 25762600 of 5629 papers

TitleStatusHype
Demystifying the MLPerf Benchmark Suite0
Demystifying the Communication Characteristics for Distributed Transformer Models0
Backpropagation Training for Fisher Vectors within Neural Networks0
PerfTracker: Online Performance Troubleshooting for Large-scale Model Training in Production0
InfiniteHiP: Extending Language Model Context Up to 3 Million Tokens on a Single GPU0
Demonstration of 3D ISAR Security Imaging at 24GHz with a Sparse MIMO Array0
Democratizing AI: Open-source Scalable LLM Training on GPU-based Supercomputers0
BackLink: Supervised Local Training with Backward Links0
Democracy of AI Numerical Weather Models: An Example of Global Forecasting with FourCastNetv2 Made by a University Research Lab Using GPU0
Demand Layering for Real-Time DNN Inference with Minimized Memory Usage0
AWESOME: GPU Memory-constrained Long Document Summarization using Memory Mechanism and Global Salient Content0
AI-assisted Automated Workflow for Real-time X-ray Ptychography Data Analysis via Federated Resources0
DeLTA: GPU Performance Model for Deep Learning Applications with In-depth Memory System Traffic Analysis0
aweSOM: a CPU/GPU-accelerated Self-organizing Map and Statistically Combined Ensemble Framework for Machine-learning Clustering Analysis0
DeGraF-Flow: Extending DeGraF Features for accurate and efficient sparse-to-dense optical flow estimation0
Tilus: A Virtual Machine for Arbitrary Low-Precision GPGPU Computation in LLM Serving0
AI-assisted Agile Propagation Modeling for Real-time Digital Twin Wireless Networks0
6D Object Pose Estimation without PnP0
InfiniteHBD: Building Datacenter-Scale High-Bandwidth Domain for LLM with Optical Circuit Switching Transceivers0
Input Reconstruction Attack against Vertical Federated Large Language Models0
Monocular Instance Motion Segmentation for Autonomous Driving: KITTI InstanceMotSeg Dataset and Multi-task Baseline0
DeFT: Decoding with Flash Tree-attention for Efficient Tree-structured LLM Inference0
Deformation Monitoring of Tunnel using Phase-based Motion Magnification and Optical Flow0
AI Accelerators for Large Language Model In-ference: Architecture Analysis and Scaling Strategies0
A Variant of Concurrent Constraint Programming on GPU0
Show:102550
← PrevPage 104 of 226Next →

No leaderboard results yet.