SOTAVerified

GPU

Papers

Showing 38513875 of 5629 papers

TitleStatusHype
QiMeng-Xpiler: Transcompiling Tensor Programs for Deep Learning Systems with a Neural-Symbolic Approach0
QORT-Former: Query-optimized Real-time Transformer for Understanding Two Hands Manipulating Objects0
QoS-Efficient Serving of Multiple Mixture-of-Expert LLMs Using Partial Runtime Reconfiguration0
QuadraNet V2: Efficient and Sustainable Training of High-Order Neural Networks with Quadratic Adaptation0
QuAILoRA: Quantization-Aware Initialization for LoRA0
Qualities, challenges and future of genetic algorithms: a literature review0
QuantEase: Optimization-based Quantization for Language Models0
Quantization-Aware Imitation-Learning for Resource-Efficient Robotic Control0
Quantized Neural Network Inference with Precision Batching0
QuantSpec: Self-Speculative Decoding with Hierarchical Quantized KV Cache0
10,000 km Straight-line Transmission using a Real-time Software-defined GPU-Based Receiver0
Quantum-Enhanced Support Vector Machine for Large-Scale Stellar Classification with GPU Acceleration0
Quantum-inspired tensor network for Earth science0
Quantum-Powered Personalized Learning0
Quantum Walks-Based Adaptive Distribution Generation with Efficient CUDA-Q Acceleration0
Query-focused Sentence Compression in Linear Time0
Query-focused Sentence Compression in Linear Time0
Query Processing on Tensor Computation Runtimes0
Queueing Analysis of GPU-Based Inference Servers with Dynamic Batching: A Closed-Form Characterization0
RADARS: Memory Efficient Reinforcement Learning Aided Differentiable Neural Architecture Search0
RAGCache: Efficient Knowledge Caching for Retrieval-Augmented Generation0
RAIN: Real-time Animation of Infinite Video Stream0
Ramanujan Bipartite Graph Products for Efficient Block Sparse Neural Networks0
Random 2.5D U-net for Fully 3D Segmentation0
Random Offset Block Embedding Array (ROBE) for CriteoTB Benchmark MLPerf DLRM Model : 1000 Compression and 3.1 Faster Inference0
Show:102550
← PrevPage 155 of 226Next →

No leaderboard results yet.