SOTAVerified

GPU

Papers

Showing 2650 of 5629 papers

TitleStatusHype
Omniwise: Predicting GPU Kernels Performance with LLMs0
GPU Kernel Scientist: An LLM-Driven Framework for Iterative Kernel Optimization0
Exploiting Lightweight Hierarchical ViT and Dynamic Framework for Efficient Visual TrackingCode1
Fast ground penetrating radar dual-parameter full waveform inversion method accelerated by hybrid compilation of CUDA kernel function and PyTorchCode1
DuoGPT: Training-free Dual Sparsity through Activation-aware Pruning in LLMs0
Scaling Speculative Decoding with Lookahead ReasoningCode0
MegaFold: System-Level Optimizations for Accelerating Protein Structure Prediction ModelsCode2
Virtual Memory for 3D Gaussian Splatting0
PocketVina Enables Scalable and Highly Accurate Physically Valid Docking through Multi-Pocket ConditioningCode2
DIP: Unsupervised Dense In-Context Post-training of Visual RepresentationsCode1
Efficient and Generalizable Speaker Diarization via Structured Pruning of Self-Supervised ModelsCode3
Let Your Video Listen to Your Music!0
Survey of HPC in US Research Institutions0
4D-LRM: Large Space-Time Reconstruction Model From and To Any View at Any Time0
CommVQ: Commutative Vector Quantization for KV Cache CompressionCode1
TDACloud: Point Cloud Recognition Using Topological Data Analysis0
Lightweight RGB-T Tracking with Mobile Vision Transformers0
Confucius3-Math: A Lightweight High-Performance Reasoning LLM for Chinese K-12 Mathematics LearningCode2
ShareGPT-4o-Image: Aligning Multimodal Models with GPT-4o-Level Image GenerationCode3
Collaborative Texture Filtering0
ConsumerBench: Benchmarking Generative AI Applications on End-User DevicesCode1
VeriLocc: End-to-End Cross-Architecture Register Allocation via LLM0
Beyond Blur: A Fluid Perspective on Generative Diffusion Models0
Speeding up Local Optimization in Vehicle Routing with Tensor-based GPU Acceleration0
TrainVerify: Equivalence-Based Verification for Distributed LLM Training0
Show:102550
← PrevPage 2 of 226Next →

No leaderboard results yet.