SOTAVerified

GPU

Papers

Showing 701750 of 5629 papers

TitleStatusHype
DeepSeek-V3 Technical ReportCode16
MBQ: Modality-Balanced Quantization for Large Vision-Language ModelsCode2
Dovetail: A CPU/GPU Heterogeneous Speculative Decoding for LLM inference0
GIMS: Image Matching System Based on Adaptive Graph Construction and Graph Neural NetworkCode1
KunServe: Efficient Parameter-centric Memory Management for LLM Serving0
GQSA: Group Quantization and Sparsity for Accelerating Large Language Model Inference0
Broadband Ground Motion Synthesis by Diffusion Model with Minimal Condition0
Resource-Aware Arabic LLM Creation: Model Adaptation, Integration, and Multi-Domain TestingCode1
Balanced 3DGS: Gaussian-wise Parallelism Rendering with Fine-Grained Tiling0
Power- and Fragmentation-aware Online Scheduling for GPU DatacentersCode0
CoSurfGS:Collaborative 3D Surface Gaussian Splatting with Distributed Learning for Large Scene Reconstruction0
Flash3D: Super-scaling Point Transformers through Joint Hardware-Geometry LocalityCode1
Lillama: Large Language Models Compression via Low-Rank Feature Distillation0
Less is More: Towards Green Code Large Language Models via Unified Structural Pruning0
CLEAR: Conv-Like Linearization Revs Pre-Trained Diffusion Transformers UpCode3
WebLLM: A High-Performance In-Browser LLM Inference EngineCode11
MUSTER: Longitudinal Deformable Registration by Composition of Consecutive DeformationsCode0
Taming the Memory Beast: Strategies for Reliable ML Training on Kubernetes0
IDOL: Instant Photorealistic 3D Human Creation from a Single Image0
HashAttention: Semantic Sparsity for Faster Inference0
DI-PCG: Diffusion-based Efficient Inverse Procedural Content Generation for High-quality 3D Asset Creation0
SqueezeMe: Efficient Gaussian Avatars for VR0
Channel Merging: Preserving Specialization for Merged Experts0
Comparative Analysis of YOLOv9, YOLOv10 and RT-DETR for Real-Time Weed Detection0
SocialED: A Python Library for Social Event DetectionCode4
Language verY Rare for All0
Crabs: Consuming Resource via Auto-generation for LLM-DoS Attack under Black-box SettingsCode1
ArchesWeather & ArchesWeatherGen: a deterministic and generative model for efficient ML weather forecastingCode2
What is YOLOv6? A Deep Insight into the Object Detection Model0
Echo: Simulating Distributed Training At Scale0
Three Things to Know about Deep Metric Learning0
Exploring AI-Enabled Cybersecurity Frameworks: Deep-Learning Techniques, GPU Support, and Future Enhancements0
DAOP: Data-Aware Offloading and Predictive Pre-Calculation for Efficient MoE InferenceCode0
Accelerating Sparse Graph Neural Networks with Tensor Core Optimization0
FinLoRA: Finetuning Quantized Financial Large Language Models Using Low-Rank Adaptation0
What Matters in Learning A Zero-Shot Sim-to-Real RL Policy for Quadrotor Control? A Comprehensive Study0
Formulations and scalability of neural network surrogates in nonlinear optimization problems0
Ultra-High-Definition Dynamic Multi-Exposure Image Fusion via Infinite Pixel Learning0
GS-ProCams: Gaussian Splatting-based Projector-Camera Systems0
Stabilizing Reinforcement Learning in Differentiable Multiphysics Simulation0
PanSplat: 4K Panorama Synthesis with Feed-Forward Gaussian SplattingCode3
Dynamic Graph Attention Networks for Travel Time Distribution Prediction in Urban Arterial Roads0
NITRO: LLM Inference on Intel Laptop NPUsCode1
Light-T2M: A Lightweight and Fast Model for Text-to-motion GenerationCode1
Advancing Vehicle Plate Recognition: Multitasking Visual Language Models with VehiclePaliGemma0
KVDirect: Distributed Disaggregated LLM Inference0
HashEvict: A Pre-Attention KV Cache Eviction Strategy using Locality-Sensitive Hashing0
Real-time Identity Defenses against Malicious Personalization of Diffusion ModelsCode1
SuperGSeg: Open-Vocabulary 3D Segmentation with Structured Super-Gaussians0
Toy-GS: Assembling Local Gaussians for Precisely Rendering Large-Scale Free Camera Trajectories0
Show:102550
← PrevPage 15 of 113Next →

No leaderboard results yet.