SOTAVerified

GPU

Papers

Showing 251300 of 5629 papers

TitleStatusHype
VoxFormer: Sparse Voxel Transformer for Camera-based 3D Semantic Scene CompletionCode3
Cramming: Training a Language Model on a Single GPU in One DayCode3
MegaBlocks: Efficient Sparse Training with Mixture-of-ExpertsCode3
What Language Model to Train if You Have One Million GPU Hours?Code3
A GPU-specialized Inference Parameter Server for Large-Scale Deep Recommendation ModelsCode3
PyTorch Image Quality: Metrics for Image Quality AssessmentCode3
USB: A Unified Semi-supervised Learning Benchmark for ClassificationCode3
ProDiff: Progressive Fast Diffusion Model For High-Quality Text-to-SpeechCode3
ASE: Large-Scale Reusable Adversarial Skill Embeddings for Physically Simulated CharactersCode3
Fast Sampling of Diffusion Models with Exponential IntegratorCode3
Fast-MD: Fast Multi-Decoder End-to-End Speech Translation with Non-Autoregressive Hidden IntermediatesCode3
Robust High-Resolution Video Matting with Temporal GuidanceCode3
Real-Time High-Resolution Background MattingCode3
Biomedical and Clinical English Model Packages in the Stanza Python NLP LibraryCode3
Generalized Focal Loss: Learning Qualified and Distributed Bounding Boxes for Dense Object DetectionCode3
U^2-Net: Going Deeper with Nested U-Structure for Salient Object DetectionCode3
Machine Learning in Python: Main developments and technology trends in data science, machine learning, and artificial intelligenceCode3
mlpack 3: a fast, flexible machine learning libraryCode3
Performance Analysis of Open Source Machine Learning Frameworks for Various Parameters in Single-Threaded and Multi-Threaded ModesCode3
U-Net: Convolutional Networks for Biomedical Image SegmentationCode3
AutoTriton: Automatic Triton Programming with Reinforcement Learning in LLMsCode2
any4: Learned 4-bit Numeric Representation for LLMsCode2
MathOptAI.jl: Embed trained machine learning predictors into JuMP modelsCode2
MEMFOF: High-Resolution Training for Memory-Efficient Multi-Frame Optical Flow EstimationCode2
VolumetricSMPL: A Neural Volumetric Body Model for Efficient Interactions, Contacts, and CollisionsCode2
MegaFold: System-Level Optimizations for Accelerating Protein Structure Prediction ModelsCode2
PocketVina Enables Scalable and Highly Accurate Physically Valid Docking through Multi-Pocket ConditioningCode2
Confucius3-Math: A Lightweight High-Performance Reasoning LLM for Chinese K-12 Mathematics LearningCode2
SeerAttention-R: Sparse Attention Adaptation for Long ReasoningCode2
Perceive Anything: Recognize, Explain, Caption, and Segment Anything in Images and VideosCode2
ZPressor: Bottleneck-Aware Compression for Scalable Feed-Forward 3DGSCode2
QuickVideo: Real-Time Long Video Understanding with System Algorithm Co-DesignCode2
Training Long-Context LLMs Efficiently via Chunk-wise OptimizationCode2
Grouping First, Attending Smartly: Training-Free Acceleration for Diffusion TransformersCode2
UltraEdit: Training-, Subject-, and Memory-Free Lifelong Editing in Large Language ModelsCode2
VRSplat: Fast and Robust Gaussian Splatting for Virtual RealityCode2
GPU Performance Portability needs AutotuningCode2
STCOcc: Sparse Spatial-Temporal Cascade Renovation for 3D Occupancy and Scene Flow PredictionCode2
CaRL: Learning Scalable Planning Policies with Simple RewardsCode2
SG-Reg: Generalizable and Efficient Scene Graph RegistrationCode2
Taccel: Scaling Up Vision-based Tactile Robotics via High-performance GPU SimulationCode2
Tokenize Image Patches: Global Context Fusion for Effective Haze Removal in Large ImagesCode2
TorchFX: A modern approach to Audio DSP with PyTorch and GPU accelerationCode2
HybriMoE: Hybrid CPU-GPU Scheduling and Cache Management for Efficient MoE InferenceCode2
Weak-for-Strong: Training Weak Meta-Agent to Harness Strong ExecutorsCode2
GPTAQ: Efficient Finetuning-Free Quantization for Asymmetric CalibrationCode2
Scaling Video-Language Models to 10K Frames via Hierarchical Differential DistillationCode2
THEMIS: Towards Practical Intellectual Property Protection for Post-Deployment On-Device Deep Learning ModelsCode2
FastVAR: Linear Visual Autoregressive Modeling via Cached Token PruningCode2
CPPO: Accelerating the Training of Group Relative Policy Optimization-Based Reasoning ModelsCode2
Show:102550
← PrevPage 6 of 113Next →

No leaderboard results yet.