SOTAVerified

GPU

Papers

Showing 351400 of 5629 papers

TitleStatusHype
Datasets and Benchmarks for Offline Safe Reinforcement LearningCode2
LightGen: Efficient Image Generation through Knowledge Distillation and Direct Preference OptimizationCode2
λ-ECLIPSE: Multi-Concept Personalized Text-to-Image Diffusion Models by Leveraging CLIP Latent SpaceCode2
deepmriprep: Voxel-based Morphometry (VBM) Preprocessing via Deep Neural NetworksCode2
LightSeq2: Accelerated Training for Transformer-based Models on GPUsCode2
Bag of Tricks: Benchmarking of Jailbreak Attacks on LLMsCode2
Latent Neural Operator for Solving Forward and Inverse PDE ProblemsCode2
LeanDojo: Theorem Proving with Retrieval-Augmented Language ModelsCode2
KAD: No More FAD! An Effective and Efficient Evaluation Metric for Audio GenerationCode2
cuSLINK: Single-linkage Agglomerative Clustering on the GPUCode2
LAMP: Learn A Motion Pattern for Few-Shot-Based Video GenerationCode2
Learning to Fly in SecondsCode2
JaxMARL: Multi-Agent RL Environments and Algorithms in JAXCode2
JAX MD: A Framework for Differentiable PhysicsCode2
MODNet: Real-Time Trimap-Free Portrait Matting via Objective DecompositionCode2
AutoTriton: Automatic Triton Programming with Reinforcement Learning in LLMsCode2
Is One GPU Enough? Pushing Image Generation at Higher-Resolutions with Foundation ModelsCode2
JAX, M.D.: A Framework for Differentiable PhysicsCode2
INT-FlashAttention: Enabling Flash Attention for INT8 QuantizationCode2
Invertible Diffusion Models for Compressed SensingCode2
Forecasting GPU Performance for Deep Learning Training and InferenceCode2
Instant Volumetric Head AvatarsCode2
Isaac Gym: High Performance GPU-Based Physics Simulation For Robot LearningCode2
DISTFLASHATTN: Distributed Memory-efficient Attention for Long-context LLMs TrainingCode2
Learning to Walk in Minutes Using Massively Parallel Deep Reinforcement LearningCode2
LoongServe: Efficiently Serving Long-Context Large Language Models with Elastic Sequence ParallelismCode2
ImMesh: An Immediate LiDAR Localization and Meshing FrameworkCode2
I-BERT: Integer-only BERT QuantizationCode2
Cross-domain Neural Pitch and Periodicity EstimationCode2
Im4D: High-Fidelity and Real-Time Novel View Synthesis for Dynamic ScenesCode2
ALBERT: A Lite BERT for Self-supervised Learning of Language RepresentationsCode2
HybridDepth: Robust Metric Depth Fusion by Leveraging Depth from Focus and Single-Image PriorsCode2
AutoFocus: Efficient Multi-Scale InferenceCode2
Cross-Scale MAE: A Tale of Multi-Scale Exploitation in Remote SensingCode2
HybriMoE: Hybrid CPU-GPU Scheduling and Cache Management for Efficient MoE InferenceCode2
InPars Toolkit: A Unified and Reproducible Synthetic Data Generation Pipeline for Neural Information RetrievalCode2
Accelerating Transformer Pre-training with 2:4 SparsityCode2
A User's Guide to KSig: GPU-Accelerated Computation of the Signature KernelCode2
HRDA: Context-Aware High-Resolution Domain-Adaptive Semantic SegmentationCode2
CPPO: Accelerating the Training of Group Relative Policy Optimization-Based Reasoning ModelsCode2
HLSTransform: Energy-Efficient Llama 2 Inference on FPGAs Via High Level SynthesisCode2
CrypTen: Secure Multi-Party Computation Meets Machine LearningCode2
Holistically-Attracted Wireframe Parsing: From Supervised to Self-Supervised LearningCode2
HTS-AT: A Hierarchical Token-Semantic Audio Transformer for Sound Classification and DetectionCode2
HeadInfer: Memory-Efficient LLM Inference by Head-wise OffloadingCode2
Hardware-Aware Parallel Prompt Decoding for Memory-Efficient Acceleration of LLM InferenceCode2
Helix: Serving Large Language Models over Heterogeneous GPUs and Network via Max-FlowCode2
MARS: Mixture of Auto-Regressive Models for Fine-grained Text-to-image SynthesisCode2
Habitat: A Platform for Embodied AI ResearchCode2
HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech SynthesisCode2
Show:102550
← PrevPage 8 of 113Next →

No leaderboard results yet.