SOTAVerified

GPU

Papers

Showing 401450 of 5629 papers

TitleStatusHype
Isaac Gym: High Performance GPU-Based Physics Simulation For Robot LearningCode2
JaxMARL: Multi-Agent RL Environments and Algorithms in JAXCode2
ALBERT: A Lite BERT for Self-supervised Learning of Language RepresentationsCode2
DNABERT-2: Efficient Foundation Model and Benchmark For Multi-Species GenomeCode2
Instant Volumetric Head AvatarsCode2
Domain Adaptive and Generalizable Network Architectures and Training Strategies for Semantic Image SegmentationCode2
INT-FlashAttention: Enabling Flash Attention for INT8 QuantizationCode2
deepmriprep: Voxel-based Morphometry (VBM) Preprocessing via Deep Neural NetworksCode2
Invertible Diffusion Models for Compressed SensingCode2
JAX MD: A Framework for Differentiable PhysicsCode2
Bag of Tricks: Benchmarking of Jailbreak Attacks on LLMsCode2
Accelerating Transformer Pre-training with 2:4 SparsityCode2
InPars Toolkit: A Unified and Reproducible Synthetic Data Generation Pipeline for Neural Information RetrievalCode2
Im4D: High-Fidelity and Real-Time Novel View Synthesis for Dynamic ScenesCode2
I-BERT: Integer-only BERT QuantizationCode2
AutoTriton: Automatic Triton Programming with Reinforcement Learning in LLMsCode2
HybriMoE: Hybrid CPU-GPU Scheduling and Cache Management for Efficient MoE InferenceCode2
ImMesh: An Immediate LiDAR Localization and Meshing FrameworkCode2
Multi-Stage Speech Bandwidth Extension with Flexible Sampling Rate ControlCode2
JAX, M.D.: A Framework for Differentiable PhysicsCode2
HRDA: Context-Aware High-Resolution Domain-Adaptive Semantic SegmentationCode2
Datasets and Benchmarks for Offline Safe Reinforcement LearningCode2
Holistically-Attracted Wireframe Parsing: From Supervised to Self-Supervised LearningCode2
HTS-AT: A Hierarchical Token-Semantic Audio Transformer for Sound Classification and DetectionCode2
CaRL: Learning Scalable Planning Policies with Simple RewardsCode2
Not All Points Are Equal: Learning Highly Efficient Point-based Detectors for 3D LiDAR Point CloudsCode2
Helix: Serving Large Language Models over Heterogeneous GPUs and Network via Max-FlowCode2
OctFormer: Octree-based Transformers for 3D Point CloudsCode2
OmniDrones: An Efficient and Flexible Platform for Reinforcement Learning in Drone ControlCode2
OmniGS: Fast Radiance Field Reconstruction using Omnidirectional Gaussian SplattingCode2
Forecasting GPU Performance for Deep Learning Training and InferenceCode2
HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech SynthesisCode2
HLSTransform: Energy-Efficient Llama 2 Inference on FPGAs Via High Level SynthesisCode2
Habitat: A Platform for Embodied AI ResearchCode2
Dynamic Pre-training: Towards Efficient and Scalable All-in-One Image RestorationCode2
DynamicVis: An Efficient and General Visual Foundation Model for Remote Sensing Image UnderstandingCode2
Habitat 2.0: Training Home Assistants to Rearrange their HabitatCode2
Parallel WaveGAN: A fast waveform generation model based on generative adversarial networks with multi-resolution spectrogramCode2
A User's Guide to KSig: GPU-Accelerated Computation of the Signature KernelCode2
GS^3: Efficient Relighting with Triple Gaussian SplattingCode2
H_2O: Heavy-Hitter Oracle for Efficient Generative Inference of Large Language ModelsCode2
Hardware-Aware Parallel Prompt Decoding for Memory-Efficient Acceleration of LLM InferenceCode2
Accelerating Sparse Deep Neural NetworksCode2
AutoFocus: Efficient Multi-Scale InferenceCode2
Deep Snake for Real-Time Instance SegmentationCode2
PocketVina Enables Scalable and Highly Accurate Physically Valid Docking through Multi-Pocket ConditioningCode2
Grouping First, Attending Smartly: Training-Free Acceleration for Diffusion TransformersCode2
HeadInfer: Memory-Efficient LLM Inference by Head-wise OffloadingCode2
HybridDepth: Robust Metric Depth Fusion by Leveraging Depth from Focus and Single-Image PriorsCode2
GPU Performance Portability needs AutotuningCode2
Show:102550
← PrevPage 9 of 113Next →

No leaderboard results yet.