GPU

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 301–350 of 5629 papers

Title	Date	Tasks	Status	Hype
LoRA: Low-Rank Adaptation of Large Language Models	Jun 17, 2021	GPULanguage Modelling	CodeCode Available	2
LoRANN: Low-Rank Matrix Factorization for Approximate Nearest Neighbor Search	Oct 24, 2024	ClusteringGPU	CodeCode Available	2
LoongServe: Efficiently Serving Long-Context Large Language Models with Elastic Sequence Parallelism	Apr 15, 2024	GPU	CodeCode Available	2
Accelerated Quality-Diversity through Massive Parallelism	Feb 2, 2022	DiversityGPU	CodeCode Available	2
LoQT: Low-Rank Adapters for Quantized Pretraining	May 26, 2024	GPULanguage Modeling	CodeCode Available	2
Low-Rank Quantization-Aware Training for LLMs	Jun 10, 2024	GPUparameter-efficient fine-tuning	CodeCode Available	2
LongRecipe: Recipe for Efficient Long Context Generalization in Large Language Models	Aug 31, 2024	8kGPU	CodeCode Available	2
CPPO: Accelerating the Training of Group Relative Policy Optimization-Based Reasoning Models	Mar 28, 2025	GPUGSM8K	CodeCode Available	2
Low-resource finetuning of foundation models beats state-of-the-art in histopathology	Jan 9, 2024	GPUSelf-Supervised Learning	CodeCode Available	2
LiteTransformerSearch: Training-free Neural Architecture Search for Efficient Language Models	Mar 4, 2022	DecoderGPU	CodeCode Available	2
DISTFLASHATTN: Distributed Memory-efficient Attention for Long-context LLMs Training	Oct 5, 2023	GPU	CodeCode Available	2
LightSeq2: Accelerated Training for Transformer-based Models on GPUs	Oct 12, 2021	DecoderGPU	CodeCode Available	2
Cross-domain Neural Pitch and Periodicity Estimation	Jan 28, 2023	CPUGPU	CodeCode Available	2
LightSeq: A High Performance Inference Library for Transformers	Oct 23, 2020	GPUMachine Translation	CodeCode Available	2
λ-ECLIPSE: Multi-Concept Personalized Text-to-Image Diffusion Models by Leveraging CLIP Latent Space	Feb 7, 2024	Concept AlignmentGPU	CodeCode Available	2
Confucius3-Math: A Lightweight High-Performance Reasoning LLM for Chinese K-12 Mathematics Learning	Jun 23, 2025	GPULarge Language Model	CodeCode Available	2
360MonoDepth: High-Resolution 360deg Monocular Depth Estimation	Jan 1, 2022	2kDepth Estimation	CodeCode Available	2
Learning to Walk in Minutes Using Massively Parallel Deep Reinforcement Learning	Sep 24, 2021	Deep Reinforcement LearningGPU	CodeCode Available	2
A Case Study in CUDA Kernel Fusion: Implementing FlashAttention-2 on NVIDIA Hopper Architecture using the CUTLASS Library	Dec 19, 2023	GPU	CodeCode Available	2
LeanDojo: Theorem Proving with Retrieval-Augmented Language Models	Jun 27, 2023	Automated Theorem ProvingGPU	CodeCode Available	2
Latent Neural Operator for Solving Forward and Inverse PDE Problems	Jun 6, 2024	Computational EfficiencyGPU	CodeCode Available	2
Learning to Fly in Seconds	Nov 22, 2023	GPUReinforcement Learning (RL)	CodeCode Available	2
LightGen: Efficient Image Generation through Knowledge Distillation and Direct Preference Optimization	Mar 11, 2025	GPUImage Generation	CodeCode Available	2
KAD: No More FAD! An Effective and Efficient Evaluation Metric for Audio Generation	Feb 21, 2025	Audio GenerationFAD	CodeCode Available	2
JAX, M.D.: A Framework for Differentiable Physics	Dec 9, 2019	Drug DiscoveryGPU	CodeCode Available	2
JAX MD: A Framework for Differentiable Physics	Dec 1, 2020	GPU	CodeCode Available	2
CoMoSpeech: One-Step Speech and Singing Voice Synthesis via Consistency Model	May 11, 2023	DenoisingGPU	CodeCode Available	2
CoMoSVC: Consistency Model-based Singing Voice Conversion	Jan 3, 2024	GPUmodel	CodeCode Available	2
2nd Place Solution for Waymo Open Dataset Challenge -- Real-time 2D Object Detection	Jun 16, 2021	2D Object DetectionAutonomous Driving	CodeCode Available	2
Cross-Scale MAE: A Tale of Multi-Scale Exploitation in Remote Sensing	Jan 29, 2024	GPURepresentation Learning	CodeCode Available	2
JaxMARL: Multi-Agent RL Environments and Algorithms in JAX	Nov 16, 2023	CPUGPU	CodeCode Available	2
LAMP: Learn A Motion Pattern for Few-Shot-Based Video Generation	Oct 16, 2023	GPUImage Animation	CodeCode Available	2
Instant Volumetric Head Avatars	Nov 22, 2022	Face ModelGPU	CodeCode Available	2
CoLLiE: Collaborative Training of Large Language Models in an Efficient Way	Dec 1, 2023	GPUparameter-efficient fine-tuning	CodeCode Available	2
2nd Place Solution for Waymo Open Dataset Challenge - Real-time 2D Object Detection	Jun 16, 2021	2D Object DetectionAutonomous Driving	CodeCode Available	2
INT-FlashAttention: Enabling Flash Attention for INT8 Quantization	Sep 25, 2024	GPUQuantization	CodeCode Available	2
Collaborative Decoding Makes Visual Auto-Regressive Modeling Efficient	Nov 26, 2024	GPUImage Generation	CodeCode Available	2
CoLA: Exploiting Compositional Structure for Automatic and Efficient Numerical Linear Algebra	Sep 6, 2023	CoLAGaussian Processes	CodeCode Available	2
ImMesh: An Immediate LiDAR Localization and Meshing Framework	Jan 12, 2023	CPUDimensionality Reduction	CodeCode Available	2
InPars Toolkit: A Unified and Reproducible Synthetic Data Generation Pipeline for Neural Information Retrieval	Jul 10, 2023	GPUInformation Retrieval	CodeCode Available	2
Invertible Diffusion Models for Compressed Sensing	Mar 25, 2024	compressed sensingGPU	CodeCode Available	2
HybridDepth: Robust Metric Depth Fusion by Leveraging Depth from Focus and Single-Image Priors	Jul 26, 2024	Depth EstimationGPU	CodeCode Available	2
HybriMoE: Hybrid CPU-GPU Scheduling and Cache Management for Efficient MoE Inference	Apr 8, 2025	CPUGPU	CodeCode Available	2
HRDA: Context-Aware High-Resolution Domain-Adaptive Semantic Segmentation	Apr 27, 2022	Domain AdaptationGPU	CodeCode Available	2
HTS-AT: A Hierarchical Token-Semantic Audio Transformer for Sound Classification and Detection	Feb 2, 2022	Audio ClassificationEvent Detection	CodeCode Available	2
I-BERT: Integer-only BERT Quantization	Jan 5, 2021	GPUNatural Language Inference	CodeCode Available	2
HLSTransform: Energy-Efficient Llama 2 Inference on FPGAs Via High Level Synthesis	Apr 29, 2024	CPUEdge-computing	CodeCode Available	2
Holistically-Attracted Wireframe Parsing: From Supervised to Self-Supervised Learning	Oct 24, 2022	GPUSelf-Supervised Learning	CodeCode Available	2
Im4D: High-Fidelity and Real-Time Novel View Synthesis for Dynamic Scenes	Oct 12, 2023	GPUNovel View Synthesis	CodeCode Available	2
Isaac Gym: High Performance GPU-Based Physics Simulation For Robot Learning	Aug 24, 2021	CPUGPU	CodeCode Available	2

Show:10 25 50

← PrevPage 7 of 113Next →

No leaderboard results yet.