SOTAVerified

GPU

Papers

Showing 501550 of 5629 papers

TitleStatusHype
AutoTriton: Automatic Triton Programming with Reinforcement Learning in LLMsCode2
TransVOD: End-to-End Video Object Detection with Spatial-Temporal TransformersCode2
Hardware-Aware Parallel Prompt Decoding for Memory-Efficient Acceleration of LLM InferenceCode2
Collaborative Decoding Makes Visual Auto-Regressive Modeling EfficientCode2
Habitat: A Platform for Embodied AI ResearchCode2
HeadInfer: Memory-Efficient LLM Inference by Head-wise OffloadingCode2
Holistically-Attracted Wireframe Parsing: From Supervised to Self-Supervised LearningCode2
Learning to Fly in SecondsCode2
AEROMamba: An efficient architecture for audio super-resolution using generative adversarial networks and state space modelsCode2
Gradient Boosting Reinforcement LearningCode2
Grouping First, Attending Smartly: Training-Free Acceleration for Diffusion TransformersCode2
GPU Performance Portability needs AutotuningCode2
Characterization of Large Language Model Development in the DatacenterCode2
GPTAQ: Efficient Finetuning-Free Quantization for Asymmetric CalibrationCode2
GradeADreamer: Enhanced Text-to-3D Generation Using Gaussian Splatting and Multi-View DiffusionCode2
GS^3: Efficient Relighting with Triple Gaussian SplattingCode2
Geomstats: A Python Package for Riemannian Geometry in Machine LearningCode2
geomstats: a Python Package for Riemannian Geometry in Machine LearningCode2
Ghost in the Minecraft: Generally Capable Agents for Open-World Environments via Large Language Models with Text-based Knowledge and MemoryCode2
Adversarial Score identity Distillation: Rapidly Surpassing the Teacher in One StepCode2
GeoLRM: Geometry-Aware Large Reconstruction Model for High-Quality 3D Gaussian GenerationCode2
GPflow: A Gaussian process library using TensorFlowCode2
gCastle: A Python Toolbox for Causal DiscoveryCode2
CaRL: Learning Scalable Planning Policies with Simple RewardsCode2
Generalized and Efficient 2D Gaussian Splatting for Arbitrary-scale Super-ResolutionCode2
Galvatron: Efficient Transformer Training over Multiple GPUs Using Automatic ParallelismCode2
Full Parameter Fine-tuning for Large Language Models with Limited ResourcesCode2
Advancing 6-DoF Instrument Pose Estimation in Variable X-Ray Imaging GeometriesCode2
Fully-fused Multi-Layer Perceptrons on Intel Data Center GPUsCode2
Gamba: Marry Gaussian Splatting with Mamba for single view 3D reconstructionCode2
H_2O: Heavy-Hitter Oracle for Efficient Generative Inference of Large Language ModelsCode2
ByteTransformer: A High-Performance Transformer Boosted for Variable-Length InputsCode2
Follow-Your-Canvas: Higher-Resolution Video Outpainting with Extensive Content GenerationCode2
Brain Tumour Removing and Missing Modality Generation using 3D WDMCode2
FlashRNN: Optimizing Traditional RNNs on Modern HardwareCode2
Bringing Light Into the Dark: A Large-scale Evaluation of Knowledge Graph Embedding Models Under a Unified FrameworkCode2
FluidLab: A Differentiable Environment for Benchmarking Complex Fluid ManipulationCode2
FP8-LM: Training FP8 Large Language ModelsCode2
BMInf: An Efficient Toolkit for Big Model Inference and TuningCode2
Black-Box Prompt Optimization: Aligning Large Language Models without Model TrainingCode2
BitDecoding: Unlocking Tensor Cores for Long-Context LLMs Decoding with Low-Bit KV CacheCode2
Boundary-Aware Segmentation Network for Mobile and Web ApplicationsCode2
Birbal: An efficient 7B instruct-model fine-tuned with curated datasetsCode2
Flash-LLM: Enabling Cost-Effective and Highly-Efficient Large Generative Model Inference with Unstructured SparsityCode2
Positive-Unlabeled Compression on the CloudCode2
FRA-RIR: Fast Random Approximation of the Image-source MethodCode2
FinRL-Meta: A Universe of Near-Real Market Environments for Data-Driven Deep Reinforcement Learning in Quantitative FinanceCode2
GaussianDreamer: Fast Generation from Text to 3D Gaussians by Bridging 2D and 3D Diffusion ModelsCode2
GaussianPretrain: A Simple Unified 3D Gaussian Representation for Visual Pre-training in Autonomous DrivingCode2
Accelerating Text-to-Image Editing via Cache-Enabled Sparse Diffusion InferenceCode2
Show:102550
← PrevPage 11 of 113Next →

No leaderboard results yet.