SOTAVerified

GPU

Papers

Showing 13511400 of 5629 papers

TitleStatusHype
Implicit-Zoo: A Large-Scale Dataset of Neural Implicit Functions for 2D Images and 3D ScenesCode1
Mooncake: A KVCache-centric Disaggregated Architecture for LLM ServingCode7
GraphPipe: Improving Performance and Scalability of DNN Training with Graph Pipeline Parallelism0
Video-Infinity: Distributed Long Video Generation0
MLAAN: Scaling Supervised Local Learning with Multilaminar Leap Augmented Auxiliary NetworkCode0
Hardware-Aware Neural Dropout Search for Reliable Uncertainty Prediction on FPGACode0
LaneSegNet Design Study0
MoA: Mixture of Sparse Attention for Automatic Large Language Model CompressionCode2
GeoLRM: Geometry-Aware Large Reconstruction Model for High-Quality 3D Gaussian GenerationCode2
Enhancing Dropout-based Bayesian Neural Networks with Multi-Exit on FPGACode1
ExVideo: Extending Video Diffusion Models via Parameter-Efficient Post-TuningCode0
Consistency Models Made EasyCode3
UpDLRM: Accelerating Personalized Recommendation using Real-World PIM Architecture0
CE-SSL: Computation-Efficient Semi-Supervised Learning for ECG-based Cardiovascular Diseases DetectionCode1
GPU-Accelerated DCOPF using Gradient-Based OptimizationCode0
VisualRWKV: Exploring Recurrent Neural Networks for Visual Language ModelsCode3
Sparse High Rank Adapters0
Under the Hood of Tabular Data Generation Models: Benchmarks with Extensive Tuning0
LaMDA: Large Model Fine-Tuning via Spectrally Decomposed Low-Dimensional AdaptationCode1
MCSD: An Efficient Language Model with Diverse Fusion0
Contraction rates for conjugate gradient and Lanczos approximate posteriors in Gaussian process regression0
Compress then Serve: Serving Thousands of LoRA Adapters with Little Overhead0
Multispectral Snapshot Image Registration Using Learned Cross Spectral Disparity Estimation and a Deep Guided Occlusion Reconstruction NetworkCode0
Endor: Hardware-Friendly Sparse Format for Offloaded LLM Inference0
Duoduo CLIP: Efficient 3D Understanding with Multi-View ImagesCode2
VideoLLM-online: Online Video Large Language Model for Streaming Video0
What Operations can be Performed Directly on Compressed Arrays, and with What Error?0
Optimized Speculative Sampling for GPU Hardware AcceleratorsCode0
CancerLLM: A Large Language Model in Cancer Domain0
Bypass Back-propagation: Optimization-based Structural Pruning for Large Language Models via Policy Gradient0
IMDL-BenCo: A Comprehensive Benchmark and Codebase for Image Manipulation Detection & LocalizationCode3
A GPU-accelerated Large-scale Simulator for Transportation System Optimization BenchmarkingCode1
GradeADreamer: Enhanced Text-to-3D Generation Using Gaussian Splatting and Multi-View DiffusionCode2
Deep Symbolic Optimization for Combinatorial Optimization: Accelerating Node Selection by Discovering Potential HeuristicsCode0
PixRO: Pixel-Distributed Rotational Odometry with Gaussian Belief Propagation0
A Training-free Sub-quadratic Cost Transformer Model Serving Framework With Hierarchically Pruned Attention0
Practical offloading for fine-tuning LLM on commodity GPU via learned sparse projectorsCode0
Coralai: Intrinsic Evolution of Embodied Neural Cellular Automata EcosystemsCode1
Cognitively Inspired Energy-Based World Models0
Optimal Kernel Orchestration for Tensor Programs with KorchCode1
LASER: Learning by Aligning Self-supervised Representations of Speech for Improving Content-related TasksCode0
Modeling Ambient Scene Dynamics for Free-view Synthesis0
AdaRevD: Adaptive Patch Exiting Reversible Decoder Pushes the Limit of Image DeblurringCode3
Toffee: Efficient Million-Scale Dataset Construction for Subject-Driven Text-to-Image Generation0
XLand-100B: A Large-Scale Multi-Task Dataset for In-Context Reinforcement LearningCode0
Bag of Tricks: Benchmarking of Jailbreak Attacks on LLMsCode2
WonderWorld: Interactive 3D Scene Generation from a Single Image0
COVE: Unleashing the Diffusion Feature Correspondence for Consistent Video EditingCode1
ME-Switch: A Memory-Efficient Expert Switching Framework for Large Language Models0
ALPS: Improved Optimization for Highly Sparse One-Shot Pruning for Large Language Models0
Show:102550
← PrevPage 28 of 113Next →

No leaderboard results yet.