SOTAVerified

GPU

Papers

Showing 301325 of 5629 papers

TitleStatusHype
Scaling Down Text Encoders of Text-to-Image Diffusion ModelsCode2
BitDecoding: Unlocking Tensor Cores for Long-Context LLMs Decoding with Low-Bit KV CacheCode2
Splat-LOAM: Gaussian Splatting LiDAR Odometry and MappingCode2
DynamicVis: An Efficient and General Visual Foundation Model for Remote Sensing Image UnderstandingCode2
Tiled Flash Linear Attention: More Efficient Linear RNN and xLSTM KernelsCode2
MaTVLM: Hybrid Mamba-Transformer for Efficient Vision-Language ModelingCode2
RENO: Real-Time Neural Compression for 3D LiDAR Point CloudsCode2
LightGen: Efficient Image Generation through Knowledge Distillation and Direct Preference OptimizationCode2
OmniMamba: Efficient and Unified Multimodal Understanding and Generation via State Space ModelsCode2
X2I: Seamless Integration of Multimodal Understanding into Diffusion Transformer via Attention DistillationCode2
Real-time Spatial-temporal Traversability Assessment via Feature-based Sparse Gaussian ProcessCode2
DivPrune: Diversity-based Visual Token Pruning for Large Multimodal ModelsCode2
Streaming Video Question-Answering with In-context Video KV-Cache RetrievalCode2
KAD: No More FAD! An Effective and Efficient Evaluation Metric for Audio GenerationCode2
TritonBench: Benchmarking Large Language Model Capabilities for Generating Triton OperatorsCode2
Train Small, Infer Large: Memory-Efficient LoRA Training for Large Language ModelsCode2
Multimodal Mamba: Decoder-only Multimodal State Space Model via Quadratic to Linear DistillationCode2
HeadInfer: Memory-Efficient LLM Inference by Head-wise OffloadingCode2
Saving 77% of the Parameters in Large Language Models Technical ReportCode2
QuEST: Stable Training of LLMs with 1-Bit Weights and ActivationsCode2
WaferLLM: Large Language Model Inference at Wafer ScaleCode2
An Efficient Sparse Kernel Generator for O(3)-Equivariant Deep NetworksCode2
Recurrent Diffusion for Large-Scale Parameter GenerationCode2
A User's Guide to KSig: GPU-Accelerated Computation of the Signature KernelCode2
Generalized and Efficient 2D Gaussian Splatting for Arbitrary-scale Super-ResolutionCode2
Show:102550
← PrevPage 13 of 226Next →

No leaderboard results yet.