SOTAVerified

GPU

Papers

Showing 76100 of 5629 papers

TitleStatusHype
Optimizing LLM Inference: Fluid-Guided Online Scheduling with Memory ConstraintsCode4
70% Size, 100% Accuracy: Lossless LLM Compression for Efficient GPU Inference via Dynamic-Length FloatCode4
LettuceDetect: A Hallucination Detection Framework for RAG ApplicationsCode4
Building reliable sim driving agents by scaling self-playCode4
KernelBench: Can LLMs Write Efficient GPU Kernels?Code4
LLaVA-Mini: Efficient Image and Video Large Multimodal Models with One Vision TokenCode4
TangoFlux: Super Fast and Faithful Text to Audio Generation with Flow Matching and Clap-Ranked Preference OptimizationCode4
SocialED: A Python Library for Social Event DetectionCode4
SVDQuant: Absorbing Outliers by Low-Rank Components for 4-Bit Diffusion ModelsCode4
DuoAttention: Efficient Long-Context LLM Inference with Retrieval and Streaming HeadsCode4
MoE++: Accelerating Mixture-of-Experts Methods with Zero-Computation ExpertsCode4
Video-XL: Extra-Long Vision Language Model for Hour-Scale Video UnderstandingCode4
EmbodiedSAM: Online Segment Any 3D Thing in Real TimeCode4
Deep Patch Visual SLAMCode4
GPUDrive: Data-driven, multi-agent driving simulation at 1 million FPSCode4
NNsight and NDIF: Democratizing Access to Open-Weight Foundation Model InternalsCode4
fVDB: A Deep-Learning Framework for Sparse, Large-Scale, and High-Performance Spatial IntelligenceCode4
On Scaling Up 3D Gaussian Splatting TrainingCode4
Mamba YOLO: A Simple Baseline for Object Detection with State Space ModelCode4
Flash Diffusion: Accelerating Any Conditional Diffusion Model for Few Steps Image GenerationCode4
Looking Backward: Streaming Video-to-Video Translation with Feature BanksCode4
Vidur: A Large-Scale Simulation Framework For LLM InferenceCode4
QServe: W4A8KV4 Quantization and System Co-design for Efficient LLM ServingCode4
Mamba-FETrack: Frame-Event Tracking via State Space ModelCode4
JetMoE: Reaching Llama2 Performance with 0.1M DollarsCode4
Show:102550
← PrevPage 4 of 226Next →

No leaderboard results yet.