SOTAVerified

GPU

Papers

Showing 101125 of 5629 papers

TitleStatusHype
Generalizable, real-time neural decoding with hybrid state-space models0
Perceive Anything: Recognize, Explain, Caption, and Segment Anything in Images and VideosCode2
Diagonal Batching Unlocks Parallelism in Recurrent Memory Transformers for Long ContextsCode1
MonkeyOCR: Document Parsing with a Structure-Recognition-Relation Triplet ParadigmCode9
Single GPU Task Adaptation of Pathology Foundation Models for Whole Slide Image Analysis0
FlashDMoE: Fast Distributed MoE in a Single KernelCode3
Similarity-based fuzzy clustering scientific articles: potentials and challenges from mathematical and computational perspectives0
High-Speed Ultra-Energy-Efficient Memristor-Based Massive MIMO SIC Detector Circuit with Hybrid Analog-Digital Computing Architecture0
FALO: Fast and Accurate LiDAR 3D Object Detection on Resource-Constrained Devices0
Diffusion Buffer: Online Diffusion-based Speech Enhancement with Sub-Second Latency0
VTGaussian-SLAM: RGBD SLAM for Large Scale Scenes with Splatting View-Tied 3D Gaussians0
Unleashing the Reasoning Potential of Pre-trained LLMs by Critique Fine-Tuning on One Problem0
SmolVLA: A Vision-Language-Action Model for Affordable and Efficient RoboticsCode11
COALESCE: Economic and Security Dynamics of Skill-Based Task Outsourcing Among Team of Autonomous LLM Agents0
Fine-tune Before Structured Pruning: Towards Compact and Accurate Self-Supervised Models for Speaker Diarization0
Recipes for Pre-training LLMs with MXFP80
Pushing the Limits of Beam Search Decoding for Transducer-based ASR models0
NUC-Net: Non-uniform Cylindrical Partition Network for Efficient LiDAR Semantic SegmentationCode0
AReaL: A Large-Scale Asynchronous Reinforcement Learning System for Language ReasoningCode7
TSENOR: Highly-Efficient Algorithm for Finding Transposable N:M Sparse Masks0
LlamaRL: A Distributed Asynchronous Reinforcement Learning Framework for Efficient Large-scale LLM Trainin0
LoLA: Low-Rank Linear Attention With Sparse Caching0
Accelerating AllReduce with a Persistent StragglerCode1
LUMION: Fast Fault Recovery for ML Jobs Using Programmable Optical Fabrics0
CF-DETR: Coarse-to-Fine Transformer for Real-Time Object Detection0
Show:102550
← PrevPage 5 of 226Next →

No leaderboard results yet.