SOTAVerified

GPU

Papers

Showing 201225 of 5629 papers

TitleStatusHype
MetaDE: Evolving Differential Evolution by Differential EvolutionCode3
MobileMamba: Lightweight Multi-Receptive Visual Mamba NetworkCode3
MA-LMM: Memory-Augmented Large Multimodal Model for Long-Term Video UnderstandingCode3
Machine Learning in Python: Main developments and technology trends in data science, machine learning, and artificial intelligenceCode3
BiLLM: Pushing the Limit of Post-Training Quantization for LLMsCode3
MagicPIG: LSH Sampling for Efficient LLM GenerationCode3
MobileVLM : A Fast, Strong and Open Vision Language Assistant for Mobile DevicesCode3
LLMServingSim: A HW/SW Co-Simulation Infrastructure for LLM Inference Serving at ScaleCode3
Flash3D: Feed-Forward Generalisable 3D Scene Reconstruction from a Single ImageCode3
LongLLaVA: Scaling Multi-modal LLMs to 1000 Images Efficiently via a Hybrid ArchitectureCode3
LiteGS: A High-Performance Modular Framework for Gaussian Splatting TrainingCode3
Data Generation for Hardware-Friendly Post-Training QuantizationCode3
Dataset Distillation with Neural Characteristic Function: A Minmax PerspectiveCode3
Lightning Attention-2: A Free Lunch for Handling Unlimited Sequence Lengths in Large Language ModelsCode3
LinFusion: 1 GPU, 1 Minute, 16K ImageCode3
CtrLoRA: An Extensible and Efficient Framework for Controllable Image GenerationCode3
LayerKV: Optimizing Large Language Model Serving with Layer-wise KV Cache ManagementCode3
Craftax: A Lightning-Fast Benchmark for Open-Ended Reinforcement LearningCode3
FP6-LLM: Efficiently Serving Large Language Models Through FP6-Centric Algorithm-System Co-DesignCode3
Consistency Models Made EasyCode3
InstanSeg: an embedding-based instance segmentation algorithm optimized for accurate, efficient and portable cell segmentationCode3
Inference Performance Optimization for Large Language Models on CPUsCode3
Inferflow: an Efficient and Highly Configurable Inference Engine for Large Language ModelsCode3
AdaRevD: Adaptive Patch Exiting Reversible Decoder Pushes the Limit of Image DeblurringCode3
Cramming: Training a Language Model on a Single GPU in One DayCode3
Show:102550
← PrevPage 9 of 226Next →

No leaderboard results yet.