SOTAVerified

GPU

Papers

Showing 851900 of 5629 papers

TitleStatusHype
GateNet: A novel Neural Network Architecture for Automated Flow Cytometry GatingCode1
Rethinking Compression: Reduced Order Modelling of Latent Features in Large Language ModelsCode1
Compound Text-Guided Prompt Tuning via Image-Adaptive CuesCode1
Tenplex: Dynamic Parallelism for Deep Learning using Parallelizable Tensor CollectionsCode1
SmoothQuant+: Accurate and Efficient 4-bit Post-Training WeightQuantization for LLMCode1
On the Diversity and Realism of Distilled Dataset: An Efficient Dataset Distillation ParadigmCode1
MMM: Generative Masked Motion ModelCode1
FlexModel: A Framework for Interpretability of Distributed Large Language ModelsCode1
Minuet: Accelerating 3D Sparse Convolutions on GPUsCode1
A Simple Video Segmenter by Tracking Objects Along Axial TrajectoriesCode1
Language Embedded 3D Gaussians for Open-Vocabulary Scene UnderstandingCode1
GNNFlow: A Distributed Framework for Continuous Temporal GNN Learning on Dynamic GraphsCode1
Animatable 3D Gaussian: Fast and High-Quality Reconstruction of Multiple Human AvatarsCode1
SpotServe: Serving Generative Large Language Models on Preemptible InstancesCode1
vTrain: A Simulation Framework for Evaluating Cost-effective and Compute-optimal Large Language Model TrainingCode1
ComPEFT: Compression for Communicating Parameter Efficient Updates via Sparsification and QuantizationCode1
Mobile-Seed: Joint Semantic Segmentation and Boundary Detection for Mobile RobotsCode1
LQ-LoRA: Low-rank Plus Quantized Matrix Decomposition for Efficient Language Model FinetuningCode1
4K-Resolution Photo Exposure Correction at 125 FPS with ~8K ParametersCode1
InfMLLM: A Unified Framework for Visual-Language TasksCode1
GPU-Accelerated WFST Beam Search Decoder for CTC-based Speech RecognitionCode1
Prompt Cache: Modular Attention Reuse for Low-Latency InferenceCode1
VR-NeRF: High-Fidelity Virtualized Walkable SpacesCode1
In Search of Lost Online Test-time Adaptation: A SurveyCode1
Network Contention-Aware Cluster Scheduling with Reinforcement LearningCode1
Prediction of Effective Elastic Moduli of Rocks using Graph Neural NetworksCode1
DiffusionVID: Denoising Object Boxes with Spatio-temporal Conditioning for Video Object DetectionCode1
SiDA-MoE: Sparsity-Inspired Data-Aware Serving for Efficient and Scalable Large Mixture-of-Experts ModelsCode1
LLMSTEP: LLM proofstep suggestions in LeanCode1
RedCoast: A Lightweight Tool to Automate Distributed Training of LLMs on Any GPU/TPUsCode1
Metrically Scaled Monocular Depth Estimation through Sparse Priors for Underwater RobotsCode1
LoRAShear: Efficient Large Language Model Structured Pruning and Knowledge RecoveryCode1
CAPIVARA: Cost-Efficient Approach for Improving Multilingual CLIP Performance on Low-Resource LanguagesCode1
CycleNet: Rethinking Cycle Consistency in Text-Guided Diffusion for Image ManipulationCode1
MonoSKD: General Distillation Framework for Monocular 3D Object Detection via Spearman Correlation CoefficientCode1
DialogueLLM: Context and Emotion Knowledge-Tuned Large Language Models for Emotion Recognition in ConversationsCode1
TRANSOM: An Efficient Fault-Tolerant System for Training LLMsCode1
ConsistNet: Enforcing 3D Consistency for Multi-view Images DiffusionCode1
Chameleon: a Heterogeneous and Disaggregated Accelerator System for Retrieval-Augmented Language ModelsCode1
G10: Enabling An Efficient Unified GPU Memory and Storage Architecture with Smart Tensor MigrationsCode1
QUIK: Towards End-to-End 4-Bit Inference on Generative Large Language ModelsCode1
QLLM: Accurate and Efficient Low-Bitwidth Quantization for Large Language ModelsCode1
No Privacy Left Outside: On the (In-)Security of TEE-Shielded DNN Partition for On-Device MLCode1
Sparse Fine-tuning for Inference Acceleration of Large Language ModelsCode1
Persis: A Persian Font Recognition Pipeline Using Convolutional Neural NetworksCode1
GEAR: A GPU-Centric Experience Replay System for Large Reinforcement Learning ModelsCode1
Surgical Gym: A high-performance GPU-based platform for reinforcement learning with surgical robotsCode1
Model Tells You What to Discard: Adaptive KV Cache Compression for LLMsCode1
Label Supervised LLaMA FinetuningCode1
Training a Large Video Model on a Single Machine in a DayCode1
Show:102550
← PrevPage 18 of 113Next →

No leaderboard results yet.