SOTAVerified

GPU

Papers

Showing 21012150 of 5629 papers

TitleStatusHype
RedCoast: A Lightweight Tool to Automate Distributed Training of LLMs on Any GPU/TPUsCode1
QMoE: Practical Sub-1-Bit Compression of Trillion-Parameter ModelsCode2
Metrically Scaled Monocular Depth Estimation through Sparse Priors for Underwater RobotsCode1
LoRAShear: Efficient Large Language Model Structured Pruning and Knowledge RecoveryCode1
Anchor Space Optimal Transport as a Fast Solution to Multiple Optimal Transport ProblemsCode0
Performance Tuning for GPU-Embedded Systems: Machine-Learning-based and Analytical Model-driven Tuning Methodologies0
UncertaintyPlayground: A Fast and Simplified Python Library for Uncertainty EstimationCode0
CAPIVARA: Cost-Efficient Approach for Improving Multilingual CLIP Performance on Low-Resource LanguagesCode1
CycleNet: Rethinking Cycle Consistency in Text-Guided Diffusion for Image ManipulationCode1
Benchmarking GPUs on SVBRDF Extractor Model0
Fine-Tuning Generative Models as an Inference Method for Robotic TasksCode0
Cooperative Minibatching in Graph Neural NetworksCode0
Take the aTrain. Introducing an Interface for the Accessible Transcription of InterviewsCode3
Jorge: Approximate Preconditioning for GPU-efficient Second-order Optimization0
Learning to Generate Parameters of ConvNets for Unseen Image DataCode0
FROST: Towards Energy-efficient AI-on-5G Platforms -- A GPU Power Capping Evaluation0
MonoSKD: General Distillation Framework for Monocular 3D Object Detection via Spearman Correlation CoefficientCode1
DialogueLLM: Context and Emotion Knowledge-Tuned Large Language Models for Emotion Recognition in ConversationsCode1
4K4D: Real-Time 4D View Synthesis at 4K Resolution0
LAMP: Learn A Motion Pattern for Few-Shot-Based Video GenerationCode2
TRANSOM: An Efficient Fault-Tolerant System for Training LLMsCode1
Leveraging Knowledge Distillation for Efficient Deep Reinforcement Learning in Resource-Constrained EnvironmentsCode0
ReMax: A Simple, Effective, and Efficient Reinforcement Learning Method for Aligning Large Language ModelsCode2
ConsistNet: Enforcing 3D Consistency for Multi-view Images DiffusionCode1
Chameleon: a Heterogeneous and Disaggregated Accelerator System for Retrieval-Augmented Language ModelsCode1
Can LSH (Locality-Sensitive Hashing) Be Replaced by Neural Network?0
Unsupervised Discovery of Interpretable Directions in h-space of Pre-trained Diffusion Models0
PC-bzip2: a phase-space continuity enhanced lossless compression algorithm for light field microscopy data0
Neural network scoring for efficient computing0
G10: Enabling An Efficient Unified GPU Memory and Storage Architecture with Smart Tensor MigrationsCode1
Revisiting Multi-modal 3D Semantic Segmentation in Real-world Autonomous Driving0
QUIK: Towards End-to-End 4-Bit Inference on Generative Large Language ModelsCode1
QLLM: Accurate and Efficient Low-Bitwidth Quantization for Large Language ModelsCode1
GaussianDreamer: Fast Generation from Text to 3D Gaussians by Bridging 2D and 3D Diffusion ModelsCode2
Im4D: High-Fidelity and Real-Time Novel View Synthesis for Dynamic ScenesCode2
4D Gaussian Splatting for Real-Time Dynamic Scene RenderingCode4
Polynomial Time Cryptanalytic Extraction of Neural Network ModelsCode0
InstructRetro: Instruction Tuning post Retrieval-Augmented Pretraining0
No Privacy Left Outside: On the (In-)Security of TEE-Shielded DNN Partition for On-Device MLCode1
QFT: Quantized Full-parameter Tuning of LLMs with Affordable Resources0
Transformers for Green Semantic Communication: Less Energy, More SemanticsCode0
Distributed Transfer Learning with 4th Gen Intel Xeon Processors0
Sparse Fine-tuning for Inference Acceleration of Large Language ModelsCode1
Look-Up mAI GeMM: Increasing AI GeMMs Performance by Nearly 2.5x via msGeMM0
Scaling Studies for Efficient Parameter Search and Parallelism for Large Language Model Pre-training0
Exploiting Manifold Structured Data Priors for Improved MR Fingerprinting Reconstruction0
Persis: A Persian Font Recognition Pipeline Using Convolutional Neural NetworksCode1
GEAR: A GPU-Centric Experience Replay System for Large Reinforcement Learning ModelsCode1
Surgical Gym: A high-performance GPU-based platform for reinforcement learning with surgical robotsCode1
Memory-Constrained Semantic Segmentation for Ultra-High Resolution UAV Imagery0
Show:102550
← PrevPage 43 of 113Next →

No leaderboard results yet.