SOTAVerified

GPU

Papers

Showing 851900 of 5629 papers

TitleStatusHype
REDUCIO! Generating 10241024 Video within 16 Seconds using Extremely Compressed Motion LatentsCode3
Video-RAG: Visually-aligned Retrieval-Augmented Long Video ComprehensionCode3
Faster Multi-GPU Training with PPLL: A Pipeline Parallelism Framework Leveraging Local Learning0
Automated 3D Physical Simulation of Open-world Scene with Gaussian Splatting0
AdaCM^2: On Understanding Extremely Long-Term Video with Adaptive Cross-Modality Memory Reduction0
GaussianPretrain: A Simple Unified 3D Gaussian Representation for Visual Pre-training in Autonomous DrivingCode2
Graph Retention Networks for Dynamic GraphsCode0
Modeling Multivariable High-resolution 3D Urban Microclimate Using Localized Fourier Neural Operator0
LP Data Pipeline: Lightweight, Purpose-driven Data Pipeline for Large Language Models0
MoE-Lightning: High-Throughput MoE Inference on Memory-constrained GPUs0
Towards Accurate and Efficient Sub-8-Bit Integer Training0
Improving training time and GPU utilization in geo-distributed language model training0
NeuroNURBS: Learning Efficient Surface Representations for 3D Solids0
MDHP-Net: Detecting an Emerging Time-exciting Threat in IVN0
TEESlice: Protecting Sensitive Neural Network Models in Trusted Execution Environments When Attackers have Pre-Trained Models0
Pie: Pooling CPU Memory for LLM Inference0
SANDWICH: Towards an Offline, Differentiable, Fully-Trainable Wireless Neural Ray-Tracing SurrogateCode0
On Adapting Randomized Nyström Preconditioners to Accelerate Variational Image Reconstruction0
FRUGAL: Memory-Efficient Optimization by Reducing State Overhead for Scalable TrainingCode0
Optimizing LLM Inference for Database Systems: Cost-Aware Scheduling for Concurrent Requests0
ITER: Iterative Transformer-based Entity Recognition and Relation ExtractionCode1
GPU-Accelerated Inverse Lithography Towards High Quality Curvy Mask GenerationCode1
OpenThaiGPT 1.5: A Thai-Centric Open Source Large Language Model0
AEROMamba: An efficient architecture for audio super-resolution using generative adversarial networks and state space modelsCode2
Accelerating Large Language Model Training with 4D Parallelism and Memory Consumption Estimator0
Diffusion Sampling Correction via Approximately 10 ParametersCode1
KeyB2: Selecting Key Blocks is Also Important for Long Document Ranking with Large Language Models0
Benchmarking 3D multi-coil NC-PDNet MRI reconstruction0
Hardware and Software Platform Inference0
SVDQuant: Absorbing Outliers by Low-Rank Components for 4-Bit Diffusion ModelsCode4
Brain Tumour Removing and Missing Modality Generation using 3D WDMCode2
LEGO-GraphRAG: Modularizing Graph-based Retrieval-Augmented Generation for Design Space Exploration0
PropNEAT -- Efficient GPU-Compatible Backpropagation over NeuroEvolutionary Augmenting Topology Networks0
Reducing Hyperparameter Tuning Costs in ML, Vision and Language Model Training Pipelines via Memoization-AwarenessCode0
HRDecoder: High-Resolution Decoder Network for Fundus Image Lesion SegmentationCode1
LiVOS: Light Video Object Segmentation with Gated Linear MatchingCode1
Efficient and Effective Adaptation of Multimodal Foundation Models in Sequential Recommendation0
Real-Time Polygonal Semantic Mapping for Humanoid Robot Stair ClimbingCode2
Context Parallelism for Scalable Million-Token Inference0
DeeR-VLA: Dynamic Inference of Multimodal Large Language Models for Efficient Robot ExecutionCode2
xDiT: an Inference Engine for Diffusion Transformers (DiTs) with Massive ParallelismCode7
"Give Me BF16 or Give Me Death"? Accuracy-Performance Trade-Offs in LLM Quantization0
RAGViz: Diagnose and Visualize Retrieval-Augmented GenerationCode2
Stochastic Communication Avoidance for Recommendation Systems0
CRONOS: Enhancing Deep Learning with Scalable GPU Accelerated Convex Neural Networks0
Fast and Memory-Efficient Video Diffusion Using Streamlined InferenceCode1
NEO: Saving GPU Memory Crisis with CPU Offloading for Online LLM InferenceCode0
Hollowed Net for On-Device Personalization of Text-to-Image Diffusion Models0
Computation-Aware Gaussian Processes: Model Selection And Linear-Time Inference0
HopTrack: A Real-time Multi-Object Tracking System for Embedded DevicesCode0
Show:102550
← PrevPage 18 of 113Next →

No leaderboard results yet.