SOTAVerified

GPU

Papers

Showing 751800 of 5629 papers

TitleStatusHype
LinGen: Towards High-Resolution Minute-Length Text-to-Video Generation with Linear Computational Complexity0
Dipper: Diversity in Prompts for Producing Large Language Model Ensembles in Reasoning tasks0
Stellar parameter prediction and spectral simulation using machine learning0
Benchmarking of GPU-optimized Quantum-Inspired Evolutionary Optimization Algorithm using Functional Analysis0
Dimensionality Reduction Techniques for Global Bayesian Optimisation0
HadaCore: Tensor Core Accelerated Hadamard Transform KernelCode3
All You Need in Knowledge Distillation Is a Tailored Coordinate System0
Representing Long Volumetric Video with Temporal Gaussian HierarchyCode5
COEF-VQ: Cost-Efficient Video Quality Understanding through a Cascaded Multimodal LLM Framework0
EOV-Seg: Efficient Open-Vocabulary Panoptic SegmentationCode1
Protecting Confidentiality, Privacy and Integrity in Collaborative Learning0
Lightweight Method for Interactive 3D Medical Image Segmentation with Multi-Round Result FusionCode0
CEEMS: A Resource Manager Agnostic Energy and Emissions Monitoring Stack0
Low-Latency Scalable Streaming for Event-Based Vision0
Machine learning-driven conservative-to-primitive conversion in hybrid piecewise polytropic and tabulated equations of state0
FlashRNN: Optimizing Traditional RNNs on Modern HardwareCode2
From Slow Bidirectional to Fast Autoregressive Video Diffusion Models0
Make-A-Texture: Fast Shape-Aware Texture Generation in 3 Seconds0
LoRA3D: Low-Rank Self-Calibration of 3D Geometric Foundation Models0
ManiSkill-HAB: A Benchmark for Low-Level Manipulation in Home Rearrangement TasksCode2
Flexible and Scalable Deep Dendritic Spiking Neural Networks with Multiple Nonlinear Branching0
GraphNeuralNetworks.jl: Deep Learning on Graphs with JuliaCode3
Improving text-conditioned latent diffusion for cancer pathologyCode0
ASGDiffusion: Parallel High-Resolution Generation with Asynchronous Structure Guidance0
Edge Delayed Deep Deterministic Policy Gradient: efficient continuous control for edge scenarios0
MCP-MedSAM: A Powerful Lightweight Medical Segment Anything Model Trained with a Single GPU in Just One DayCode1
Batch-Max: Higher LLM Throughput using Larger Batch Sizes and KV Cache Compression0
Code generation and runtime techniques for enabling data-efficient deep learning training on GPUs0
APOLLO: SGD-like Memory, AdamW-level PerformanceCode3
Flash Communication: Reducing Tensor Parallelization Bottleneck for Fast Large Language Model Inference0
Momentum-GS: Momentum Gaussian Self-Distillation for High-Quality Large Scene ReconstructionCode2
DHIL-GT: Scalable Graph Transformer with Decoupled Hierarchy Labeling0
Beyond Boxes: Mask-Guided Spatio-Temporal Feature Aggregation for Video Object Detection0
GUIDE: A Global Unified Inference Engine for Deploying Large Language Models in Heterogeneous Environments0
Transformers Can Navigate Mazes With Multi-Step PredictionCode1
DEIM: DETR with Improved Matching for Fast ConvergenceCode5
SKIM: Any-bit Quantization Pushing The Limits of Post-Training Quantization0
Assessing and Learning Alignment of Unimodal Vision and Language Models0
p-MoD: Building Mixture-of-Depths MLLMs via Progressive Ratio DecayCode1
Beyond [cls]: Exploring the true potential of Masked Image Modeling representationsCode1
FlashAttention on a Napkin: A Diagrammatic Approach to Deep Learning IO-Awareness0
Diffusion-VLA: Generalizable and Interpretable Robot Foundation Model via Self-Generated Reasoning0
Unifying KV Cache Compression for Large Language Models with LeanKV0
CLAP: Unsupervised 3D Representation Learning for Fusion 3D Perception via Curvature Sampling and Prototype Learning0
SJTU:Spatial judgments in multimodal models towards unified segmentation through coordinate detectionCode0
Can't Slow me Down: Learning Robust and Hardware-Adaptive Object Detectors against Latency Attacks for Edge Devices0
Improving feature interactions at Pinterest under industry constraints0
Quantization-Aware Imitation-Learning for Resource-Efficient Robotic Control0
MamKPD: A Simple Mamba Baseline for Real-Time 2D Keypoint Detection0
Memory-Efficient Training for Deep Speaker Embedding Learning in Speaker Verification0
Show:102550
← PrevPage 16 of 113Next →

No leaderboard results yet.