SOTAVerified

GPU

Papers

Showing 801850 of 5629 papers

TitleStatusHype
BlendPCR: Seamless and Efficient Rendering of Dynamic Point Clouds captured by Multiple RGB-D CamerasCode0
SPILDL: A Scalable and Parallel Inductive Learner in Description Logic0
HT-HEDL: High-Throughput Hypothesis Evaluation in Description Logic0
Playable Game GenerationCode2
Dynamic-LLaVA: Efficient Multimodal Large Language Models via Dynamic Vision-language Context SparsificationCode2
Real-Time Metric-Semantic Mapping for Autonomous Navigation in Outdoor EnvironmentsCode2
PAL -- Parallel active learning for machine-learned potentialsCode0
BatchLLM: Optimizing Large Batched LLM Inference with Global Prefix Sharing and Throughput-oriented Token Batching0
VISION-XL: High Definition Video Inverse Problem Solver using Latent Image Diffusion ModelsCode1
Open source Differentiable ODE Solving Infrastructure0
A Simple Sparse Matrix Vector Multiplication Approach to Padded ConvolutionCode0
Look Every Frame All at Once: Video-Ma^2mba for Efficient Long-form Video Understanding with Multi-Axis Gradient Checkpointing0
Act Now: A Novel Online Forecasting Framework for Large-Scale Streaming DataCode1
Orthus: Autoregressive Interleaved Image-Text Generation with Modality-Specific Heads0
Differentiable Topology Estimating from Curvatures for 3D Shapes0
Puzzle: Distillation-Based NAS for Inference-Optimized LLMs0
Automating Energy-Efficient GPU Kernel Generation: A Fast Search-Based Compilation Approach0
An Integrated Artificial Intelligence Operating System for Advanced Low-Altitude Aviation Applications0
Global Tensor Motion PlanningCode1
PREBA: A Hardware/Software Co-Design for Multi-Instance GPU based AI Inference Servers0
Stochastic Taylor Derivative Estimator: Efficient amortization for arbitrary differential operatorsCode2
Towards Chunk-Wise Generation for Long Videos0
FastSwitch: Optimizing Context Switching Efficiency in Fairness-aware Large Language Model ServingCode7
A Runtime-Adaptive Transformer Neural Network Accelerator on FPGAsCode0
A High Energy-Efficiency Multi-core Neuromorphic Architecture for Deep SNN Training0
Collaborative Decoding Makes Visual Auto-Regressive Modeling EfficientCode2
k2SSL: A Faster and Better Framework for Self-Supervised Speech Representation Learning0
Pushing the Limits of Large Language Model Quantization via the Linearity TheoremCode3
Automatic Skull Reconstruction by Deep Learnable Symmetry Enforcement0
Knowledge-aware Evolutionary Graph Neural Architecture SearchCode0
KVPR: Efficient LLM Inference with I/O-Aware KV Cache Partial RecomputationCode0
ADAF: An Artificial Intelligence Data Assimilation Framework for Weather ForecastingCode1
SAR3D: Autoregressive 3D Object Generation and Understanding via Multi-scale 3D VQVAE0
A Data-Driven Approach to Dataflow-Aware Online Scheduling for Graph Neural Network Inference0
Plastic Arbor: a modern simulation framework for synaptic plasticity x2013 from single synapses to networks of morphological neuronsCode0
MambaTrack: Exploiting Dual-Enhancement for Night UAV Tracking0
MobileMamba: Lightweight Multi-Receptive Visual Mamba NetworkCode3
Anda: Unlocking Efficient LLM Inference with a Variable-Length Grouped Activation Data Format0
Enabling Efficient Serverless Inference Serving for LLM (Large Language Model) in the Cloud0
Multi-scale Cascaded Large-Model for Whole-body ROI SegmentationCode0
Reassessing Layer Pruning in LLMs: New Insights and MethodsCode0
Nd-BiMamba2: A Unified Bidirectional Architecture for Multi-Dimensional Data ProcessingCode3
XGrammar: Flexible and Efficient Structured Generation Engine for Large Language ModelsCode5
Simplifying CLIP: Unleashing the Power of Large-Scale Models on Consumer-level Computers0
Deep operator network models for predicting post-burn contraction0
Baking Gaussian Splatting into Diffusion Denoiser for Fast and Scalable Single-stage Image-to-3D Generation and Reconstruction0
Spatiotemporal Decoupling for Efficient Vision-Based Occupancy Forecasting0
Quantization without TearsCode1
FAST-Splat: Fast, Ambiguity-Free Semantics Transfer in Gaussian Splatting0
Hardware Scaling Trends and Diminishing Returns in Large-Scale Distributed Training0
Show:102550
← PrevPage 17 of 113Next →

No leaderboard results yet.