SOTAVerified

GPU

Papers

Showing 18011850 of 5629 papers

TitleStatusHype
Fully Differentiable Lagrangian Convolutional Neural Network for Continuity-Consistent Physics-Informed Precipitation Nowcasting0
Evaluating Neural Radiance Fields (NeRFs) for 3D Plant Geometry Reconstruction in Field Conditions0
ME-ViT: A Single-Load Memory-Efficient FPGA Accelerator for Vision Transformers0
QUICK: Quantization-aware Interleaving and Conflict-free Kernel for efficient LLM inferenceCode2
Efficient Language Adaptive Pre-training: Extending State-of-the-Art Large Language Models for Polish0
BitDelta: Your Fine-Tune May Only Be Worth One BitCode3
Rewards-in-Context: Multi-objective Alignment of Foundation Models with Dynamic Preference AdjustmentCode1
TinyCL: An Efficient Hardware Architecture for Continual Learning on Autonomous Systems0
MobileSpeech: A Fast and High-Fidelity Framework for Mobile Zero-Shot Text-to-Speech0
Listening to Multi-talker Conversations: Modular and End-to-end Perspectives0
Active Disruption Avoidance and Trajectory Design for Tokamak Ramp-downs with Neural Differential Equations and Reinforcement Learning0
MLTCP: Congestion Control for DNN Training0
DisGNet: A Distance Graph Neural Network for Forward Kinematics Learning of Gough-Stewart PlatformCode0
Stochastic Spiking Attention: Accelerating Attention with Stochastic Computing in Spiking Networks0
HyCubE: Efficient Knowledge Hypergraph 3D Circular Convolutional Embedding0
Multi-Level GNN Preconditioner for Solving Large Scale Problems0
Graph Feature Preprocessor: Real-time Subgraph-based Feature Extraction for Financial Crime Detection0
Accelerating Distributed Deep Learning using Lossless Homomorphic CompressionCode0
Anchor-based Large Language ModelsCode1
Benchmarking and Building Long-Context Retrieval Models with LoCo and M2-BERT0
The I/O Complexity of Attention, or How Optimal is Flash Attention?0
Context-aware Multi-Model Object Detection for Diversely Heterogeneous Compute SystemsCode0
Fiddler: CPU-GPU Orchestration for Fast Inference of Mixture-of-Experts ModelsCode3
Cardiac ultrasound simulation for autonomous ultrasound navigation0
On the Efficacy of Eviction Policy for Key-Value Constrained Generative Language Model InferenceCode2
Anatomizing Deep Learning Inference in Web Browsers0
Everybody Prune Now: Structured Pruning of LLMs with only Forward PassesCode1
On the Convergence of Zeroth-Order Federated Tuning for Large Language Models0
Improving Token-Based World Models with Parallel Observation PredictionCode1
TASER: Temporal Adaptive Sampling for Fast and Accurate Dynamic Graph Representation LearningCode1
A Lightweight Inception Boosted U-Net Neural Network for Routability PredictionCode1
ApiQ: Finetuning of 2-Bit Quantized Large Language ModelCode1
λ-ECLIPSE: Multi-Concept Personalized Text-to-Image Diffusion Models by Leveraging CLIP Latent SpaceCode2
Graph convolutional network as a fast statistical emulator for numerical ice sheet modeling0
JAX-Fluids 2.0: Towards HPC for Differentiable CFD of Compressible Two-phase FlowsCode4
EfficientViT-SAM: Accelerated Segment Anything Model Without Accuracy Loss0
BirdNeRF: Fast Neural Reconstruction of Large-Scale Scenes From Aerial Imagery0
Fast Timing-Conditioned Latent Audio DiffusionCode7
BiLLM: Pushing the Limit of Post-Training Quantization for LLMsCode3
Towards Deterministic End-to-end Latency for Medical AI Systems in NVIDIA Holoscan0
EscherNet: A Generative Model for Scalable View SynthesisCode3
torchmSAT: A GPU-Accelerated Approximation To The Maximum Satisfiability Problem0
Low-rank Attention Side-Tuning for Parameter-Efficient Fine-Tuning0
Approximation Rates and VC-Dimension Bounds for (P)ReLU MLP Mixture of Experts0
Single-GPU GNN Systems: Traps and Pitfalls0
Time-, Memory- and Parameter-Efficient Visual Adaptation0
4D-Rotor Gaussian Splatting: Towards Efficient Novel View Synthesis for Dynamic ScenesCode2
GPU-Accelerated 3D Polygon Visibility Volumes for Synergistic Perception and Navigation0
Spin: An Efficient Secure Computation Framework with GPU Acceleration0
DeSparsify: Adversarial Attack Against Token Sparsification Mechanisms in Vision TransformersCode0
Show:102550
← PrevPage 37 of 113Next →

No leaderboard results yet.