SOTAVerified

GPU

Papers

Showing 55515600 of 5629 papers

TitleStatusHype
Single-Shot Object Detection with Enriched Semantics0
Single-shot prediction of parametric partial differential equations0
Single Storage Semi-Global Matching for Real Time Depth Processing0
Single stream parallelization of generalized LSTM-like RNNs on a GPU0
Sionna RT: Technical Report0
SIP: Autotuning GPU Native Schedules via Stochastic Instruction Perturbation0
SketchColour: Channel Concat Guided DiT-based Sketch-to-Colour Pipeline for 2D Animation0
SKIM: Any-bit Quantization Pushing The Limits of Post-Training Quantization0
SKVQ: Sliding-window Key and Value Cache Quantization for Large Language Models0
SLAG: Scalable Language-Augmented Gaussian Splatting0
Slice3D: Multi-Slice, Occlusion-Revealing, Single View 3D Reconstruction0
Slice3D: Multi-Slice Occlusion-Revealing Single View 3D Reconstruction0
Sliced at SemEval-2022 Task 11: Bigger, Better? Massively Multilingual LMs for Multilingual Complex NER on an Academic GPU Budget0
Improving compute efficacy frontiers with SliceOut0
Slicing Input Features to Accelerate Deep Learning: A Case Study with Graph Neural Networks0
Sliding Window Sum Algorithms for Deep Neural Networks0
SlimFit: Memory-Efficient Fine-Tuning of Transformer-based Models Using Training Dynamics0
Slimmable Encoders for Flexible Split DNNs in Bandwidth and Resource Constrained IoT Systems0
ReCycle: Resilient Training of Large DNNs using Pipeline Adaptation0
SLO-aware GPU Frequency Scaling for Energy Efficient LLM Inference Serving0
SLOs-Serve: Optimized Serving of Multi-SLO LLMs0
SLSDeep: Skin Lesion Segmentation Based on Dilated Residual and Pyramid Pooling Networks0
Small Language Models in the Real World: Insights from Industrial Text Classification0
Small-Text: Active Learning for Text Classification in Python0
SmartQuant: CXL-based AI Model Store in Support of Runtime Configurable Weight Quantization0
SMAUG: Sparse Masked Autoencoder for Efficient Video-Language Pre-training0
SMDP-Based Dynamic Batching for Efficient Inference on GPU-Based Platforms0
SM-NAS: Structural-to-Modular Neural Architecture Search for Object Detection0
SmolVLM: Redefining small and efficient multimodal models0
Snap ML: A Hierarchical Framework for Machine Learning0
SNeRF: Stylized Neural Implicit Representations for 3D Scenes0
S-Net: A Scalable Convolutional Neural Network for JPEG Compression Artifact Reduction0
CrAFT: Compression-Aware Fine-Tuning for Efficient Visual Task Adaptation0
SoftTreeMax: Exponential Variance Reduction in Policy Gradient via Tree Search0
Software implemented fault diagnosis of natural gas pumping unit based on feedforward neural network0
SOLIS -- The MLOps journey from data acquisition to actionable insights0
SOL: Reducing the Maintenance Overhead for Integrating Hardware Support into AI Frameworks0
Solving Large Sequential Games with the Excessive Gap Technique0
Solving machine learning optimization problems using quantum computers0
Solving the Uncapacitated Single Allocation p-Hub Median Problem on GPU0
Sometimes Painful but Certainly Promising: Feasibility and Trade-offs of Language Model Inference at the Edge0
Sort-free Gaussian Splatting via Weighted Sum Rendering0
SOT-MRAM based Sigmoidal Neuron for Neuromorphic Architectures0
Source Code Classification for Energy Efficiency in Parallel Ultra Low-Power Microcontrollers0
SPA-GCN: Efficient and Flexible GCN Accelerator with an Application for Graph Similarity Computation0
DeepCEE: Efficient Cross-Region Model Distributed Training System under Heterogeneous GPUs and Networks0
PIT: Optimization of Dynamic Sparse Deep Learning Models via Permutation Invariant Transformation0
Sparfels: Fast Reconstruction from Sparse Unposed Imagery0
SparseDM: Toward Sparse Efficient Diffusion Models0
Sparse High Rank Adapters0
Show:102550
← PrevPage 112 of 113Next →

No leaderboard results yet.