SOTAVerified

GPU

Papers

Showing 24012450 of 5629 papers

TitleStatusHype
Automated Quality Control System for Canned Tuna Production using Artificial Vision0
CTC-GMM: CTC guided modality matching for fast and accurate streaming speech translation0
PalmBench: A Comprehensive Benchmark of Compressed Large Language Models on Mobile Platforms0
Fast Object Detection with a Machine Learning Edge Device0
Dolphin: A Programmable Framework for Scalable Neurosymbolic Learning0
Compute Or Load KV Cache? Why Not Both?0
LoRC: Low-Rank Compression for LLMs KV Cache with a Progressive Compression Strategy0
Learning from Offline Foundation Features with Tensor Augmentations0
Efficient Semantic Segmentation via Lightweight Multiple-Information Interaction Network0
Efficient Residual Learning with Mixture-of-Experts for Universal Dexterous Grasping0
An Efficient Inference Frame for SMLM (Single-Molecule Localization Microscopy)Code0
Online Energy Optimization in GPUs: A Multi-Armed Bandit ApproachCode0
Contextual Document Embeddings0
LLMCO2: Advancing Accurate Carbon Footprint Prediction for LLM Inferences0
Replacement Learning: Training Vision Tasks with Fewer Learnable Parameters0
A Little Goes a Long Way: Efficient Long Context Training and Inference with Partial Contexts0
VectorGraphNET: Graph Attention Networks for Accurate Segmentation of Complex Technical Drawings0
FlashMask: Efficient and Rich Mask Extension of FlashAttention0
Scalable and Consistent Graph Neural Networks for Distributed Mesh-based Data-driven Modeling0
ConServe: Harvesting GPUs for Low-Latency and High-Throughput Large Language Model Serving0
ROK Defense M&S in the Age of Hyperscale AI: Concepts, Challenges, and Future Directions0
Lotus: learning-based online thermal and latency variation management for two-stage detectors on edge devicesCode0
MoS: Unleashing Parameter Efficiency of Low-Rank Adaptation with Mixture of Shards0
Characterizing and Efficiently Accelerating Multimodal Generation Model Inference0
HEADS-UP: Head-Mounted Egocentric Dataset for Trajectory Prediction in Blind Assistance Systems0
Simulation-based inference with the Python Package sbijax0
Gradient-free Decoder Inversion in Latent Diffusion Models0
TensorSocket: Shared Data Loading for Deep Learning Training0
Input-Dependent Power Usage in GPUsCode0
Efficient Arbitrary Precision Acceleration for Large Language Models on GPU Tensor Cores0
DRL-STNet: Unsupervised Domain Adaptation for Cross-modality Medical Image Segmentation via Disentangled Representation Learning0
Behaviour4All: in-the-wild Facial Behaviour Analysis Toolkit0
CNN Mixture-of-Depths0
Efficient and generalizable nested Fourier-DeepONet for three-dimensional geological carbon sequestrationCode0
FusionANNS: An Efficient CPU/GPU Cooperative Processing Architecture for Billion-scale Approximate Nearest Neighbor Search0
A Modular-based Strategy for Mitigating Gradient Conflicts in Simultaneous Speech Translation0
dnaGrinder: a lightweight and high-capacity genomic foundation model0
Textless NLP -- Zero Resource Challenge with Low Resource Compute0
Efficient Tabular Data Preprocessing of ML Pipelines0
PipeFill: Using GPUs During Bubbles in Pipeline-parallel LLM Training0
Benchmarking Edge AI Platforms for High-Performance ML Inference0
TextToon: Real-Time Text Toonify Head Avatar from Single Video0
A Realistic Simulation Framework for Analog/Digital Neuromorphic Architectures0
FAMOUS: Flexible Accelerator for the Attention Mechanism of Transformer on UltraScale+ FPGAs0
Drift to Remember0
ProTEA: Programmable Transformer Encoder Acceleration on FPGA0
On Importance of Pruning and Distillation for Efficient Low Resource NLP0
Optimizing RLHF Training for Large Language Models with Stage Fusion0
Graph Convolutional Neural Networks as Surrogate Models for Climate Simulation0
Enhancing Performance and Scalability of Large-Scale Recommendation Systems with Jagged Flash Attention0
Show:102550
← PrevPage 49 of 113Next →

No leaderboard results yet.