SOTAVerified

GPU

Papers

Showing 29513000 of 5629 papers

TitleStatusHype
DeSparsify: Adversarial Attack Against Token Sparsification Mechanisms in Vision TransformersCode0
Spin: An Efficient Secure Computation Framework with GPU Acceleration0
Scalable and Efficient Temporal Graph Representation Learning via Forward Recent SamplingCode0
PRIME: Protect Your Videos From Malicious EditingCode0
Faster Inference of Integer SWIN Transformer by Removing the GELU Activation0
Enriched Physics-informed Neural Networks for Dynamic Poisson-Nernst-Planck Systems0
An Accurate and Low-Parameter Machine Learning Architecture for Next Location Prediction0
Paramanu: A Family of Novel Efficient Generative Foundation Language Models for Indian Languages0
Efficient Subseasonal Weather Forecast using Teleconnection-informed Transformers0
SwapNet: Efficient Swapping for DNN Inference on Edge AI Devices Beyond the Memory Budget0
GPU Cluster Scheduling for Network-Sensitive Deep Learning0
M2-Encoder: Advancing Bilingual Image-Text Understanding by Large-scale Efficient PretrainingCode0
The Case for Co-Designing Model Architectures with Hardware0
CNN architecture extraction on edge GPU0
Automated Root Causing of Cloud Incidents using In-Context Learning with GPT-40
Edge-Enabled Real-time Railway Track Segmentation0
Enhancing Scalability in Recommender Systems through Lottery Ticket Hypothesis and Knowledge Distillation-based Neural Network Pruning0
Exact analytical algorithm for solvent accessible surface area and derivatives in implicit solvent molecular simulations on GPUs0
A Lightweight FPGA-based IDS-ECU Architecture for Automotive CAN0
Towards providing reliable job completion time predictions using PCSCode0
LoMA: Lossless Compressed Memory Attention0
TP-Aware Dequantization0
Efficient approximation of Earth Mover's Distance Based on Nearest Neighbor SearchCode0
Beyond Traditional Approaches: Multi-Task Network for Breast Ultrasound DiagnosisCode0
Parameter-Efficient Detoxification with Contrastive Decoding0
E^2-LLM: Efficient and Extreme Length Extension of Large Language Models0
Efficient Parallel Algorithms for Inpainting-Based Representations of 4K Images -- Part I: Homogeneous Diffusion Inpainting0
Efficient Parallel Data Optimization for Homogeneous Diffusion Inpainting of 4K Images0
MGARD: A multigrid framework for high-performance, error-controlled data compression and refactoring0
PANDORA: A Parallel Dendrogram Construction Algorithm for Single Linkage Clustering on GPU0
Towards Safe Load Balancing based on Control Barrier Functions and Deep Reinforcement Learning0
G-Meta: Distributed Meta Learning in GPU Clusters for Large-Scale Recommender Systems0
A foundation for exact binarized morphological neural networksCode0
FFSplit: Split Feed-Forward Network For Optimizing Accuracy-Efficiency Trade-off in Language Model Inference0
Efficient Multiscale Multimodal Bottleneck Transformer for Audio-Video Classification0
IntervalMDP.jl: Accelerated Value Iteration for Interval Markov Decision ProcessesCode0
FlightLLM: Efficient Large Language Model Inference with a Complete Mapping Flow on FPGAs0
LLaMA Beyond English: An Empirical Study on Language Capability Transfer0
LTM: Lightweight Textured Mesh Extraction and Refinement of Large Unbounded Scenes for Efficient Storage and Real-time Rendering0
LAMP: Learn A Motion Pattern for Few-Shot Video Generation0
Distraction is All You Need: Memory-Efficient Image Immunization against Diffusion-Based Image Editing0
Learning to Select Views for Efficient Multi-View Understanding0
Slice3D: Multi-Slice Occlusion-Revealing Single View 3D Reconstruction0
Scaling Laws for Data Filtering-- Data Curation cannot be Compute Agnostic0
Time- Memory- and Parameter-Efficient Visual Adaptation0
Discovery of Small Ultra-short-period Planets Orbiting KG Dwarfs in Kepler Survey Using GPU Phase Folding and Deep Learning Detection System0
FALCON: Feature-Label Constrained Graph Net Collapse for Memory Efficient GNNsCode0
Masked Contrastive Reconstruction for Cross-modal Medical Image-Report Retrieval0
Proximal Gradient Descent Unfolding Dense-spatial Spectral-attention Transformer for Compressive Spectral Imaging0
A Comprehensive Evaluation of Parameter-Efficient Fine-Tuning on Software Engineering TasksCode0
Show:102550
← PrevPage 60 of 113Next →

No leaderboard results yet.