GPU

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 1451–1500 of 5629 papers

Title	Date	Tasks	Status	Hype
STAT: Shrinking Transformers After Training	May 29, 2024	DecoderGPU	—Unverified	0
MoNDE: Mixture of Near-Data Experts for Large-Scale Sparse Models	May 29, 2024	DecoderGPU	—Unverified	0
Contrastive-Adversarial and Diffusion: Exploring pre-training and fine-tuning strategies for sulcal identification	May 29, 2024	Contrastive LearningDenoising	—Unverified	0
Spatio-Spectral Graph Neural Networks	May 29, 2024	GPUGraph Classification	CodeCode Available	1
Hardware-Aware Parallel Prompt Decoding for Memory-Efficient Acceleration of LLM Inference	May 28, 2024	GPUText Generation	CodeCode Available	2
Cardiovascular Disease Detection from Multi-View Chest X-rays with BI-Mamba	May 28, 2024	Computed Tomography (CT)GPU	CodeCode Available	1
DiG: Scalable and Efficient Diffusion Models with Gated Linear Attention	May 28, 2024	GPUMamba	CodeCode Available	2
Pipette: Automatic Fine-grained Large Language Model Training Configurator for Real-World Clusters	May 28, 2024	GPULanguage Modeling	CodeCode Available	0
Scaling Laws and Compute-Optimal Training Beyond Fixed Training Durations	May 28, 2024	GPU	CodeCode Available	2
ViG: Linear-complexity Visual Sequence Learning with Gated Linear Attention	May 28, 2024	GPURepresentation Learning	CodeCode Available	2
Cycle-YOLO: A Efficient and Robust Framework for Pavement Damage Detection	May 28, 2024	GPU	—Unverified	0
Coupled Mamba: Enhanced Multi-modal Fusion with Coupled State Space Model	May 28, 2024	GPUMamba	—Unverified	0
Exploiting the Layered Intrinsic Dimensionality of Deep Models for Practical Adversarial Training	May 27, 2024	DecoderGPU	—Unverified	0
CudaSIFT-SLAM: multiple-map visual SLAM for full procedure mapping in real human endoscopy	May 27, 2024	GPUSimultaneous Localization and Mapping	—Unverified	0
Various Lengths, Constant Speed: Efficient Language Modeling with Lightning Attention	May 27, 2024	GPULanguage Modeling	CodeCode Available	3
SWAT: Scalable and Efficient Window Attention-based Transformers Acceleration on FPGAs	May 27, 2024	GPU	—Unverified	0
TrojFM: Resource-efficient Backdoor Attacks against Very Large Foundation Models	May 27, 2024	Backdoor AttackGPU	CodeCode Available	0
Laboratory-Scale AI: Open-Weight Models are Competitive with ChatGPT Even in Low-Resource Settings	May 27, 2024	Domain AdaptationGPU	—Unverified	0
Transformers Can Do Arithmetic with the Right Embeddings	May 27, 2024	GPUPosition	CodeCode Available	3
GPU Based Differential Evolution: New Insights and Comparative Study	May 26, 2024	GPU	—Unverified	0
LoQT: Low-Rank Adapters for Quantized Pretraining	May 26, 2024	GPULanguage Modeling	CodeCode Available	2
The devil is in discretization discrepancy. Robustifying Differentiable NAS with Single-Stage Searching Protocol	May 26, 2024	GPUNeural Architecture Search	—Unverified	0
vHeat: Building Vision Models upon Heat Conduction	May 26, 2024	Computational EfficiencyGPU	CodeCode Available	3
Apply Distributed CNN on Genomics to accelerate Transcription-Factor TAL1 Motif Prediction	May 25, 2024	Deep LearningGPU	—Unverified	0
LUCIE: A Lightweight Uncoupled ClImate Emulator with long-term stability and physical consistency for O(1000)-member ensembles	May 25, 2024	GPU	CodeCode Available	0
HETHUB: A Distributed Training System with Heterogeneous Cluster for Large-Scale Models	May 25, 2024	GPU	—Unverified	0
MINet: Multi-scale Interactive Network for Real-time Salient Object Detection of Strip Steel Surface Defects	May 25, 2024	CPUDefect Detection	CodeCode Available	1
A GPU-Accelerated Bi-linear ADMM Algorithm for Distributed Sparse Machine Learning	May 25, 2024	GPUregression	—Unverified	0
Accelerating Diffusion Models with Parallel Sampling: Inference at Sub-Linear Time Complexity	May 24, 2024	GPU	—Unverified	0
Looking Backward: Streaming Video-to-Video Translation with Feature Banks	May 24, 2024	GPUTranslation	CodeCode Available	4
DAGER: Exact Gradient Inversion for Large Language Models	May 24, 2024	DecoderFederated Learning	CodeCode Available	1
Sparse Matrix in Large Language Model Fine-tuning	May 24, 2024	GPULanguage Modeling	CodeCode Available	1
ARVideo: Autoregressive Pretraining for Self-Supervised Video Representation Learning	May 24, 2024	GPURepresentation Learning	—Unverified	0
Fast inference with Kronecker-sparse matrices	May 23, 2024	GPUManagement	CodeCode Available	1
Fast Bayesian Inference for Neutrino Non-Standard Interactions at Dark Matter Direct Detection Experiments	May 23, 2024	Bayesian InferenceGPU	CodeCode Available	0
ArchesWeather: An efficient AI weather forecasting model at 1.5° resolution	May 23, 2024	GPUWeather Forecasting	CodeCode Available	1
CoMERA: Computing- and Memory-Efficient Training via Rank-Adaptive Tensor Optimization	May 23, 2024	Code GenerationGPU	CodeCode Available	0
Tele-Aloha: A Low-budget and High-authenticity Telepresence System Using Sparse RGB Cameras	May 23, 2024	2kGPU	—Unverified	0
MAMBA4D: Efficient Long-Sequence Point Cloud Video Understanding with Disentangled Spatial-Temporal State Space Models	May 23, 2024	Action RecognitionAction Segmentation	—Unverified	0
LiteVAE: Lightweight and Efficient Variational Autoencoders for Latent Diffusion Models	May 23, 2024	Computational EfficiencyDecoder	—Unverified	0
ZipCache: Accurate and Efficient KV Cache Quantization with Salient Token Identification	May 23, 2024	GPUGSM8K	CodeCode Available	1
Sparse-Tuning: Adapting Vision Transformers with Efficient Fine-tuning and Inference	May 23, 2024	GPUparameter-efficient fine-tuning	CodeCode Available	1
ReCycle: Resilient Training of Large DNNs using Pipeline Adaptation	May 22, 2024	GPU	—Unverified	0
Attention as an RNN	May 22, 2024	GPUTime Series	CodeCode Available	1
HoverFast: an accurate, high-throughput, clinically deployable nuclear segmentation tool for brightfield digital pathology images	May 22, 2024	GPUKnowledge Distillation	—Unverified	0
Adversarial Training of Two-Layer Polynomial and ReLU Activation Networks via Convex Optimization	May 22, 2024	GPU	CodeCode Available	0
What is Your Data Worth to GPT? LLM-Scale Data Valuation with Influence Functions	May 22, 2024	Data ValuationGPU	CodeCode Available	2
PyramidInfer: Pyramid KV Cache Compression for High-throughput LLM Inference	May 21, 2024	GPU	CodeCode Available	1
Personalized Residuals for Concept-Driven Text-to-Image Generation	May 21, 2024	GPUImage Generation	—Unverified	0
Parallelization of the K-Means Algorithm with Applications to Big Data Clustering	May 20, 2024	ClusteringGPU	—Unverified	0

Show:10 25 50

← PrevPage 30 of 113Next →

No leaderboard results yet.