SOTAVerified

GPU

Papers

Showing 10511100 of 5629 papers

TitleStatusHype
Efficient and generalizable nested Fourier-DeepONet for three-dimensional geological carbon sequestrationCode0
CNN Mixture-of-Depths0
INT-FlashAttention: Enabling Flash Attention for INT8 QuantizationCode2
Textless NLP -- Zero Resource Challenge with Low Resource Compute0
CAD: Memory Efficient Convolutional Adapter for Segment AnythingCode1
A Modular-based Strategy for Mitigating Gradient Conflicts in Simultaneous Speech Translation0
Efficient Motion Prediction: A Lightweight & Accurate Trajectory Prediction Model With Fast Training and Inference SpeedCode1
dnaGrinder: a lightweight and high-capacity genomic foundation model0
PipeFill: Using GPUs During Bubbles in Pipeline-parallel LLM Training0
TextToon: Real-Time Text Toonify Head Avatar from Single Video0
Efficient Tabular Data Preprocessing of ML Pipelines0
Benchmarking Edge AI Platforms for High-Performance ML Inference0
FastGL: A GPU-Efficient Framework for Accelerating Sampling-Based GNN Training at Large ScaleCode1
A Realistic Simulation Framework for Analog/Digital Neuromorphic Architectures0
Video-XL: Extra-Long Vision Language Model for Hour-Scale Video UnderstandingCode4
FAMOUS: Flexible Accelerator for the Attention Mechanism of Transformer on UltraScale+ FPGAs0
ProTEA: Programmable Transformer Encoder Acceleration on FPGA0
Drift to Remember0
On Importance of Pruning and Distillation for Efficient Low Resource NLP0
Optimizing RLHF Training for Large Language Models with Stage Fusion0
Occupancy-Based Dual ContouringCode2
Enhancing Performance and Scalability of Large-Scale Recommendation Systems with Jagged Flash Attention0
3DGS-LM: Faster Gaussian-Splatting Optimization with Levenberg-MarquardtCode3
Graph Convolutional Neural Networks as Surrogate Models for Climate Simulation0
Scaling Smart: Accelerating Large Language Model Pre-training with Small Model InitializationCode2
Impact of ML Optimization Tactics on Greener Pre-Trained ML Models0
CritiPrefill: A Segment-wise Criticality-based Approach for Prefilling Acceleration in LLMsCode1
Efficient Low-Resolution Face Recognition via Bridge Distillation0
User-friendly Foundation Model Adapters for Multivariate Time Series Classification0
Bundle Adjustment in the Eager Mode0
Development and bilingual evaluation of Japanese medical large language model within reasonably low computational resourcesCode1
Less Memory Means smaller GPUs: Backpropagation with Compressed Activations0
Mamba Fusion: Learning Actions Through QuestioningCode0
Can Graph Reordering Speed Up Graph Neural Network Training? An Experimental StudyCode0
RenderWorld: World Model with Self-Supervised 3D Label0
Early Detection of Coronary Heart Disease Using Hybrid Quantum Machine Learning Approach0
MARCA: Mamba Accelerator with ReConfigurable Architecture0
RetrievalAttention: Accelerating Long-Context LLM Inference via Vector RetrievalCode2
One-Shot Learning for Pose-Guided Person Image Synthesis in the WildCode1
LLM-Powered Ensemble Learning for Paper Source Tracing: A GPU-Free ApproachCode0
Wave-U-Mamba: An End-To-End Framework For High-Quality And Efficient Speech Super Resolution0
Accurate and Fast Estimation of Temporal Motifs using Path SamplingCode0
Using Convolutional Neural Networks for Denoising and Deblending of Marine Seismic Data0
SwinGS: Sliding Window Gaussian Splatting for Volumetric Video Streaming with Arbitrary Length0
Super Monotonic Alignment SearchCode2
Self-Supervised Learning of Iterative Solvers for Constrained Optimization0
Improve Machine Learning carbon footprint using Nvidia GPU and Mixed Precision training for classification models -- Part ICode0
Inf-MLLM: Efficient Streaming Inference of Multimodal Large Language Models on a Single GPU0
ENACT: Entropy-based Clustering of Attention Input for Improving the Computational Performance of Object Detection TransformersCode0
A Cost-Aware Approach to Adversarial Robustness in Neural Networks0
Show:102550
← PrevPage 22 of 113Next →

No leaderboard results yet.