SOTAVerified|Agents Browse Leaderboard About

GPU

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 2326–2350 of 5629 papers

Title	Date	Tasks	Status	Hype
"Give Me BF16 or Give Me Death"? Accuracy-Performance Trade-Offs in LLM Quantization	Nov 4, 2024	GPULarge Language Model	—Unverified	0
Context Parallelism for Scalable Million-Token Inference	Nov 4, 2024	GPULanguage Modeling	—Unverified	0
Stochastic Communication Avoidance for Recommendation Systems	Nov 3, 2024	Federated LearningGPU	—Unverified	0
NEO: Saving GPU Memory Crisis with CPU Offloading for Online LLM Inference	Nov 2, 2024	Code GenerationCPU	CodeCode Available	0
Hollowed Net for On-Device Personalization of Text-to-Image Diffusion Models	Nov 2, 2024	GPU	—Unverified	0
CRONOS: Enhancing Deep Learning with Scalable GPU Accelerated Convex Neural Networks	Nov 2, 2024	GPU	—Unverified	0
HopTrack: A Real-time Multi-Object Tracking System for Embedded Devices	Nov 1, 2024	Autonomous DrivingGPU	CodeCode Available	0
Computation-Aware Gaussian Processes: Model Selection And Linear-Time Inference	Nov 1, 2024	Decision MakingGaussian Processes	—Unverified	0
A Novel Breast Ultrasound Image Augmentation Method Using Advanced Neural Style Transfer: An Efficient and Explainable Approach	Oct 31, 2024	GPUImage Augmentation	—Unverified	0
Cycle-Constrained Adversarial Denoising Convolutional Network for PET Image Denoising: Multi-Dimensional Validation on Large Datasets with Reader Study and Real Low-Dose Data	Oct 31, 2024	DenoisingGPU	—Unverified	0
Reinforcement learning with learned gadgets to tackle hard quantum problems on real hardware	Oct 31, 2024	GPUProgram Synthesis	CodeCode Available	0
Context-Aware Token Selection and Packing for Enhanced Vision Transformer	Oct 31, 2024	GPUobject-detection	—Unverified	0
ProMoE: Fast MoE-based LLM Serving using Proactive Caching	Oct 29, 2024	GPUMixture-of-Experts	—Unverified	0
Application of Audio Fingerprinting Techniques for Real-Time Scalable Speech Retrieval and Speech Clusterization	Oct 29, 2024	GPURetrieval	—Unverified	0
Memory-Efficient Point Cloud Registration via Overlapping Region Sampling	Oct 29, 2024	GPUPoint Cloud Registration	—Unverified	0
A Message Passing Neural Network Surrogate Model for Bond-Associated Peridynamic Material Correspondence Formulation	Oct 29, 2024	GPU	—Unverified	0
Revisiting Reliability in Large-Scale Machine Learning Research Clusters	Oct 29, 2024	GPU	—Unverified	0
AI-assisted Agile Propagation Modeling for Real-time Digital Twin Wireless Networks	Oct 29, 2024	Computational EfficiencyCPU	—Unverified	0
Motion Graph Unleashed: A Novel Approach to Video Prediction	Oct 29, 2024	GPUOptical Flow Estimation	CodeCode Available	0
Pushing the Performance Envelope of DNN-based Recommendation Systems Inference on GPUs	Oct 29, 2024	GPURecommendation Systems	CodeCode Available	0
VL-Cache: Sparsity and Modality-Aware KV Cache Compression for Vision-Language Model Inference Acceleration	Oct 29, 2024	GPULanguage Modeling	—Unverified	0
Accelerated Bayesian parameter estimation and model selection for gravitational waves with normalizing flows	Oct 28, 2024	CPUGPU	—Unverified	0
FusedInf: Efficient Swapping of DNN Models for On-Demand Serverless Inference Services on the Edge	Oct 28, 2024	GPU	CodeCode Available	0
Deep Optimizer States: Towards Scalable Training of Transformer Models Using Interleaved Offloading	Oct 26, 2024	CPUGPU	CodeCode Available	0
Computational Bottlenecks of Training Small-scale Large Language Models	Oct 25, 2024	GPULanguage Modeling	—Unverified	0

Show:10 25 50

← PrevPage 94 of 226Next →

No leaderboard results yet.