SOTAVerified|Agents Browse Leaderboard About

GPU

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 576–600 of 5629 papers

Title	Date	Tasks	Status	Hype
QuEST: Stable Training of LLMs with 1-Bit Weights and Activations	Feb 7, 2025	GPUQuantization	CodeCode Available	2
WaferLLM: Large Language Model Inference at Wafer Scale	Feb 6, 2025	GPULanguage Modeling	CodeCode Available	2
InfiniteHBD: Building Datacenter-Scale High-Bandwidth Domain for LLM with Optical Circuit Switching Transceivers	Feb 6, 2025	GPULarge Language Model	—Unverified	0
QuantSpec: Self-Speculative Decoding with Hierarchical Quantized KV Cache	Feb 5, 2025	GPU	—Unverified	0
Kozax: Flexible and Scalable Genetic Programming in JAX	Feb 5, 2025	GPU	CodeCode Available	1
SyMANTIC: An Efficient Symbolic Regression Method for Interpretable and Parsimonious Model Discovery in Science and Beyond	Feb 5, 2025	feature selectionGPU	CodeCode Available	1
Fast Sampling of Cosmological Initial Conditions with Gaussian Neural Posterior Estimation	Feb 5, 2025	GPU	—Unverified	0
Robust Autonomy Emerges from Self-Play	Feb 5, 2025	Autonomous DrivingGPU	—Unverified	0
Unrealized Expectations: Comparing AI Methods vs Classical Algorithms for Maximum Independent Set	Feb 5, 2025	Combinatorial OptimizationCPU	—Unverified	0
Transolver++: An Accurate Neural Solver for PDEs on Million-Scale Geometries	Feb 4, 2025	GPU	CodeCode Available	3
EasySpec: Layer-Parallel Speculative Decoding for Efficient Multi-GPU Utilization	Feb 4, 2025	GPULarge Language Model	—Unverified	0
Brief analysis of DeepSeek R1 and it's implications for Generative AI	Feb 4, 2025	GPUMixture-of-Experts	—Unverified	0
LV-XAttn: Distributed Cross-Attention for Long Visual Inputs in Multimodal Large Language Models	Feb 4, 2025	GPUVideo Understanding	—Unverified	0
Ilargi: a GPU Compatible Factorized ML Model Training Framework	Feb 4, 2025	Computational EfficiencyCPU	—Unverified	0
Comparative Analysis of FPGA and GPU Performance for Machine Learning-Based Track Reconstruction at LHCb	Feb 4, 2025	GPUGraph Neural Network	CodeCode Available	0
Accelerating Linear Recurrent Neural Networks for the Edge with Unstructured Sparsity	Feb 3, 2025	Audio DenoisingDenoising	—Unverified	0
ModServe: Scalable and Resource-Efficient Large Multimodal Model Serving	Feb 2, 2025	DecoderGPU	—Unverified	0
Recursive generalized type-2 fuzzy radial basis function neural networks for joint position estimation and adaptive EMG-based impedance control of lower limb exoskeletons	Feb 1, 2025	Electromyography (EMG)GPU	CodeCode Available	0
M+: Extending MemoryLLM with Scalable Long-Term Memory	Feb 1, 2025	16kGPU	CodeCode Available	3
ChunkKV: Semantic-Preserving KV Cache Compression for Efficient Long-Context LLM Inference	Feb 1, 2025	GPUGSM8K	—Unverified	0
Work-Efficient Parallel Non-Maximum Suppression Kernels	Feb 1, 2025	GPUobject-detection	CodeCode Available	1
Longer Attention Span: Increasing Transformer Context Length with Sparse Graph Processing Techniques	Jan 31, 2025	GPU	CodeCode Available	0
TeZO: Empowering the Low-Rankness on the Temporal Dimension in the Zeroth-Order Optimization for Fine-tuning LLMs	Jan 31, 2025	GPU	—Unverified	0
LLM-based Affective Text Generation Quality Based on Different Quantization Values	Jan 31, 2025	GPUQuantization	—Unverified	0
Pivoting Factorization: A Compact Meta Low-Rank Representation of Sparsity for Efficient Inference in Large Language Models	Jan 31, 2025	GPUModel Compression	—Unverified	0

Show:10 25 50

← PrevPage 24 of 226Next →

No leaderboard results yet.