SOTAVerified|Agents Browse Leaderboard About

GPU

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 2101–2125 of 5629 papers

Title	Date	Tasks	Status	Hype
Fast Sampling of Cosmological Initial Conditions with Gaussian Neural Posterior Estimation	Feb 5, 2025	GPU	—Unverified	0
Robust Autonomy Emerges from Self-Play	Feb 5, 2025	Autonomous DrivingGPU	—Unverified	0
Comparative Analysis of FPGA and GPU Performance for Machine Learning-Based Track Reconstruction at LHCb	Feb 4, 2025	GPUGraph Neural Network	CodeCode Available	0
Brief analysis of DeepSeek R1 and it's implications for Generative AI	Feb 4, 2025	GPUMixture-of-Experts	—Unverified	0
EasySpec: Layer-Parallel Speculative Decoding for Efficient Multi-GPU Utilization	Feb 4, 2025	GPULarge Language Model	—Unverified	0
Ilargi: a GPU Compatible Factorized ML Model Training Framework	Feb 4, 2025	Computational EfficiencyCPU	—Unverified	0
LV-XAttn: Distributed Cross-Attention for Long Visual Inputs in Multimodal Large Language Models	Feb 4, 2025	GPUVideo Understanding	—Unverified	0
Accelerating Linear Recurrent Neural Networks for the Edge with Unstructured Sparsity	Feb 3, 2025	Audio DenoisingDenoising	—Unverified	0
ModServe: Scalable and Resource-Efficient Large Multimodal Model Serving	Feb 2, 2025	DecoderGPU	—Unverified	0
ChunkKV: Semantic-Preserving KV Cache Compression for Efficient Long-Context LLM Inference	Feb 1, 2025	GPUGSM8K	—Unverified	0
Recursive generalized type-2 fuzzy radial basis function neural networks for joint position estimation and adaptive EMG-based impedance control of lower limb exoskeletons	Feb 1, 2025	Electromyography (EMG)GPU	CodeCode Available	0
TeZO: Empowering the Low-Rankness on the Temporal Dimension in the Zeroth-Order Optimization for Fine-tuning LLMs	Jan 31, 2025	GPU	—Unverified	0
Pivoting Factorization: A Compact Meta Low-Rank Representation of Sparsity for Efficient Inference in Large Language Models	Jan 31, 2025	GPUModel Compression	—Unverified	0
Longer Attention Span: Increasing Transformer Context Length with Sparse Graph Processing Techniques	Jan 31, 2025	GPU	CodeCode Available	0
Brain-inspired sparse training enables Transformers and LLMs to perform as fully connected	Jan 31, 2025	GPULanguage Modeling	—Unverified	0
LLM-based Affective Text Generation Quality Based on Different Quantization Values	Jan 31, 2025	GPUQuantization	—Unverified	0
adabmDCA 2.0 -- a flexible but easy-to-use package for Direct Coupling Analysis	Jan 30, 2025	CPUGPU	CodeCode Available	0
Scaling Policy Gradient Quality-Diversity with Massive Parallelization via Behavioral Variations	Jan 30, 2025	DiversityEvolutionary Algorithms	—Unverified	0
CrowdSplat: Exploring Gaussian Splatting For Crowd Rendering	Jan 29, 2025	Computational EfficiencyGPU	CodeCode Available	0
Assessing the Capability of YOLO- and Transformer-based Object Detectors for Real-time Weed Detection	Jan 29, 2025	GPU	—Unverified	0
One Head Eight Arms: Block Matrix based Low Rank Adaptation for CLIP-based Few-Shot Learning	Jan 28, 2025	Few-Shot LearningGPU	—Unverified	0
Static Batching of Irregular Workloads on GPUs: Framework and Application to Efficient MoE Model Inference	Jan 27, 2025	GPUMixture-of-Experts	—Unverified	0
PISCO: Pretty Simple Compression for Retrieval-Augmented Generation	Jan 27, 2025	GPUKnowledge Distillation	—Unverified	0
Towards Scalable Topological Regularizers	Jan 24, 2025	Domain AdaptationGPU	—Unverified	0
GANQ: GPU-Adaptive Non-Uniform Quantization for Large Language Models	Jan 22, 2025	GPUQuantization	CodeCode Available	0

Show:10 25 50

← PrevPage 85 of 226Next →

No leaderboard results yet.