GPU

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 301–350 of 5629 papers

Title	Date	Tasks	Status	Hype
Characterizing and Optimizing LLM Inference Workloads on CPU-GPU Coupled Architectures	Apr 16, 2025	CPUGPU	—Unverified	0
ConvShareViT: Enhancing Vision Transformers with Convolutional Attention Mechanisms for Free-Space Optical Accelerators	Apr 15, 2025	GPU	—Unverified	0
Bringing together invertible UNets with invertible attention modules for memory-efficient diffusion models	Apr 15, 2025	DenoisingGPU	—Unverified	0
70% Size, 100% Accuracy: Lossless LLM Compression for Efficient GPU Inference via Dynamic-Length Float	Apr 15, 2025	CPUGPU	CodeCode Available	4
PatrolVision: Automated License Plate Recognition in the wild	Apr 15, 2025	Autonomous DrivingGPU	—Unverified	0
Optimizing LLM Inference: Fluid-Guided Online Scheduling with Memory Constraints	Apr 15, 2025	GPUInference Optimization	CodeCode Available	4
CAT: A Conditional Adaptation Tailor for Efficient and Effective Instance-Specific Pansharpening on Real-World Data	Apr 14, 2025	Computational EfficiencyGPU	—Unverified	0
Anchors no more: Using peculiar velocities to constrain H_0 and the primordial Universe without calibrators	Apr 14, 2025	GPU	CodeCode Available	0
Frozen Layers: Memory-efficient Many-fidelity Hyperparameter Optimization	Apr 14, 2025	GPUHyperparameter Optimization	—Unverified	0
Tokenize Image Patches: Global Context Fusion for Effective Haze Removal in Large Images	Apr 13, 2025	GPU	CodeCode Available	2
aweSOM: a CPU/GPU-accelerated Self-organizing Map and Statistically Combined Ensemble Framework for Machine-learning Clustering Analysis	Apr 13, 2025	CPUGPU	—Unverified	0
Towards On-Device Learning and Reconfigurable Hardware Implementation for Encoded Single-Photon Signal Processing	Apr 12, 2025	CPUGPU	—Unverified	0
MoE-Lens: Towards the Hardware Limit of High-Throughput MoE LLM Serving Under Resource Constraints	Apr 12, 2025	CPUGPU	—Unverified	0
Parameter-Free Fine-tuning via Redundancy Elimination for Vision Foundation Models	Apr 11, 2025	channel selectionGPU	—Unverified	0
TensorNEAT: A GPU-accelerated Library for NeuroEvolution of Augmenting Topologies	Apr 11, 2025	Computational EfficiencyGPU	CodeCode Available	3
Seaweed-7B: Cost-Effective Training of Video Generation Foundation Model	Apr 11, 2025	GPUVideo Generation	—Unverified	0
Spectral Normalization for Lipschitz-Constrained Policies on Learning Humanoid Locomotion	Apr 11, 2025	GPUReinforcement Learning (RL)	—Unverified	0
SpecEE: Accelerating Large Language Model Inference with Speculative Early Exiting	Apr 11, 2025	GPULanguage Modeling	—Unverified	0
MSCCL++: Rethinking GPU Communication Abstractions for Cutting-edge AI Applications	Apr 11, 2025	GPU	CodeCode Available	3
EO-VLM: VLM-Guided Energy Overload Attacks on Vision Models	Apr 11, 2025	Autonomous DrivingGPU	—Unverified	0
TorchFX: A modern approach to Audio DSP with PyTorch and GPU acceleration	Apr 11, 2025	Audio Signal ProcessingBenchmarking	CodeCode Available	2
Apt-Serve: Adaptive Request Scheduling on Hybrid Cache for Scalable LLM Inference Serving	Apr 10, 2025	GPULarge Language Model	CodeCode Available	1
DGOcc: Depth-aware Global Query-based Network for Monocular 3D Occupancy Prediction	Apr 10, 2025	GPUPrediction	—Unverified	0
PoGO: A Scalable Proof of Useful Work via Quantized Gradient Descent and Merkle Proofs	Apr 10, 2025	GPUQuantization	—Unverified	0
Search-contempt: a hybrid MCTS algorithm for training AlphaZero-like engines with better computational efficiency	Apr 10, 2025	Computational EfficiencyGPU	—Unverified	0
GPT Carry-On: Training Foundation Model for Customization Could Be Simple, Scalable and Affordable	Apr 10, 2025	GPUMath	—Unverified	0
A Comparison of Deep Learning Methods for Cell Detection in Digital Cytology	Apr 9, 2025	Cell DetectionComputational Efficiency	CodeCode Available	0
CRYSIM: Prediction of Symmetric Structures of Large Crystals with GPU-based Ising Machines	Apr 9, 2025	Bayesian OptimizationGPU	CodeCode Available	0
Accelerating LLM Inference Throughput via Asynchronous KV Cache Prefetching	Apr 8, 2025	GPUScheduling	—Unverified	0
GPU-accelerated Evolutionary Many-objective Optimization Using Tensorized NSGA-III	Apr 8, 2025	Computational EfficiencyCPU	CodeCode Available	3
Nonuniform-Tensor-Parallelism: Mitigating GPU failure impact for Scaled-up LLM Training	Apr 8, 2025	GPU	—Unverified	0
HybriMoE: Hybrid CPU-GPU Scheduling and Cache Management for Efficient MoE Inference	Apr 8, 2025	CPUGPU	CodeCode Available	2
HRMedSeg: Unlocking High-resolution Medical Image segmentation via Memory-efficient Attention Modeling	Apr 8, 2025	DecoderGPU	CodeCode Available	1
SmolVLM: Redefining small and efficient multimodal models	Apr 7, 2025	GPU	—Unverified	0
PRIMA.CPP: Speeding Up 70B-Scale LLM Inference on Low-Resource Everyday Home Clusters	Apr 7, 2025	CPUGPU	CodeCode Available	0
Leveraging State Space Models in Long Range Genomics	Apr 7, 2025	BenchmarkingGPU	—Unverified	0
Weak-for-Strong: Training Weak Meta-Agent to Harness Strong Executors	Apr 7, 2025	GPU	CodeCode Available	2
Scaling Graph Neural Networks for Particle Track Reconstruction	Apr 7, 2025	Edge ClassificationGPU	CodeCode Available	1
Stereo-LiDAR Fusion by Semi-Global Matching With Discrete Disparity-Matching Cost and Semidensification	Apr 7, 2025	Depth EstimationGPU	CodeCode Available	0
Multimodal Cinematic Video Synthesis Using Text-to-Image and Audio Generation Models	Apr 6, 2025	Audio GenerationGPU	—Unverified	0
SLOs-Serve: Optimized Serving of Multi-SLO LLMs	Apr 5, 2025	ChatbotGPU	—Unverified	0
DeepOHeat-v1: Efficient Operator Learning for Fast and Trustworthy Thermal Simulation and Optimization in 3D-IC Design	Apr 4, 2025	GPUKolmogorov-Arnold Networks	CodeCode Available	0
Meta-DAN: towards an efficient prediction strategy for page-level handwritten text recognition	Apr 4, 2025	GPUHandwritten Text Recognition	CodeCode Available	1
HeterMoE: Efficient Training of Mixture-of-Experts Models on Heterogeneous GPUs	Apr 4, 2025	GPUMixture-of-Experts	—Unverified	0
Accurate GPU Memory Prediction for Deep Learning Jobs through Dynamic Analysis	Apr 4, 2025	CPUGPU	—Unverified	0
Scaling Video-Language Models to 10K Frames via Hierarchical Differential Distillation	Apr 3, 2025	Computational EfficiencyGPU	CodeCode Available	2
MegaScale-Infer: Serving Mixture-of-Experts at Scale with Disaggregated Expert Parallelism	Apr 3, 2025	CPUGPU	—Unverified	0
Incorporating the ChEES Criterion into Sequential Monte Carlo Samplers	Apr 3, 2025	Bayesian InferenceGPU	—Unverified	0
GPTAQ: Efficient Finetuning-Free Quantization for Asymmetric Calibration	Apr 3, 2025	GPUQuantization	CodeCode Available	2
A Truncated Newton Method for Optimal Transport	Apr 2, 2025	GPU	CodeCode Available	0

Show:10 25 50

← PrevPage 7 of 113Next →

No leaderboard results yet.