GPU

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 2551–2600 of 5629 papers

Title	Date	Tasks	Status
Optimus: Accelerating Large-Scale Multi-Modal LLM Training by Bubble Exploitation	Aug 7, 2024	GPUQuestion Answering	—Unverified
Quantum Annealing based Power Grid Partitioning for Parallel Simulation	Aug 7, 2024	CPUGPU	—Unverified
PackMamba: Efficient Processing of Variable-Length Sequences in Mamba training	Aug 7, 2024	GPUMamba	—Unverified
L3iTC at the FinLLM Challenge Task: Quantization for Financial Text Classification & Summarization	Aug 6, 2024	GPUQuantization	—Unverified
A Real-Time Adaptive Multi-Stream GPU System for Online Approximate Nearest Neighborhood Search	Aug 6, 2024	BlockingGPU	—Unverified
SLO-aware GPU Frequency Scaling for Energy Efficient LLM Inference Serving	Aug 5, 2024	GPU	—Unverified
VoxelTrack: Exploring Voxel Representation for 3D Point Cloud Object Tracking	Aug 5, 2024	3D Single Object TrackingGPU	—Unverified
PanoFree: Tuning-Free Holistic Multi-view Image Generation with Cross-view Self-Guidance	Aug 4, 2024	GPUImage Generation	—Unverified
FT K-means: A High-Performance K-means on GPU with Fault Tolerance	Aug 2, 2024	Code GenerationGPU	CodeCode Available
The Impact of Hyperparameters on Large Language Model Inference Performance: An Evaluation of vLLM and HuggingFace Pipelines	Aug 2, 2024	GPUHyperparameter Optimization	—Unverified
Data-Driven Traffic Simulation for an Intersection in a Metropolis	Aug 1, 2024	GPUTrajectory Forecasting	—Unverified
Enabling High Data Throughput Reinforcement Learning on GPUs: A Domain Agnostic Framework for Data-Driven Scientific Research	Aug 1, 2024	CPUGPU	—Unverified
Towards Scalable GPU-Accelerated SNN Training via Temporal Fusion	Aug 1, 2024	GPUNavigate	CodeCode Available
Finch: Prompt-guided Key-Value Cache Compression	Jul 31, 2024	GPULanguage Modeling	—Unverified
ThinK: Thinner Key Cache by Query-Driven Pruning	Jul 30, 2024	GPUQuantization	—Unverified
NeuroSEM: A hybrid framework for simulating multiphysics problems by coupling PINNs and spectral elements	Jul 30, 2024	CPUGPU	CodeCode Available
GPU-based data processing for speeding-up correlation plenoptic imaging	Jul 30, 2024	GPU	—Unverified
Toward Efficient Permutation for Hierarchical N:M Sparsity on GPUs	Jul 30, 2024	GPU	—Unverified
ByteCheckpoint: A Unified Checkpointing System for Large Foundation Model Development	Jul 29, 2024	GPU	—Unverified
Graphite: A Graph-based Extreme Multi-Label Short Text Classifier for Keyphrase Recommendation	Jul 29, 2024	GPUtext-classification	—Unverified
Simply Trainable Nearest Neighbour Machine Translation with GPU Inference	Jul 29, 2024	Domain AdaptationGPU	—Unverified
SAPG: Split and Aggregate Policy Gradients	Jul 29, 2024	Decision MakingGPU	—Unverified
Mini-batch Coresets for Memory-efficient Training of Large Language Models	Jul 28, 2024	GPUNetwork Pruning	—Unverified
WindsorML: High-Fidelity Computational Fluid Dynamics Dataset For Automotive Aerodynamics	Jul 27, 2024	GPU	—Unverified
NARVis: Neural Accelerated Rendering for Real-Time Scientific Point Cloud Visualization	Jul 26, 2024	GPU	—Unverified
Textile Anomaly Detection: Evaluation of the State-of-the-Art for Automated Quality Inspection of Carpet	Jul 26, 2024	Anomaly DetectionCPU	—Unverified
HG-PIPE: Vision Transformer Acceleration with Hybrid-Grained Pipeline	Jul 25, 2024	GPU	—Unverified
Keep the Cost Down: A Review on Methods to Optimize LLM' s KV-Cache Consumption	Jul 25, 2024	GPU	CodeCode Available
SPLAT: A framework for optimised GPU code-generation for SParse reguLar ATtention	Jul 23, 2024	Code GenerationGPU	—Unverified
A Pairwise Comparison Relation-assisted Multi-objective Evolutionary Neural Architecture Search Method with Multi-population Mechanism	Jul 22, 2024	GPUNeural Architecture Search	—Unverified
Automated Road Safety: Enhancing Sign and Surface Damage Detection with AI	Jul 22, 2024	Cloud ComputingGPU	—Unverified
LSM-GNN: Large-scale Storage-based Multi-GPU GNN Training by Optimizing Data Transfer Scheme	Jul 21, 2024	CPUFraud Detection	—Unverified
MedSAGa: Few-shot Memory Efficient Medical Image Segmentation using Gradient Low-Rank Projection in SAM	Jul 21, 2024	Few-Shot LearningGPU	—Unverified
GreenStableYolo: Optimizing Inference Time and Image Quality of Text-to-Image Generation	Jul 20, 2024	GPUImage Generation	CodeCode Available
Neural topology optimization: the good, the bad, and the ugly	Jul 19, 2024	GPUMisconceptions	—Unverified
Performance Modeling and Workload Analysis of Distributed Large Language Model Training and Inference	Jul 19, 2024	GPULanguage Modeling	—Unverified
Mixture of Experts with Mixture of Precisions for Tuning Quality of Service	Jul 19, 2024	CPUGPU	—Unverified
LiNR: Model Based Neural Retrieval on GPUs at LinkedIn	Jul 18, 2024	AttributeGPU	—Unverified
RoDE: Linear Rectified Mixture of Diverse Experts for Food Large Multi-Modal Models	Jul 17, 2024	GPUNutrition	—Unverified
SmartQuant: CXL-based AI Model Store in Support of Runtime Configurable Weight Quantization	Jul 17, 2024	GPUQuantization	—Unverified
ARTEMIS: A Mixed Analog-Stochastic In-DRAM Accelerator for Transformer Neural Networks	Jul 17, 2024	CPUGPU	—Unverified
Tiled Bit Networks: Sub-Bit Neural Network Compression Through Reuse of Learnable Binary Vectors	Jul 16, 2024	GPUNeural Network Compression	—Unverified
Characterizing and Understanding HGNN Training on GPUs	Jul 16, 2024	GPURecommendation Systems	—Unverified
MEMO: Fine-grained Tensor Management For Ultra-long Context LLM Training	Jul 16, 2024	CPUGPU	—Unverified
Learning Multi-view Anomaly Detection	Jul 16, 2024	Anomaly DetectionGPU	—Unverified
PADRe: A Unifying Polynomial Attention Drop-in Replacement for Efficient Vision Transformer	Jul 16, 2024	2D Object DetectionComputational Efficiency	—Unverified
MINI-LLM: Memory-Efficient Structured Pruning for Large Language Models	Jul 16, 2024	GPUMultiple-choice	—Unverified
SuperPADL: Scaling Language-Directed Physics-Based Control with Progressive Supervised Distillation	Jul 15, 2024	GPUReinforcement Learning (RL)	—Unverified
Differentiable Neural-Integrated Meshfree Method for Forward and Inverse Modeling of Finite Strain Hyperelasticity	Jul 15, 2024	GPUPhysics-informed machine learning	CodeCode Available
NGP-RT: Fusing Multi-Level Hash Features with Lightweight Attention for Real-Time Novel View Synthesis	Jul 15, 2024	GPUNeRF	—Unverified

Show:10 25 50

← PrevPage 52 of 113Next →

No leaderboard results yet.