GPU

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 2301–2350 of 5629 papers

Title	Date	Tasks	Status
AdaCM^2: On Understanding Extremely Long-Term Video with Adaptive Cross-Modality Memory Reduction	Nov 19, 2024	GPUQuestion Answering	—Unverified
Automated 3D Physical Simulation of Open-world Scene with Gaussian Splatting	Nov 19, 2024	3D GenerationGPU	—Unverified
Modeling Multivariable High-resolution 3D Urban Microclimate Using Localized Fourier Neural Operator	Nov 18, 2024	GPU	—Unverified
Graph Retention Networks for Dynamic Graphs	Nov 18, 2024	GPUGraph Learning	CodeCode Available
MoE-Lightning: High-Throughput MoE Inference on Memory-constrained GPUs	Nov 18, 2024	Computational EfficiencyCPU	—Unverified
LP Data Pipeline: Lightweight, Purpose-driven Data Pipeline for Large Language Models	Nov 18, 2024	GPU	—Unverified
Towards Accurate and Efficient Sub-8-Bit Integer Training	Nov 17, 2024	CPUGPU	—Unverified
NeuroNURBS: Learning Efficient Surface Representations for 3D Solids	Nov 16, 2024	GPURepresentation Learning	—Unverified
Improving training time and GPU utilization in geo-distributed language model training	Nov 16, 2024	GPULanguage Modeling	—Unverified
MDHP-Net: Detecting an Emerging Time-exciting Threat in IVN	Nov 15, 2024	DiagnosticGPU	—Unverified
TEESlice: Protecting Sensitive Neural Network Models in Trusted Execution Environments When Attackers have Pre-Trained Models	Nov 15, 2024	GPULanguage Modeling	—Unverified
Pie: Pooling CPU Memory for LLM Inference	Nov 14, 2024	CPUGPU	—Unverified
SANDWICH: Towards an Offline, Differentiable, Fully-Trainable Wireless Neural Ray-Tracing Surrogate	Nov 13, 2024	Decision MakingGPU	CodeCode Available
Optimizing LLM Inference for Database Systems: Cost-Aware Scheduling for Concurrent Requests	Nov 12, 2024	Decision MakingGPU	—Unverified
On Adapting Randomized Nyström Preconditioners to Accelerate Variational Image Reconstruction	Nov 12, 2024	DeblurringGPU	—Unverified
FRUGAL: Memory-Efficient Optimization by Reducing State Overhead for Scalable Training	Nov 12, 2024	GPU	CodeCode Available
OpenThaiGPT 1.5: A Thai-Centric Open Source Large Language Model	Nov 11, 2024	GPULanguage Modeling	—Unverified
Accelerating Large Language Model Training with 4D Parallelism and Memory Consumption Estimator	Nov 10, 2024	GPULanguage Modeling	—Unverified
KeyB2: Selecting Key Blocks is Also Important for Long Document Ranking with Large Language Models	Nov 9, 2024	Document RankingGPU	—Unverified
Benchmarking 3D multi-coil NC-PDNet MRI reconstruction	Nov 8, 2024	3D ReconstructionBenchmarking	—Unverified
Hardware and Software Platform Inference	Nov 7, 2024	GPULarge Language Model	—Unverified
PropNEAT -- Efficient GPU-Compatible Backpropagation over NeuroEvolutionary Augmenting Topology Networks	Nov 6, 2024	Binary ClassificationGPU	—Unverified
Reducing Hyperparameter Tuning Costs in ML, Vision and Language Model Training Pipelines via Memoization-Awareness	Nov 6, 2024	Bayesian OptimizationGPU	CodeCode Available
LEGO-GraphRAG: Modularizing Graph-based Retrieval-Augmented Generation for Design Space Exploration	Nov 6, 2024	GPUKnowledge Graphs	—Unverified
Efficient and Effective Adaptation of Multimodal Foundation Models in Sequential Recommendation	Nov 5, 2024	GPUparameter-efficient fine-tuning	—Unverified
"Give Me BF16 or Give Me Death"? Accuracy-Performance Trade-Offs in LLM Quantization	Nov 4, 2024	GPULarge Language Model	—Unverified
Context Parallelism for Scalable Million-Token Inference	Nov 4, 2024	GPULanguage Modeling	—Unverified
Stochastic Communication Avoidance for Recommendation Systems	Nov 3, 2024	Federated LearningGPU	—Unverified
NEO: Saving GPU Memory Crisis with CPU Offloading for Online LLM Inference	Nov 2, 2024	Code GenerationCPU	CodeCode Available
Hollowed Net for On-Device Personalization of Text-to-Image Diffusion Models	Nov 2, 2024	GPU	—Unverified
CRONOS: Enhancing Deep Learning with Scalable GPU Accelerated Convex Neural Networks	Nov 2, 2024	GPU	—Unverified
HopTrack: A Real-time Multi-Object Tracking System for Embedded Devices	Nov 1, 2024	Autonomous DrivingGPU	CodeCode Available
Computation-Aware Gaussian Processes: Model Selection And Linear-Time Inference	Nov 1, 2024	Decision MakingGaussian Processes	—Unverified
A Novel Breast Ultrasound Image Augmentation Method Using Advanced Neural Style Transfer: An Efficient and Explainable Approach	Oct 31, 2024	GPUImage Augmentation	—Unverified
Cycle-Constrained Adversarial Denoising Convolutional Network for PET Image Denoising: Multi-Dimensional Validation on Large Datasets with Reader Study and Real Low-Dose Data	Oct 31, 2024	DenoisingGPU	—Unverified
Reinforcement learning with learned gadgets to tackle hard quantum problems on real hardware	Oct 31, 2024	GPUProgram Synthesis	CodeCode Available
Context-Aware Token Selection and Packing for Enhanced Vision Transformer	Oct 31, 2024	GPUobject-detection	—Unverified
ProMoE: Fast MoE-based LLM Serving using Proactive Caching	Oct 29, 2024	GPUMixture-of-Experts	—Unverified
Application of Audio Fingerprinting Techniques for Real-Time Scalable Speech Retrieval and Speech Clusterization	Oct 29, 2024	GPURetrieval	—Unverified
Memory-Efficient Point Cloud Registration via Overlapping Region Sampling	Oct 29, 2024	GPUPoint Cloud Registration	—Unverified
A Message Passing Neural Network Surrogate Model for Bond-Associated Peridynamic Material Correspondence Formulation	Oct 29, 2024	GPU	—Unverified
Revisiting Reliability in Large-Scale Machine Learning Research Clusters	Oct 29, 2024	GPU	—Unverified
AI-assisted Agile Propagation Modeling for Real-time Digital Twin Wireless Networks	Oct 29, 2024	Computational EfficiencyCPU	—Unverified
Motion Graph Unleashed: A Novel Approach to Video Prediction	Oct 29, 2024	GPUOptical Flow Estimation	CodeCode Available
Pushing the Performance Envelope of DNN-based Recommendation Systems Inference on GPUs	Oct 29, 2024	GPURecommendation Systems	CodeCode Available
VL-Cache: Sparsity and Modality-Aware KV Cache Compression for Vision-Language Model Inference Acceleration	Oct 29, 2024	GPULanguage Modeling	—Unverified
Accelerated Bayesian parameter estimation and model selection for gravitational waves with normalizing flows	Oct 28, 2024	CPUGPU	—Unverified
FusedInf: Efficient Swapping of DNN Models for On-Demand Serverless Inference Services on the Edge	Oct 28, 2024	GPU	CodeCode Available
Deep Optimizer States: Towards Scalable Training of Transformer Models Using Interleaved Offloading	Oct 26, 2024	CPUGPU	CodeCode Available
Computational Bottlenecks of Training Small-scale Large Language Models	Oct 25, 2024	GPULanguage Modeling	—Unverified

Show:10 25 50

← PrevPage 47 of 113Next →

No leaderboard results yet.