GPU

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 501–550 of 5629 papers

Title	Date	Tasks	Status	Hype
SparseTransX: Efficient Training of Translation-Based Knowledge Graph Embeddings Using Sparse Matrix Operations	Feb 24, 2025	CPUGPU	CodeCode Available	0
Low-distortion and GPU-compatible Tree Embeddings in Hyperbolic Space	Feb 24, 2025	GPU	—Unverified	0
LettuceDetect: A Hallucination Detection Framework for RAG Applications	Feb 24, 2025	8kGPU	CodeCode Available	4
A Split-Window Transformer for Multi-Model Sequence Spammer Detection using Multi-Model Variational Autoencoder	Feb 23, 2025	GPUmodel	—Unverified	0
SelaVPR++: Towards Seamless Adaptation of Foundation Models for Efficient Place Recognition	Feb 23, 2025	Deep HashingGPU	CodeCode Available	3
Fine-Tuning Qwen 2.5 3B for Realistic Movie Dialogue Generation	Feb 22, 2025	Dialogue GenerationGPU	—Unverified	0
A Universal Framework for Compressing Embeddings in CTR Prediction	Feb 21, 2025	Click-Through Rate PredictionContrastive Learning	CodeCode Available	0
Round Attention: A Novel Round-Level Attention Mechanism to Accelerate LLM Inference	Feb 21, 2025	GPU	—Unverified	0
KAD: No More FAD! An Effective and Efficient Evaluation Metric for Audio Generation	Feb 21, 2025	Audio GenerationFAD	CodeCode Available	2
Determining Layer-wise Sparsity for Large Language Models Through a Theoretical Perspective	Feb 20, 2025	CPUGPU	—Unverified	0
TritonBench: Benchmarking Large Language Model Capabilities for Generating Triton Operators	Feb 20, 2025	BenchmarkingCode Generation	CodeCode Available	2
Towards Efficient Automatic Self-Pruning of Large Language Models	Feb 20, 2025	GPU	—Unverified	0
Distributed U-net model and Image Segmentation for Lung Cancer Detection	Feb 20, 2025	CPUFederated Learning	—Unverified	0
Dynamic Low-Rank Sparse Adaptation for Large Language Models	Feb 20, 2025	CPUGPU	CodeCode Available	1
Exploring RWKV for Sentence Embeddings: Layer-wise Analysis and Baseline Comparison for Semantic Similarity	Feb 20, 2025	GPULanguage Modeling	CodeCode Available	0
Building reliable sim driving agents by scaling self-play	Feb 20, 2025	Autonomous VehiclesBenchmarking	CodeCode Available	4
ParallelComp: Parallel Long-Context Compressor for Length Extrapolation	Feb 20, 2025	4k8k	—Unverified	0
Multiscale Byte Language Models -- A Hierarchical Architecture for Causal Million-Length Sequence Modeling	Feb 20, 2025	DecoderGPU	CodeCode Available	0
Learning conformational ensembles of proteins based on backbone geometry	Feb 19, 2025	GPU	—Unverified	0
FairKV: Balancing Per-Head KV Cache for Fast Multi-GPU Inference	Feb 19, 2025	GPU	—Unverified	0
Slamming: Training a Speech Language Model on One GPU in a Day	Feb 19, 2025	GPULanguage Modeling	CodeCode Available	3
LSR-Adapt: Ultra-Efficient Parameter Tuning with Matrix Low Separation Rank Kernel Adaptation	Feb 19, 2025	GPUparameter-efficient fine-tuning	—Unverified	0
RocketKV: Accelerating Long-Context LLM Inference via Two-Stage KV Cache Compression	Feb 19, 2025	GPU	—Unverified	0
SPPD: Self-training with Process Preference Learning Using Dynamic Value Margin	Feb 19, 2025	GPULogical Reasoning	—Unverified	0
MEX: Memory-efficient Approach to Referring Multi-Object Tracking	Feb 19, 2025	Autonomous DrivingGPU	—Unverified	0
Train Small, Infer Large: Memory-Efficient LoRA Training for Large Language Models	Feb 19, 2025	GPUQuantization	CodeCode Available	2
Activation-aware Probe-Query: Effective Key-Value Retrieval for Long-Context LLMs Inference	Feb 19, 2025	GPURetrieval	—Unverified	0
Astra: Efficient and Money-saving Automatic Parallel Strategies Search on Heterogeneous GPUs	Feb 19, 2025	GPU	—Unverified	0
GPU-Friendly Laplacian Texture Blending	Feb 19, 2025	GPU	—Unverified	0
YOLOv12: Attention-Centric Real-Time Object Detectors	Feb 18, 2025	GPUObject	CodeCode Available	7
HeadInfer: Memory-Efficient LLM Inference by Head-wise Offloading	Feb 18, 2025	Computational EfficiencyCPU	CodeCode Available	2
An Experimental Study of SOTA LiDAR Segmentation Models	Feb 18, 2025	GPUMotion Compensation	—Unverified	0
Multimodal Mamba: Decoder-only Multimodal State Space Model via Quadratic to Linear Distillation	Feb 18, 2025	DecoderGPU	CodeCode Available	2
BaKlaVa -- Budgeted Allocation of KV cache for Long-context Inference	Feb 18, 2025	GPULanguage Modeling	—Unverified	0
GPU Memory Usage Optimization for Backward Propagation in Deep Network Training	Feb 18, 2025	GPU	—Unverified	0
SEA: Low-Resource Safety Alignment for Multimodal Large Language Models via Synthetic Embeddings	Feb 18, 2025	GPUSafety Alignment	CodeCode Available	0
Myna: Masking-Based Contrastive Learning of Musical Representations	Feb 18, 2025	Contrastive LearningData Augmentation	CodeCode Available	1
Rotate, Clip, and Partition: Towards W2A4KV4 Quantization by Integrating Rotation and Learnable Non-uniform Quantizer	Feb 17, 2025	GPUQuantization	—Unverified	0
Fate: Fast Edge Inference of Mixture-of-Experts Models via Cross-Layer Gate	Feb 17, 2025	GPUMixture-of-Experts	CodeCode Available	0
AdaSplash: Adaptive Sparse Flash Attention	Feb 17, 2025	GPULanguage Modeling	CodeCode Available	1
Energy-Conscious LLM Decoding: Impact of Text Generation Strategies on GPU Energy Consumption	Feb 17, 2025	BenchmarkingCode Summarization	—Unverified	0
Massively Scaling Explicit Policy-conditioned Value Functions	Feb 17, 2025	continuous-controlContinuous Control	—Unverified	0
Real-time Neural Rendering of LiDAR Point Clouds	Feb 17, 2025	GPUNeural Rendering	—Unverified	0
GPU-accelerated Multi-relational Parallel Graph Retrieval for Web-scale Recommendations	Feb 17, 2025	GPUMetric Learning	—Unverified	0
JExplore: Design Space Exploration Tool for Nvidia Jetson Boards	Feb 16, 2025	BenchmarkingGPU	CodeCode Available	0
TPCap: Unlocking Zero-Shot Image Captioning with Trigger-Augmented and Multi-Modal Purification Modules	Feb 16, 2025	GPUImage Captioning	—Unverified	0
CalibQuant: 1-Bit KV Cache Quantization for Multimodal LLMs	Feb 15, 2025	Computational EfficiencyGPU	CodeCode Available	1
An Efficient Large Recommendation Model: Towards a Resource-Optimal Scaling Law	Feb 14, 2025	Feature CompressionGPU	—Unverified	0
KernelBench: Can LLMs Write Efficient GPU Kernels?	Feb 14, 2025	GPU	CodeCode Available	4
Efficient solution validation of constraint satisfaction problems on neuromorphic hardware: the case of Sudoku puzzles	Feb 13, 2025	GPU	CodeCode Available	0

Show:10 25 50

← PrevPage 11 of 113Next →

No leaderboard results yet.