SOTAVerified

GPU

Papers

Showing 501525 of 5629 papers

TitleStatusHype
SparseTransX: Efficient Training of Translation-Based Knowledge Graph Embeddings Using Sparse Matrix OperationsCode0
Low-distortion and GPU-compatible Tree Embeddings in Hyperbolic Space0
LettuceDetect: A Hallucination Detection Framework for RAG ApplicationsCode4
SelaVPR++: Towards Seamless Adaptation of Foundation Models for Efficient Place RecognitionCode3
A Split-Window Transformer for Multi-Model Sequence Spammer Detection using Multi-Model Variational Autoencoder0
Fine-Tuning Qwen 2.5 3B for Realistic Movie Dialogue Generation0
A Universal Framework for Compressing Embeddings in CTR PredictionCode0
Round Attention: A Novel Round-Level Attention Mechanism to Accelerate LLM Inference0
KAD: No More FAD! An Effective and Efficient Evaluation Metric for Audio GenerationCode2
Towards Efficient Automatic Self-Pruning of Large Language Models0
Dynamic Low-Rank Sparse Adaptation for Large Language ModelsCode1
Distributed U-net model and Image Segmentation for Lung Cancer Detection0
Determining Layer-wise Sparsity for Large Language Models Through a Theoretical Perspective0
TritonBench: Benchmarking Large Language Model Capabilities for Generating Triton OperatorsCode2
Exploring RWKV for Sentence Embeddings: Layer-wise Analysis and Baseline Comparison for Semantic SimilarityCode0
Multiscale Byte Language Models -- A Hierarchical Architecture for Causal Million-Length Sequence ModelingCode0
Building reliable sim driving agents by scaling self-playCode4
ParallelComp: Parallel Long-Context Compressor for Length Extrapolation0
Learning conformational ensembles of proteins based on backbone geometry0
FairKV: Balancing Per-Head KV Cache for Fast Multi-GPU Inference0
Slamming: Training a Speech Language Model on One GPU in a DayCode3
RocketKV: Accelerating Long-Context LLM Inference via Two-Stage KV Cache Compression0
LSR-Adapt: Ultra-Efficient Parameter Tuning with Matrix Low Separation Rank Kernel Adaptation0
SPPD: Self-training with Process Preference Learning Using Dynamic Value Margin0
Astra: Efficient and Money-saving Automatic Parallel Strategies Search on Heterogeneous GPUs0
Show:102550
← PrevPage 21 of 226Next →

No leaderboard results yet.