SOTAVerified

GPU

Papers

Showing 776800 of 5629 papers

TitleStatusHype
HW-GPT-Bench: Hardware-Aware Architecture Benchmark for Language ModelsCode1
No Time to Waste: Squeeze Time into Channel for Mobile Video UnderstandingCode1
Computation-Aware Kalman Filtering and SmoothingCode1
The Developing Human Connectome Project: A Fast Deep Learning-based Pipeline for Neonatal Cortical Surface ReconstructionCode1
Differentiable Model Scaling using Differentiable TopkCode1
CoSense3D: an Agent-based Efficient Learning Framework for Collective PerceptionCode1
LoRA Land: 310 Fine-tuned LLMs that Rival GPT-4, A Technical ReportCode1
CutDiffusion: A Simple, Fast, Cheap, and Strong Diffusion Extrapolation MethodCode1
Mélange: Cost Efficient Large Language Model Serving by Exploiting GPU HeterogeneityCode1
Evaluating Retrieval Quality in Retrieval-Augmented GenerationCode1
LLMem: Estimating GPU Memory Usage for Fine-Tuning Pre-Trained LLMsCode1
Interpolating neural network: A novel unification of machine learning and interpolation theoryCode1
CATS: Contextually-Aware Thresholding for Sparsity in Large Language ModelsCode1
Lossless Acceleration of Large Language Model via Adaptive N-gram Parallel DecodingCode1
FreeEval: A Modular Framework for Trustworthy and Efficient Evaluation of Large Language ModelsCode1
LIPT: Latency-aware Image Processing TransformerCode1
Tensorized Ant Colony Optimization for GPU AccelerationCode1
GeoT: Tensor Centric Library for Graph Neural Network via Efficient Segment Reduction on GPUCode1
IISAN: Efficiently Adapting Multimodal Representation for Sequential Recommendation with Decoupled PEFTCode1
Taming Lookup Tables for Efficient Image RetouchingCode1
Siamese Vision Transformers are Scalable Audio-visual LearnersCode1
ModeTv2: GPU-accelerated Motion Decomposition Transformer for Pairwise Optimization in Medical Image RegistrationCode1
MetaAligner: Towards Generalizable Multi-Objective Alignment of Language ModelsCode1
MEDDAP: Medical Dataset Enhancement via Diversified Augmentation PipelineCode1
Once for Both: Single Stage of Importance and Sparsity Search for Vision Transformer CompressionCode1
Show:102550
← PrevPage 32 of 226Next →

No leaderboard results yet.