SOTAVerified

GPU

Papers

Showing 551575 of 5629 papers

TitleStatusHype
BMInf: An Efficient Toolkit for Big Model Inference and TuningCode2
FluidLab: A Differentiable Environment for Benchmarking Complex Fluid ManipulationCode2
From GaLore to WeLore: How Low-Rank Weights Non-uniformly Emerge from Low-Rank GradientsCode2
Gradient Boosting Reinforcement LearningCode2
mLoRA: Fine-Tuning LoRA Adapters via Highly-Efficient Pipeline Parallelism in Multiple GPUsCode2
gCastle: A Python Toolbox for Causal DiscoveryCode2
Atom: Low-bit Quantization for Efficient and Accurate LLM ServingCode2
Characterization of Large Language Model Development in the DatacenterCode2
Birbal: An efficient 7B instruct-model fine-tuned with curated datasetsCode2
BitDecoding: Unlocking Tensor Cores for Long-Context LLMs Decoding with Low-Bit KV CacheCode2
HeadInfer: Memory-Efficient LLM Inference by Head-wise OffloadingCode2
Helix: Serving Large Language Models over Heterogeneous GPUs and Network via Max-FlowCode2
BiFormer: Vision Transformer with Bi-Level Routing AttentionCode2
Flash-LLM: Enabling Cost-Effective and Highly-Efficient Large Generative Model Inference with Unstructured SparsityCode2
Collaborative Decoding Makes Visual Auto-Regressive Modeling EfficientCode2
CoLA: Exploiting Compositional Structure for Automatic and Efficient Numerical Linear AlgebraCode2
FinRL-Meta: A Universe of Near-Real Market Environments for Data-Driven Deep Reinforcement Learning in Quantitative FinanceCode2
FeNNol: an Efficient and Flexible Library for Building Force-field-enhanced Neural Network PotentialsCode2
CoLLiE: Collaborative Training of Large Language Models in an Efficient WayCode2
HybridDepth: Robust Metric Depth Fusion by Leveraging Depth from Focus and Single-Image PriorsCode2
Accelerating Text-to-Image Editing via Cache-Enabled Sparse Diffusion InferenceCode2
FeatureBooster: Boosting Feature Descriptors with a Lightweight Neural NetworkCode2
AEROMamba: An efficient architecture for audio super-resolution using generative adversarial networks and state space modelsCode2
A Tensor Compiler for Unified Machine Learning Prediction ServingCode2
Feature Pyramid Networks for Object DetectionCode2
Show:102550
← PrevPage 23 of 226Next →

No leaderboard results yet.