SOTAVerified

GPU

Papers

Showing 18511875 of 5629 papers

TitleStatusHype
Pruner: A Speculative Exploration Mechanism to Accelerate Tensor Program TuningCode1
Scalable and Efficient Temporal Graph Representation Learning via Forward Recent SamplingCode0
Structure-Aware E(3)-Invariant Molecular Conformer Aggregation NetworksCode1
InferCept: Efficient Intercept Support for Augmented Large Language Model InferenceCode1
PRIME: Protect Your Videos From Malicious EditingCode0
Faster Inference of Integer SWIN Transformer by Removing the GELU Activation0
Enriched Physics-informed Neural Networks for Dynamic Poisson-Nernst-Planck Systems0
An Accurate and Low-Parameter Machine Learning Architecture for Next Location Prediction0
Graph-Mamba: Towards Long-Range Graph Sequence Modeling with Selective State SpacesCode3
KVQuant: Towards 10 Million Context Length LLM Inference with KV Cache QuantizationCode3
Efficient Subseasonal Weather Forecast using Teleconnection-informed Transformers0
Paramanu: A Family of Novel Efficient Generative Foundation Language Models for Indian Languages0
SwapNet: Efficient Swapping for DNN Inference on Edge AI Devices Beyond the Memory Budget0
GPU Cluster Scheduling for Network-Sensitive Deep Learning0
SHViT: Single-Head Vision Transformer with Memory Efficient Macro DesignCode2
M2-Encoder: Advancing Bilingual Image-Text Understanding by Large-scale Efficient PretrainingCode0
Cross-Scale MAE: A Tale of Multi-Scale Exploitation in Remote SensingCode2
HiFT: A Hierarchical Full Parameter Fine-Tuning StrategyCode1
The Case for Co-Designing Model Architectures with Hardware0
FP6-LLM: Efficiently Serving Large Language Models Through FP6-Centric Algorithm-System Co-DesignCode3
MoE-Infinity: Efficient MoE Inference on Personal Machines with Sparsity-Aware Expert CacheCode3
ServerlessLLM: Low-Latency Serverless Inference for Large Language ModelsCode4
CNN architecture extraction on edge GPU0
Automated Root Causing of Cloud Incidents using In-Context Learning with GPT-40
InverseMatrixVT3D: An Efficient Projection Matrix-Based Approach for 3D Occupancy PredictionCode1
Show:102550
← PrevPage 75 of 226Next →

No leaderboard results yet.