SOTAVerified

GPU

Papers

Showing 12511300 of 5629 papers

TitleStatusHype
Deep Architectures for Neural Machine TranslationCode1
Deep CNNs Meet Global Covariance Pooling: Better Representation and GeneralizationCode1
EPSNet: Efficient Panoptic Segmentation Network with Cross-layer Attention FusionCode1
CPU- and GPU-based Distributed Sampling in Dirichlet Process Mixtures for Large-scale AnalysisCode1
A GPU-accelerated Large-scale Simulator for Transportation System Optimization BenchmarkingCode1
Effective Batching for Recurrent Neural Network GrammarsCode1
Evaluation and Optimization of Gradient Compression for Distributed Deep LearningCode1
CPM-2: Large-scale Cost-effective Pre-trained Language ModelsCode1
AutoDNNchip: An Automated DNN Chip Predictor and Builder for Both FPGAs and ASICsCode1
Evaluating Retrieval Quality in Retrieval-Augmented GenerationCode1
Aerial Single-View Depth Completion with Image-Guided Uncertainty EstimationCode1
Edge-MoE: Memory-Efficient Multi-Task Vision Transformer Architecture with Task-level Sparsity via Mixture-of-ExpertsCode1
LLMThinkBench: Towards Basic Math Reasoning and Overthinking in Large Language ModelsCode1
LMLT: Low-to-high Multi-Level Vision Transformer for Image Super-ResolutionCode1
CowClip: Reducing CTR Prediction Model Training Time from 12 hours to 10 minutes on 1 GPUCode1
Deep Dual-resolution Networks for Real-time and Accurate Semantic Segmentation of Road ScenesCode1
LLM-Pilot: Characterize and Optimize Performance of your LLM Inference ServicesCode1
LLM-PQ: Serving LLM on Heterogeneous Clusters with Phase-Aware Partition and Adaptive QuantizationCode1
Microscopy Image Restoration using Deep Learning on W2SCode1
COVE: Unleashing the Diffusion Feature Correspondence for Consistent Video EditingCode1
Accelerating Large Scale Real-Time GNN Inference using Channel PruningCode1
A Streaming Approach For Efficient Batched Beam SearchCode1
Counterfactual Generative NetworksCode1
eWaSR -- an embedded-compute-ready maritime obstacle detection networkCode1
Auto Learning AttentionCode1
Easy and Efficient Transformer : Scalable Inference Solution For large NLP modelCode1
Deep learning approach to left ventricular non-compaction measurementCode1
EXODUS: Stable and Efficient Training of Spiking Neural NetworksCode1
AE-OT: A NEW GENERATIVE MODEL BASED ON EXTENDED SEMI-DISCRETE OPTIMAL TRANSPORTCode1
Edge and Identity Preserving Network for Face Super-ResolutionCode1
LLMSTEP: LLM proofstep suggestions in LeanCode1
Fast and accurate learned multiresolution dynamical downscaling for precipitationCode1
FADRM: Fast and Accurate Data Residual Matching for Dataset DistillationCode1
EZ-CLIP: Efficient Zeroshot Video Action RecognitionCode1
Locret: Enhancing Eviction in Long-Context LLM Inference with Trained Retaining Heads on Consumer-Grade DevicesCode1
Dynamic Structure Pruning for Compressing CNNsCode1
LL-GNN: Low Latency Graph Neural Networks on FPGAs for High Energy PhysicsCode1
Dynamic Sparse Training with Structured SparsityCode1
LLMCarbon: Modeling the end-to-end Carbon Footprint of Large Language ModelsCode1
Farseer: A Refined Scaling Law in Large Language ModelsCode1
Dynamic Pooling Improves Nanopore Base Calling AccuracyCode1
Dynamic Perceiver for Efficient Visual RecognitionCode1
LiVOS: Light Video Object Segmentation with Gated Linear MatchingCode1
CoSense3D: an Agent-based Efficient Learning Framework for Collective PerceptionCode1
Accelerating Sparse DNN Models without Hardware-Support via Tile-Wise SparsityCode1
Fast and Accurate Neural CRF Constituency ParsingCode1
Fast and Accurate Retrieval of Methane Concentration from Imaging Spectrometer Data Using Sparsity PriorCode1
Fast and Complete: Enabling Complete Neural Network Verification with Rapid and Massively Parallel Incomplete VerifiersCode1
Dynamic-OFA: Runtime DNN Architecture Switching for Performance Scaling on Heterogeneous Embedded PlatformsCode1
LlamaCare: A Large Medical Language Model for Enhancing Healthcare Knowledge SharingCode1
Show:102550
← PrevPage 26 of 113Next →

No leaderboard results yet.