SOTAVerified

GPU

Papers

Showing 11011150 of 5629 papers

TitleStatusHype
FreeRide: Harvesting Bubbles in Pipeline Parallelism0
InstructSing: High-Fidelity Singing Voice Generation via Instructing Yourself0
GigaGS: Scaling up Planar-Based 3D Gaussians for Large Scene Surface Reconstruction0
Enhancing Sequential Recommendations through Multi-Perspective Reflections and Iteration0
CoDiCast: Conditional Diffusion Model for Global Weather Prediction with Uncertainty QuantificationCode0
Scalable Multitask Learning Using Gradient-based Estimation of Task AffinityCode0
TriplePlay: Enhancing Federated Learning with CLIP for Non-IID Data and Resource Efficiency0
Optimizing VarLiNGAM for Scalable and Efficient Time Series Causal Discovery0
Resource-Efficient Generative AI Model Deployment in Mobile Edge Networks0
ELMS: Elasticized Large Language Models On Mobile Devices0
InstInfer: In-Storage Attention Offloading for Cost-Effective Long-Context LLM Inference0
From Computation to Consumption: Exploring the Compute-Energy Link for Training and Testing Neural Networks for SED Systems0
MultiCounter: Multiple Action Agnostic Repetition Counting in Untrimmed Videos0
Confidential Computing on NVIDIA Hopper GPUs: A Performance Benchmark Study0
mPLUG-DocOwl2: High-resolution Compressing for OCR-free Multi-page Document Understanding0
Hardware Acceleration of LLMs: A comprehensive survey and comparison0
Differentiable Discrete Event Simulation for Queuing Network Control0
LMLT: Low-to-high Multi-Level Vision Transformer for Image Super-ResolutionCode1
LowFormer: Hardware Efficient Design for Convolutional Transformer BackbonesCode1
ISO: Overlap of Computation and Communication within Seqenence For LLM Inference0
Hallucination Detection in LLMs: Fast and Memory-Efficient Fine-Tuned ModelsCode0
LongLLaVA: Scaling Multi-modal LLMs to 1000 Images Efficiently via a Hybrid ArchitectureCode3
AdvSecureNet: A Python Toolkit for Adversarial Machine LearningCode0
Accelerating Large Language Model Training with Hybrid GPU-based Compression0
Toward Capturing Genetic Epistasis From Multivariate Genome-Wide Association Studies Using Mixed-Precision Kernel Ridge Regression0
LinFusion: 1 GPU, 1 Minute, 16K ImageCode3
GaussianPU: A Hybrid 2D-3D Upsampling Framework for Enhancing Color Point Clouds via 3D Gaussian Splatting0
Compressing VAE-Based Out-of-Distribution Detectors for Embedded Deployment0
TempMe: Video Temporal Token Merging for Efficient Text-Video RetrievalCode1
Follow-Your-Canvas: Higher-Resolution Video Outpainting with Extensive Content GenerationCode2
Enhancing Privacy in Federated Learning: Secure Aggregation for Real-World Healthcare ApplicationsCode2
VideoLLaMB: Long-context Video Understanding with Recurrent Memory Bridges0
OD-VAE: An Omni-dimensional Video Compressor for Improving Latent Video Diffusion Model0
Accelerating Hybrid Agent-Based Models and Fuzzy Cognitive Maps: How to Combine Agents who Think Alike?0
LongRecipe: Recipe for Efficient Long Context Generalization in Large Language ModelsCode2
ContextVLM: Zero-Shot and Few-Shot Context Understanding for Autonomous Driving using Vision Language Models0
VQ4DiT: Efficient Post-Training Vector Quantization for Diffusion Transformers0
Training Ultra Long Context Language Model with Fully Pipelined Distributed Transformer0
MemLong: Memory-Augmented Retrieval for Long Text ModelingCode2
H-SGANet: Hybrid Sparse Graph Attention Network for Deformable Medical Image Registration0
TinyTNAS: GPU-Free, Time-Bound, Hardware-Aware Neural Architecture Search for TinyML Time Series ClassificationCode1
3-in-1: 2D Rotary Adaptation for Efficient Finetuning, Efficient Batching and ComposabilityCode0
Conan-embedding: General Text Embedding with More and Better Negative Samples0
microYOLO: Towards Single-Shot Object Detection on Microcontrollers0
InstanSeg: an embedding-based instance segmentation algorithm optimized for accurate, efficient and portable cell segmentationCode3
SCAN-Edge: Finding MobileNet-speed Hybrid Networks for Diverse Edge Devices via Hardware-Aware Evolutionary Search0
GPU-Accelerated Counterfactual Regret MinimizationCode1
OctFusion: Octree-based Diffusion Models for 3D Shape GenerationCode3
Text-guided Foundation Model Adaptation for Long-Tailed Medical Image Classification0
The Mamba in the Llama: Distilling and Accelerating Hybrid ModelsCode3
Show:102550
← PrevPage 23 of 113Next →

No leaderboard results yet.