SOTAVerified

GPU

Papers

Showing 34513500 of 5629 papers

TitleStatusHype
Bespoke Large Language Models for Digital Triage Assistance in Mental Health Care0
Bespoke Solvers for Generative Flow Models0
Best of Both Worlds: AutoML Codesign of a CNN and its Hardware Accelerator0
BestServe: Serving Strategies with Optimal Goodput in Collocation and Disaggregation Architectures0
Betty: An Automatic Differentiation Library for Multilevel Optimization0
Beyond Blur: A Fluid Perspective on Generative Diffusion Models0
Beyond Boxes: Mask-Guided Spatio-Temporal Feature Aggregation for Video Object Detection0
Beyond Desktop Computation: Challenges in Scaling a GPU Infrastructure0
Beyond Fine-tuning: Classifying High Resolution Mammograms using Function-Preserving Transformations0
Beyond Terabit/s Integrated Neuromorphic Photonic Processor for DSP-Free Optical Interconnects0
BGL: GPU-Efficient GNN Training by Optimizing Graph Data I/O and Preprocessing0
Bidirectional Encoder Representations from Transformers (BERT): A sentiment analysis odyssey0
BigGraphVis: Leveraging Streaming Algorithms and GPU Acceleration for Visualizing Big Graphs0
Bi-Level Optimization Augmented with Conditional Variational Autoencoder for Autonomous Driving in Dense Traffic0
Bilinear CNN Models for Fine-Grained Visual Recognition0
Binarized Convolutional Neural Networks for Efficient Inference on GPUs0
BioGrad: Biologically Plausible Gradient-Based Learning for Spiking Neural Networks0
Biological Evolution and Genetic Algorithms: Exploring the Space of Abstract Tile Self-Assembly0
BiPMAP: A Toolbox for Predictions of Perceived Motion Artifacts on Modern Displays0
BirdNeRF: Fast Neural Reconstruction of Large-Scale Scenes From Aerial Imagery0
BiSwift: Bandwidth Orchestrator for Multi-Stream Video Analytics on Edge0
Bit Fusion: Bit-Level Dynamically Composable Architecture for Accelerating Deep Neural Networks0
BitNet b1.58 2B4T Technical Report0
Bit-Parallel Vector Composability for Neural Acceleration0
BitSplit-Net: Multi-bit Deep Neural Network with Bitwise Activation Function0
Block based Singular Value Decomposition approach to matrix factorization for recommender systems0
Blockchain For Mobile Health Applications: Acceleration With GPU Computing0
Block-Parallel IDA* for GPUs (Extended Manuscript)0
BMF: Block matrix approach to factorization of large scale data0
BML: A High-performance, Low-cost Gradient Synchronization Algorithm for DML Training0
Bolt3D: Generating 3D Scenes in Seconds0
Boosting Distributed Full-graph GNN Training with Asynchronous One-bit Communication0
Enhancing Large-Scale AI Training Efficiency: The C4 Solution for Real-Time Anomaly Detection and Communication Optimization0
Boosting Mobile CNN Inference through Semantic Memory0
Boosting Performance on ARC is a Matter of Perspective0
Boost Vision Transformer with GPU-Friendly Sparsity and Quantization0
BOP Challenge 2023 on Detection, Segmentation and Pose Estimation of Seen and Unseen Rigid Objects0
Brain Cancer Segmentation Using YOLOv5 Deep Neural Network0
BrainFrame: A node-level heterogeneous accelerator platform for neuron simulations0
Brain-inspired sparse training enables Transformers and LLMs to perform as fully connected0
Breadth-First Pipeline Parallelism0
Breaking the Bank with ChatGPT: Few-Shot Text Classification for Finance0
Approximation Rates and VC-Dimension Bounds for (P)ReLU MLP Mixture of Experts0
Breaking the Memory Barrier of Contrastive Loss via Tile-Based Strategy0
Bregman Alternating Direction Method of Multipliers0
Brief analysis of DeepSeek R1 and it's implications for Generative AI0
Brief Announcement: On the Limits of Parallelizing Convolutional Neural Networks on GPUs0
Bringing regularized optimal transport to lightspeed: a splitting method adapted for GPUs0
Bringing together invertible UNets with invertible attention modules for memory-efficient diffusion models0
Broadband Ground Motion Synthesis by Diffusion Model with Minimal Condition0
Show:102550
← PrevPage 70 of 113Next →

No leaderboard results yet.