SOTAVerified

GPU

Papers

Showing 851900 of 5629 papers

TitleStatusHype
BESA: Pruning Large Language Models with Blockwise Parameter-Efficient Sparsity AllocationCode1
BERTVision -- A Parameter-Efficient Approach for Question AnsweringCode1
Im2win: An Efficient Convolution Paradigm on GPUCode1
BenchPress: A Deep Active Benchmark GeneratorCode1
Fleet-DAgger: Interactive Robot Fleet Learning with Scalable Human SupervisionCode1
ByteTrack: Multi-Object Tracking by Associating Every Detection BoxCode1
FlexModel: A Framework for Interpretability of Distributed Large Language ModelsCode1
Improved Visual-Spatial Reasoning via R1-Zero-Like TrainingCode1
Cache Me If You Must: Adaptive Key-Value Quantization for Large Language ModelsCode1
CAD2Render: A Modular Toolkit for GPU-accelerated Photorealistic Synthetic Data Generation for the Manufacturing IndustryCode1
CAD: Memory Efficient Convolutional Adapter for Segment AnythingCode1
An Energy and GPU-Computation Efficient Backbone Network for Real-Time Object DetectionCode1
Collapsible Linear Blocks for Super-Efficient Super ResolutionCode1
ComSL: A Composite Speech-Language Model for End-to-End Speech-to-Text TranslationCode1
Neural Architecture Search using Deep Neural Networks and Monte Carlo Tree SearchCode1
COLO: A Contrastive Learning based Re-ranking Framework for One-Stage SummarizationCode1
Flash Invariant Point AttentionCode1
Conformal Trajectory Prediction with Multi-View Data Integration in Cooperative DrivingCode1
AlphaGAN: Fully Differentiable Architecture Search for Generative Adversarial NetworksCode1
Interactive Volume Visualization via Multi-Resolution Hash Encoding based Neural RepresentationCode1
FlashMLA-ETAP: Efficient Transpose Attention Pipeline for Accelerating MLA Inference on NVIDIA H20 GPUsCode1
Fourier-MIONet: Fourier-enhanced multiple-input neural operators for multiphase modeling of geological carbon sequestrationCode1
InverseMatrixVT3D: An Efficient Projection Matrix-Based Approach for 3D Occupancy PredictionCode1
Lightning-Fast Image Inversion and Editing for Text-to-Image Diffusion ModelsCode1
Fine-tuning Quantized Neural Networks with Zeroth-order OptimizationCode1
Fine-tuning giant neural networks on commodity hardware with automatic pipeline model parallelismCode1
ISTA-NAS: Efficient and Consistent Neural Architecture Search by Sparse CodingCode1
CAPIVARA: Cost-Efficient Approach for Improving Multilingual CLIP Performance on Low-Resource LanguagesCode1
Fine-tuning of sign language recognition models: a technical reportCode1
Collaborative Distillation for Ultra-Resolution Universal Style TransferCode1
A Batch Noise Contrastive Estimation Approach for Training Large Vocabulary Language ModelsCode1
FindVehicle and VehicleFinder: A NER dataset for natural language-based vehicle retrieval and a keyword-based cross-modal vehicle retrieval systemCode1
An Experimental Evaluation of Machine Learning Training on a Real Processing-in-Memory SystemCode1
Fine-Tuning Pre-trained Transformers into Decaying Fast WeightsCode1
Flash3D: Super-scaling Point Transformers through Joint Hardware-Geometry LocalityCode1
Active Sampling for Pairwise Comparisons via Approximate Message Passing and Information Gain MaximizationCode1
JIT-Masker: Efficient Online Distillation for Background MattingCode1
FFHNet : Generating Multi-Fingered Robotic Grasps for Unknown Objects in Real-timeCode1
FG-Net: Fast Large-Scale LiDAR Point Clouds Understanding Network Leveraging Correlated Feature Mining and Geometric-Aware ModellingCode1
FFB: A Fair Fairness Benchmark for In-Processing Group Fairness MethodsCode1
CARTO: Category and Joint Agnostic Reconstruction of ARTiculated ObjectsCode1
Cascade Cost Volume for High-Resolution Multi-View Stereo and Stereo MatchingCode1
KD-MRI: A knowledge distillation framework for image reconstruction and image restoration in MRI workflowCode1
FedMedICL: Towards Holistic Evaluation of Distribution Shifts in Federated Medical ImagingCode1
Allegro-Legato: Scalable, Fast, and Robust Neural-Network Quantum Molecular Dynamics via Sharpness-Aware MinimizationCode1
F-coref: Fast, Accurate and Easy to Use Coreference ResolutionCode1
An In-depth Study of Stochastic BackpropagationCode1
CatBoost: gradient boosting with categorical features supportCode1
Ladder-residual: parallelism-aware architecture for accelerating large model inference with communication overlappingCode1
FELARE: Fair Scheduling of Machine Learning Tasks on Heterogeneous Edge SystemsCode1
Show:102550
← PrevPage 18 of 113Next →

No leaderboard results yet.