SOTAVerified

GPU

Papers

Showing 26012650 of 5629 papers

TitleStatusHype
Disrupting Diffusion-based Inpainters with Semantic Digression0
LeanQuant: Accurate Large Language Model Quantization with Loss-Error-Aware Grid0
Enhancing Training Efficiency Using Packing with Flash Attention0
Weight Block Sparsity: Training, Compilation, and AI Engine Accelerators0
Analyzing Machine Learning Performance in a Hybrid Quantum Computing and HPC Environment0
Swiss DINO: Efficient and Versatile Vision Framework for On-device Personal Object SearchCode0
HAFormer: Unleashing the Power of Hierarchy-Aware Features for Lightweight Semantic SegmentationCode0
Parameter Efficient Fine Tuning for Multi-scanner PET to PET Reconstruction0
INSIGHT: Universal Neural Simulator for Analog Circuits Harnessing Autoregressive Transformers0
3D Gaussian Ray Tracing: Fast Tracing of Particle Scenes0
Mobius: A High Efficient Spatial-Temporal Parallel Training Paradigm for Text-to-Video Generation TaskCode0
DεpS: Delayed ε-Shrinking for Faster Once-For-All Training0
Accelerating MRI Uncertainty Estimation with Mask-based Bayesian Neural Network0
The Solution for the AIGC Inference Performance Optimization Competition0
Lazarus: Resilient and Elastic Training of Mixture-of-Experts Models with Adaptive Expert Placement0
Re-Tuning: Overcoming the Compositionality Limits of Large Language Models with Recursive TuningCode0
Autoverse: An Evolvable Game Language for Learning Robust Embodied Agents0
PatchEX: High-Quality Real-Time Temporal Supersampling through Patch-based Parallel Extrapolation0
GOALPlace: Begin with the End in Mind0
LoCo: Low-Bit Communication Adaptor for Large-scale Model TrainingCode0
Green Multigrid Network0
Learning Scalable Model Soup on a Single GPU: An Efficient Subspace Training StrategyCode0
Implementation and Analysis of GPU Algorithms for Vecchia ApproximationCode0
Achieving High Throughput with a Trainable Neural-Network-Based Equalizer for Communications on FPGA0
Benchmarking End-To-End Performance of AI-Based Chip Placement Algorithms0
M5: A Whole Genome Bacterial Encoder at Single Nucleotide Resolution0
Automated Text Scoring in the Age of Generative AI for the GPU-poor0
SparseSSP: 3D Subcellular Structure Prediction from Sparse-View Transmitted Light ImagesCode0
M^2IST: Multi-Modal Interactive Side-Tuning for Efficient Referring Expression Comprehension0
Needle in the Haystack for Memory Based Large Language Models0
PQCache: Product Quantization-based KVCache for Long Context LLM Inference0
SpectralKAN: Kolmogorov-Arnold Network for Hyperspectral Images Change DetectionCode0
Badllama 3: removing safety finetuning from Llama 3 in minutes0
Parm: Efficient Training of Large Sparsely-Activated Models with Dedicated SchedulesCode0
Hierarchical Memory for Long Video QA0
LASSI: An LLM-based Automated Self-Correcting Pipeline for Translating Parallel Scientific Codes0
Explore as a Storm, Exploit as a Raindrop: On the Benefit of Fine-Tuning Kernel Schedulers with Coordinate DescentCode0
Meta Large Language Model Compiler: Foundation Models of Compiler Optimization0
MUMU: Bootstrapping Multimodal Image Generation from Text-to-Image Data0
Real-time Structure Flow0
DICE: End-to-end Deformation Capture of Hand-Face Interactions from a Single Image0
Graph Neural Network as Computationally Efficient Emulator of Ice-sheet and Sea-level System Model (ISSM)0
The Overcooked Generalisation ChallengeCode0
BlockLLM: Memory-Efficient Adaptation of LLMs by Selecting and Optimizing the Right Coordinate BlocksCode0
Video-Infinity: Distributed Long Video Generation0
GraphPipe: Improving Performance and Scalability of DNN Training with Graph Pipeline Parallelism0
MLAAN: Scaling Supervised Local Learning with Multilaminar Leap Augmented Auxiliary NetworkCode0
Hardware-Aware Neural Dropout Search for Reliable Uncertainty Prediction on FPGACode0
LaneSegNet Design Study0
ExVideo: Extending Video Diffusion Models via Parameter-Efficient Post-TuningCode0
Show:102550
← PrevPage 53 of 113Next →

No leaderboard results yet.