SOTAVerified

GPU

Papers

Showing 19011950 of 5629 papers

TitleStatusHype
RoSA: Accurate Parameter-Efficient Fine-Tuning via Robust AdaptationCode3
Lightning Attention-2: A Free Lunch for Handling Unlimited Sequence Lengths in Large Language ModelsCode3
Low-resource finetuning of foundation models beats state-of-the-art in histopathologyCode2
G-Meta: Distributed Meta Learning in GPU Clusters for Large-Scale Recommender Systems0
IntervalMDP.jl: Accelerated Value Iteration for Interval Markov Decision ProcessesCode0
FFSplit: Split Feed-Forward Network For Optimizing Accuracy-Efficiency Trade-off in Language Model Inference0
Efficient Multiscale Multimodal Bottleneck Transformer for Audio-Video Classification0
FlightLLM: Efficient Large Language Model Inference with a Complete Mapping Flow on FPGAs0
A foundation for exact binarized morphological neural networksCode0
WidthFormer: Toward Efficient Transformer-based BEV View TransformationCode2
CAVIAR: Co-simulation of 6G Communications, 3D Scenarios and AI for Digital TwinsCode1
Parameter-Efficient Sparsity Crafting from Dense to Mixture-of-Experts for Instruction Tuning on General TasksCode2
CoMoSVC: Consistency Model-based Singing Voice ConversionCode2
LLaMA Beyond English: An Empirical Study on Language Capability Transfer0
Scaling Laws for Data Filtering-- Data Curation cannot be Compute Agnostic0
Resource-Efficient Transformer Pruning for Finetuning of Large ModelsCode1
LTM: Lightweight Textured Mesh Extraction and Refinement of Large Unbounded Scenes for Efficient Storage and Real-time Rendering0
Distraction is All You Need: Memory-Efficient Image Immunization against Diffusion-Based Image Editing0
Slice3D: Multi-Slice Occlusion-Revealing Single View 3D Reconstruction0
LAMP: Learn A Motion Pattern for Few-Shot Video Generation0
Time- Memory- and Parameter-Efficient Visual Adaptation0
Learning to Select Views for Efficient Multi-View Understanding0
TinyPredNet: A Lightweight Framework for Satellite Image Sequence PredictionCode1
MosaicBERT: A Bidirectional Encoder Optimized for Fast PretrainingCode2
Discovery of Small Ultra-short-period Planets Orbiting KG Dwarfs in Kepler Survey Using GPU Phase Folding and Deep Learning Detection System0
MobileVLM : A Fast, Strong and Open Vision Language Assistant for Mobile DevicesCode3
Spacetime Gaussian Feature Splatting for Real-Time Dynamic View SynthesisCode2
City-on-Web: Real-time Neural Rendering of Large-scale Scenes on the WebCode1
FALCON: Feature-Label Constrained Graph Net Collapse for Memory Efficient GNNsCode0
Masked Contrastive Reconstruction for Cross-modal Medical Image-Report Retrieval0
XuanCe: A Comprehensive and Unified Deep Reinforcement Learning LibraryCode3
Proximal Gradient Descent Unfolding Dense-spatial Spectral-attention Transformer for Compressive Spectral Imaging0
A Comprehensive Evaluation of Parameter-Efficient Fine-Tuning on Software Engineering TasksCode0
BiSwift: Bandwidth Orchestrator for Multi-Stream Video Analytics on Edge0
CARSS: Cooperative Attention-guided Reinforcement Subpath Synthesis for Solving Traveling Salesman Problem0
PERP: Rethinking the Prune-Retrain Paradigm in the Era of LLMsCode0
Understanding the Potential of FPGA-Based Spatial Acceleration for Large Language Model InferenceCode2
ZO-AdaMU Optimizer: Adapting Perturbation by the Momentum and Uncertainty in Zeroth-order OptimizationCode1
Emage: Non-Autoregressive Text-to-Image Generation0
BSS-Bench: Towards Reproducible and Effective Band Selection Search0
CRD: Collaborative Representation Distance for Practical Anomaly Detection0
NeRF-VO: Real-Time Sparse Visual Odometry with Neural Radiance Fields0
PointeNet: A Lightweight Framework for Effective and Efficient Point Cloud Analysis0
Optimizing Distributed Training on Frontier for Large Language Models0
Splatter Image: Ultra-Fast Single-View 3D ReconstructionCode3
Efficient LLM inference solution on Intel GPU0
Enhancing predictive capabilities in fusion burning plasmas through surrogate-based optimization in core transport solversCode1
Lightning-Fast Image Inversion and Editing for Text-to-Image Diffusion ModelsCode1
IS-DARTS: Stabilizing DARTS through Precise Measurement on Candidate ImportanceCode0
A Case Study in CUDA Kernel Fusion: Implementing FlashAttention-2 on NVIDIA Hopper Architecture using the CUTLASS LibraryCode2
Show:102550
← PrevPage 39 of 113Next →

No leaderboard results yet.