SOTAVerified

GPU

Papers

Showing 23512400 of 5629 papers

TitleStatusHype
Parameter-Efficient Fine-Tuning in Large Models: A Survey of Methodologies0
Sort-free Gaussian Splatting via Weighted Sum Rendering0
ExpertFlow: Optimized Expert Activation and Token Allocation for Efficient Mixture-of-Experts Inference0
Is the GPU Half-Empty or Half-Full? Practical Scheduling Techniques for LLMs0
POD-Attention: Unlocking Full Prefill-Decode Overlap for Faster LLM InferenceCode0
Trajectory Optimization for Spatial Microstructure Control in Electron Beam Metal Additive Manufacturing0
CoreInfer: Accelerating Large Language Model Inference with Semantics-Inspired Adaptive Sparse Activation0
FastAttention: Extend FlashAttention2 to NPUs and Low-resource GPUs0
Optimizing Mixture-of-Experts Inference Time Combining Model Deployment and Communication Scheduling0
Semantic-guided Search for Efficient Program Repair with Large Language Models0
AI-focused HPC Data Centers Can Provide More Power Grid Flexibility and at Lower Cost0
Enabling Energy-Efficient Deployment of Large Language Models on Memristor Crossbar: A Synergy of Large and Small0
Mean-Field Simulation-Based Inference for Cosmological Initial Conditions0
Fully Explicit Dynamic Gaussian Splatting0
CompAct: Compressed Activations for Memory-Efficient LLM Training0
A Remedy to Compute-in-Memory with Dynamic Random Access Memory: 1FeFET-1C Technology for Neuro-Symbolic AI0
SemiHVision: Enhancing Medical Multimodal Models with a Semi-Human Annotated Dataset and Fine-Tuned Instruction GenerationCode0
Accelerate Coastal Ocean Circulation Model with AI Surrogate0
Evaluating Quantized Large Language Models for Code Generation on Low-Resource Language BenchmarksCode0
Parallel Backpropagation for Inverse of a Convolution with Application to Normalizing FlowsCode0
AC-Mix: Self-Supervised Adaptation for Low-Resource Automatic Speech Recognition using Agnostic Contrastive Mixup0
Takin-ADA: Emotion Controllable Audio-Driven Animation with Canonical and Landmark Loss Optimization0
Harnessing Your DRAM and SSD for Sustainable and Accessible LLM Inference with Mixed-Precision and Multi-level Caching0
Shavette: Low Power Neural Network Acceleration via Algorithm-level Error Detection and UndervoltingCode0
MEGA: Memory-Efficient 4D Gaussian Splatting for Dynamic Scenes0
FDF: Flexible Decoupled Framework for Time Series Forecasting with Conditional Denoising and Polynomial ModelingCode0
Optimization and Application of Cloud-based Deep Learning Architecture for Multi-Source Data Prediction0
RapidDock: Unlocking Proteome-scale Molecular Docking0
Long-LRM: Long-sequence Large Reconstruction Model for Wide-coverage Gaussian Splats0
CoreGuard: Safeguarding Foundational Capabilities of LLMs Against Model Stealing in Edge Deployment0
Learning Representations for Reasoning: Generalizing Across Diverse Structures0
LR-SQL: A Supervised Fine-Tuning Method for Text2SQL Tasks under Low-Resource ScenariosCode0
Exploiting Local Features and Range Images for Small Data Real-Time Point Cloud Semantic SegmentationCode0
ET-Former: Efficient Triplane Deformable Attention for 3D Semantic Scene Completion From Monocular Camera0
Automated Filtering of Human Feedback Data for Aligning Text-to-Image Diffusion ModelsCode0
Deep Compression Autoencoder for Efficient High-Resolution Diffusion Models0
PromptGCN: Bridging Subgraph Gaps in Lightweight GCNs0
MoIN: Mixture of Introvert Experts to Upcycle an LLM0
VIBES -- Vision Backbone Efficient Selection0
ActNAS : Generating Efficient YOLO Models using Activation NAS0
Superpipeline: A Universal Approach for Reducing GPU Memory Usage in Large ModelsCode0
Parallel Watershed Partitioning: GPU-Based Hierarchical Image Segmentation0
CSA: Data-efficient Mapping of Unimodal Features to Multimodal Features0
HM-DF SNN: Transcending Conventional Online Learning with Advanced Training and Deployment0
Addax: Utilizing Zeroth-Order Gradients to Improve Memory Efficiency and Performance of SGD for Fine-Tuning Language Models0
TinyClick: Single-Turn Agent for Empowering GUI Automation0
Do better language models have crisper vision?0
QuAILoRA: Quantization-Aware Initialization for LoRA0
PortLLM: Personalizing Evolving Large Language Models with Training-Free and Portable Model Patches0
Automated Quality Control System for Canned Tuna Production using Artificial Vision0
Show:102550
← PrevPage 48 of 113Next →

No leaderboard results yet.