SOTAVerified

GPU

Papers

Showing 20012050 of 5629 papers

TitleStatusHype
Short-Term Load Forecasting for AI-Data Center0
AttFC: Attention Fully-Connected Layer for Large-Scale Face Recognition with One GPU0
Fine-Tuning LLMs for Report Summarization: Analysis on Supervised and Unsupervised Data0
Are We There Yet? A Measurement Study of Efficiency for LLM Applications on Mobile Devices0
Global Context Is All You Need for Parallel Efficient Tractography Parcellation0
A Mesh Is Worth 512 Numbers: Spectral-domain Diffusion Modeling for High-dimension Shape Generation0
Training and Inference Efficiency of Encoder-Decoder Speech Models0
Symbolic Mixture-of-Experts: Adaptive Skill-based Routing for Heterogeneous Reasoning0
Real-Time Semantic Segmentation of Aerial Images Using an Embedded U-Net: A Comparison of CPU, GPU, and FPGA Workflows0
Wanda++: Pruning Large Language Models via Regional GradientsCode0
Eventprop training for efficient neuromorphic applications0
Fine-Tuning Florence2 for Enhanced Object Detection in Un-constructed Environments: Vision-Language Model Approach0
Predictable Scale: Part I -- Optimal Hyperparameter Scaling Law in Large Language Model Pretraining0
JamMa: Ultra-lightweight Local Feature Matching with Joint Mamba0
Partial Convolution Meets Visual Attention0
Memory and Bandwidth are All You Need for Fully Sharded Data Parallel0
CoServe: Efficient Collaboration-of-Experts (CoE) Model Inference with Limited Memory0
OceanSim: A GPU-Accelerated Underwater Robot Perception Simulation Framework0
Category-level Meta-learned NeRF Priors for Efficient Object Mapping0
KurTail : Kurtosis-based LLM Quantization0
Open-source framework for detecting bias and overfitting for large pathology imagesCode0
A Reconfigurable Stream-Based FPGA Accelerator for Bayesian Confidence Propagation Neural Networks0
Cauchy Random Features for Operator Learning in Sobolev SpaceCode0
Floorplan-SLAM: A Real-Time, High-Accuracy, and Long-Term Multi-Session Point-Plane SLAM for Efficient Floorplan Reconstruction0
Timing-Driven Global Placement by Efficient Critical Path Extraction0
Efficient Jailbreaking of Large Models by Freeze Training: Lower Layers Exhibit Greater Sensitivity to Harmful Content0
AMPLE: Event-Driven Accelerator for Mixed-Precision Inference of Graph Neural Networks0
Supporting the development of Machine Learning for fundamental science in a federated Cloud with the AI_INFN platform0
S4ConvD: Adaptive Scaling and Frequency Adjustment for Energy-Efficient Sensor Networks in Smart BuildingsCode0
TeleRAG: Efficient Retrieval-Augmented Generation Inference with Lookahead Retrieval0
Striving for Faster and Better: A One-Layer Architecture with Auto Re-parameterization for Low-Light Image EnhancementCode0
QORT-Former: Query-optimized Real-time Transformer for Understanding Two Hands Manipulating Objects0
Accurate and Scalable Graph Neural Networks via Message InvarianceCode0
SEKI: Self-Evolution and Knowledge Inspiration based Neural Architecture Search via Large Language Models0
WaveGAS: Waveform Relaxation for Scaling Graph Neural Networks0
AutoHete: An Automatic and Efficient Heterogeneous Training System for LLMs0
FPGA-Accelerated SpeckleNN with SNL for Real-time X-ray Single-Particle Imaging0
LLMs Have Rhythm: Fingerprinting Large Language Models Using Inter-Token Times and Network Traffic Analysis0
Mechanistic PDE Networks for Discovery of Governing Equations0
Software implemented fault diagnosis of natural gas pumping unit based on feedforward neural network0
Accelerated Training on Low-Power Edge Devices0
The Power of Graph Signal Processing for Chip Placement Acceleration0
Lean and Mean: Decoupled Value Policy Optimization with Global Value Guidance0
SparseTransX: Efficient Training of Translation-Based Knowledge Graph Embeddings Using Sparse Matrix OperationsCode0
Low-distortion and GPU-compatible Tree Embeddings in Hyperbolic Space0
A Split-Window Transformer for Multi-Model Sequence Spammer Detection using Multi-Model Variational Autoencoder0
Fine-Tuning Qwen 2.5 3B for Realistic Movie Dialogue Generation0
A Universal Framework for Compressing Embeddings in CTR PredictionCode0
Round Attention: A Novel Round-Level Attention Mechanism to Accelerate LLM Inference0
Exploring RWKV for Sentence Embeddings: Layer-wise Analysis and Baseline Comparison for Semantic SimilarityCode0
Show:102550
← PrevPage 41 of 113Next →

No leaderboard results yet.