SOTAVerified

GPU

Papers

Showing 19011950 of 5629 papers

TitleStatusHype
CAT: A Conditional Adaptation Tailor for Efficient and Effective Instance-Specific Pansharpening on Real-World Data0
Anchors no more: Using peculiar velocities to constrain H_0 and the primordial Universe without calibratorsCode0
Frozen Layers: Memory-efficient Many-fidelity Hyperparameter Optimization0
aweSOM: a CPU/GPU-accelerated Self-organizing Map and Statistically Combined Ensemble Framework for Machine-learning Clustering Analysis0
MoE-Lens: Towards the Hardware Limit of High-Throughput MoE LLM Serving Under Resource Constraints0
Towards On-Device Learning and Reconfigurable Hardware Implementation for Encoded Single-Photon Signal Processing0
EO-VLM: VLM-Guided Energy Overload Attacks on Vision Models0
Seaweed-7B: Cost-Effective Training of Video Generation Foundation Model0
SpecEE: Accelerating Large Language Model Inference with Speculative Early Exiting0
Spectral Normalization for Lipschitz-Constrained Policies on Learning Humanoid Locomotion0
Parameter-Free Fine-tuning via Redundancy Elimination for Vision Foundation Models0
DGOcc: Depth-aware Global Query-based Network for Monocular 3D Occupancy Prediction0
GPT Carry-On: Training Foundation Model for Customization Could Be Simple, Scalable and Affordable0
PoGO: A Scalable Proof of Useful Work via Quantized Gradient Descent and Merkle Proofs0
Search-contempt: a hybrid MCTS algorithm for training AlphaZero-like engines with better computational efficiency0
A Comparison of Deep Learning Methods for Cell Detection in Digital CytologyCode0
CRYSIM: Prediction of Symmetric Structures of Large Crystals with GPU-based Ising MachinesCode0
Nonuniform-Tensor-Parallelism: Mitigating GPU failure impact for Scaled-up LLM Training0
Accelerating LLM Inference Throughput via Asynchronous KV Cache Prefetching0
PRIMA.CPP: Speeding Up 70B-Scale LLM Inference on Low-Resource Everyday Home ClustersCode0
SmolVLM: Redefining small and efficient multimodal models0
Leveraging State Space Models in Long Range Genomics0
Stereo-LiDAR Fusion by Semi-Global Matching With Discrete Disparity-Matching Cost and SemidensificationCode0
Multimodal Cinematic Video Synthesis Using Text-to-Image and Audio Generation Models0
SLOs-Serve: Optimized Serving of Multi-SLO LLMs0
HeterMoE: Efficient Training of Mixture-of-Experts Models on Heterogeneous GPUs0
Accurate GPU Memory Prediction for Deep Learning Jobs through Dynamic Analysis0
DeepOHeat-v1: Efficient Operator Learning for Fast and Trustworthy Thermal Simulation and Optimization in 3D-IC DesignCode0
MegaScale-Infer: Serving Mixture-of-Experts at Scale with Disaggregated Expert Parallelism0
Incorporating the ChEES Criterion into Sequential Monte Carlo Samplers0
A Truncated Newton Method for Optimal TransportCode0
Accelerating IoV Intrusion Detection: Benchmarking GPU-Accelerated vs CPU-Based ML Libraries0
FlowR: Flowing from Sparse to Dense 3D Reconstructions0
SentenceKV: Efficient LLM Inference via Sentence-Level Semantic KV Caching0
SCRec: A Scalable Computational Storage System with Statistical Sharding and Tensor-train Decomposition for Recommendation Models0
Open-Qwen2VL: Compute-Efficient Pre-Training of Fully-Open Multimodal LLMs on Academic Resources0
Adapting Vision Foundation Models for Real-time Ultrasound Image Segmentation0
GPU-centric Communication Schemes for HPC and ML Applications0
StochasticSplats: Stochastic Rasterization for Sorting-Free 3D Gaussian Splatting0
Deep Learning Model Deployment in Multiple Cloud Providers: an Exploratory Study Using Low Computing Power Environments0
Pan-LUT: Efficient Pan-sharpening via Learnable Look-Up Tables0
Orchestrate Multimodal Data with Batch Post-Balancing to Accelerate Multimodal Large Language Model Training0
Cocktail: Chunk-Adaptive Mixed-Precision Quantization for Long-Context LLM Inference0
CityGS-X: A Scalable Architecture for Efficient and Geometrically Accurate Large-Scale Scene Reconstruction0
PartialLoading: User Scheduling and Bandwidth Allocation for Parameter-sharing Edge Inference0
Disentangled 4D Gaussian Splatting: Towards Faster and More Efficient Dynamic Scene Rendering0
Scenario Dreamer: Vectorized Latent Diffusion for Generating Driving Simulation Environments0
ChatAnyone: Stylized Real-time Portrait Video Generation with Hierarchical Motion Diffusion Model0
FACETS: Efficient Once-for-all Object Detection via Constrained Iterative Search0
Lobster: A GPU-Accelerated Framework for Neurosymbolic Programming0
Show:102550
← PrevPage 39 of 113Next →

No leaderboard results yet.