| Scalable Cross-Entropy Loss for Sequential Recommendations with Large Item Catalogs | Sep 27, 2024 | GPURecommendation Systems | CodeCode Available | 1 |
| MALPOLON: A Framework for Deep Species Distribution Modeling | Sep 26, 2024 | BenchmarkingGPU | CodeCode Available | 1 |
| LightAvatar: Efficient Head Avatar as Dynamic Neural Light Field | Sep 26, 2024 | GPUNeRF | CodeCode Available | 1 |
| Search for Efficient Large Language Models | Sep 25, 2024 | GPUModel Compression | CodeCode Available | 1 |
| CAD: Memory Efficient Convolutional Adapter for Segment Anything | Sep 24, 2024 | DecoderGPU | CodeCode Available | 1 |
| Efficient Motion Prediction: A Lightweight & Accurate Trajectory Prediction Model With Fast Training and Inference Speed | Sep 24, 2024 | Autonomous DrivingAutonomous Vehicles | CodeCode Available | 1 |
| FastGL: A GPU-Efficient Framework for Accelerating Sampling-Based GNN Training at Large Scale | Sep 23, 2024 | GPU | CodeCode Available | 1 |
| CritiPrefill: A Segment-wise Criticality-based Approach for Prefilling Acceleration in LLMs | Sep 19, 2024 | GPU | CodeCode Available | 1 |
| Development and bilingual evaluation of Japanese medical large language model within reasonably low computational resources | Sep 18, 2024 | GPULanguage Modeling | CodeCode Available | 1 |
| One-Shot Learning for Pose-Guided Person Image Synthesis in the Wild | Sep 15, 2024 | GPUImage Generation | CodeCode Available | 1 |
| LMLT: Low-to-high Multi-Level Vision Transformer for Image Super-Resolution | Sep 5, 2024 | GPUImage Super-Resolution | CodeCode Available | 1 |
| LowFormer: Hardware Efficient Design for Convolutional Transformer Backbones | Sep 5, 2024 | CPUGPU | CodeCode Available | 1 |
| TempMe: Video Temporal Token Merging for Efficient Text-Video Retrieval | Sep 2, 2024 | GPURetrieval | CodeCode Available | 1 |
| TinyTNAS: GPU-Free, Time-Bound, Hardware-Aware Neural Architecture Search for TinyML Time Series Classification | Aug 29, 2024 | CPUDiagnostic | CodeCode Available | 1 |
| GPU-Accelerated Counterfactual Regret Minimization | Aug 27, 2024 | counterfactualGPU | CodeCode Available | 1 |
| Efficient fine-tuning of 37-level GraphCast with the Canadian global deterministic analysis | Aug 26, 2024 | GPU | CodeCode Available | 1 |
| S4D: Streaming 4D Real-World Reconstruction with Gaussians and 3D Control Points | Aug 23, 2024 | 3D Reconstruction4D reconstruction | CodeCode Available | 1 |
| EdgeNAT: Transformer for Efficient Edge Detection | Aug 20, 2024 | Edge DetectionGPU | CodeCode Available | 1 |
| HiRED: Attention-Guided Token Dropping for Efficient Inference of High-Resolution Vision-Language Models | Aug 20, 2024 | GPULanguage Modelling | CodeCode Available | 1 |
| Real-time Event Recognition of Long-distance Distributed Vibration Sensing with Knowledge Distillation and Hardware Acceleration | Aug 7, 2024 | GPUIntrusion Detection | CodeCode Available | 1 |
| Advancing Multimodal Large Language Models with Quantization-Aware Scale Learning for Efficient Adaptation | Aug 7, 2024 | GPUQuantization | CodeCode Available | 1 |
| Image-to-LaTeX Converter for Mathematical Formulas and Text | Aug 7, 2024 | DecoderGPU | CodeCode Available | 1 |
| RECE: Reduced Cross-Entropy Loss for Large-Catalogue Sequential Recommenders | Aug 5, 2024 | GPURecommendation Systems | CodeCode Available | 1 |
| Adaptive Two-Stage Cloud Resource Scaling via Hierarchical Multi-Indicator Forecasting and Bayesian Decision-Making | Aug 2, 2024 | Cloud ComputingDecision Making | CodeCode Available | 1 |
| Conformal Trajectory Prediction with Multi-View Data Integration in Cooperative Driving | Aug 1, 2024 | Conformal PredictionData Integration | CodeCode Available | 1 |
| OmniBal: Towards Fast Instruct-tuning for Vision-Language Models via Omniverse Computation Balance | Jul 30, 2024 | GPU | CodeCode Available | 1 |
| CLEFT: Language-Image Contrastive Learning with Efficient Large Language Model and Prompt Fine-Tuning | Jul 30, 2024 | Contrastive LearningDiagnostic | CodeCode Available | 1 |
| Pruning Large Language Models with Semi-Structural Adaptive Sparse Training | Jul 30, 2024 | GPUKnowledge Distillation | CodeCode Available | 1 |
| Visual Haystacks: A Vision-Centric Needle-In-A-Haystack Benchmark | Jul 18, 2024 | GPUImage Retrieval | CodeCode Available | 1 |
| Attention in SRAM on Tenstorrent Grayskull | Jul 18, 2024 | CPUGPU | CodeCode Available | 1 |
| WiNet: Wavelet-based Incremental Learning for Efficient Medical Image Registration | Jul 18, 2024 | GPUImage Registration | CodeCode Available | 1 |
| FastSAM-3DSlicer: A 3D-Slicer Extension for 3D Volumetric Segment Anything Model with Uncertainty Quantification | Jul 17, 2024 | CPUDomain Adaptation | CodeCode Available | 1 |
| Separable Operator Networks | Jul 15, 2024 | BenchmarkingGPU | CodeCode Available | 1 |
| LeRF: Learning Resampling Function for Adaptive and Efficient Image Interpolation | Jul 13, 2024 | GPU | CodeCode Available | 1 |
| FedMedICL: Towards Holistic Evaluation of Distribution Shifts in Federated Medical Imaging | Jul 11, 2024 | DiversityFederated Learning | CodeCode Available | 1 |
| HPFF: Hierarchical Locally Supervised Learning with Patch Feature Fusion | Jul 8, 2024 | GPU | CodeCode Available | 1 |
| Momentum Auxiliary Network for Supervised Local Learning | Jul 8, 2024 | GPUimage-classification | CodeCode Available | 1 |
| SurgicalGaussian: Deformable 3D Gaussians for High-Fidelity Surgical Scene Reconstruction | Jul 6, 2024 | Dynamic ReconstructionGPU | CodeCode Available | 1 |
| Every Pixel Has its Moments: Ultra-High-Resolution Unpaired Image-to-Image Translation via Dense Normalization | Jul 5, 2024 | GPUImage-to-Image Translation | CodeCode Available | 1 |
| HRSAM: Efficient Interactive Segmentation in High-Resolution Images | Jul 2, 2024 | Data AugmentationGPU | CodeCode Available | 1 |
| QSync: Quantization-Minimized Synchronous Distributed Training Across Hybrid Devices | Jul 2, 2024 | GPUQuantization | CodeCode Available | 1 |
| Efficient Expert Pruning for Sparse Mixture-of-Experts Language Models: Enhancing Performance and Reducing Inference Costs | Jul 1, 2024 | GPUMixture-of-Experts | CodeCode Available | 1 |
| LLMEasyQuant: Scalable Quantization for Parallel and Distributed LLM Inference | Jun 28, 2024 | GPUQuantization | CodeCode Available | 1 |
| ConStyle v2: A Strong Prompter for All-in-One Image Restoration | Jun 26, 2024 | AllGPU | CodeCode Available | 1 |
| SEED: Accelerating Reasoning Tree Construction via Scheduled Speculative Decoding | Jun 26, 2024 | GPUManagement | CodeCode Available | 1 |
| Implicit-Zoo: A Large-Scale Dataset of Neural Implicit Functions for 2D Images and 3D Scenes | Jun 25, 2024 | GPUimage-classification | CodeCode Available | 1 |
| Grass: Compute Efficient Low-Memory LLM Training with Structured Sparse Gradients | Jun 25, 2024 | GPULanguage Modeling | CodeCode Available | 1 |
| Enhancing Dropout-based Bayesian Neural Networks with Multi-Exit on FPGA | Jun 20, 2024 | Autonomous DrivingCPU | CodeCode Available | 1 |
| CE-SSL: Computation-Efficient Semi-Supervised Learning for ECG-based Cardiovascular Diseases Detection | Jun 20, 2024 | Computational EfficiencyElectrocardiography (ECG) | CodeCode Available | 1 |
| LaMDA: Large Model Fine-Tuning via Spectrally Decomposed Low-Dimensional Adaptation | Jun 18, 2024 | GPUNatural Language Understanding | CodeCode Available | 1 |