| Development and Application of a Monte Carlo Tree Search Algorithm for Simulating Da Vinci Code Game Strategies | Mar 15, 2024 | CPUDecision Making | —Unverified | 0 |
| BOP Challenge 2023 on Detection, Segmentation and Pose Estimation of Seen and Unseen Rigid Objects | Mar 14, 2024 | 6D Pose Estimation using RGBGPU | —Unverified | 0 |
| MARVIS: Motion & Geometry Aware Real and Virtual Image Segmentation | Mar 14, 2024 | 3D ReconstructionAutonomous Navigation | CodeCode Available | 0 |
| FastSAM3D: An Efficient Segment Anything Model for 3D Volumetric Medical Images | Mar 14, 2024 | 3D Medical Imaging SegmentationGPU | CodeCode Available | 1 |
| Dynamic Memory Compression: Retrofitting LLMs for Accelerated Inference | Mar 14, 2024 | GPU | —Unverified | 0 |
| Optimistic Verifiable Training by Controlling Hardware Nondeterminism | Mar 14, 2024 | Data PoisoningGPU | CodeCode Available | 1 |
| SemanticDraw: Towards Real-Time Interactive Content Creation from Image Diffusion Models | Mar 14, 2024 | BlockingGPU | CodeCode Available | 4 |
| Improving Real-Time Omnidirectional 3D Multi-Person Human Pose Estimation with People Matching and Unsupervised 2D-3D Lifting | Mar 14, 2024 | 3D Multi-Person Human Pose EstimationGPU | —Unverified | 0 |
| A Novel Implicit Neural Representation for Volume Data | Mar 13, 2024 | GPU | —Unverified | 0 |
| Real-time 3D semantic occupancy prediction for autonomous vehicles using memory-efficient sparse convolution | Mar 13, 2024 | 3D Semantic Occupancy Prediction3D Semantic Segmentation | —Unverified | 0 |
| Measuring the Energy Consumption and Efficiency of Deep Neural Networks: An Empirical Analysis and Design Recommendations | Mar 13, 2024 | CPUGPU | CodeCode Available | 0 |
| GaussianImage: 1000 FPS Image Representation and Compression by 2D Gaussian Splatting | Mar 13, 2024 | GPUQuantization | CodeCode Available | 3 |
| SM4Depth: Seamless Monocular Metric Depth Estimation across Multiple Cameras and Scenes by One Model | Mar 13, 2024 | Depth EstimationGPU | CodeCode Available | 1 |
| EM-TTS: Efficiently Trained Low-Resource Mongolian Lightweight Text-to-Speech | Mar 13, 2024 | GPUSpeech Synthesis | —Unverified | 0 |
| METER: a mobile vision transformer architecture for monocular depth estimation | Mar 13, 2024 | CPUData Augmentation | CodeCode Available | 0 |
| Towards a clinically accessible radiology foundation model: open-access and lightweight, with automated evaluation | Mar 12, 2024 | Cross-Modal RetrievalGPU | CodeCode Available | 2 |
| Augmenting Efficient Real-time Surgical Instrument Segmentation in Video with Point Tracking and Segment Anything | Mar 12, 2024 | GPUPoint Tracking | CodeCode Available | 1 |
| xMLP: Revolutionizing Private Inference with Exclusive Square Activation | Mar 12, 2024 | GPU | —Unverified | 0 |
| Cost-Effective Methodology for Complex Tuning Searches in HPC: Navigating Interdependencies and Dimensionality | Mar 12, 2024 | Bayesian OptimizationGPU | —Unverified | 0 |
| Communication Optimization for Distributed Training: Architecture, Advances, and Opportunities | Mar 12, 2024 | GPU | —Unverified | 0 |
| Mondrian: On-Device High-Performance Video Analytics with Compressive Packed Inference | Mar 12, 2024 | GPUobject-detection | —Unverified | 0 |
| Characterization of Large Language Model Development in the Datacenter | Mar 12, 2024 | GPULanguage Modeling | CodeCode Available | 2 |
| SSM Meets Video Diffusion Models: Efficient Long-Term Video Generation with Structured State Spaces | Mar 12, 2024 | GPUImage Generation | CodeCode Available | 1 |
| LookupFFN: Making Transformers Compute-lite for CPU inference | Mar 12, 2024 | CPUGPU | CodeCode Available | 1 |
| Scalable Spatiotemporal Prediction with Bayesian Neural Fields | Mar 12, 2024 | Bayesian InferenceDemand Forecasting | CodeCode Available | 2 |
| Multiple Population Alternate Evolution Neural Architecture Search | Mar 11, 2024 | DiversityGPU | —Unverified | 0 |
| A Converting Autoencoder Toward Low-latency and Energy-efficient DNN Inference at the Edge | Mar 11, 2024 | GPUimage-classification | —Unverified | 0 |
| Smart-Infinity: Fast Large Language Model Training using Near-Storage Processing on a Real System | Mar 11, 2024 | GPULanguage Modeling | CodeCode Available | 2 |
| Ensemble Quadratic Assignment Network for Graph Matching | Mar 11, 2024 | 3D Shape ClassificationGPU | —Unverified | 0 |
| SCORE: Self-supervised Correspondence Fine-tuning for Improved Content Representations | Mar 10, 2024 | Automatic Speech RecognitionData Augmentation | CodeCode Available | 0 |
| UniSparse: An Intermediate Language for General Sparse Format Customization | Mar 9, 2024 | AttributeCode Generation | CodeCode Available | 1 |
| Optimizing LLM Queries in Relational Data Analytics Workloads | Mar 9, 2024 | GPU | —Unverified | 0 |
| HDReason: Algorithm-Hardware Codesign for Hyperdimensional Knowledge Graph Reasoning | Mar 9, 2024 | GPUGraph Classification | —Unverified | 0 |
| Multi-GPU-Enabled Hybrid Quantum-Classical Workflow in Quantum-HPC Middleware: Applications in Quantum Simulations | Mar 9, 2024 | BenchmarkingCPU | CodeCode Available | 0 |
| Tracking Meets LoRA: Faster Training, Larger Model, Stronger Performance | Mar 8, 2024 | GPUparameter-efficient fine-tuning | CodeCode Available | 2 |
| SplattingAvatar: Realistic Real-Time Human Avatars with Mesh-Embedded Gaussian Splatting | Mar 8, 2024 | GPU | CodeCode Available | 4 |
| RFWave: Multi-band Rectified Flow for Audio Waveform Reconstruction | Mar 8, 2024 | Audio GenerationComputational Efficiency | CodeCode Available | 2 |
| Looking Ahead to Avoid Being Late: Solving Hard-Constrained Traveling Salesman Problem | Mar 8, 2024 | GPUTraveling Salesman Problem | —Unverified | 0 |
| Divide and Conquer: High-Resolution Industrial Anomaly Detection via Memory Efficient Tiled Ensemble | Mar 7, 2024 | Anomaly DetectionGPU | CodeCode Available | 9 |
| Benchmarking News Recommendation in the Era of Green AI | Mar 7, 2024 | BenchmarkingGPU | —Unverified | 0 |
| SWAP-NAS: Sample-Wise Activation Patterns for Ultra-fast NAS | Mar 7, 2024 | GPUNeural Architecture Search | CodeCode Available | 0 |
| Tensor Power Flow Formulations for Multidimensional Analyses in Distribution Systems | Mar 7, 2024 | CPUGPU | —Unverified | 0 |
| AcceleratedLiNGAM: Learning Causal DAGs at the speed of GPUs | Mar 6, 2024 | Causal DiscoveryCausal Inference | CodeCode Available | 0 |
| Fast, nonlocal and neural: a lightweight high quality solution to image denoising | Mar 6, 2024 | DenoisingGPU | —Unverified | 0 |
| SPEAR:Exact Gradient Inversion of Batches in Federated Learning | Mar 6, 2024 | Federated LearningGPU | —Unverified | 0 |
| CenterDisks: Real-time instance segmentation with disk covering | Mar 5, 2024 | GPUInstance Segmentation | CodeCode Available | 0 |
| G-EvoNAS: Evolutionary Neural Architecture Search Based on Network Growth | Mar 5, 2024 | GPUimage-classification | —Unverified | 0 |
| Birbal: An efficient 7B instruct-model fine-tuned with curated datasets | Mar 4, 2024 | GPU | CodeCode Available | 2 |
| MiM-ISTD: Mamba-in-Mamba for Efficient Infrared Small Target Detection | Mar 4, 2024 | GPUMamba | CodeCode Available | 2 |
| Taming Throughput-Latency Tradeoff in LLM Inference with Sarathi-Serve | Mar 4, 2024 | GPUScheduling | CodeCode Available | 3 |