| Texture-GS: Disentangling the Geometry and Texture for 3D Gaussian Splatting Editing | Mar 15, 2024 | DisentanglementGPU | —Unverified | 0 |
| BOP Challenge 2023 on Detection, Segmentation and Pose Estimation of Seen and Unseen Rigid Objects | Mar 14, 2024 | 6D Pose Estimation using RGBGPU | —Unverified | 0 |
| FastSAM3D: An Efficient Segment Anything Model for 3D Volumetric Medical Images | Mar 14, 2024 | 3D Medical Imaging SegmentationGPU | CodeCode Available | 1 |
| MARVIS: Motion & Geometry Aware Real and Virtual Image Segmentation | Mar 14, 2024 | 3D ReconstructionAutonomous Navigation | CodeCode Available | 0 |
| SemanticDraw: Towards Real-Time Interactive Content Creation from Image Diffusion Models | Mar 14, 2024 | BlockingGPU | CodeCode Available | 4 |
| Dynamic Memory Compression: Retrofitting LLMs for Accelerated Inference | Mar 14, 2024 | GPU | —Unverified | 0 |
| Optimistic Verifiable Training by Controlling Hardware Nondeterminism | Mar 14, 2024 | Data PoisoningGPU | CodeCode Available | 1 |
| Improving Real-Time Omnidirectional 3D Multi-Person Human Pose Estimation with People Matching and Unsupervised 2D-3D Lifting | Mar 14, 2024 | 3D Multi-Person Human Pose EstimationGPU | —Unverified | 0 |
| A Novel Implicit Neural Representation for Volume Data | Mar 13, 2024 | GPU | —Unverified | 0 |
| Real-time 3D semantic occupancy prediction for autonomous vehicles using memory-efficient sparse convolution | Mar 13, 2024 | 3D Semantic Occupancy Prediction3D Semantic Segmentation | —Unverified | 0 |
| EM-TTS: Efficiently Trained Low-Resource Mongolian Lightweight Text-to-Speech | Mar 13, 2024 | GPUSpeech Synthesis | —Unverified | 0 |
| GaussianImage: 1000 FPS Image Representation and Compression by 2D Gaussian Splatting | Mar 13, 2024 | GPUQuantization | CodeCode Available | 3 |
| SM4Depth: Seamless Monocular Metric Depth Estimation across Multiple Cameras and Scenes by One Model | Mar 13, 2024 | Depth EstimationGPU | CodeCode Available | 1 |
| METER: a mobile vision transformer architecture for monocular depth estimation | Mar 13, 2024 | CPUData Augmentation | CodeCode Available | 0 |
| Measuring the Energy Consumption and Efficiency of Deep Neural Networks: An Empirical Analysis and Design Recommendations | Mar 13, 2024 | CPUGPU | CodeCode Available | 0 |
| Towards a clinically accessible radiology foundation model: open-access and lightweight, with automated evaluation | Mar 12, 2024 | Cross-Modal RetrievalGPU | CodeCode Available | 2 |
| Cost-Effective Methodology for Complex Tuning Searches in HPC: Navigating Interdependencies and Dimensionality | Mar 12, 2024 | Bayesian OptimizationGPU | —Unverified | 0 |
| Augmenting Efficient Real-time Surgical Instrument Segmentation in Video with Point Tracking and Segment Anything | Mar 12, 2024 | GPUPoint Tracking | CodeCode Available | 1 |
| xMLP: Revolutionizing Private Inference with Exclusive Square Activation | Mar 12, 2024 | GPU | —Unverified | 0 |
| Communication Optimization for Distributed Training: Architecture, Advances, and Opportunities | Mar 12, 2024 | GPU | —Unverified | 0 |
| Characterization of Large Language Model Development in the Datacenter | Mar 12, 2024 | GPULanguage Modeling | CodeCode Available | 2 |
| Mondrian: On-Device High-Performance Video Analytics with Compressive Packed Inference | Mar 12, 2024 | GPUobject-detection | —Unverified | 0 |
| Scalable Spatiotemporal Prediction with Bayesian Neural Fields | Mar 12, 2024 | Bayesian InferenceDemand Forecasting | CodeCode Available | 2 |
| LookupFFN: Making Transformers Compute-lite for CPU inference | Mar 12, 2024 | CPUGPU | CodeCode Available | 1 |
| SSM Meets Video Diffusion Models: Efficient Long-Term Video Generation with Structured State Spaces | Mar 12, 2024 | GPUImage Generation | CodeCode Available | 1 |