| A 2D Sinogram-Based Approach to Defect Localization in Computed Tomography | Jan 29, 2024 | Deep LearningDefect Detection | —Unverified | 0 |
| You Only Look Bottom-Up for Monocular 3D Object Detection | Jan 27, 2024 | 3D Object DetectionAutonomous Driving | —Unverified | 0 |
| VALL-T: Decoder-Only Generative Transducer for Robust and Decoding-Controllable Text-to-Speech | Jan 25, 2024 | DecoderHallucination | —Unverified | 0 |
| CMMU: A Benchmark for Chinese Multi-modal Multi-type Question Understanding and Reasoning | Jan 25, 2024 | Multiple-choicePosition | CodeCode Available | 1 |
| Position: AI/ML Influencers Have a Place in the Academic Process | Jan 24, 2024 | Causal InferenceDiversity | —Unverified | 0 |
| Perception-latency aware distributed target tracking | Jan 24, 2024 | Position | —Unverified | 0 |
| Collaborative Position Reasoning Network for Referring Image Segmentation | Jan 22, 2024 | Image SegmentationPosition | —Unverified | 0 |
| Coevolving Artistic Images Using OMNIREP | Jan 20, 2024 | Position | CodeCode Available | 0 |
| When Large Language Models Meet Evolutionary Algorithms: Potential Enhancements and Challenges | Jan 19, 2024 | Evolutionary AlgorithmsMulti-Task Learning | —Unverified | 0 |
| Learning Position-Aware Implicit Neural Network for Real-World Face Inpainting | Jan 19, 2024 | DecoderFacial Inpainting | —Unverified | 0 |
| Mitigating Position Bias with Regularization for Recommender Systems | Jan 18, 2024 | FairnessPosition | —Unverified | 0 |
| An Empirical Study on the Impact of Positional Encoding in Transformer-based Monaural Speech Enhancement | Jan 18, 2024 | POSPosition | —Unverified | 0 |
| QoS-Aware 3D Coverage Deployment of UAVs for Internet of Vehicles in Intelligent Transportation | Jan 18, 2024 | DiversityPosition | —Unverified | 0 |
| CMFN: Cross-Modal Fusion Network for Irregular Scene Text Recognition | Jan 18, 2024 | PositionScene Text Recognition | —Unverified | 0 |
| Motion-Zero: Zero-Shot Moving Object Control Framework for Diffusion-Based Video Generation | Jan 18, 2024 | DenoisingPosition | —Unverified | 0 |
| Vision Mamba: Efficient Visual Representation Learning with Bidirectional State Space Model | Jan 17, 2024 | GPUImage Classification | CodeCode Available | 2 |
| Solution of the Probabilistic Lambert Problem: Connections with Optimal Mass Transport, Schrödinger Bridge and Reaction-Diffusion PDEs | Jan 15, 2024 | Position | —Unverified | 0 |
| Carrying over algorithm in transformers | Jan 15, 2024 | DecoderPosition | CodeCode Available | 0 |
| Extending LLMs' Context Window with 100 Samples | Jan 13, 2024 | Position | CodeCode Available | 2 |
| E^2-LLM: Efficient and Extreme Length Extension of Large Language Models | Jan 13, 2024 | 4kGPU | —Unverified | 0 |
| Full-State Prescribed Performance-Based Consensus of Double-Integrator Multi-Agent Systems with Jointly Connected Topologies | Jan 11, 2024 | Position | —Unverified | 0 |
| UAV-enabled Integrated Sensing and Communication: Tracking Design and Optimization | Jan 8, 2024 | Integrated sensing and communicationISAC | —Unverified | 0 |
| PosDiffNet: Positional Neural Diffusion for Point Cloud Registration in a Large Field of View with Perturbations | Jan 6, 2024 | Point Cloud RegistrationPosition | CodeCode Available | 0 |
| Multimodal Data Curation via Object Detection and Filter Ensembles | Jan 5, 2024 | Objectobject-detection | —Unverified | 0 |
| Robot-Assisted Deep Venous Thrombosis Ultrasound Examination using Virtual Fixture | Jan 4, 2024 | ARCPosition | CodeCode Available | 0 |