| Two Stones Hit One Bird: Bilevel Positional Encoding for Better Length Extrapolation | Jan 29, 2024 | DisentanglementPosition | CodeCode Available | 1 |
| You Only Look Bottom-Up for Monocular 3D Object Detection | Jan 27, 2024 | 3D Object DetectionAutonomous Driving | —Unverified | 0 |
| CMMU: A Benchmark for Chinese Multi-modal Multi-type Question Understanding and Reasoning | Jan 25, 2024 | Multiple-choicePosition | CodeCode Available | 1 |
| VALL-T: Decoder-Only Generative Transducer for Robust and Decoding-Controllable Text-to-Speech | Jan 25, 2024 | DecoderHallucination | —Unverified | 0 |
| Perception-latency aware distributed target tracking | Jan 24, 2024 | Position | —Unverified | 0 |
| Position: AI/ML Influencers Have a Place in the Academic Process | Jan 24, 2024 | Causal InferenceDiversity | —Unverified | 0 |
| Collaborative Position Reasoning Network for Referring Image Segmentation | Jan 22, 2024 | Image SegmentationPosition | —Unverified | 0 |
| Coevolving Artistic Images Using OMNIREP | Jan 20, 2024 | Position | CodeCode Available | 0 |
| Learning Position-Aware Implicit Neural Network for Real-World Face Inpainting | Jan 19, 2024 | DecoderFacial Inpainting | —Unverified | 0 |
| When Large Language Models Meet Evolutionary Algorithms: Potential Enhancements and Challenges | Jan 19, 2024 | Evolutionary AlgorithmsMulti-Task Learning | —Unverified | 0 |