| Which One? Leveraging Context Between Objects and Multiple Views for Language Grounding | Nov 12, 2023 | ObjectPosition | CodeCode Available | 1 |
| MultiSPANS: A Multi-range Spatial-Temporal Transformer Network for Traffic Forecast via Structural Entropy Optimization | Nov 6, 2023 | ManagementPosition | CodeCode Available | 1 |
| Sounding Bodies: Modeling 3D Spatial Sound of Humans Using Body Pose and Audio | Nov 1, 2023 | Position | CodeCode Available | 1 |
| Towards A Holistic Landscape of Situated Theory of Mind in Large Language Models | Oct 30, 2023 | PositionTheory of Mind Modeling | CodeCode Available | 1 |
| NLP Evaluation in trouble: On the Need to Measure LLM Data Contamination for each Benchmark | Oct 27, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| CLEX: Continuous Length Extrapolation for Large Language Models | Oct 25, 2023 | 4kPosition | CodeCode Available | 1 |
| Semi-Supervised End-to-End Learning for Integrated Sensing and Communications | Oct 15, 2023 | ISACPosition | CodeCode Available | 1 |
| Generative Modeling with Phase Stochastic Bridges | Oct 11, 2023 | Image GenerationPosition | CodeCode Available | 1 |
| Fast, Expressive SE(n) Equivariant Networks through Weight-Sharing in Position-Orientation Space | Oct 4, 2023 | Computational EfficiencyPosition | CodeCode Available | 1 |
| CoCA: Fusing Position Embedding with Collinear Constrained Attention in Transformers for Long Context Window Extending | Sep 15, 2023 | 2kPosition | CodeCode Available | 1 |
| Mutation-based Fault Localization of Deep Neural Networks | Sep 10, 2023 | Fault localizationPosition | CodeCode Available | 1 |
| DropPos: Pre-Training Vision Transformers by Reconstructing Dropped Positions | Sep 7, 2023 | PositionSpatial Reasoning | CodeCode Available | 1 |
| Mask-Attention-Free Transformer for 3D Instance Segmentation | Sep 4, 2023 | 3D Instance SegmentationInstance Segmentation | CodeCode Available | 1 |
| A lightweight 3D dense facial landmark estimation model from position map data | Aug 29, 2023 | Keypoint DetectionPosition | CodeCode Available | 1 |
| Relighting Neural Radiance Fields with Shadow and Highlight Hints | Aug 25, 2023 | Position | CodeCode Available | 1 |
| Position-Enhanced Visual Instruction Tuning for Multimodal Large Language Models | Aug 25, 2023 | cross-modal alignmentPosition | CodeCode Available | 1 |
| Instruction Position Matters in Sequence Generation with Large Language Models | Aug 23, 2023 | Instruction FollowingPosition | CodeCode Available | 1 |
| DALNet: A Rail Detection Network Based on Dynamic Anchor Line | Aug 22, 2023 | DiversityLane Detection | CodeCode Available | 1 |
| Spatial LibriSpeech: An Augmented Dataset for Spatial Audio Learning | Aug 18, 2023 | 8kPosition | CodeCode Available | 1 |
| DeSCo: Towards Generalizable and Scalable Deep Subgraph Counting | Aug 16, 2023 | Graph Neural NetworkGraph Regression | CodeCode Available | 1 |
| Exploring Lightweight Hierarchical Vision Transformers for Efficient Visual Tracking | Aug 14, 2023 | PositionVisual Tracking | CodeCode Available | 1 |
| V-DETR: DETR with Vertex Relative Position Encoding for 3D Object Detection | Aug 8, 2023 | 3D Object DetectionDecoder | CodeCode Available | 1 |
| Point Anywhere: Directed Object Estimation from Omnidirectional Images | Aug 2, 2023 | Objectobject-detection | CodeCode Available | 1 |
| Advancing Beyond Identification: Multi-bit Watermark for Large Language Models | Aug 1, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Differentiable short-time Fourier transform with respect to the hop length | Jul 26, 2023 | Position | CodeCode Available | 1 |