| Beyond Utility: Evaluating LLM as Recommender | Nov 1, 2024 | PositionRe-Ranking | CodeCode Available | 1 |
| VECTOR: Velocity-Enhanced GRU Neural Network for Real-Time 3D UAV Trajectory Prediction | Oct 24, 2024 | PositionPrediction | CodeCode Available | 1 |
| ARTS: Semi-Analytical Regressor using Disentangled Skeletal Representations for Human Mesh Recovery from Videos | Oct 21, 2024 | 3D Human Pose EstimationDisentanglement | CodeCode Available | 1 |
| Arithmetic Transformers Can Length-Generalize in Both Operand Length and Count | Oct 21, 2024 | Position | CodeCode Available | 1 |
| TULIP: Token-length Upgraded CLIP | Oct 13, 2024 | Image GenerationPosition | CodeCode Available | 1 |
| PuzzleBoard: A New Camera Calibration Pattern with Position Encoding | Sep 30, 2024 | Camera CalibrationCamera Pose Estimation | CodeCode Available | 1 |
| OrientedFormer: An End-to-End Transformer-Based Oriented Object Detector in Remote Sensing Images | Sep 29, 2024 | object-detectionObject Detection | CodeCode Available | 1 |
| FlashMix: Fast Map-Free LiDAR Localization via Feature Mixing and Contrastive-Constrained Accelerated Training | Sep 27, 2024 | Metric LearningPosition | CodeCode Available | 1 |
| Mastering Chess with a Transformer Model | Sep 18, 2024 | Decision Makingmodel | CodeCode Available | 1 |
| TrackSSM: A General Motion Predictor by State-Space Model | Aug 31, 2024 | DecoderMamba | CodeCode Available | 1 |
| Positional Prompt Tuning for Efficient 3D Representation Learning | Aug 21, 2024 | 3D Parameter-Efficient Fine-Tuning for Classification3D Point Cloud Classification | CodeCode Available | 1 |
| Recurrent Neural Networks Learn to Store and Generate Sequences using Non-Linear Representations | Aug 20, 2024 | Position | CodeCode Available | 1 |
| GRFormer: Grouped Residual Self-Attention for Lightweight Single Image Super-Resolution | Aug 14, 2024 | Image Super-ResolutionPosition | CodeCode Available | 1 |
| PIR: Photometric Inverse Rendering with Shading Cues Modeling and Surface Reflectance Regularization | Aug 13, 2024 | Inverse RenderingPosition | CodeCode Available | 1 |
| DNTextSpotter: Arbitrary-Shaped Scene Text Spotting via Improved Denoising Training | Aug 1, 2024 | DenoisingGraph Matching | CodeCode Available | 1 |
| Move and Act: Enhanced Object Manipulation and Background Integrity for Image Editing | Jul 25, 2024 | ObjectPosition | CodeCode Available | 1 |
| PrevPredMap: Exploring Temporal Modeling with Previous Predictions for Online Vectorized HD Map Construction | Jul 24, 2024 | DecoderOnline Vectorized HD Map Construction | CodeCode Available | 1 |
| Improving Visual Place Recognition Based Robot Navigation By Verifying Localization Estimates | Jul 11, 2024 | PositionRobot Navigation | CodeCode Available | 1 |
| Pan-cancer Histopathology WSI Pre-training with Position-aware Masked Autoencoder | Jul 10, 2024 | Cancer ClassificationPosition | CodeCode Available | 1 |
| Eliminating Position Bias of Language Models: A Mechanistic Approach | Jul 1, 2024 | Mathobject-detection | CodeCode Available | 1 |
| Consensus Learning with Deep Sets for Essential Matrix Estimation | Jun 25, 2024 | Position | CodeCode Available | 1 |
| Insights into LLM Long-Context Failures: When Transformers Know but Don't Tell | Jun 20, 2024 | Information RetrievalPosition | CodeCode Available | 1 |
| LieRE: Generalizing Rotary Position Encodings | Jun 14, 2024 | image-classificationImage Classification | CodeCode Available | 1 |
| Comment on paper: Position: Rethinking Post-Hoc Search-Based Neural Approaches for Solving Large-Scale Traveling Salesman Problems | Jun 11, 2024 | AllPosition | CodeCode Available | 1 |
| P2PFormer: A Primitive-to-polygon Method for Regular Building Contour Extraction from Remote Sensing Images | Jun 5, 2024 | Position | CodeCode Available | 1 |
| Mitigate Position Bias in Large Language Models via Scaling a Single Dimension | Jun 4, 2024 | Position | CodeCode Available | 1 |
| Position: Rethinking Post-Hoc Search-Based Neural Approaches for Solving Large-Scale Traveling Salesman Problems | Jun 2, 2024 | Position | CodeCode Available | 1 |
| Learning to Play Air Hockey with Model-Based Deep Reinforcement Learning | Jun 1, 2024 | Deep Reinforcement LearningPosition | CodeCode Available | 1 |
| Position Coupling: Improving Length Generalization of Arithmetic Transformers Using Task Structure | May 31, 2024 | DecoderPosition | CodeCode Available | 1 |
| Position-Guided Prompt Learning for Anomaly Detection in Chest X-Rays | May 20, 2024 | Anomaly DetectionPosition | CodeCode Available | 1 |
| NeRO: Neural Road Surface Reconstruction | May 17, 2024 | Autonomous DrivingPosition | CodeCode Available | 1 |
| Positional Knowledge is All You Need: Position-induced Transformer (PiT) for Operator Learning | May 15, 2024 | AllOperator learning | CodeCode Available | 1 |
| Position: Quo Vadis, Unsupervised Time Series Anomaly Detection? | May 4, 2024 | Anomaly DetectionBenchmarking | CodeCode Available | 1 |
| Towards Consistent Object Detection via LiDAR-Camera Synergy | May 2, 2024 | Objectobject-detection | CodeCode Available | 1 |
| GIST: Gibbs self-tuning for locally adaptive Hamiltonian Monte Carlo | Apr 23, 2024 | Position | CodeCode Available | 1 |
| Deep SE(3)-Equivariant Geometric Reasoning for Precise Placement Tasks | Apr 20, 2024 | Pose PredictionPosition | CodeCode Available | 1 |
| Length Generalization of Causal Transformers without Position Encoding | Apr 18, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Advancing Social Intelligence in AI Agents: Technical Challenges and Open Questions | Apr 17, 2024 | Position | CodeCode Available | 1 |
| Leveraging edge detection and neural networks for better UAV localization | Apr 9, 2024 | Edge DetectionPosition | CodeCode Available | 1 |
| Resonance RoPE: Improving Context Length Generalization of Large Language Models | Feb 29, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Attention-Propagation Network for Egocentric Heatmap to 3D Pose Lifting | Feb 28, 2024 | 3D Pose EstimationEgocentric Pose Estimation | CodeCode Available | 1 |
| Focus on Your Question! Interpreting and Mitigating Toxic CoT Problems in Commonsense Reasoning | Feb 28, 2024 | Position | CodeCode Available | 1 |
| Two Stones Hit One Bird: Bilevel Positional Encoding for Better Length Extrapolation | Jan 29, 2024 | DisentanglementPosition | CodeCode Available | 1 |
| CMMU: A Benchmark for Chinese Multi-modal Multi-type Question Understanding and Reasoning | Jan 25, 2024 | Multiple-choicePosition | CodeCode Available | 1 |
| 3D Feature Tracking via Event Camera | Jan 1, 2024 | Motion CompensationPatch Matching | CodeCode Available | 1 |
| RecRanker: Instruction Tuning Large Language Model as Ranker for Top-k Recommendation | Dec 26, 2023 | In-Context LearningLanguage Modeling | CodeCode Available | 1 |
| Sample-Efficient Learning to Solve a Real-World Labyrinth Game Using Data-Augmented Model-Based Reinforcement Learning | Dec 15, 2023 | Model-based Reinforcement LearningNavigate | CodeCode Available | 1 |
| ASF-YOLO: A Novel YOLO Model with Attentional Scale Sequence Fusion for Cell Instance Segmentation | Dec 11, 2023 | Instance SegmentationPosition | CodeCode Available | 1 |
| Doodle Your 3D: From Abstract Freehand Sketches to Precise 3D Shapes | Dec 7, 2023 | DecoderPosition | CodeCode Available | 1 |
| Sparse Beats Dense: Rethinking Supervision in Radar-Camera Depth Completion | Dec 1, 2023 | Depth CompletionDepth Estimation | CodeCode Available | 1 |