| Classification of Long Sequential Data using Circular Dilated Convolutional Neural Networks | Jan 6, 2022 | Audio ClassificationClassification | CodeCode Available | 1 |
| Position-Guided Point Cloud Panoptic Segmentation Transformer | Mar 23, 2023 | Instance SegmentationPanoptic Segmentation | CodeCode Available | 1 |
| CDistNet: Perceiving Multi-Domain Character Distance for Robust Text Recognition | Nov 22, 2021 | PositionScene Text Recognition | CodeCode Available | 1 |
| CDGNet: Class Distribution Guided Network for Human Parsing | Nov 28, 2021 | Human ParsingPosition | CodeCode Available | 1 |
| Position-prior Clustering-based Self-attention Module for Knee Cartilage Segmentation | Jun 21, 2022 | ClusteringPosition | CodeCode Available | 1 |
| Position: Rethinking Post-Hoc Search-Based Neural Approaches for Solving Large-Scale Traveling Salesman Problems | Jun 2, 2024 | Position | CodeCode Available | 1 |
| A Transformer-based Approach for Source Code Summarization | May 1, 2020 | Code SummarizationPosition | CodeCode Available | 1 |
| PrevPredMap: Exploring Temporal Modeling with Previous Predictions for Online Vectorized HD Map Construction | Jul 24, 2024 | DecoderOnline Vectorized HD Map Construction | CodeCode Available | 1 |
| CentripetalNet: Pursuing High-quality Keypoint Pairs for Object Detection | Mar 20, 2020 | Instance Segmentationobject-detection | CodeCode Available | 1 |
| CLEX: Continuous Length Extrapolation for Large Language Models | Oct 25, 2023 | 4kPosition | CodeCode Available | 1 |
| PTRAIL -- A python package for parallel trajectory data preprocessing | Aug 26, 2021 | Feature EngineeringPosition | CodeCode Available | 1 |
| Attention-Propagation Network for Egocentric Heatmap to 3D Pose Lifting | Feb 28, 2024 | 3D Pose EstimationEgocentric Pose Estimation | CodeCode Available | 1 |
| PTT: Point-Track-Transformer Module for 3D Single Object Tracking in Point Clouds | Aug 14, 2021 | 3D Single Object TrackingGPU | CodeCode Available | 1 |
| Benchmarking TinyML Systems: Challenges and Direction | Mar 10, 2020 | BenchmarkingPosition | CodeCode Available | 1 |
| Can an AI Win Ghana's National Science and Maths Quiz? An AI Grand Challenge for Education | Jan 30, 2023 | MathPosition | CodeCode Available | 1 |
| Camera-Space Hand Mesh Recovery via Semantic Aggregation and Adaptive 2D-1D Registration | Mar 4, 2021 | 3D Hand Pose EstimationPosition | CodeCode Available | 1 |
| CAPE: Camera View Position Embedding for Multi-View 3D Object Detection | Mar 17, 2023 | 3D Object Detectionobject-detection | CodeCode Available | 1 |
| Region-based Non-local Operation for Video Classification | Jul 17, 2020 | Action ClassificationAction Recognition | CodeCode Available | 1 |
| Masked Jigsaw Puzzle: A Versatile Position Embedding for Vision Transformers | May 25, 2022 | Federated LearningPosition | CodeCode Available | 1 |
| Relation-aware Graph Attention Networks with Relational Position Encodings for Emotion Recognition in Conversations | Nov 1, 2020 | Emotion RecognitionEmotion Recognition in Conversation | CodeCode Available | 1 |
| Augmenting Scientific Papers with Just-in-Time, Position-Sensitive Definitions of Terms and Symbols | Sep 29, 2020 | Position | CodeCode Available | 1 |
| Unified Learning Approach for Egocentric Hand Gesture Recognition and Fingertip Detection | Jan 6, 2021 | Fingertip DetectionGesture Recognition | CodeCode Available | 1 |
| Brain over Brawn: Using a Stereo Camera to Detect, Track, and Intercept a Faster UAV by Reconstructing the Intruder's Trajectory | Jul 2, 2021 | Position | CodeCode Available | 1 |
| Rethinking Spatial Invariance of Convolutional Networks for Object Counting | Jun 10, 2022 | Crowd CountingObject | CodeCode Available | 1 |
| Camera Pose Auto-Encoders for Improving Pose Regression | Jul 12, 2022 | Positionregression | CodeCode Available | 1 |
| RGB-Event Fusion with Self-Attention for Collision Prediction | May 7, 2025 | BenchmarkingComputational Efficiency | CodeCode Available | 1 |
| Causal Imitative Model for Autonomous Driving | Dec 7, 2021 | Autonomous DrivingImitation Learning | CodeCode Available | 1 |
| Robustness Verification for Transformers | Feb 16, 2020 | PositionSentiment Analysis | CodeCode Available | 1 |
| CMMU: A Benchmark for Chinese Multi-modal Multi-type Question Understanding and Reasoning | Jan 25, 2024 | Multiple-choicePosition | CodeCode Available | 1 |
| RYANSQL: Recursively Applying Sketch-based Slot Fillings for Complex Text-to-SQL in Cross-Domain Databases | Apr 7, 2020 | Positionslot-filling | CodeCode Available | 1 |
| Context-Patch Face Hallucination Based on Thresholding Locality-constrained Representation and Reproducing Learning | Sep 3, 2018 | Face HallucinationHallucination | CodeCode Available | 1 |
| Scene-Aware 3D Multi-Human Motion Capture from a Single Camera | Jan 12, 2023 | Position | CodeCode Available | 1 |
| Searching for long faint astronomical high energy transients: a data driven approach | Mar 28, 2023 | Anomaly DetectionPathfinder | CodeCode Available | 1 |
| Segatron: Segment-Aware Transformer for Language Modeling and Understanding | Apr 30, 2020 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Semi-Supervised End-to-End Learning for Integrated Sensing and Communications | Oct 15, 2023 | ISACPosition | CodeCode Available | 1 |
| Sentiment Word Aware Multimodal Refinement for Multimodal Sentiment Analysis with ASR Errors | Mar 1, 2022 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 1 |
| DeSCo: Towards Generalizable and Scalable Deep Subgraph Counting | Aug 16, 2023 | Graph Neural NetworkGraph Regression | CodeCode Available | 1 |
| Shortformer: Better Language Modeling using Shorter Inputs | Dec 31, 2020 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Generative Modeling with Phase Stochastic Bridges | Oct 11, 2023 | Image GenerationPosition | CodeCode Available | 1 |
| SMS-WSJ: Database, performance measures, and baseline recipe for multi-channel source separation and recognition | Oct 30, 2019 | Position | CodeCode Available | 1 |
| Sounding Bodies: Modeling 3D Spatial Sound of Humans Using Body Pose and Audio | Nov 1, 2023 | Position | CodeCode Available | 1 |
| An Energy Management System Approach for Power System Cyber-Physical Resilience | Oct 7, 2021 | energy managementManagement | —Unverified | 0 |
| An Empirical Study on the Impact of Positional Encoding in Transformer-based Monaural Speech Enhancement | Jan 18, 2024 | POSPosition | —Unverified | 0 |
| An Empirical Study on Position of the Batch Normalization Layer in Convolutional Neural Networks | Dec 9, 2019 | Position | —Unverified | 0 |
| Accurate Entrance Position Detection Based on Wi-Fi and GPS Signals Using Machine Learning | Dec 10, 2019 | BIG-bench Machine LearningPosition | —Unverified | 0 |
| MonoVisual3DFilter: 3D tomatoes' localisation with monocular cameras using histogram filters | Oct 9, 2023 | Position | —Unverified | 0 |
| A Differential Evolution-Enhanced Latent Factor Analysis Model for High-dimensional and Sparse Data | Apr 2, 2022 | Position | —Unverified | 0 |
| An Empirical Study on Display Ad Impression Viewability Measurements | May 21, 2015 | Position | —Unverified | 0 |
| Accurate and Robust Neural Networks for Security Related Applications Exampled by Face Morphing Attacks | Jun 11, 2018 | Decision MakingPosition | —Unverified | 0 |
| An Element Sensitive Saliency Model with Position Prior Learning for Web Pages | Apr 27, 2018 | PositionPrediction | —Unverified | 0 |