| PCP-MAE: Learning to Predict Centers for Point Masked Autoencoders | Aug 16, 2024 | 3D Object Classification3D Point Cloud Classification | CodeCode Available | 2 | 5 |
| Point Transformer V2: Grouped Vector Attention and Partition-based Pooling | Oct 11, 2022 | 3D Point Cloud Classification3D Semantic Segmentation | CodeCode Available | 2 | 5 |
| Structure Consistent Gaussian Splatting with Matching Prior for Few-shot Novel View Synthesis | Nov 6, 2024 | 3DGSNeRF | CodeCode Available | 2 | 5 |
| Machine Learning in Asset Management—Part 1: Portfolio Construction—Trading Strategies | Feb 10, 2020 | Algorithmic TradingAsset Management | CodeCode Available | 2 | 5 |
| LongEmbed: Extending Embedding Models for Long Context Retrieval | Apr 18, 2024 | 4k8k | CodeCode Available | 2 | 5 |
| Lost in the Middle: How Language Models Use Long Contexts | Jul 6, 2023 | Language ModellingPosition | CodeCode Available | 2 | 5 |
| GSGAN: Adversarial Learning for Hierarchical Generation of 3D Gaussian Splats | Jun 5, 2024 | 3D-Aware Image Synthesis3D Generation | CodeCode Available | 2 | 5 |
| An End-to-End Structure with Novel Position Mechanism and Improved EMD for Stock Forecasting | Mar 25, 2024 | PositionTime Series | CodeCode Available | 2 | 5 |
| DeepInteraction: 3D Object Detection via Modality Interaction | Aug 23, 2022 | 3D Object DetectionDecoder | CodeCode Available | 2 | 5 |
| Position: Foundation Agents as the Paradigm Shift for Decision Making | May 27, 2024 | Decision MakingPosition | CodeCode Available | 2 | 5 |
| How do Large Language Models Learn In-Context? Query and Key Matrices of In-Context Heads are Two Towers for Metric Learning | Feb 5, 2024 | In-Context LearningMetric Learning | CodeCode Available | 2 | 5 |
| LayoutDM: Discrete Diffusion Model for Controllable Layout Generation | Mar 14, 2023 | Layout Generationmodel | CodeCode Available | 2 | 5 |
| MPNet: Masked and Permuted Pre-training for Language Understanding | Apr 20, 2020 | Language ModelingLanguage Modelling | CodeCode Available | 2 | 5 |
| FLAT: Chinese NER Using Flat-Lattice Transformer | Apr 24, 2020 | Chinese Named Entity Recognitionnamed-entity-recognition | CodeCode Available | 2 | 5 |
| Extending LLMs' Context Window with 100 Samples | Jan 13, 2024 | Position | CodeCode Available | 2 | 5 |
| Fast Chain-of-Thought: A Glance of Future from Parallel Decoding Leads to Answers Faster | Nov 14, 2023 | GPUPosition | CodeCode Available | 2 | 5 |
| Fourier Position Embedding: Enhancing Attention's Periodic Extension for Length Generalization | Dec 23, 2024 | Position | CodeCode Available | 2 | 5 |
| Detection Transformer with Stable Matching | Apr 10, 2023 | DecoderPosition | CodeCode Available | 2 | 5 |
| FiLo++: Zero-/Few-Shot Anomaly Detection by Fused Fine-Grained Descriptions and Deformable Localization | Jan 17, 2025 | Anomaly DetectionImage-text matching | CodeCode Available | 2 | 5 |
| FiLo: Zero-Shot Anomaly Detection by Fine-Grained Description and High-Quality Localization | Apr 21, 2024 | Anomaly DetectionPosition | CodeCode Available | 2 | 5 |
| GLACE: Global Local Accelerated Coordinate Encoding | Jun 6, 2024 | Camera Pose EstimationPose Estimation | CodeCode Available | 2 | 5 |
| ParC-Net: Position Aware Circular Convolution with Merits from ConvNets and Transformer | Mar 8, 2022 | Image Classificationobject-detection | CodeCode Available | 2 | 5 |
| A Length-Extrapolatable Transformer | Dec 20, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 2 | 5 |
| CroCo v2: Improved Cross-view Completion Pre-training for Stereo Matching and Optical Flow | Nov 18, 2022 | Optical Flow EstimationPosition | CodeCode Available | 2 | 5 |
| Axial-DeepLab: Stand-Alone Axial-Attention for Panoptic Segmentation | Mar 17, 2020 | image-classificationImage Classification | CodeCode Available | 2 | 5 |
| Never Lost in the Middle: Mastering Long-Context Question Answering with Position-Agnostic Decompositional Training | Nov 15, 2023 | Passage RetrievalPosition | CodeCode Available | 2 | 5 |
| Think Twice before Driving: Towards Scalable Decoders for End-to-End Autonomous Driving | May 10, 2023 | Autonomous DrivingBench2Drive | CodeCode Available | 2 | 5 |
| Attention-Propagation Network for Egocentric Heatmap to 3D Pose Lifting | Feb 28, 2024 | 3D Pose EstimationEgocentric Pose Estimation | CodeCode Available | 1 | 5 |
| Deep Domain Confusion: Maximizing for Domain Invariance | Dec 10, 2014 | Domain AdaptationModel Selection | CodeCode Available | 1 | 5 |
| DeepFocus: a Few-Shot Microscope Slide Auto-Focus using a Sample Invariant CNN-based Sharpness Function | Jan 2, 2020 | Position | CodeCode Available | 1 | 5 |
| DeepBall: Deep Neural-Network Ball Detector | Feb 19, 2019 | General ClassificationObject | CodeCode Available | 1 | 5 |
| 3D Feature Tracking via Event Camera | Jan 1, 2024 | Motion CompensationPatch Matching | CodeCode Available | 1 | 5 |
| Deep Deformable 3D Caricatures with Learned Shape Control | Jul 29, 2022 | CaricaturePosition | CodeCode Available | 1 | 5 |
| CoCA: Fusing Position Embedding with Collinear Constrained Attention in Transformers for Long Context Window Extending | Sep 15, 2023 | 2kPosition | CodeCode Available | 1 | 5 |
| A Case for Rejection in Low Resource ML Deployment | Aug 12, 2022 | DiversityPosition | CodeCode Available | 1 | 5 |
| DALNet: A Rail Detection Network Based on Dynamic Anchor Line | Aug 22, 2023 | DiversityLane Detection | CodeCode Available | 1 | 5 |
| Asynchronous Trajectory Matching-Based Multimodal Maritime Data Fusion for Vessel Traffic Surveillance in Inland Waterways | Feb 22, 2023 | PositionVessel Detection | CodeCode Available | 1 | 5 |
| A Transformer-based Approach for Source Code Summarization | May 1, 2020 | Code SummarizationPosition | CodeCode Available | 1 | 5 |
| Audio-Conditioned U-Net for Position Estimation in Full Sheet Images | Oct 16, 2019 | Multimodal Deep LearningPosition | CodeCode Available | 1 | 5 |
| DDLP: Unsupervised Object-Centric Video Prediction with Deep Dynamic Latent Particles | Jun 9, 2023 | ObjectPosition | CodeCode Available | 1 | 5 |
| Deep Momentum Multi-Marginal Schrödinger Bridge | Mar 3, 2023 | Position | CodeCode Available | 1 | 5 |
| A Skull-Adaptive Framework for AI-Based 3D Transcranial Focused Ultrasound Simulation | May 19, 2025 | Position | CodeCode Available | 1 | 5 |
| Assigning personality/identity to a chatting machine for coherent conversation generation | Jun 9, 2017 | ChatbotDecoder | CodeCode Available | 1 | 5 |
| ASF-YOLO: A Novel YOLO Model with Attentional Scale Sequence Fusion for Cell Instance Segmentation | Dec 11, 2023 | Instance SegmentationPosition | CodeCode Available | 1 | 5 |
| ARTS: Semi-Analytical Regressor using Disentangled Skeletal Representations for Human Mesh Recovery from Videos | Oct 21, 2024 | 3D Human Pose EstimationDisentanglement | CodeCode Available | 1 | 5 |
| CrossFormer: A Versatile Vision Transformer Hinging on Cross-scale Attention | Jul 31, 2021 | image-classificationImage Classification | CodeCode Available | 1 | 5 |
| Cross-View Geo-Localization with Street-View and VHR Satellite Imagery in Decentrality Settings | Dec 16, 2024 | Disaster Responsegeo-localization | CodeCode Available | 1 | 5 |
| Arithmetic Transformers Can Length-Generalize in Both Operand Length and Count | Oct 21, 2024 | Position | CodeCode Available | 1 | 5 |
| Cross-Attention Head Position Patterns Can Align with Human Visual Concepts in Text-to-Image Generative Models | Dec 3, 2024 | Image GenerationPosition | CodeCode Available | 1 | 5 |
| Context-Patch Face Hallucination Based on Thresholding Locality-constrained Representation and Reproducing Learning | Sep 3, 2018 | Face HallucinationHallucination | CodeCode Available | 1 | 5 |