| How do Large Language Models Learn In-Context? Query and Key Matrices of In-Context Heads are Two Towers for Metric Learning | Feb 5, 2024 | In-Context LearningMetric Learning | CodeCode Available | 2 |
| Position: What Can Large Language Models Tell Us about Time Series Analysis | Feb 5, 2024 | Decision MakingPosition | CodeCode Available | 2 |
| Robot Trajectron: Trajectory Prediction-based Shared Control for Robot Manipulation | Feb 4, 2024 | PositionRobot Manipulation | CodeCode Available | 2 |
| Vision Mamba: Efficient Visual Representation Learning with Bidirectional State Space Model | Jan 17, 2024 | GPUImage Classification | CodeCode Available | 2 |
| Extending LLMs' Context Window with 100 Samples | Jan 13, 2024 | Position | CodeCode Available | 2 |
| Never Lost in the Middle: Mastering Long-Context Question Answering with Position-Agnostic Decompositional Training | Nov 15, 2023 | Passage RetrievalPosition | CodeCode Available | 2 |
| Fast Chain-of-Thought: A Glance of Future from Parallel Decoding Leads to Answers Faster | Nov 14, 2023 | GPUPosition | CodeCode Available | 2 |
| Position Interpolation Improves ALiBi Extrapolation | Oct 18, 2023 | Language ModellingPosition | CodeCode Available | 2 |
| ProbTS: Benchmarking Point and Distributional Forecasting across Diverse Prediction Horizons | Oct 11, 2023 | BenchmarkingPosition | CodeCode Available | 2 |
| PoSE: Efficient Context Window Extension of LLMs via Positional Skip-wise Training | Sep 19, 2023 | 2kPosition | CodeCode Available | 2 |
| Lost in the Middle: How Language Models Use Long Contexts | Jul 6, 2023 | Language ModellingPosition | CodeCode Available | 2 |
| Think Twice before Driving: Towards Scalable Decoders for End-to-End Autonomous Driving | May 10, 2023 | Autonomous DrivingBench2Drive | CodeCode Available | 2 |
| Detection Transformer with Stable Matching | Apr 10, 2023 | DecoderPosition | CodeCode Available | 2 |
| LayoutDM: Discrete Diffusion Model for Controllable Layout Generation | Mar 14, 2023 | Layout Generationmodel | CodeCode Available | 2 |
| A Length-Extrapolatable Transformer | Dec 20, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| CroCo v2: Improved Cross-view Completion Pre-training for Stereo Matching and Optical Flow | Nov 18, 2022 | Optical Flow EstimationPosition | CodeCode Available | 2 |
| Point Transformer V2: Grouped Vector Attention and Partition-based Pooling | Oct 11, 2022 | 3D Point Cloud Classification3D Semantic Segmentation | CodeCode Available | 2 |
| Mega: Moving Average Equipped Gated Attention | Sep 21, 2022 | Image ClassificationInductive Bias | CodeCode Available | 2 |
| DeepInteraction: 3D Object Detection via Modality Interaction | Aug 23, 2022 | 3D Object DetectionDecoder | CodeCode Available | 2 |
| Stratified Transformer for 3D Point Cloud Segmentation | Mar 28, 2022 | Point Cloud SegmentationPosition | CodeCode Available | 2 |
| ParC-Net: Position Aware Circular Convolution with Merits from ConvNets and Transformer | Mar 8, 2022 | Image Classificationobject-detection | CodeCode Available | 2 |
| Train Short, Test Long: Attention with Linear Biases Enables Input Length Extrapolation | Aug 27, 2021 | Inductive BiasPlaying the Game of 2048 | CodeCode Available | 2 |
| FLAT: Chinese NER Using Flat-Lattice Transformer | Apr 24, 2020 | Chinese Named Entity Recognitionnamed-entity-recognition | CodeCode Available | 2 |
| MPNet: Masked and Permuted Pre-training for Language Understanding | Apr 20, 2020 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| Axial-DeepLab: Stand-Alone Axial-Attention for Panoptic Segmentation | Mar 17, 2020 | image-classificationImage Classification | CodeCode Available | 2 |
| Machine Learning in Asset Management—Part 1: Portfolio Construction—Trading Strategies | Feb 10, 2020 | Algorithmic TradingAsset Management | CodeCode Available | 2 |
| R-FCN-3000 at 30fps: Decoupling Detection and Classification | Dec 5, 2017 | ClassificationGeneral Classification | CodeCode Available | 2 |
| SeqPE: Transformer with Sequential Position Encoding | Jun 16, 2025 | image-classificationImage Classification | CodeCode Available | 1 |
| POSS: Position Specialist Generates Better Draft for Speculative Decoding | Jun 4, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| ComRoPE: Scalable and Robust Rotary Position Embedding Parameterized by Trainable Commuting Angle Matrices | Jun 4, 2025 | Position | CodeCode Available | 1 |
| LLM as Effective Streaming Processor: Bridging Streaming-Batch Mismatches with Group Position Encoding | May 22, 2025 | Position | CodeCode Available | 1 |
| A Skull-Adaptive Framework for AI-Based 3D Transcranial Focused Ultrasound Simulation | May 19, 2025 | Position | CodeCode Available | 1 |
| PoseBench3D: A Cross-Dataset Analysis Framework for 3D Human Pose Estimation | May 16, 2025 | 3D Human Pose EstimationPose Estimation | CodeCode Available | 1 |
| RGB-Event Fusion with Self-Attention for Collision Prediction | May 7, 2025 | BenchmarkingComputational Efficiency | CodeCode Available | 1 |
| TC-GS: Tri-plane based compression for 3D Gaussian Splatting | Mar 26, 2025 | 3DGSDecoder | CodeCode Available | 1 |
| LookAhead Tuning: Safer Language Models via Partial Answer Previews | Mar 24, 2025 | PositionSafety Alignment | CodeCode Available | 1 |
| Visual Position Prompt for MLLM based Visual Grounding | Mar 19, 2025 | PositionVisual Grounding | CodeCode Available | 1 |
| GNNs as Predictors of Agentic Workflow Performances | Mar 14, 2025 | BenchmarkingPosition | CodeCode Available | 1 |
| VRoPE: Rotary Position Embedding for Video Large Language Models | Feb 17, 2025 | PositionVideo Understanding | CodeCode Available | 1 |
| A Contextual-Aware Position Encoding for Sequential Recommendation | Feb 13, 2025 | PositionRecommendation Systems | CodeCode Available | 1 |
| WyckoffDiff -- A Generative Diffusion Model for Crystal Symmetry | Feb 10, 2025 | modelPosition | CodeCode Available | 1 |
| Position-aware Automatic Circuit Discovery | Feb 7, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| ADIFF: Explaining audio difference using natural language | Feb 6, 2025 | AudioCapsAudio captioning | CodeCode Available | 1 |
| Learning Efficient Positional Encodings with Graph Neural Networks | Feb 3, 2025 | Graph RegressionGraph Representation Learning | CodeCode Available | 1 |
| AlphaPre: Amplitude-Phase Disentanglement Model for Precipitation Nowcasting | Jan 1, 2025 | Disentanglementmodel | CodeCode Available | 1 |
| Rethinking Addressing in Language Models via Contexualized Equivariant Positional Encoding | Jan 1, 2025 | Arithmetic ReasoningLanguage Modeling | CodeCode Available | 1 |
| DriveEditor: A Unified 3D Information-Guided Framework for Controllable Object Editing in Driving Scenes | Dec 27, 2024 | Autonomous DrivingNovel View Synthesis | CodeCode Available | 1 |
| Cross-View Geo-Localization with Street-View and VHR Satellite Imagery in Decentrality Settings | Dec 16, 2024 | Disaster Responsegeo-localization | CodeCode Available | 1 |
| Wonderful Matrices: Combining for a More Efficient and Effective Foundation Model Architecture | Dec 16, 2024 | Mixture-of-ExpertsPosition | CodeCode Available | 1 |
| Cross-Attention Head Position Patterns Can Align with Human Visual Concepts in Text-to-Image Generative Models | Dec 3, 2024 | Image GenerationPosition | CodeCode Available | 1 |