| Training Compute-Optimal Large Language Models | Mar 29, 2022 | AnachronismsAnalogical Similarity | CodeCode Available | 6 |
| Temporal Working Memory: Query-Guided Segment Refinement for Enhanced Multimodal Understanding | Feb 9, 2025 | Image CaptioningImage-text Retrieval | CodeCode Available | 3 |
| SkySense: A Multi-Modal Remote Sensing Foundation Model Towards Universal Interpretation for Earth Observation Imagery | Dec 15, 2023 | Contrastive LearningEarth Observation | CodeCode Available | 3 |
| Scaling Video-Language Models to 10K Frames via Hierarchical Differential Distillation | Apr 3, 2025 | Computational EfficiencyGPU | CodeCode Available | 2 |
| xLSTM-Mixer: Multivariate Time Series Forecasting by Mixing via Scalar Memories | Oct 22, 2024 | Multivariate Time Series ForecastingTemporal Sequences | CodeCode Available | 2 |
| TEOChat: A Large Vision-Language Assistant for Temporal Earth Observation Data | Oct 8, 2024 | Change DetectionEarth Observation | CodeCode Available | 2 |
| RoboSense: Large-scale Dataset and Benchmark for Egocentric Robot Perception and Navigation in Crowded and Unstructured Environments | Aug 28, 2024 | Autonomous DrivingAutonomous Navigation | CodeCode Available | 2 |
| PRformer: Pyramidal Recurrent Transformer for Multivariate Time Series Forecasting | Aug 20, 2024 | Multivariate Time Series ForecastingTemporal Sequences | CodeCode Available | 2 |
| Multi-Patch Prediction: Adapting LLMs for Time Series Representation Learning | Feb 7, 2024 | Contrastive LearningPrediction | CodeCode Available | 2 |
| Scaling Language Models: Methods, Analysis & Insights from Training Gopher | Dec 8, 2021 | Abstract AlgebraAnachronisms | CodeCode Available | 2 |
| TiMo: Spatiotemporal Foundation Model for Satellite Image Time Series | May 13, 2025 | Temporal SequencesTime Series | CodeCode Available | 1 |
| Grid: Omni Visual Generation | Dec 14, 2024 | Image GenerationScheduling | CodeCode Available | 1 |
| KAN-AD: Time Series Anomaly Detection with Kolmogorov-Arnold Networks | Nov 1, 2024 | Anomaly DetectionKolmogorov-Arnold Networks | CodeCode Available | 1 |
| Unlocking the Power of Spatial and Temporal Information in Medical Multimodal Pre-training | May 30, 2024 | Temporal Sequences | CodeCode Available | 1 |
| Semantically-correlated memories in a dense associative model | Apr 10, 2024 | Image Retrievalmodel | CodeCode Available | 1 |
| Towards Learning Contrast Kinetics with Multi-Condition Latent Diffusion Models | Mar 20, 2024 | Decision MakingImage Generation | CodeCode Available | 1 |
| SpikeReveal: Unlocking Temporal Sequences from Real Blurry Inputs with Spike Streams | Mar 14, 2024 | DeblurringKnowledge Distillation | CodeCode Available | 1 |
| Remaining Useful Life Prediction for Aircraft Engines using LSTM | Jan 15, 2024 | PredictionTemporal Sequences | CodeCode Available | 1 |
| Short-term Precipitation Forecasting in The Netherlands: An Application of Convolutional LSTM neural networks to weather radar data | Dec 2, 2023 | Precipitation ForecastingTemporal Sequences | CodeCode Available | 1 |
| Cross-attention Spatio-temporal Context Transformer for Semantic Segmentation of Historical Maps | Oct 19, 2023 | Earth ObservationSemantic Segmentation | CodeCode Available | 1 |
| NuTime: Numerically Multi-Scaled Embedding for Large-Scale Time-Series Pretraining | Oct 11, 2023 | Anomaly DetectionFew-Shot Learning | CodeCode Available | 1 |
| Reducing ANN-SNN Conversion Error through Residual Membrane Potential | Feb 4, 2023 | Temporal Sequences | CodeCode Available | 1 |
| Scalable Spatiotemporal Graph Neural Networks | Sep 14, 2022 | Temporal SequencesTime Series | CodeCode Available | 1 |
| Leveraging Language Foundation Models for Human Mobility Forecasting | Sep 11, 2022 | DecoderSequential Pattern Mining | CodeCode Available | 1 |
| Causal Representation Learning for Instantaneous and Temporal Effects in Interactive Systems | Jun 13, 2022 | Causal DiscoveryRepresentation Learning | CodeCode Available | 1 |
| Continual Spatio-Temporal Graph Convolutional Networks | Mar 21, 2022 | Action RecognitionSkeleton Based Action Recognition | CodeCode Available | 1 |
| CITRIS: Causal Identifiability from Temporal Intervened Sequences | Feb 7, 2022 | Representation LearningTemporal Sequences | CodeCode Available | 1 |
| M2A: Motion Aware Attention for Accurate Video Action Recognition | Nov 18, 2021 | Action RecognitionTemporal Action Localization | CodeCode Available | 1 |
| Panoptic Segmentation of Satellite Image Time Series with Convolutional Temporal Attention Networks | Jul 16, 2021 | Cloud RemovalEarth Observation | CodeCode Available | 1 |
| Predicting Human Scanpaths in Visual Question Answering | Jun 19, 2021 | Deep Reinforcement LearningQuestion Answering | CodeCode Available | 1 |
| Representation Learning via Global Temporal Alignment and Cycle-Consistency | May 11, 2021 | Action ClassificationDynamic Time Warping | CodeCode Available | 1 |
| Learning future terrorist targets through temporal meta-graphs | Apr 21, 2021 | HumanitarianTemporal Sequences | CodeCode Available | 1 |
| Detecting Invisible People | Dec 15, 2020 | Depth EstimationMonocular Depth Estimation | CodeCode Available | 1 |
| A Graph Attention Spatio-temporal Convolutional Network for 3D Human Pose Estimation in Video | Mar 11, 2020 | 3D Human Pose Estimation3D Pose Estimation | CodeCode Available | 1 |
| Graph WaveNet for Deep Spatial-Temporal Graph Modeling | May 31, 2019 | Graph Neural NetworkRelation | CodeCode Available | 1 |
| Functional Map of the World | Nov 21, 2017 | Temporal Sequences | CodeCode Available | 1 |
| Hunyuan-GameCraft: High-dynamic Interactive Game Video Generation with Hybrid History Condition | Jun 20, 2025 | Temporal SequencesVideo Generation | —Unverified | 0 |
| TrajSceneLLM: A Multimodal Perspective on Semantic GPS Trajectory Analysis | Jun 19, 2025 | Temporal Sequences | CodeCode Available | 0 |
| APVR: Hour-Level Long Video Understanding with Adaptive Pivot Visual Information Retrieval | Jun 5, 2025 | Information RetrievalRetrieval | —Unverified | 0 |
| Time Blindness: Why Video-Language Models Can't See What Humans Can? | May 30, 2025 | Temporal SequencesVideo Understanding | —Unverified | 0 |
| Automata Learning of Preferences over Temporal Logic Formulas from Pairwise Comparisons | May 23, 2025 | Motion PlanningSequential Decision Making | —Unverified | 0 |
| LLM4FTS: Enhancing Large Language Models for Financial Time Series Prediction | May 5, 2025 | Temporal SequencesTime Series | —Unverified | 0 |
| OT-Talk: Animating 3D Talking Head with Optimal Transportation | May 3, 2025 | Temporal Sequences | —Unverified | 0 |
| Accelerated 3D-3D rigid registration of echocardiographic images obtained from apical window using particle filter | Apr 28, 2025 | CPUTemporal Sequences | —Unverified | 0 |
| TimeSoccer: An End-to-End Multimodal Large Language Model for Soccer Commentary Generation | Apr 24, 2025 | Caption GenerationDense Video Captioning | —Unverified | 0 |
| Of All StrIPEs: Investigating Structure-informed Positional Encoding for Efficient Music Generation | Apr 7, 2025 | AllMusic Generation | —Unverified | 0 |
| Inverse RL Scene Dynamics Learning for Nonlinear Predictive Control in Autonomous Vehicles | Apr 2, 2025 | Autonomous NavigationAutonomous Vehicles | —Unverified | 0 |
| When Reasoning Meets Compression: Benchmarking Compressed Large Reasoning Models on Complex Reasoning Tasks | Apr 2, 2025 | BenchmarkingLanguage Modeling | —Unverified | 0 |
| ASD Classification on Dynamic Brain Connectome using Temporal Random Walk with Transformer-based Dynamic Network Embedding | Mar 16, 2025 | Network EmbeddingTemporal Sequences | CodeCode Available | 0 |
| A Deep Learning Architecture for Land Cover Mapping Using Spatio-Temporal Sentinel-1 Features | Mar 10, 2025 | ManagementTemporal Sequences | —Unverified | 0 |