| Hunyuan-GameCraft: High-dynamic Interactive Game Video Generation with Hybrid History Condition | Jun 20, 2025 | Temporal SequencesVideo Generation | —Unverified | 0 |
| TrajSceneLLM: A Multimodal Perspective on Semantic GPS Trajectory Analysis | Jun 19, 2025 | Temporal Sequences | CodeCode Available | 0 |
| APVR: Hour-Level Long Video Understanding with Adaptive Pivot Visual Information Retrieval | Jun 5, 2025 | Information RetrievalRetrieval | —Unverified | 0 |
| Time Blindness: Why Video-Language Models Can't See What Humans Can? | May 30, 2025 | Temporal SequencesVideo Understanding | —Unverified | 0 |
| Automata Learning of Preferences over Temporal Logic Formulas from Pairwise Comparisons | May 23, 2025 | Motion PlanningSequential Decision Making | —Unverified | 0 |
| TiMo: Spatiotemporal Foundation Model for Satellite Image Time Series | May 13, 2025 | Temporal SequencesTime Series | CodeCode Available | 1 |
| LLM4FTS: Enhancing Large Language Models for Financial Time Series Prediction | May 5, 2025 | Temporal SequencesTime Series | —Unverified | 0 |
| OT-Talk: Animating 3D Talking Head with Optimal Transportation | May 3, 2025 | Temporal Sequences | —Unverified | 0 |
| Accelerated 3D-3D rigid registration of echocardiographic images obtained from apical window using particle filter | Apr 28, 2025 | CPUTemporal Sequences | —Unverified | 0 |
| TimeSoccer: An End-to-End Multimodal Large Language Model for Soccer Commentary Generation | Apr 24, 2025 | Caption GenerationDense Video Captioning | —Unverified | 0 |
| Of All StrIPEs: Investigating Structure-informed Positional Encoding for Efficient Music Generation | Apr 7, 2025 | AllMusic Generation | —Unverified | 0 |
| Scaling Video-Language Models to 10K Frames via Hierarchical Differential Distillation | Apr 3, 2025 | Computational EfficiencyGPU | CodeCode Available | 2 |
| When Reasoning Meets Compression: Benchmarking Compressed Large Reasoning Models on Complex Reasoning Tasks | Apr 2, 2025 | BenchmarkingLanguage Modeling | —Unverified | 0 |
| Inverse RL Scene Dynamics Learning for Nonlinear Predictive Control in Autonomous Vehicles | Apr 2, 2025 | Autonomous NavigationAutonomous Vehicles | —Unverified | 0 |
| ASD Classification on Dynamic Brain Connectome using Temporal Random Walk with Transformer-based Dynamic Network Embedding | Mar 16, 2025 | Network EmbeddingTemporal Sequences | CodeCode Available | 0 |
| A Deep Learning Architecture for Land Cover Mapping Using Spatio-Temporal Sentinel-1 Features | Mar 10, 2025 | ManagementTemporal Sequences | —Unverified | 0 |
| Towards Patient-Specific Surgical Planning for Bicuspid Aortic Valve Repair: Fully Automated Segmentation of the Aortic Valve in 4D CT | Feb 13, 2025 | Computed Tomography (CT)Segmentation | —Unverified | 0 |
| Temporal Working Memory: Query-Guided Segment Refinement for Enhanced Multimodal Understanding | Feb 9, 2025 | Image CaptioningImage-text Retrieval | CodeCode Available | 3 |
| MV-GMN: State Space Model for Multi-View Action Recognition | Jan 23, 2025 | Action RecognitionMamba | —Unverified | 0 |
| IMSSA: Deploying modern state-space models on memristive in-memory compute hardware | Dec 28, 2024 | GPUQuantization | —Unverified | 0 |
| A Staged Deep Learning Approach to Spatial Refinement in 3D Temporal Atmospheric Transport | Dec 14, 2024 | Super-ResolutionTemporal Sequences | —Unverified | 0 |
| Grid: Omni Visual Generation | Dec 14, 2024 | Image GenerationScheduling | CodeCode Available | 1 |
| DiCoDe: Diffusion-Compressed Deep Tokens for Autoregressive Video Generation with Language Models | Dec 5, 2024 | Temporal SequencesVideo Generation | —Unverified | 0 |
| LiDAR-based End-to-end Temporal Perception for Vehicle-Infrastructure Cooperation | Nov 22, 2024 | Autonomous DrivingTemporal Sequences | —Unverified | 0 |
| KAN-AD: Time Series Anomaly Detection with Kolmogorov-Arnold Networks | Nov 1, 2024 | Anomaly DetectionKolmogorov-Arnold Networks | CodeCode Available | 1 |
| xLSTM-Mixer: Multivariate Time Series Forecasting by Mixing via Scalar Memories | Oct 22, 2024 | Multivariate Time Series ForecastingTemporal Sequences | CodeCode Available | 2 |
| MarineFormer: A Spatio-Temporal Attention Model for USV Navigation in Dynamic Marine Environments | Oct 17, 2024 | Collision AvoidanceGraph Attention | —Unverified | 0 |
| Scalable Mechanistic Neural Networks for Differential Equations and Machine Learning | Oct 8, 2024 | Temporal Sequences | CodeCode Available | 0 |
| TEOChat: A Large Vision-Language Assistant for Temporal Earth Observation Data | Oct 8, 2024 | Change DetectionEarth Observation | CodeCode Available | 2 |
| SurgPETL: Parameter-Efficient Image-to-Surgical-Video Transfer Learning for Surgical Phase Recognition | Sep 30, 2024 | Action RecognitionSurgical phase recognition | —Unverified | 0 |
| Causal Temporal Representation Learning with Nonstationary Sparse Transition | Sep 5, 2024 | Representation LearningTemporal Sequences | CodeCode Available | 0 |
| RoboSense: Large-scale Dataset and Benchmark for Egocentric Robot Perception and Navigation in Crowded and Unstructured Environments | Aug 28, 2024 | Autonomous DrivingAutonomous Navigation | CodeCode Available | 2 |
| PRformer: Pyramidal Recurrent Transformer for Multivariate Time Series Forecasting | Aug 20, 2024 | Multivariate Time Series ForecastingTemporal Sequences | CodeCode Available | 2 |
| A Comprehensive Review of Few-shot Action Recognition | Jul 20, 2024 | Action RecognitionFew-Shot action recognition | —Unverified | 0 |
| Spiking Tucker Fusion Transformer for Audio-Visual Zero-Shot Learning | Jul 11, 2024 | Temporal SequencesZero-Shot Learning | —Unverified | 0 |
| Spatiotemporal Predictions of Toxic Urban Plumes Using Deep Learning | May 30, 2024 | Deep LearningTemporal Sequences | —Unverified | 0 |
| Unlocking the Power of Spatial and Temporal Information in Medical Multimodal Pre-training | May 30, 2024 | Temporal Sequences | CodeCode Available | 1 |
| Untangling Lariats: Subgradient Following of Variationally Penalized Objectives | May 7, 2024 | Temporal Sequences | —Unverified | 0 |
| F5C-finder: An Explainable and Ensemble Biological Language Model for Predicting 5-Formylcytidine Modifications on mRNA | Apr 20, 2024 | Ensemble LearningLanguage Modeling | CodeCode Available | 0 |
| Learning Object Semantic Similarity with Self-Supervision | Apr 19, 2024 | ObjectSemantic Similarity | —Unverified | 0 |
| Semantically-correlated memories in a dense associative model | Apr 10, 2024 | Image Retrievalmodel | CodeCode Available | 1 |
| Interpretable Neural Temporal Point Processes for Modelling Electronic Health Records | Apr 9, 2024 | Point ProcessesTemporal Sequences | —Unverified | 0 |
| Mitigating LLM Hallucinations via Conformal Abstention | Apr 4, 2024 | Conformal PredictionGenerative Question Answering | —Unverified | 0 |
| Towards Learning Contrast Kinetics with Multi-Condition Latent Diffusion Models | Mar 20, 2024 | Decision MakingImage Generation | CodeCode Available | 1 |
| Towards the Reusability and Compositionality of Causal Representations | Mar 14, 2024 | Representation LearningTemporal Sequences | —Unverified | 0 |
| SpikeReveal: Unlocking Temporal Sequences from Real Blurry Inputs with Spike Streams | Mar 14, 2024 | DeblurringKnowledge Distillation | CodeCode Available | 1 |
| Structural Positional Encoding for knowledge integration in transformer-based medical process monitoring | Mar 13, 2024 | Knowledge Graph EmbeddingManagement | CodeCode Available | 0 |
| Deep Learning Approaches for Human Action Recognition in Video Data | Mar 11, 2024 | Action RecognitionAction Recognition In Videos | —Unverified | 0 |
| DeepSRGM -- Sequence Classification and Ranking in Indian Classical Music with Deep Learning | Feb 15, 2024 | Information RetrievalMusic Information Retrieval | —Unverified | 0 |
| Multi-Patch Prediction: Adapting LLMs for Time Series Representation Learning | Feb 7, 2024 | Contrastive LearningPrediction | CodeCode Available | 2 |