| UniHDSA: A Unified Relation Prediction Approach for Hierarchical Document Structure Analysis | Mar 20, 2025 | Document Layout AnalysisDocument Summarization | CodeCode Available | 2 |
| Multi-Modal Mamba Modeling for Survival Prediction (M4Survive): Adapting Joint Foundation Model Representations | Mar 13, 2025 | Computational EfficiencyMamba | CodeCode Available | 2 |
| FlexVAR: Flexible Visual Autoregressive Modeling without Residual Prediction | Feb 27, 2025 | Image GenerationPrediction | CodeCode Available | 2 |
| Electron flow matching for generative reaction mechanism prediction obeying conservation laws | Feb 18, 2025 | Prediction | CodeCode Available | 2 |
| PyMOLfold: Interactive Protein and Ligand Structure Prediction in PyMOL | Feb 1, 2025 | PredictionProtein Folding | CodeCode Available | 2 |
| Deep Learning and Foundation Models for Weather Prediction: A Survey | Jan 12, 2025 | Deep LearningPrediction | CodeCode Available | 2 |
| Next Patch Prediction for Autoregressive Visual Generation | Dec 19, 2024 | Image GenerationPrediction | CodeCode Available | 2 |
| FlowAR: Scale-wise Autoregressive Image Generation Meets Flow Matching | Dec 19, 2024 | Image GenerationPrediction | CodeCode Available | 2 |
| Joint Perception and Prediction for Autonomous Driving: A Survey | Dec 18, 2024 | Autonomous Drivingmotion prediction | CodeCode Available | 2 |
| NeuralPLexer3: Accurate Biomolecular Complex Structure Prediction with Flow Models | Dec 14, 2024 | BenchmarkingDrug Design | CodeCode Available | 2 |
| GaussianWorld: Gaussian World Model for Streaming 3D Occupancy Prediction | Dec 13, 2024 | Autonomous DrivingPrediction | CodeCode Available | 2 |
| Mr. DETR: Instructive Multi-Route Training for Detection Transformers | Dec 13, 2024 | DecoderObject Detection | CodeCode Available | 2 |
| Financial Fine-tuning a Large Time Series Model | Dec 13, 2024 | Image GenerationPrediction | CodeCode Available | 2 |
| Federated Learning in Mobile Networks: A Comprehensive Case Study on Traffic Forecasting | Dec 5, 2024 | Federated LearningManagement | CodeCode Available | 2 |
| EmbodiedOcc: Embodied 3D Occupancy Prediction for Vision-based Online Scene Understanding | Dec 5, 2024 | PredictionScene Understanding | CodeCode Available | 2 |
| V2XPnP: Vehicle-to-Everything Spatio-Temporal Fusion for Multi-Agent Perception and Prediction | Dec 2, 2024 | Prediction | CodeCode Available | 2 |
| Brain Tumour Removing and Missing Modality Generation using 3D WDM | Nov 7, 2024 | GPUPrediction | CodeCode Available | 2 |
| Training on test proteins improves fitness, structure, and function prediction | Nov 4, 2024 | PredictionProtein Structure Prediction | CodeCode Available | 2 |
| Training-Free Adaptive Diffusion with Bounded Difference Approximation Strategy | Oct 13, 2024 | DenoisingPrediction | CodeCode Available | 2 |
| Video Prediction Transformers without Recurrence or Convolution | Oct 7, 2024 | DecoderPrediction | CodeCode Available | 2 |
| A Survey on Graph Neural Networks for Remaining Useful Life Prediction: Methodologies, Evaluation and Future Trends | Sep 29, 2024 | Benchmarkinggraph construction | CodeCode Available | 2 |
| Mamba Meets Financial Markets: A Graph-Mamba Approach for Stock Price Prediction | Sep 26, 2024 | MambaPrediction | CodeCode Available | 2 |
| SocialCircle+: Learning the Angle-based Conditioned Interaction Representation for Pedestrian Trajectory Prediction | Sep 23, 2024 | counterfactualPedestrian Trajectory Prediction | CodeCode Available | 2 |
| Interpreting Arithmetic Mechanism in Large Language Models through Comparative Neuron Analysis | Sep 21, 2024 | Model EditingPrediction | CodeCode Available | 2 |
| CVT-Occ: Cost Volume Temporal Fusion for 3D Occupancy Prediction | Sep 20, 2024 | Depth EstimationPrediction | CodeCode Available | 2 |
| OPUS: Occupancy Prediction Using a Sparse Set | Sep 14, 2024 | Autonomous DrivingPrediction | CodeCode Available | 2 |
| In-Context Imitation Learning via Next-Token Prediction | Aug 28, 2024 | Imitation LearningPrediction | CodeCode Available | 2 |
| Efficient Autoregressive Audio Modeling via Next-Scale Prediction | Aug 16, 2024 | Audio GenerationFAD | CodeCode Available | 2 |
| OpenCity: Open Spatio-Temporal Foundation Models for Traffic Prediction | Aug 16, 2024 | PredictionTraffic Prediction | CodeCode Available | 2 |
| MART: MultiscAle Relational Transformer Networks for Multi-agent Trajectory Prediction | Jul 31, 2024 | Autonomous DrivingPrediction | CodeCode Available | 2 |
| Multi-Agent Trajectory Prediction with Difficulty-Guided Feature Enhancement Network | Jul 26, 2024 | Autonomous DrivingDecoder | CodeCode Available | 2 |
| Progressive Pretext Task Learning for Human Trajectory Prediction | Jul 16, 2024 | Knowledge DistillationPrediction | CodeCode Available | 2 |
| Monocular Occupancy Prediction for Scalable Indoor Scenes | Jul 16, 2024 | 3D Semantic Scene Completion from a single RGB imagePrediction | CodeCode Available | 2 |
| Map It Anywhere (MIA): Empowering Bird's Eye View Mapping using Large-scale Public Data | Jul 11, 2024 | Autonomous NavigationPrediction | CodeCode Available | 2 |
| ScaleDreamer: Scalable Text-to-3D Synthesis with Asynchronous Score Distillation | Jul 2, 2024 | PredictionText to 3D | CodeCode Available | 2 |
| Diving Deeper Into Pedestrian Behavior Understanding: Intention Estimation, Action Prediction, and Event Risk Assessment | Jun 29, 2024 | Prediction | CodeCode Available | 2 |
| Multimodal Prototyping for cancer survival prediction | Jun 28, 2024 | PredictionSurvival Prediction | CodeCode Available | 2 |
| Large Scale Transfer Learning for Tabular Data via Language Modeling | Jun 17, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| ABodyBuilder3: Improved and scalable antibody structure predictions | May 31, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| Autonomous Driving with Spiking Neural Networks | May 30, 2024 | Autonomous DrivingPrediction | CodeCode Available | 2 |
| FlashST: A Simple and Universal Prompt-Tuning Framework for Traffic Prediction | May 28, 2024 | In-Context LearningPrediction | CodeCode Available | 2 |
| Any-step Dynamics Model Improves Future Predictions for Online and Offline Reinforcement Learning | May 27, 2024 | Gym halfcheetah-mediumGym halfcheetah-medium-expert | CodeCode Available | 2 |
| TokenUnify: Scalable Autoregressive Visual Pre-training with Mixture Token Prediction | May 27, 2024 | MambaPrediction | CodeCode Available | 2 |
| REACT: Real-time Efficiency and Accuracy Compromise for Tradeoffs in Scene Graph Generation | May 25, 2024 | Graph GenerationObject | CodeCode Available | 2 |
| Drones Help Drones: A Collaborative Framework for Multi-Drone Object Trajectory Prediction and Beyond | May 23, 2024 | 3D Object Detectionobject-detection | CodeCode Available | 2 |
| RadarOcc: Robust 3D Occupancy Prediction with 4D Imaging Radar | May 22, 2024 | Autonomous DrivingPrediction | CodeCode Available | 2 |
| CityLLaVA: Efficient Fine-Tuning for VLMs in City Scenario | May 6, 2024 | PositionPrediction | CodeCode Available | 2 |
| Understanding the Ranking Loss for Recommendation with Sparse User Feedback | Mar 21, 2024 | Binary ClassificationClick-Through Rate Prediction | CodeCode Available | 2 |
| Certified Human Trajectory Prediction | Mar 20, 2024 | Autonomous VehiclesPrediction | CodeCode Available | 2 |
| SmartRefine: A Scenario-Adaptive Refinement Framework for Efficient Motion Prediction | Mar 18, 2024 | Autonomous Vehiclesmotion prediction | CodeCode Available | 2 |