| K2: A Foundation Language Model for Geoscience Knowledge Understanding and Utilization | Jun 8, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 2 | 5 |
| GenSim: A General Social Simulation Platform with Large Language Model based Agents | Oct 6, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 | 5 |
| Metric Flow Matching for Smooth Interpolations on the Data Manifold | May 23, 2024 | Trajectory Prediction | CodeCode Available | 2 | 5 |
| Harmonizer: Learning to Perform White-Box Image and Video Harmonization | Jul 4, 2022 | Image HarmonizationVideo Harmonization | CodeCode Available | 2 | 5 |
| Android in the Zoo: Chain-of-Action-Thought for GUI Agents | Mar 5, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 | 5 |
| Knowledge Circuits in Pretrained Transformers | May 28, 2024 | In-Context Learningknowledge editing | CodeCode Available | 2 | 5 |
| PyMIC: A deep learning toolkit for annotation-efficient medical image segmentation | Aug 19, 2022 | Deep LearningImage Segmentation | CodeCode Available | 2 | 5 |
| PHemoNet: A Multimodal Network for Physiological Signals | Sep 13, 2024 | Brain Computer InterfaceEEG | CodeCode Available | 2 | 5 |
| From Sparse to Soft Mixtures of Experts | Aug 2, 2023 | | CodeCode Available | 2 | 5 |
| ColorizeDiffusion: Adjustable Sketch Colorization with Reference Image and Text | Jan 2, 2024 | ColorizationSketch Colorization | CodeCode Available | 2 | 5 |
| Diving into Underwater: Segment Anything Model Guided Underwater Salient Instance Segmentation and A Large-scale Dataset | Jun 10, 2024 | Instance SegmentationSalient Object Detection | CodeCode Available | 2 | 5 |
| DifIISR: A Diffusion Model with Gradient Guidance for Infrared Image Super-Resolution | Mar 3, 2025 | Autonomous DrivingImage Super-Resolution | CodeCode Available | 2 | 5 |
| nuScenes: A multimodal dataset for autonomous driving | Mar 26, 2019 | 3D Object DetectionAutonomous Driving | CodeCode Available | 2 | 5 |
| An LLM-Assisted Easy-to-Trigger Backdoor Attack on Code Completion Models: Injecting Disguised Vulnerabilities against Strong Detection | Jun 10, 2024 | Backdoor AttackCode Completion | CodeCode Available | 2 | 5 |
| Shape, Light, and Material Decomposition from Images using Monte Carlo Rendering and Denoising | Jun 7, 2022 | 3D ReconstructionDenoising | CodeCode Available | 2 | 5 |
| Video Prediction Transformers without Recurrence or Convolution | Oct 7, 2024 | DecoderPrediction | CodeCode Available | 2 | 5 |
| TinyLLaVA-Video-R1: Towards Smaller LMMs for Video Reasoning | Apr 13, 2025 | Question Answeringreinforcement-learning | CodeCode Available | 2 | 5 |
| DeepFilterNet: A Low Complexity Speech Enhancement Framework for Full-Band Audio based on Deep Filtering | Oct 11, 2021 | Speech Enhancement | CodeCode Available | 2 | 5 |
| PoseScript: Linking 3D Human Poses and Natural Language | Oct 21, 2022 | Cross-Modal RetrievalImage Captioning | CodeCode Available | 2 | 5 |
| SDEdit: Guided Image Synthesis and Editing with Stochastic Differential Equations | Aug 2, 2021 | DenoisingImage Generation | CodeCode Available | 2 | 5 |
| Satellite Image Time Series Semantic Change Detection: Novel Architecture and Analysis of Domain Shift | Jul 10, 2024 | Change DetectionDisaster Response | CodeCode Available | 2 | 5 |
| LLaMEA: A Large Language Model Evolutionary Algorithm for Automatically Generating Metaheuristics | May 30, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 | 5 |
| Unsupervised Universal Image Segmentation | Dec 28, 2023 | Image SegmentationInstance Segmentation | CodeCode Available | 2 | 5 |
| VideoREPA: Learning Physics for Video Generation through Relational Alignment with Foundation Models | May 29, 2025 | Self-Supervised LearningVideo Generation | CodeCode Available | 2 | 5 |
| Collaborative Gym: A Framework for Enabling and Evaluating Human-Agent Collaboration | Dec 20, 2024 | Human Agent Collaboration | CodeCode Available | 2 | 5 |