| Nexus: A Lightweight and Scalable Multi-Agent Framework for Complex Tasks Automation | Feb 26, 2025 | Code GenerationHumanEval | CodeCode Available | 2 |
| Advanced Millimeter-Wave Radar System for Real-Time Multiple-Human Tracking and Fall Detection | Mar 8, 2024 | Clustering | CodeCode Available | 2 |
| DISTFLASHATTN: Distributed Memory-efficient Attention for Long-context LLMs Training | Oct 5, 2023 | GPU | CodeCode Available | 2 |
| VideoSAGE: Video Summarization with Graph Representation Learning | Apr 14, 2024 | Graph Representation LearningNode Classification | CodeCode Available | 2 |
| DISC-LawLLM: Fine-tuning Large Language Models for Intelligent Legal Services | Sep 20, 2023 | Language ModellingLarge Language Model | CodeCode Available | 2 |
| Free-T2M: Frequency Enhanced Text-to-Motion Diffusion Model With Consistency Loss | Jan 30, 2025 | DenoisingMotion Generation | CodeCode Available | 2 |
| Dual-path Mamba: Short and Long-term Bidirectional Selective Structured State Space Models for Speech Separation | Mar 27, 2024 | MambaSpeech Separation | CodeCode Available | 2 |
| Magic-Boost: Boost 3D Generation with Multi-View Conditioned Diffusion | Apr 9, 2024 | 3D Generation | CodeCode Available | 2 |
| Earth-Adapter: Bridge the Geospatial Domain Gaps with Mixture of Frequency Adaptation | Apr 8, 2025 | Domain AdaptationDomain Generalization | CodeCode Available | 2 |
| LaSagnA: Language-based Segmentation Assistant for Complex Queries | Apr 12, 2024 | SegmentationSemantic Segmentation | CodeCode Available | 2 |
| Piccolo2: General Text Embedding with Multi-task Hybrid Loss Training | May 11, 2024 | | CodeCode Available | 2 |
| An Initial Investigation of Language Adaptation for TTS Systems under Low-resource Scenarios | Jun 13, 2024 | Language IdentificationSelf-Supervised Learning | CodeCode Available | 2 |
| Recipe for a General, Powerful, Scalable Graph Transformer | May 25, 2022 | Graph ClassificationGraph Property Prediction | CodeCode Available | 2 |
| WATT: Weight Average Test-Time Adaptation of CLIP | Jun 19, 2024 | image-classificationImage Classification | CodeCode Available | 2 |
| Fisheye-Calib-Adapter: An Easy Tool for Fisheye Camera Model Conversion | Jul 17, 2024 | Autonomous Driving | CodeCode Available | 2 |
| CoverNet: Multimodal Behavior Prediction using Trajectory Sets | Nov 23, 2019 | Predictionregression | CodeCode Available | 2 |
| VAR-CLIP: Text-to-Image Generator with Visual Auto-Regressive Modeling | Aug 2, 2024 | Image Generation | CodeCode Available | 2 |
| MermaidFlow: Redefining Agentic Workflow Generation via Safety-Constrained Evolutionary Programming | May 29, 2025 | DiversityEfficient Exploration | CodeCode Available | 2 |
| Fused Gromov-Wasserstein distance for structured objects: theoretical foundations and mathematical properties | Nov 7, 2018 | BIG-bench Machine Learning | CodeCode Available | 2 |
| RSL-SQL: Robust Schema Linking in Text-to-SQL Generation | Oct 31, 2024 | Text to SQLText-To-SQL | CodeCode Available | 2 |
| The Geometry of Categorical and Hierarchical Concepts in Large Language Models | Jun 3, 2024 | Language ModellingLarge Language Model | CodeCode Available | 2 |
| Multi-modal Time Series Analysis: A Tutorial and Survey | Mar 17, 2025 | SurveyTime Series | CodeCode Available | 2 |
| Patho-R1: A Multimodal Reinforcement Learning-Based Pathology Expert Reasoner | May 16, 2025 | Cross-Modal RetrievalDiagnostic | CodeCode Available | 2 |
| Refusion: Enabling Large-Size Realistic Image Restoration with Latent-Space Diffusion Models | Apr 17, 2023 | Bokeh Effect RenderingDenoising | CodeCode Available | 2 |
| Enhancing Autonomous Driving Systems with On-Board Deployed Large Language Models | Apr 15, 2025 | Autonomous DrivingComputational Efficiency | CodeCode Available | 2 |
| Diffusion-Enhanced Test-time Adaptation with Text and Image Augmentation | Dec 12, 2024 | Image AugmentationImage Generation | CodeCode Available | 2 |
| Revisiting Unreasonable Effectiveness of Data in Deep Learning Era | Jul 10, 2017 | Deep Learningimage-classification | CodeCode Available | 2 |
| Generating Long Videos of Dynamic Scenes | Jun 7, 2022 | MORPHVideo Generation | CodeCode Available | 2 |
| Towards Open-Vocabulary Remote Sensing Image Semantic Segmentation | Dec 27, 2024 | Image SegmentationSemantic Segmentation | CodeCode Available | 2 |
| QUEEN: QUantized Efficient ENcoding of Dynamic Gaussians for Streaming Free-viewpoint Videos | Dec 5, 2024 | AttributeQuantization | CodeCode Available | 2 |
| Video-Panda: Parameter-efficient Alignment for Encoder-free Video-Language Models | Dec 24, 2024 | Question AnsweringVideo Question Answering | CodeCode Available | 2 |
| TryOn-Adapter: Efficient Fine-Grained Clothing Identity Adaptation for High-Fidelity Virtual Try-On | Apr 1, 2024 | Virtual Try-on | CodeCode Available | 2 |
| Geometric Deep Learning: Grids, Groups, Graphs, Geodesics, and Gauges | Apr 27, 2021 | Deep LearningProtein Folding | CodeCode Available | 2 |
| Large Scale Transfer Learning for Tabular Data via Language Modeling | Jun 17, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| An end-to-end attention-based approach for learning on graphs | Feb 16, 2024 | Graph ClassificationGraph Regression | CodeCode Available | 2 |
| TEaR: Improving LLM-based Machine Translation with Systematic Self-Refinement | Feb 26, 2024 | Machine TranslationTranslation | CodeCode Available | 2 |
| CLAIMED, a visual and scalable component library for Trusted AI | Mar 4, 2021 | Adversarial RobustnessFairness | CodeCode Available | 2 |
| ScienceAgentBench: Toward Rigorous Assessment of Language Agents for Data-Driven Scientific Discovery | Oct 7, 2024 | scientific discovery | CodeCode Available | 2 |
| FedPylot: Navigating Federated Learning for Real-Time Object Detection in Internet of Vehicles | Jun 5, 2024 | Autonomous DrivingAutonomous Vehicles | CodeCode Available | 2 |
| A DeNoising FPN With Transformer R-CNN for Tiny Object Detection | Jun 9, 2024 | Contrastive LearningDenoising | CodeCode Available | 2 |
| Mapping the Increasing Use of LLMs in Scientific Papers | Apr 1, 2024 | | CodeCode Available | 2 |
| Efficient Training of Deep Equilibrium Models | Apr 23, 2023 | | CodeCode Available | 2 |
| BigDetection: A Large-scale Benchmark for Improved Object Detector Pre-training | Mar 24, 2022 | Objectobject-detection | CodeCode Available | 2 |
| Frugal Optimization for Cost-related Hyperparameters | May 4, 2020 | AutoMLBIG-bench Machine Learning | CodeCode Available | 2 |
| Point-SLAM: Dense Neural Point Cloud-based SLAM | Apr 9, 2023 | Simultaneous Localization and Mapping | CodeCode Available | 2 |
| V^3: Viewing Volumetric Videos on Mobiles via Streamable 2D Dynamic Gaussians | Sep 20, 2024 | 3DGS | CodeCode Available | 2 |
| BirdNET: A deep learning solution for avian diversity monitoring | Jan 27, 2021 | Data AugmentationDeep Learning | CodeCode Available | 2 |
| Secure & Private Federated Neuroimaging | May 11, 2022 | Federated Learning | CodeCode Available | 2 |
| SPot-the-Difference Self-Supervised Pre-training for Anomaly Detection and Segmentation | Jul 28, 2022 | Anomaly DetectionAnomaly Segmentation | CodeCode Available | 2 |
| LLaVA-PruMerge: Adaptive Token Reduction for Efficient Large Multimodal Models | Mar 22, 2024 | Language ModellingLarge Language Model | CodeCode Available | 2 |