| HILCodec: High-Fidelity and Lightweight Neural Audio Codec | May 8, 2024 | | CodeCode Available | 2 |
| Full Page Handwriting Recognition via Image to Sequence Extraction | Mar 11, 2021 | Handwriting RecognitionHandwritten Text Recognition | CodeCode Available | 2 |
| F-LMM: Grounding Frozen Large Multimodal Models | Jun 9, 2024 | General KnowledgeInstruction Following | CodeCode Available | 2 |
| DreamDiffusion: Generating High-Quality Images from Brain EEG Signals | Jun 29, 2023 | EEGElectroencephalogram (EEG) | CodeCode Available | 2 |
| Improved Multi-Task Brain Tumour Segmentation with Synthetic Data Augmentation | Nov 7, 2024 | Data AugmentationSynthetic Data Generation | CodeCode Available | 2 |
| moolib: A Platform for Distributed RL | Jan 26, 2022 | reinforcement-learningReinforcement Learning | CodeCode Available | 2 |
| COCO-O: A Benchmark for Object Detectors under Natural Distribution Shifts | Jul 24, 2023 | Autonomous DrivingObject | CodeCode Available | 2 |
| Think-on-Graph 2.0: Deep and Faithful Large Language Model Reasoning with Knowledge-guided Retrieval Augmented Generation | Jul 15, 2024 | Information RetrievalKnowledge Graphs | CodeCode Available | 2 |
| DiffLoc: Diffusion Model for Outdoor LiDAR Localization | Jan 1, 2024 | Denoisingmodel | CodeCode Available | 2 |
| PropertyGPT: LLM-driven Formal Verification of Smart Contracts through Retrieval-Augmented Property Generation | May 4, 2024 | In-Context LearningRetrieval | CodeCode Available | 2 |
| SSL: A Self-similarity Loss for Improving Generative Image Super-resolution | Aug 11, 2024 | HallucinationImage Super-Resolution | CodeCode Available | 2 |
| SOLO: A Single Transformer for Scalable Vision-Language Modeling | Jul 8, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| UNetMamba: An Efficient UNet-Like Mamba for Semantic Segmentation of High-Resolution Remote Sensing Images | Aug 21, 2024 | MambaSegmentation | CodeCode Available | 2 |
| LLaSM: Large Language and Speech Model | Aug 30, 2023 | Instruction FollowingLanguage Modeling | CodeCode Available | 2 |
| Parallel Speculative Decoding with Adaptive Draft Length | Aug 13, 2024 | Text Generation | CodeCode Available | 2 |
| Can Atomic Step Decomposition Enhance the Self-structured Reasoning of Multimodal Large Models? | Mar 8, 2025 | Mathematical ReasoningMultimodal Reasoning | CodeCode Available | 2 |
| RAG-Driver: Generalisable Driving Explanations with Retrieval-Augmented In-Context Learning in Multi-Modal Large Language Model | Feb 16, 2024 | Autonomous DrivingDecision Making | CodeCode Available | 2 |
| How Do Vision Transformers Work? | Feb 14, 2022 | Specificity | CodeCode Available | 2 |
| ACE: A fast, skillful learned global atmospheric model for climate prediction | Oct 3, 2023 | | CodeCode Available | 2 |
| Brainchop: Next Generation Web-Based Neuroimaging Application | Oct 24, 2023 | | CodeCode Available | 2 |
| Solving Quantitative Reasoning Problems with Language Models | Jun 29, 2022 | Arithmetic ReasoningLanguage Modeling | CodeCode Available | 2 |
| Towards Foundation Models for Knowledge Graph Reasoning | Oct 6, 2023 | Knowledge GraphsLink Prediction | CodeCode Available | 2 |
| UAVD4L: A Large-Scale Dataset for UAV 6-DoF Localization | Jan 11, 2024 | Synthetic Data GenerationVisual Localization | CodeCode Available | 2 |
| PeRFception: Perception using Radiance Fields | Aug 24, 2022 | 3D ReconstructionNeRF | CodeCode Available | 2 |
| Context is Key: A Benchmark for Forecasting with Essential Textual Information | Oct 24, 2024 | Decision MakingTime Series | CodeCode Available | 2 |
| InfiAgent-DABench: Evaluating Agents on Data Analysis Tasks | Jan 10, 2024 | Benchmarking | CodeCode Available | 2 |
| A Survey on 3D Gaussian Splatting | Jan 8, 2024 | 3D ReconstructionSurvey | CodeCode Available | 2 |
| SafeAgentBench: A Benchmark for Safe Task Planning of Embodied LLM Agents | Dec 17, 2024 | Task Planning | CodeCode Available | 2 |
| Efficient Parallel Genetic Algorithm for Perturbed Substructure Optimization in Complex Network | Dec 30, 2024 | Combinatorial OptimizationGraph Mining | CodeCode Available | 2 |
| A Survey on Hardware Accelerators for Large Language Models | Jan 18, 2024 | Survey | CodeCode Available | 2 |
| PSAvatar: A Point-based Shape Model for Real-Time Head Avatar Animation with 3D Gaussian Splatting | Jan 23, 2024 | | CodeCode Available | 2 |
| EQ-Bench: An Emotional Intelligence Benchmark for Large Language Models | Dec 11, 2023 | BenchmarkingEmotional Intelligence | CodeCode Available | 2 |
| Object Detection using Event Camera: A MoE Heat Conduction based Detector and A New Benchmark Dataset | Dec 9, 2024 | Computational EfficiencyMixture-of-Experts | CodeCode Available | 2 |
| Machine Unlearning of Pre-trained Large Language Models | Feb 23, 2024 | Machine Unlearning | CodeCode Available | 2 |
| Segment Any Anomaly without Training via Hybrid Prompt Regularization | May 18, 2023 | Anomaly DetectionAnomaly Localization | CodeCode Available | 2 |
| Nexus: A Lightweight and Scalable Multi-Agent Framework for Complex Tasks Automation | Feb 26, 2025 | Code GenerationHumanEval | CodeCode Available | 2 |
| Advanced Millimeter-Wave Radar System for Real-Time Multiple-Human Tracking and Fall Detection | Mar 8, 2024 | Clustering | CodeCode Available | 2 |
| DISTFLASHATTN: Distributed Memory-efficient Attention for Long-context LLMs Training | Oct 5, 2023 | GPU | CodeCode Available | 2 |
| VideoSAGE: Video Summarization with Graph Representation Learning | Apr 14, 2024 | Graph Representation LearningNode Classification | CodeCode Available | 2 |
| DISC-LawLLM: Fine-tuning Large Language Models for Intelligent Legal Services | Sep 20, 2023 | Language ModellingLarge Language Model | CodeCode Available | 2 |
| Free-T2M: Frequency Enhanced Text-to-Motion Diffusion Model With Consistency Loss | Jan 30, 2025 | DenoisingMotion Generation | CodeCode Available | 2 |
| Dual-path Mamba: Short and Long-term Bidirectional Selective Structured State Space Models for Speech Separation | Mar 27, 2024 | MambaSpeech Separation | CodeCode Available | 2 |
| Magic-Boost: Boost 3D Generation with Multi-View Conditioned Diffusion | Apr 9, 2024 | 3D Generation | CodeCode Available | 2 |
| Earth-Adapter: Bridge the Geospatial Domain Gaps with Mixture of Frequency Adaptation | Apr 8, 2025 | Domain AdaptationDomain Generalization | CodeCode Available | 2 |
| LaSagnA: Language-based Segmentation Assistant for Complex Queries | Apr 12, 2024 | SegmentationSemantic Segmentation | CodeCode Available | 2 |
| Piccolo2: General Text Embedding with Multi-task Hybrid Loss Training | May 11, 2024 | | CodeCode Available | 2 |
| An Initial Investigation of Language Adaptation for TTS Systems under Low-resource Scenarios | Jun 13, 2024 | Language IdentificationSelf-Supervised Learning | CodeCode Available | 2 |
| Recipe for a General, Powerful, Scalable Graph Transformer | May 25, 2022 | Graph ClassificationGraph Property Prediction | CodeCode Available | 2 |
| WATT: Weight Average Test-Time Adaptation of CLIP | Jun 19, 2024 | image-classificationImage Classification | CodeCode Available | 2 |
| Fisheye-Calib-Adapter: An Easy Tool for Fisheye Camera Model Conversion | Jul 17, 2024 | Autonomous Driving | CodeCode Available | 2 |