| TableQuery: Querying tabular data with natural language | Jan 27, 2022 | Deep LearningNatural Language Queries | CodeCode Available | 2 |
| CVSS Corpus and Massively Multilingual Speech-to-Speech Translation | Jan 11, 2022 | SentenceSpeech-to-Speech Translation | CodeCode Available | 2 |
| CrossFuse: A Novel Cross Attention Mechanism based Infrared and Visible Image Fusion Approach | Jun 15, 2024 | DecoderInfrared And Visible Image Fusion | CodeCode Available | 2 |
| Low-Rank Quantization-Aware Training for LLMs | Jun 10, 2024 | GPUparameter-efficient fine-tuning | CodeCode Available | 2 |
| EEGUnity: Open-Source Tool in Facilitating Unified EEG Datasets Towards Large-Scale EEG Model | Sep 24, 2024 | EEGElectroencephalogram (EEG) | CodeCode Available | 2 |
| MolFM: A Multimodal Molecular Foundation Model | Jun 6, 2023 | Cross-Modal RetrievalKnowledge Graphs | CodeCode Available | 2 |
| Beyond 3D Siamese Tracking: A Motion-Centric Paradigm for 3D Single Object Tracking in Point Clouds | Mar 3, 2022 | 3D Single Object TrackingAutonomous Driving | CodeCode Available | 2 |
| Agent Smith: A Single Image Can Jailbreak One Million Multimodal LLM Agents Exponentially Fast | Feb 13, 2024 | Language ModellingLarge Language Model | CodeCode Available | 2 |
| Improving Causal Reasoning in Large Language Models: A Survey | Oct 22, 2024 | Decision MakingSurvey | CodeCode Available | 2 |
| Parameter-Inverted Image Pyramid Networks for Visual Perception and Multimodal Understanding | Jan 14, 2025 | image-classificationImage Classification | CodeCode Available | 2 |
| The All-Seeing Project: Towards Panoptic Visual Recognition and Understanding of the Open World | Aug 3, 2023 | AllQuestion Answering | CodeCode Available | 2 |
| LibreFace: An Open-Source Toolkit for Deep Facial Expression Analysis | Aug 18, 2023 | Facial Expression RecognitionKnowledge Distillation | CodeCode Available | 2 |
| RelationField: Relate Anything in Radiance Fields | Dec 18, 2024 | 3d scene graph generationGraph Generation | CodeCode Available | 2 |
| Effector: A Python package for regional explanations | Apr 3, 2024 | | CodeCode Available | 2 |
| FastDiff: A Fast Conditional Diffusion Model for High-Quality Speech Synthesis | Apr 21, 2022 | DenoisingGPU | CodeCode Available | 2 |
| Structure-informed Language Models Are Protein Designers | Feb 3, 2023 | | CodeCode Available | 2 |
| RETA-LLM: A Retrieval-Augmented Large Language Model Toolkit | Jun 8, 2023 | Answer GenerationFact Checking | CodeCode Available | 2 |
| Scaling Multi-Camera 3D Object Detection through Weak-to-Strong Eliciting | Apr 10, 2024 | 3D Object DetectionAutonomous Driving | CodeCode Available | 2 |
| Reviving The Classics: Active Reward Modeling in Large Language Model Alignment | Feb 4, 2025 | Computational EfficiencyExperimental Design | CodeCode Available | 2 |
| Decoupling Knowledge from Memorization: Retrieval-augmented Prompt Learning | May 29, 2022 | Few-Shot Text ClassificationMemorization | CodeCode Available | 2 |
| TimeMarker: A Versatile Video-LLM for Long and Short Video Understanding with Superior Temporal Localization Ability | Nov 27, 2024 | Temporal LocalizationVideo Understanding | CodeCode Available | 2 |
| Vox-Fusion: Dense Tracking and Mapping with Voxel-based Neural Implicit Representation | Oct 28, 2022 | | CodeCode Available | 2 |
| LeanVec: Searching vectors faster by making them fit | Dec 26, 2023 | Cross-Modal RetrievalDimensionality Reduction | CodeCode Available | 2 |
| BIGCity: A Universal Spatiotemporal Model for Unified Trajectory and Traffic State Data Analysis | Dec 1, 2024 | | CodeCode Available | 2 |
| CoVO-MPC: Theoretical Analysis of Sampling-based MPC and Optimal Covariance Design | Jan 14, 2024 | Model-based Reinforcement LearningModel Predictive Control | CodeCode Available | 2 |
| Incremental Sequence Labeling: A Tale of Two Shifts | Feb 16, 2024 | Incremental LearningKnowledge Distillation | CodeCode Available | 2 |
| Comprehensive Verilog Design Problems: A Next-Generation Benchmark Dataset for Evaluating Large Language Models and Agents on RTL Design and Verification | Jun 17, 2025 | Code Generation | CodeCode Available | 2 |
| mLoRA: Fine-Tuning LoRA Adapters via Highly-Efficient Pipeline Parallelism in Multiple GPUs | Dec 5, 2023 | GPULarge Language Model | CodeCode Available | 2 |
| OTAvatar: One-shot Talking Face Avatar with Controllable Tri-plane Rendering | Mar 26, 2023 | | CodeCode Available | 2 |
| Graphic Design with Large Multimodal Model | Apr 22, 2024 | Layout Generationmodel | CodeCode Available | 2 |
| Humanoid Agents: Platform for Simulating Human-like Generative Agents | Oct 9, 2023 | Unity | CodeCode Available | 2 |
| What Are Expected Queries in End-to-End Object Detection? | Jun 2, 2022 | Instance Segmentationobject-detection | CodeCode Available | 2 |
| Woodpecker: Hallucination Correction for Multimodal Large Language Models | Oct 24, 2023 | Hallucination | CodeCode Available | 2 |
| Mini Honor of Kings: A Lightweight Environment for Multi-Agent Reinforcement Learning | Jun 6, 2024 | Multi-agent Reinforcement Learning | CodeCode Available | 2 |
| radarODE-MTL: A Multi-Task Learning Framework with Eccentric Gradient Alignment for Robust Radar-Based ECG Reconstruction | Oct 11, 2024 | Multi-Task Learning | CodeCode Available | 2 |
| A Model or 603 Exemplars: Towards Memory-Efficient Class-Incremental Learning | May 26, 2022 | class-incremental learningClass Incremental Learning | CodeCode Available | 2 |
| SNP-S3: Shared Network Pre-training and Significant Semantic Strengthening for Various Video-Text Tasks | Jan 31, 2024 | Sentence | CodeCode Available | 2 |
| SPA-RL: Reinforcing LLM Agents via Stepwise Progress Attribution | May 27, 2025 | Reinforcement Learning (RL) | CodeCode Available | 2 |
| Off-Policy Evaluation for Large Action Spaces via Embeddings | Feb 13, 2022 | Multi-Armed BanditsOff-policy evaluation | CodeCode Available | 2 |
| Interaction2Code: Benchmarking MLLM-based Interactive Webpage Code Generation from Interactive Prototyping | Nov 5, 2024 | BenchmarkingCode Generation | CodeCode Available | 2 |
| Aerial Lifting: Neural Urban Semantic and Building Instance Lifting from Aerial Imagery | Mar 18, 2024 | Instance SegmentationNeRF | CodeCode Available | 2 |
| DPoser: Diffusion Model as Robust 3D Human Pose Prior | Dec 9, 2023 | DenoisingHuman Mesh Recovery | CodeCode Available | 2 |
| Asynchronous Large Language Model Enhanced Planner for Autonomous Driving | Jun 20, 2024 | Autonomous DrivingLanguage Modeling | CodeCode Available | 2 |
| BanditPAM++: Faster k-medoids Clustering | Sep 21, 2023 | | CodeCode Available | 2 |
| TransST: Transfer Learning Embedded Spatial Factor Modeling of Spatial Transcriptomics Data | Apr 15, 2025 | Transfer Learning | CodeCode Available | 2 |
| Recent Advances in Medical Imaging Segmentation: A Survey | May 14, 2025 | Domain AdaptationFew-Shot Learning | CodeCode Available | 2 |
| Cross the Gap: Exposing the Intra-modal Misalignment in CLIP via Modality Inversion | Feb 6, 2025 | image-classificationImage Classification | CodeCode Available | 2 |
| Model and Data Transfer for Cross-Lingual Sequence Labelling in Zero-Resource Settings | Oct 23, 2022 | Cross-Lingual NERCross-Lingual Transfer | CodeCode Available | 2 |
| EarthLoc: Astronaut Photography Localization by Indexing Earth from Space | Mar 11, 2024 | Data AugmentationDisaster Response | CodeCode Available | 2 |
| Revisiting Generative Policies: A Simpler Reinforcement Learning Algorithmic Perspective | Dec 2, 2024 | Density EstimationOffline RL | CodeCode Available | 2 |