| DiffusionTrack: Diffusion Model For Multi-Object Tracking | Aug 19, 2023 | Denoisingmodel | CodeCode Available | 2 |
| IT3D: Improved Text-to-3D Generation with Explicit View Synthesis | Aug 22, 2023 | 3D GenerationText to 3D | CodeCode Available | 2 |
| Knowledge Graph Prompting for Multi-Document Question Answering | Aug 22, 2023 | graph constructionOpen-Domain Question Answering | CodeCode Available | 2 |
| PromptIR: Prompting for All-in-One Image Restoration | Sep 21, 2023 | | CodeCode Available | 2 |
| Diffuse, Attend, and Segment: Unsupervised Zero-Shot Segmentation using Stable Diffusion | Aug 23, 2023 | SegmentationSemantic Segmentation | CodeCode Available | 2 |
| FastSurfer-HypVINN: Automated sub-segmentation of the hypothalamus and adjacent structures on high-resolutional brain MRI | Aug 24, 2023 | GPUSegmentation | CodeCode Available | 2 |
| DARWIN Series: Domain Specific Large Language Models for Natural Science | Aug 25, 2023 | Knowledge Graphs | CodeCode Available | 2 |
| Selective Prompt Anchoring for Code Generation | Aug 17, 2024 | Code Generation | CodeCode Available | 2 |
| Learning Pattern-Specific Experts for Time Series Forecasting Under Patch-level Distribution Shift | Oct 13, 2024 | Time SeriesTime Series Forecasting | CodeCode Available | 2 |
| Pushing Mixture of Experts to the Limit: Extremely Parameter Efficient MoE for Instruction Tuning | Sep 11, 2023 | Mixture-of-Expertsparameter-efficient fine-tuning | CodeCode Available | 2 |
| Temporal Action Localization with Enhanced Instant Discriminability | Sep 11, 2023 | Action DetectionAction Localization | CodeCode Available | 2 |
| PyMOLfold: Interactive Protein and Ligand Structure Prediction in PyMOL | Feb 1, 2025 | PredictionProtein Folding | CodeCode Available | 2 |
| MuCGEC: a Multi-Reference Multi-Source Evaluation Dataset for Chinese Grammatical Error Correction | Apr 23, 2022 | Grammatical Error CorrectionSentence | CodeCode Available | 2 |
| Commands as AI Conversations | Sep 12, 2023 | | CodeCode Available | 2 |
| HiFTNet: A Fast High-Quality Neural Vocoder with Harmonic-plus-Noise Filter and Inverse Short Time Fourier Transform | Sep 18, 2023 | Speech Synthesis | CodeCode Available | 2 |
| Grasp-Anything: Large-scale Grasp Dataset from Foundation Models | Sep 18, 2023 | DiversityRobotic Grasping | CodeCode Available | 2 |
| Flash-LLM: Enabling Cost-Effective and Highly-Efficient Large Generative Model Inference with Unstructured Sparsity | Sep 19, 2023 | GPU | CodeCode Available | 2 |
| RMT: Retentive Networks Meet Vision Transformers | Sep 20, 2023 | Instance Segmentationobject-detection | CodeCode Available | 2 |
| Detect Everything with Few Examples | Sep 22, 2023 | Binary ClassificationCross-Domain Few-Shot Object Detection | CodeCode Available | 2 |
| Detecting and Grounding Multi-Modal Media Manipulation and Beyond | Sep 25, 2023 | Binary ClassificationContrastive Learning | CodeCode Available | 2 |
| Improving CLIP Fine-tuning Performance | Jan 1, 2023 | Diagnosticobject-detection | CodeCode Available | 2 |
| PandaGuard: Systematic Evaluation of LLM Safety against Jailbreaking Attacks | May 20, 2025 | LLM JailbreakSafety Alignment | CodeCode Available | 2 |
| GenSim: Generating Robotic Simulation Tasks via Large Language Models | Oct 2, 2023 | Code GenerationDiversity | CodeCode Available | 2 |
| Interpreting CLIP's Image Representation via Text-Based Decomposition | Oct 9, 2023 | | CodeCode Available | 2 |
| Colmap-PCD: An Open-source Tool for Fine Image-to-point cloud Registration | Oct 9, 2023 | Image to Point Cloud RegistrationPoint Cloud Registration | CodeCode Available | 2 |
| TopoMLP: A Simple yet Strong Pipeline for Driving Topology Reasoning | Oct 10, 2023 | 3D Lane DetectionAutonomous Driving | CodeCode Available | 2 |
| A Semantic Invariant Robust Watermark for Large Language Models | Oct 10, 2023 | | CodeCode Available | 2 |
| Survey on Factuality in Large Language Models: Knowledge, Retrieval and Domain-Specificity | Oct 11, 2023 | RetrievalSpecificity | CodeCode Available | 2 |
| Im4D: High-Fidelity and Real-Time Novel View Synthesis for Dynamic Scenes | Oct 12, 2023 | GPUNovel View Synthesis | CodeCode Available | 2 |
| OmniControl: Control Any Joint at Any Time for Human Motion Generation | Oct 12, 2023 | Motion Generation | CodeCode Available | 2 |
| Generative Adversarial Training for Text-to-Speech Synthesis Based on Raw Phonetic Input and Explicit Prosody Modelling | Oct 14, 2023 | Speech Synthesistext-to-speech | CodeCode Available | 2 |
| DeltaSpace: A Semantic-aligned Feature Space for Flexible Text-guided Image Editing | Oct 12, 2023 | text-guided-image-editing | CodeCode Available | 2 |
| Character-LLM: A Trainable Agent for Role-Playing | Oct 16, 2023 | | CodeCode Available | 2 |
| Few-Shot Learning Patterns in Financial Time-Series for Trend-Following Strategies | Oct 16, 2023 | Few-Shot LearningTime Series | CodeCode Available | 2 |
| Reflection-Tuning: Data Recycling Improves LLM Instruction-Tuning | Oct 18, 2023 | Natural Language Understanding | CodeCode Available | 2 |
| PromptCBLUE: A Chinese Prompt Tuning Benchmark for the Medical Domain | Oct 22, 2023 | Dialogue GenerationDialogue Understanding | CodeCode Available | 2 |
| CapsFusion: Rethinking Image-Text Data at Scale | Oct 31, 2023 | World Knowledge | CodeCode Available | 2 |
| TopicGPT: A Prompt-based Topic Modeling Framework | Nov 2, 2023 | SpecificityTopic Models | CodeCode Available | 2 |
| Simplifying Transformer Blocks | Nov 3, 2023 | Decoder | CodeCode Available | 2 |
| Instruction Distillation Makes Large Language Models Efficient Zero-shot Rankers | Nov 2, 2023 | Prompt Engineering | CodeCode Available | 2 |
| GLaMM: Pixel Grounding Large Multimodal Model | Nov 6, 2023 | Conversational Question AnsweringImage Captioning | CodeCode Available | 2 |
| Neuro-GPT: Towards A Foundation Model for EEG | Nov 7, 2023 | Brain Computer InterfaceEEG | CodeCode Available | 2 |
| A Survey of Large Language Models Attribution | Nov 7, 2023 | Survey | CodeCode Available | 2 |
| NExT-Chat: An LMM for Chat, Detection and Segmentation | Nov 8, 2023 | Referring ExpressionReferring Expression Segmentation | CodeCode Available | 2 |
| On the Road with GPT-4V(ision): Early Explorations of Visual-Language Model on Autonomous Driving | Nov 9, 2023 | Autonomous DrivingCommon Sense Reasoning | CodeCode Available | 2 |
| Semi-Supervised Domain Generalizable Person Re-Identification | Aug 11, 2021 | Generalizable Person Re-identificationKnowledge Distillation | CodeCode Available | 2 |
| Ant Colony Sampling with GFlowNets for Combinatorial Optimization | Mar 11, 2024 | Combinatorial Optimization | CodeCode Available | 2 |
| To See is to Believe: Prompting GPT-4V for Better Visual Instruction Tuning | Nov 13, 2023 | Instruction FollowingMM-Vet | CodeCode Available | 2 |
| Spyx: A Library for Just-In-Time Compiled Optimization of Spiking Neural Networks | Feb 29, 2024 | | CodeCode Available | 2 |
| Neural General Circulation Models for Weather and Climate | Nov 13, 2023 | Physical SimulationsWeather Forecasting | CodeCode Available | 2 |