| Planting a SEED of Vision in Large Language Model | Jul 16, 2023 | Image GenerationImage to text | CodeCode Available | 2 |
| Hypothesis Generation with Large Language Models | Apr 5, 2024 | Multi-Armed Bandits | CodeCode Available | 2 |
| Single Motion Diffusion | Feb 12, 2023 | DenoisingStyle Transfer | CodeCode Available | 2 |
| Generating Holistic 3D Human Motion from Speech | Dec 8, 2022 | 3D Face AnimationGesture Generation | CodeCode Available | 2 |
| Exploring the Limits of Vision-Language-Action Manipulations in Cross-task Generalization | May 21, 2025 | Vision-Language-ActionZero-shot Generalization | CodeCode Available | 2 |
| Consistent Sampling and Simulation: Molecular Dynamics with Energy-Based Diffusion Models | Jun 20, 2025 | | CodeCode Available | 2 |
| MO-Gym: A Library of Multi-Objective Reinforcement Learning Environments | Nov 30, 2022 | Multi-Objective Reinforcement LearningOpenAI Gym | CodeCode Available | 2 |
| OpenICL: An Open-Source Framework for In-context Learning | Mar 6, 2023 | In-Context LearningLanguage Modeling | CodeCode Available | 2 |
| tinyBenchmarks: evaluating LLMs with fewer examples | Feb 22, 2024 | MMLUMultiple-choice | CodeCode Available | 2 |
| A Closer Look at Hardware-Friendly Weight Quantization | Oct 7, 2022 | Quantization | CodeCode Available | 2 |
| STP: Self-play LLM Theorem Provers with Iterative Conjecturing and Proving | Jan 31, 2025 | Automated Theorem Proving | CodeCode Available | 2 |
| Unified Human-Scene Interaction via Prompted Chain-of-Contacts | Sep 14, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| Human-in-the-Loop Large-Scale Predictive Maintenance of Workstations | Jun 23, 2022 | Active LearningScheduling | CodeCode Available | 2 |
| OpenRFT: Adapting Reasoning Foundation Model for Domain-specific Tasks with Reinforcement Fine-Tuning | Dec 22, 2024 | | CodeCode Available | 2 |
| GaussianDreamer: Fast Generation from Text to 3D Gaussians by Bridging 2D and 3D Diffusion Models | Oct 12, 2023 | GPUText to 3D | CodeCode Available | 2 |
| Body Part-Based Representation Learning for Occluded Person Re-Identification | Nov 7, 2022 | Human ParsingOccluded Person Re-Identification | CodeCode Available | 2 |
| REPARO: Compositional 3D Assets Generation with Differentiable 3D Layout Alignment | May 28, 2024 | Image to 3DObject | CodeCode Available | 2 |
| Blended Latent Diffusion | Jun 6, 2022 | Image GenerationImage Inpainting | CodeCode Available | 2 |
| Binarized Neural Machine Translation | Feb 9, 2023 | BinarizationMachine Translation | CodeCode Available | 2 |
| Inst3D-LMM: Instance-Aware 3D Scene Understanding with Multi-modal Instruction Tuning | Mar 1, 2025 | Scene Understanding | CodeCode Available | 2 |
| Context-Guided Spatio-Temporal Video Grounding | Jan 3, 2024 | ObjectSpatio-Temporal Video Grounding | CodeCode Available | 2 |
| MIND Your Language: A Multilingual Dataset for Cross-lingual News Recommendation | Mar 26, 2024 | Cross-Lingual TransferLanguage Modelling | CodeCode Available | 2 |
| LION: Latent Point Diffusion Models for 3D Shape Generation | Oct 12, 2022 | 3D Generation3D Shape Generation | CodeCode Available | 2 |
| Lightning IR: Straightforward Fine-tuning and Inference of Transformer-based Language Models for Information Retrieval | Nov 7, 2024 | Information RetrievalRe-Ranking | CodeCode Available | 2 |
| Rethinking Unsupervised Domain Adaptation for Semantic Segmentation | Jun 30, 2022 | Domain AdaptationSemantic Segmentation | CodeCode Available | 2 |
| Latent Video Diffusion Models for High-Fidelity Long Video Generation | Nov 23, 2022 | DenoisingImage Generation | CodeCode Available | 2 |
| VoiceFlow: Efficient Text-to-Speech with Rectified Flow Matching | Sep 10, 2023 | text-to-speechText to Speech | CodeCode Available | 2 |
| FaceScore: Benchmarking and Enhancing Face Quality in Human Generation | Jun 24, 2024 | BenchmarkingDenoising | CodeCode Available | 2 |
| Exploiting Unlabeled Data with Multiple Expert Teachers for Open Vocabulary Aerial Object Detection and Its Orientation Adaptation | Nov 4, 2024 | Earth ObservationObject | CodeCode Available | 2 |
| Wasserstein Quantum Monte Carlo: A Novel Approach for Solving the Quantum Many-Body Schrödinger Equation | Sep 21, 2023 | | CodeCode Available | 2 |
| PonderV2: Pave the Way for 3D Foundation Model with A Universal Pre-training Paradigm | Oct 12, 2023 | 3D Object Detection3D Reconstruction | CodeCode Available | 2 |
| Holistic Fusion: Task- and Setup-Agnostic Robot Localization and State Estimation with Factor Graphs | Apr 8, 2025 | Motion EstimationSensor Fusion | CodeCode Available | 2 |
| Hierarchical Fine-Grained Image Forgery Detection and Localization | Mar 30, 2023 | AttributeClassification | CodeCode Available | 2 |
| 3D Equivariant Diffusion for Target-Aware Molecule Generation and Affinity Prediction | Mar 6, 2023 | Drug Design | CodeCode Available | 2 |
| Benchmarking Retrieval-Augmented Generation in Multi-Modal Contexts | Feb 24, 2025 | BenchmarkingFact Verification | CodeCode Available | 2 |
| Motion Forecasting in Continuous Driving | Oct 8, 2024 | Autonomous DrivingMotion Forecasting | CodeCode Available | 2 |
| Re-parameterizing Your Optimizers rather than Architectures | May 30, 2022 | Quantization | CodeCode Available | 2 |
| Stratified Transformer for 3D Point Cloud Segmentation | Mar 28, 2022 | Point Cloud SegmentationPosition | CodeCode Available | 2 |
| InfLoRA: Interference-Free Low-Rank Adaptation for Continual Learning | Mar 30, 2024 | Continual Learningparameter-efficient fine-tuning | CodeCode Available | 2 |
| PromptBERT: Improving BERT Sentence Embeddings with Prompts | Jan 12, 2022 | Contrastive LearningDenoising | CodeCode Available | 2 |
| Human Activity Recognition using RGB-Event based Sensors: A Multi-modal Heat Conduction Model and A Benchmark Dataset | Apr 8, 2025 | Activity RecognitionHuman Activity Recognition | CodeCode Available | 2 |
| Diffusion360: Seamless 360 Degree Panoramic Image Generation based on Diffusion Models | Nov 22, 2023 | DenoisingImage Generation | CodeCode Available | 2 |
| Advances in 4D Generation: A Survey | Mar 18, 2025 | Autonomous DrivingComputational Efficiency | CodeCode Available | 2 |
| OmniEval: An Omnidirectional and Automatic RAG Evaluation Benchmark in Financial Domain | Dec 17, 2024 | RAGRetrieval | CodeCode Available | 2 |
| SLCA++: Unleash the Power of Sequential Fine-tuning for Continual Learning with Pre-training | Aug 15, 2024 | Continual Learningimage-classification | CodeCode Available | 2 |
| LLM-Grounder: Open-Vocabulary 3D Visual Grounding with Large Language Model as an Agent | Sep 21, 2023 | 3D visual groundingLanguage Modeling | CodeCode Available | 2 |
| Igniting Language Intelligence: The Hitchhiker's Guide From Chain-of-Thought Reasoning to Language Agents | Nov 20, 2023 | | CodeCode Available | 2 |
| BEVDet: High-performance Multi-camera 3D Object Detection in Bird-Eye-View | Dec 22, 2021 | 3D Object DetectionAutonomous Driving | CodeCode Available | 2 |
| In-Context Learning Unlocked for Diffusion Models | May 1, 2023 | In-Context Learningtext-guided-image-editing | CodeCode Available | 2 |
| NAF-DPM: A Nonlinear Activation-Free Diffusion Probabilistic Model for Document Enhancement | Apr 8, 2024 | BinarizationDocument Enhancement | CodeCode Available | 2 |