| Unpaired Image-to-Image Translation via Neural Schrödinger Bridge | May 24, 2023 | Image-to-Image TranslationTranslation | CodeCode Available | 2 | 5 |
| Lawyer LLaMA Technical Report | May 24, 2023 | ArticlesHallucination | CodeCode Available | 2 | 5 |
| Adapting Language Models to Compress Contexts | May 24, 2023 | In-Context LearningLanguage Modeling | CodeCode Available | 2 | 5 |
| Accelerating Text-to-Image Editing via Cache-Enabled Sparse Diffusion Inference | May 27, 2023 | GPUImage Generation | CodeCode Available | 2 | 5 |
| APRIL-GAN: A Zero-/Few-Shot Anomaly Classification and Segmentation Method for CVPR 2023 VAND Workshop Challenge Tracks 1&2: 1st Place on Zero-shot AD and 4th Place on Few-shot AD | May 27, 2023 | Anomaly ClassificationAnomaly Detection | CodeCode Available | 2 | 5 |
| HiFA: High-fidelity Text-to-3D Generation with Advanced Diffusion Guidance | May 30, 2023 | 3D Generation3D geometry | CodeCode Available | 2 | 5 |
| WAVES: Benchmarking the Robustness of Image Watermarks | Jan 16, 2024 | Benchmarking | CodeCode Available | 2 | 5 |
| Calib-Anything: Zero-training LiDAR-Camera Extrinsic Calibration Method Using Segment Anything | Jun 5, 2023 | Camera Calibration | CodeCode Available | 2 | 5 |
| SAM3D: Zero-Shot 3D Object Detection via Segment Anything Model | Jun 4, 2023 | 3D Object DetectionImage Segmentation | CodeCode Available | 2 | 5 |
| ModuleFormer: Modularity Emerges from Mixture-of-Experts | Jun 7, 2023 | Language ModellingLightweight Deployment | CodeCode Available | 2 | 5 |
| On the Reliability of Watermarks for Large Language Models | Jun 7, 2023 | | CodeCode Available | 2 | 5 |
| DAT++: Spatially Dynamic Vision Transformer with Deformable Attention | Sep 4, 2023 | Image ClassificationInstance Segmentation | CodeCode Available | 2 | 5 |
| Valley: Video Assistant with Large Language model Enhanced abilitY | Jun 12, 2023 | Action RecognitionInstruction Following | CodeCode Available | 2 | 5 |
| The Devil is in the Details: On the Pitfalls of Event Extraction Evaluation | Jun 12, 2023 | Event Argument ExtractionEvent Detection | CodeCode Available | 2 | 5 |
| NodeFormer: A Scalable Graph Structure Learning Transformer for Node Classification | Jun 14, 2023 | Graph structure learningimage-classification | CodeCode Available | 2 | 5 |
| PyKoopman: A Python Package for Data-Driven Approximation of the Koopman Operator | Jun 22, 2023 | | CodeCode Available | 2 | 5 |
| When Shift Operation Meets Vision Transformer: An Extremely Simple Alternative to Attention Mechanism | Jan 26, 2022 | Image ClassificationObject Detection | CodeCode Available | 2 | 5 |
| RED^ FM: a Filtered and Multilingual Relation Extraction Dataset | Jun 16, 2023 | RelationRelation Extraction | CodeCode Available | 2 | 5 |
| Self-Supervised Learning for Time Series Analysis: Taxonomy, Progress, and Prospects | Jun 16, 2023 | Anomaly DetectionSelf-Supervised Learning | CodeCode Available | 2 | 5 |
| Omnidirectional Multi-Object Tracking | Mar 6, 2025 | Multi-Object TrackingObject | CodeCode Available | 2 | 5 |
| OpenMask3D: Open-Vocabulary 3D Instance Segmentation | Jun 23, 2023 | 3D Instance Segmentation3D Open-Vocabulary Instance Segmentation | CodeCode Available | 2 | 5 |
| PC-NeRF: Parent-Child Neural Radiance Fields Using Sparse LiDAR Frames in Autonomous Driving Environments | Feb 14, 2024 | 3D Reconstruction3D Scene Reconstruction | CodeCode Available | 2 | 5 |
| One missing piece in Vision and Language: A Survey on Comics Understanding | Sep 14, 2024 | document understandingimage-classification | CodeCode Available | 2 | 5 |
| RSPrompter: Learning to Prompt for Remote Sensing Instance Segmentation based on Visual Foundation Model | Jun 28, 2023 | Image SegmentationInstance Segmentation | CodeCode Available | 2 | 5 |
| Foundation Model for Endoscopy Video Analysis via Large-scale Self-supervised Pre-train | Jun 29, 2023 | SegmentationTransfer Learning | CodeCode Available | 2 | 5 |