| DiffuVolume: Diffusion Model for Volume based Stereo Matching | Aug 30, 2023 | modelStereo Matching | —Unverified | 0 |
| Cross-Modal Retrieval Meets Inference:Improving Zero-Shot Classification with Cross-Modal Retrieval | Aug 29, 2023 | Cross-Modal Retrievalimage-classification | —Unverified | 0 |
| Cheap Lunch for Medical Image Segmentation by Fine-tuning SAM on Few Exemplars | Aug 27, 2023 | Brain Tumor SegmentationImage Segmentation | —Unverified | 0 |
| SAM Meets Robotic Surgery: An Empirical Study on Generalization, Robustness and Adaptation | Aug 14, 2023 | Semantic SegmentationZero-shot Generalization | —Unverified | 0 |
| EcomGPT: Instruction-tuning Large Language Models with Chain-of-Task Tasks for E-commerce | Aug 14, 2023 | DiversityInstruction Following | CodeCode Available | 2 |
| TongueSAM: An Universal Tongue Segmentation Model Based on SAM with Zero-Shot | Aug 12, 2023 | DiagnosticInteractive Segmentation | CodeCode Available | 1 |
| Separate Anything You Describe | Aug 9, 2023 | Audio Source SeparationNatural Language Queries | CodeCode Available | 3 |
| ToolLLM: Facilitating Large Language Models to Master 16000+ Real-world APIs | Jul 31, 2023 | Trajectory PlanningZero-shot Generalization | CodeCode Available | 5 |
| Model Synthesis for Zero-Shot Model Attribution | Jul 29, 2023 | Attributemodel | CodeCode Available | 0 |
| Towards Generalist Biomedical AI | Jul 26, 2023 | Medical Question AnsweringQuestion Answering | —Unverified | 0 |
| Improving existing segmentators performance with zero-shot segmentators | Jul 26, 2023 | Camouflaged Object SegmentationSegmentation | CodeCode Available | 0 |
| Kick Back & Relax: Learning to Reconstruct the World by Watching SlowTV | Jul 20, 2023 | Depth EstimationDiversity | CodeCode Available | 1 |
| Metric3D: Towards Zero-shot Metric 3D Prediction from A Single Image | Jul 20, 2023 | Depth EstimationImage Reconstruction | CodeCode Available | 4 |
| Improving Zero-Shot Generalization for CLIP with Synthesized Prompts | Jul 14, 2023 | Generalized Zero-Shot LearningTransfer Learning | CodeCode Available | 1 |
| SAM^Med: A medical image annotation framework based on large vision model | Jul 11, 2023 | Image SegmentationLiver Segmentation | —Unverified | 0 |
| Objaverse-XL: A Universe of 10M+ 3D Objects | Jul 11, 2023 | DiversityNovel View Synthesis | CodeCode Available | 3 |
| SAM-DA: UAV Tracks Anything at Night with SAM-Powered Domain Adaptation | Jul 3, 2023 | Domain AdaptationTransfer Learning | CodeCode Available | 1 |
| PhD Thesis: Exploring the role of (self-)attention in cognitive and computer vision architecture | Jun 26, 2023 | Visual ReasoningZero-shot Generalization | —Unverified | 0 |
| Habitat Synthetic Scenes Dataset (HSSD-200): An Analysis of 3D Scene Scale and Realism Tradeoffs for ObjectGoal Navigation | Jun 20, 2023 | NavigateObjectGoal Navigation | —Unverified | 0 |
| 2nd Place Winning Solution for the CVPR2023 Visual Anomaly and Novelty Detection Challenge: Multimodal Prompting for Data-centric Anomaly Detection | Jun 15, 2023 | Anomaly DetectionAnomaly Localization | CodeCode Available | 2 |
| Learning to Specialize: Joint Gating-Expert Training for Adaptive MoEs in Decentralized Settings | Jun 14, 2023 | DiversityFederated Learning | —Unverified | 0 |
| Gradient Ascent Post-training Enhances Language Model Generalization | Jun 12, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Digital Twin-Enhanced Wireless Indoor Navigation: Achieving Efficient Environment Sensing with Zero-Shot Reinforcement Learning | Jun 11, 2023 | Navigatereinforcement-learning | CodeCode Available | 1 |
| Explore to Generalize in Zero-Shot RL | Jun 5, 2023 | Zero-shot Generalization | CodeCode Available | 0 |
| Improving day-ahead Solar Irradiance Time Series Forecasting by Leveraging Spatio-Temporal Context | Jun 1, 2023 | Solar Irradiance ForecastingTime Series | CodeCode Available | 1 |