| A Simple yet Efficient Ensemble Approach for AI-generated Text Detection | Nov 6, 2023 | Language ModellingLarge Language Model | —Unverified | 0 |
| Octavius: Mitigating Task Interference in MLLMs via LoRA-MoE | Nov 5, 2023 | DecoderMixture-of-Experts | CodeCode Available | 0 |
| What Makes for Good Visual Instructions? Synthesizing Complex Visual Reasoning Instructions for Visual Instruction Tuning | Nov 2, 2023 | MMEVisual Reasoning | CodeCode Available | 1 |
| Align Your Prompts: Test-Time Prompting with Distribution Alignment for Zero-Shot Generalization | Nov 2, 2023 | Domain GeneralizationPrompt Learning | —Unverified | 0 |
| Neural Field Dynamics Model for Granular Object Piles Manipulation | Nov 1, 2023 | ObjectZero-shot Generalization | —Unverified | 0 |
| Instructive Decoding: Instruction-Tuned Large Language Models are Self-Refiner from Noisy Instructions | Nov 1, 2023 | Few-Shot NLIInstruction Following | CodeCode Available | 1 |
| Active Instruction Tuning: Improving Cross-Task Generalization by Training on Prompt Sensitive Tasks | Nov 1, 2023 | InformativenessOut-of-Distribution Generalization | CodeCode Available | 1 |
| ZGUL: Zero-shot Generalization to Unseen Languages using Multi-source Ensembling of Language Adapters | Oct 25, 2023 | Cross-Lingual TransferLanguage Modelling | CodeCode Available | 0 |
| Matryoshka Diffusion Models | Oct 23, 2023 | Image GenerationZero-shot Generalization | CodeCode Available | 2 |
| Robot Skill Generalization via Keypoint Integrated Soft Actor-Critic Gaussian Mixture Models | Oct 23, 2023 | Skill GeneralizationZero-shot Generalization | —Unverified | 0 |
| Lag-Llama: Towards Foundation Models for Probabilistic Time Series Forecasting | Oct 12, 2023 | DecoderProbabilistic Time Series Forecasting | CodeCode Available | 3 |
| What Matters to You? Towards Visual Representation Alignment for Robot Learning | Oct 11, 2023 | Zero-shot Generalization | —Unverified | 0 |
| InstructRetro: Instruction Tuning post Retrieval-Augmented Pretraining | Oct 11, 2023 | 4kDecoder | —Unverified | 0 |
| From Supervised to Generative: A Novel Paradigm for Tabular Deep Learning with Large Language Models | Oct 11, 2023 | In-Context LearningInstruction Following | CodeCode Available | 0 |
| On the Zero-Shot Generalization of Machine-Generated Text Detectors | Oct 8, 2023 | Zero-shot Generalization | —Unverified | 0 |
| On the Performance of Multimodal Language Models | Oct 4, 2023 | BenchmarkingBinary Classification | —Unverified | 0 |
| PACIT: Unlocking the Power of Examples for Better In-Context Instruction Tuning | Oct 2, 2023 | Instruction FollowingZero-shot Generalization | CodeCode Available | 0 |
| Consciousness-Inspired Spatio-Temporal Abstractions for Better Generalization in Reinforcement Learning | Sep 30, 2023 | Decision MakingModel-based Reinforcement Learning | CodeCode Available | 1 |
| MediViSTA: Medical Video Segmentation via Temporal Fusion SAM Adaptation for Echocardiography | Sep 24, 2023 | Image SegmentationMedical Image Segmentation | CodeCode Available | 1 |
| BELT:Bootstrapping Electroencephalography-to-Language Decoding and Zero-Shot Sentiment Classification by Natural Language Supervision | Sep 21, 2023 | Brain DecodingContrastive Learning | —Unverified | 0 |
| Light Field Diffusion for Single-View Novel View Synthesis | Sep 20, 2023 | DenoisingNovel View Synthesis | —Unverified | 0 |
| DEUX: Active Exploration for Learning Unsupervised Depth Perception | Sep 16, 2023 | Depth CompletionDepth Estimation | —Unverified | 0 |
| DePT: Decoupled Prompt Tuning | Sep 14, 2023 | Prompt EngineeringZero-shot Generalization | CodeCode Available | 1 |
| Compositional Learning of Visually-Grounded Concepts Using Reinforcement | Sep 8, 2023 | Deep Reinforcement LearningNavigate | CodeCode Available | 0 |
| SQLdepth: Generalizable Self-Supervised Fine-Structured Monocular Depth Estimation | Sep 1, 2023 | Autonomous DrivingComputational Efficiency | CodeCode Available | 1 |
| DiffuVolume: Diffusion Model for Volume based Stereo Matching | Aug 30, 2023 | modelStereo Matching | —Unverified | 0 |
| Cross-Modal Retrieval Meets Inference:Improving Zero-Shot Classification with Cross-Modal Retrieval | Aug 29, 2023 | Cross-Modal Retrievalimage-classification | —Unverified | 0 |
| Cheap Lunch for Medical Image Segmentation by Fine-tuning SAM on Few Exemplars | Aug 27, 2023 | Brain Tumor SegmentationImage Segmentation | —Unverified | 0 |
| SAM Meets Robotic Surgery: An Empirical Study on Generalization, Robustness and Adaptation | Aug 14, 2023 | Semantic SegmentationZero-shot Generalization | —Unverified | 0 |
| EcomGPT: Instruction-tuning Large Language Models with Chain-of-Task Tasks for E-commerce | Aug 14, 2023 | DiversityInstruction Following | CodeCode Available | 2 |
| TongueSAM: An Universal Tongue Segmentation Model Based on SAM with Zero-Shot | Aug 12, 2023 | DiagnosticInteractive Segmentation | CodeCode Available | 1 |
| Separate Anything You Describe | Aug 9, 2023 | Audio Source SeparationNatural Language Queries | CodeCode Available | 3 |
| ToolLLM: Facilitating Large Language Models to Master 16000+ Real-world APIs | Jul 31, 2023 | Trajectory PlanningZero-shot Generalization | CodeCode Available | 5 |
| Model Synthesis for Zero-Shot Model Attribution | Jul 29, 2023 | Attributemodel | CodeCode Available | 0 |
| Towards Generalist Biomedical AI | Jul 26, 2023 | Medical Question AnsweringQuestion Answering | —Unverified | 0 |
| Improving existing segmentators performance with zero-shot segmentators | Jul 26, 2023 | Camouflaged Object SegmentationSegmentation | CodeCode Available | 0 |
| Kick Back & Relax: Learning to Reconstruct the World by Watching SlowTV | Jul 20, 2023 | Depth EstimationDiversity | CodeCode Available | 1 |
| Metric3D: Towards Zero-shot Metric 3D Prediction from A Single Image | Jul 20, 2023 | Depth EstimationImage Reconstruction | CodeCode Available | 4 |
| Improving Zero-Shot Generalization for CLIP with Synthesized Prompts | Jul 14, 2023 | Generalized Zero-Shot LearningTransfer Learning | CodeCode Available | 1 |
| SAM^Med: A medical image annotation framework based on large vision model | Jul 11, 2023 | Image SegmentationLiver Segmentation | —Unverified | 0 |
| Objaverse-XL: A Universe of 10M+ 3D Objects | Jul 11, 2023 | DiversityNovel View Synthesis | CodeCode Available | 3 |
| SAM-DA: UAV Tracks Anything at Night with SAM-Powered Domain Adaptation | Jul 3, 2023 | Domain AdaptationTransfer Learning | CodeCode Available | 1 |
| PhD Thesis: Exploring the role of (self-)attention in cognitive and computer vision architecture | Jun 26, 2023 | Visual ReasoningZero-shot Generalization | —Unverified | 0 |
| Habitat Synthetic Scenes Dataset (HSSD-200): An Analysis of 3D Scene Scale and Realism Tradeoffs for ObjectGoal Navigation | Jun 20, 2023 | NavigateObjectGoal Navigation | —Unverified | 0 |
| 2nd Place Winning Solution for the CVPR2023 Visual Anomaly and Novelty Detection Challenge: Multimodal Prompting for Data-centric Anomaly Detection | Jun 15, 2023 | Anomaly DetectionAnomaly Localization | CodeCode Available | 2 |
| Learning to Specialize: Joint Gating-Expert Training for Adaptive MoEs in Decentralized Settings | Jun 14, 2023 | DiversityFederated Learning | —Unverified | 0 |
| Gradient Ascent Post-training Enhances Language Model Generalization | Jun 12, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Digital Twin-Enhanced Wireless Indoor Navigation: Achieving Efficient Environment Sensing with Zero-Shot Reinforcement Learning | Jun 11, 2023 | Navigatereinforcement-learning | CodeCode Available | 1 |
| Explore to Generalize in Zero-Shot RL | Jun 5, 2023 | Zero-shot Generalization | CodeCode Available | 0 |
| Improving day-ahead Solar Irradiance Time Series Forecasting by Leveraging Spatio-Temporal Context | Jun 1, 2023 | Solar Irradiance ForecastingTime Series | CodeCode Available | 1 |