| Zero-Shot Fact Verification via Natural Logic and Large Language Models | Oct 4, 2024 | Fact VerificationZero-shot Generalization | CodeCode Available | 0 |
| What Matters for Model Merging at Scale? | Oct 4, 2024 | modelTask Arithmetic | —Unverified | 0 |
| Learning Diverse Bimanual Dexterous Manipulation Skills from Human Demonstrations | Oct 3, 2024 | Zero-shot Generalization | —Unverified | 0 |
| Cross-Embodiment Dexterous Grasping with Reinforcement Learning | Oct 3, 2024 | reinforcement-learningReinforcement Learning | —Unverified | 0 |
| MedViLaM: A multimodal large language model with advanced generalizability and explainability for medical data understanding and generation | Sep 29, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| A novel open-source ultrasound dataset with deep learning benchmarks for spinal cord injury localization and anatomical segmentation | Sep 24, 2024 | Anatomyobject-detection | CodeCode Available | 0 |
| From Goal-Conditioned to Language-Conditioned Agents via Vision-Language Models | Sep 24, 2024 | Reinforcement Learning (RL)Zero-shot Generalization | —Unverified | 0 |
| Deep Generative Adversarial Network for Occlusion Removal from a Single Image | Sep 20, 2024 | Generative Adversarial NetworkSegmentation | —Unverified | 0 |
| Deep Learning based Optical Image Super-Resolution via Generative Diffusion Models for Layerwise in-situ LPBF Monitoring | Sep 20, 2024 | Image Super-ResolutionSSIM | —Unverified | 0 |
| IMRL: Integrating Visual, Physical, Temporal, and Geometric Representations for Enhanced Food Acquisition | Sep 18, 2024 | Imitation LearningReinforcement Learning (RL) | —Unverified | 0 |
| Benchmarking VLMs' Reasoning About Persuasive Atypical Images | Sep 16, 2024 | BenchmarkingObject Recognition | —Unverified | 0 |
| AnySkin: Plug-and-play Skin Sensing for Robotic Touch | Sep 12, 2024 | Zero-shot Generalization | —Unverified | 0 |
| TanDepth: Leveraging Global DEMs for Metric Monocular Depth Estimation in UAVs | Sep 8, 2024 | Depth EstimationMonocular Depth Estimation | —Unverified | 0 |
| Large Language Models as Foundations for Next-Gen Dense Retrieval: A Comprehensive Empirical Assessment | Aug 22, 2024 | Multi-Task LearningRetrieval | —Unverified | 0 |
| Segment Anything Model for Grain Characterization in Hard Drive Design | Aug 22, 2024 | Zero-shot Generalization | —Unverified | 0 |
| Zero-Shot Object-Centric Representation Learning | Aug 17, 2024 | ObjectObject Discovery | —Unverified | 0 |
| One Shot is Enough for Sequential Infrared Small Target Segmentation | Aug 9, 2024 | One-Shot SegmentationSegmentation | CodeCode Available | 0 |
| Performance and Non-adversarial Robustness of the Segment Anything Model 2 in Surgical Video Segmentation | Aug 7, 2024 | Adversarial RobustnessImage Segmentation | —Unverified | 0 |
| HeteroMorpheus: Universal Control Based on Morphological Heterogeneity Modeling | Aug 2, 2024 | DiversityZero-shot Generalization | CodeCode Available | 0 |
| HDL-GPT: High-Quality HDL is All You Need | Jul 25, 2024 | AllCode Generation | —Unverified | 0 |
| SSTD: Stripe-Like Space Target Detection Using Single-Point Weak Supervision | Jul 25, 2024 | Pseudo LabelZero-shot Generalization | —Unverified | 0 |
| OpenSU3D: Open World 3D Scene Understanding using Foundation Models | Jul 19, 2024 | Scene UnderstandingSpatial Reasoning | —Unverified | 0 |
| BEAF: Observing BEfore-AFter Changes to Evaluate Hallucination in Vision-language Models | Jul 18, 2024 | HallucinationLanguage Modelling | —Unverified | 0 |
| Disentangling Representations through Multi-task Learning | Jul 15, 2024 | Decision MakingMulti-Task Learning | —Unverified | 0 |
| Adaptive Prediction Ensemble: Improving Out-of-Distribution Generalization of Motion Forecasting | Jul 12, 2024 | Autonomous DrivingDeep Learning | —Unverified | 0 |
| Enhancing Robustness of Vision-Language Models through Orthogonality Learning and Self-Regularization | Jul 11, 2024 | Data AugmentationDomain Generalization | —Unverified | 0 |
| Real-Time Anomaly Detection and Reactive Planning with Large Language Models | Jul 11, 2024 | Anomaly DetectionAutonomous Vehicles | —Unverified | 0 |
| Swiss DINO: Efficient and Versatile Vision Framework for On-device Personal Object Search | Jul 10, 2024 | Few-Shot LearningGPU | CodeCode Available | 0 |
| Cross-Modal Attention Alignment Network with Auxiliary Text Description for zero-shot sketch-based image retrieval | Jul 1, 2024 | cross-modal alignmentImage Retrieval | —Unverified | 0 |
| NeuralSCF: Neural network self-consistent fields for density functional theory | Jun 22, 2024 | Zero-shot Generalization | —Unverified | 0 |
| Zero-Shot Generalization during Instruction Tuning: Insights from Similarity and Granularity | Jun 17, 2024 | Continual LearningZero-shot Generalization | CodeCode Available | 0 |
| Words in Motion: Extracting Interpretable Control Vectors for Motion Transformers | Jun 17, 2024 | Motion ForecastingZero-shot Generalization | —Unverified | 0 |
| Deep Exploration of Cross-Lingual Zero-Shot Generalization in Instruction Tuning | Jun 13, 2024 | Zero-shot Generalization | CodeCode Available | 0 |
| Prompt-based Visual Alignment for Zero-shot Policy Transfer | Jun 5, 2024 | Autonomous DrivingLanguage Modelling | —Unverified | 0 |
| OLIVE: Object Level In-Context Visual Embeddings | Jun 2, 2024 | ObjectZero-shot Generalization | CodeCode Available | 0 |
| Text-only Synthesis for Image Captioning | May 28, 2024 | Image CaptioningLanguage Modelling | —Unverified | 0 |
| TIMA: Text-Image Mutual Awareness for Balancing Zero-Shot Adversarial Robustness and Generalization Ability | May 27, 2024 | Adversarial RobustnessKnowledge Distillation | —Unverified | 0 |
| Benchmarking General-Purpose In-Context Learning | May 27, 2024 | BenchmarkingDecision Making | —Unverified | 0 |
| Amortized Active Causal Induction with Deep Reinforcement Learning | May 26, 2024 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Gradient Projection For Continual Parameter-Efficient Tuning | May 22, 2024 | Continual LearningHallucination | —Unverified | 0 |
| Prompt Learning for Generalized Vehicle Routing | May 20, 2024 | Combinatorial OptimizationPrompt Learning | CodeCode Available | 0 |
| Revisiting the Robust Generalization of Adversarial Prompt Tuning | May 18, 2024 | Adversarial RobustnessPrompt Learning | —Unverified | 0 |
| A Minimalist Prompt for Zero-Shot Policy Learning | May 9, 2024 | Zero-shot Generalization | —Unverified | 0 |
| Enhancing Vision-Language Models Generalization via Diversity-Driven Novel Feature Synthesis | May 4, 2024 | DiversityZero-shot Generalization | —Unverified | 0 |
| Instruction Matters: A Simple yet Effective Task Selection for Optimized Instruction Tuning of Specific Tasks | Apr 25, 2024 | Zero-shot Generalization | CodeCode Available | 0 |
| The Third Monocular Depth Estimation Challenge | Apr 25, 2024 | Depth EstimationMonocular Depth Estimation | —Unverified | 0 |
| Enabling Natural Zero-Shot Prompting on Encoder Models via Statement-Tuning | Apr 19, 2024 | DiversityZero-shot Generalization | —Unverified | 0 |
| Inferring Behavior-Specific Context Improves Zero-Shot Generalization in Reinforcement Learning | Apr 15, 2024 | reinforcement-learningReinforcement Learning | CodeCode Available | 0 |
| PromptSync: Bridging Domain Gaps in Vision-Language Models through Class-Aware Prototype Alignment and Discrimination | Apr 11, 2024 | Contrastive LearningDomain Generalization | —Unverified | 0 |
| DiffCJK: Conditional Diffusion Model for High-Quality and Wide-coverage CJK Character Generation | Apr 8, 2024 | Zero-shot Generalization | —Unverified | 0 |