| Multi-Spectral Remote Sensing Image Retrieval Using Geospatial Foundation Models | Mar 4, 2024 | Image RetrievalRetrieval | CodeCode Available | 2 |
| Exposing the Deception: Uncovering More Forgery Clues for Deepfake Detection | Mar 4, 2024 | DeepFake DetectionFace Swapping | CodeCode Available | 2 |
| One Prompt Word is Enough to Boost Adversarial Robustness for Pre-trained Vision-Language Models | Mar 4, 2024 | Adversarial AttackAdversarial Robustness | CodeCode Available | 2 |
| UniCtrl: Improving the Spatiotemporal Consistency of Text-to-Video Diffusion Models via Training-Free Unified Attention Control | Mar 4, 2024 | DiversityVideo Generation | CodeCode Available | 2 |
| SciAssess: Benchmarking LLM Proficiency in Scientific Literature Analysis | Mar 4, 2024 | BenchmarkingDrug Discovery | CodeCode Available | 2 |
| A Simple Baseline for Efficient Hand Mesh Reconstruction | Mar 4, 2024 | 3D Hand Pose EstimationComputational Efficiency | CodeCode Available | 2 |
| Applied Causal Inference Powered by ML and AI | Mar 4, 2024 | Causal Inference | CodeCode Available | 2 |
| AllSpark: Reborn Labeled Features from Unlabeled in Transformer for Semi-Supervised Semantic Segmentation | Mar 4, 2024 | Semantic SegmentationSemi-Supervised Semantic Segmentation | CodeCode Available | 2 |
| Differentially Private Synthetic Data via Foundation Model APIs 2: Text | Mar 4, 2024 | Privacy Preserving | CodeCode Available | 2 |
| REAL-Colon: A dataset for developing real-world AI applications in colonoscopy | Mar 4, 2024 | Benchmarking | CodeCode Available | 2 |
| Dynamic Adapter Meets Prompt Tuning: Parameter-Efficient Transfer Learning for Point Cloud Analysis | Mar 3, 2024 | 3D Parameter-Efficient Fine-Tuning for ClassificationGPU | CodeCode Available | 2 |
| In-Context Sharpness as Alerts: An Inner Representation Perspective for Hallucination Mitigation | Mar 3, 2024 | HallucinationTruthfulQA | CodeCode Available | 2 |
| OccFusion: Multi-Sensor Fusion Framework for 3D Semantic Occupancy Prediction | Mar 3, 2024 | 3D Semantic Occupancy PredictionAutonomous Driving | CodeCode Available | 2 |
| EAGLE: Eigen Aggregation Learning for Object-Centric Unsupervised Semantic Segmentation | Mar 3, 2024 | ObjectRepresentation Learning | CodeCode Available | 2 |
| Kick Back & Relax++: Scaling Beyond Ground-Truth Depth with SlowTV & CribsTV | Mar 3, 2024 | Depth EstimationMonocular Depth Estimation | CodeCode Available | 2 |
| Face Swap via Diffusion Model | Mar 2, 2024 | Face AlignmentFace Detection | CodeCode Available | 2 |
| Dynamic 3D Point Cloud Sequences as 2D Videos | Mar 2, 2024 | Action RecognitionSelf-Supervised Learning | CodeCode Available | 2 |
| Efficient Episodic Memory Utilization of Cooperative Multi-Agent Reinforcement Learning | Mar 2, 2024 | DecoderMulti-agent Reinforcement Learning | CodeCode Available | 2 |
| On the Road to Portability: Compressing End-to-End Motion Planner for Autonomous Driving | Mar 2, 2024 | Autonomous DrivingKnowledge Distillation | CodeCode Available | 2 |
| AutoDefense: Multi-Agent LLM Defense against Jailbreak Attacks | Mar 2, 2024 | Instruction FollowingLLM real-life tasks | CodeCode Available | 2 |
| VNLP: Turkish NLP Package | Mar 2, 2024 | Morphological Analysisnamed-entity-recognition | CodeCode Available | 2 |
| Depth Information Assisted Collaborative Mutual Promotion Network for Single Image Dehazing | Mar 2, 2024 | Depth EstimationImage Dehazing | CodeCode Available | 2 |
| DAMSDet: Dynamic Adaptive Multispectral Detection Transformer with Competitive Query Selection and Adaptive Feature Fusion | Mar 1, 2024 | Objectobject-detection | CodeCode Available | 2 |
| EfficientZero V2: Mastering Discrete and Continuous Control with Limited Data | Mar 1, 2024 | continuous-controlContinuous Control | CodeCode Available | 2 |
| Point Cloud Mamba: Point Cloud Learning via State Space Model | Mar 1, 2024 | MambaState Space Models | CodeCode Available | 2 |
| HALC: Object Hallucination Reduction via Adaptive Focal-Contrast Decoding | Mar 1, 2024 | HallucinationObject | CodeCode Available | 2 |
| Data Science Education in Undergraduate Physics: Lessons Learned from a Community of Practice | Mar 1, 2024 | | CodeCode Available | 2 |
| Deformable One-shot Face Stylization via DINO Semantic Guidance | Mar 1, 2024 | One-Shot Face Stylization | CodeCode Available | 2 |
| SURE: SUrvey REcipes for building reliable and robust deep networks | Mar 1, 2024 | image-classificationImage Classification | CodeCode Available | 2 |
| A Modular and Robust Physics-Based Approach for Lensless Image Reconstruction | Mar 1, 2024 | Image Reconstruction | CodeCode Available | 2 |
| Dual-domain strip attention for image restoration | Mar 1, 2024 | DeblurringDenoising | CodeCode Available | 2 |
| Rethinking Few-shot 3D Point Cloud Semantic Segmentation | Mar 1, 2024 | Few-shot 3D Point Cloud Semantic SegmentationSegmentation | CodeCode Available | 2 |
| Selective-Stereo: Adaptive Frequency Information Selection for Stereo Matching | Mar 1, 2024 | Stereo Matching | CodeCode Available | 2 |
| TempCompass: Do Video LLMs Really Understand Videos? | Mar 1, 2024 | Diversity | CodeCode Available | 2 |
| PEM: Prototype-based Efficient MaskFormer for Image Segmentation | Feb 29, 2024 | Image SegmentationPanoptic Segmentation | CodeCode Available | 2 |
| Spyx: A Library for Just-In-Time Compiled Optimization of Spiking Neural Networks | Feb 29, 2024 | | CodeCode Available | 2 |
| Learning Commonality, Divergence and Variety for Unsupervised Visible-Infrared Person Re-identification | Feb 29, 2024 | Contrastive LearningPerson Re-Identification | CodeCode Available | 2 |
| Curiosity-driven Red-teaming for Large Language Models | Feb 29, 2024 | Red TeamingReinforcement Learning (RL) | CodeCode Available | 2 |
| FusionVision: A comprehensive approach of 3D object reconstruction and segmentation from RGB-D cameras using YOLO and fast segment anything | Feb 29, 2024 | 3D Object ReconstructionInstance Segmentation | CodeCode Available | 2 |
| A Cognitive-Based Trajectory Prediction Approach for Autonomous Driving | Feb 29, 2024 | Autonomous DrivingDecision Making | CodeCode Available | 2 |
| How do Large Language Models Handle Multilingualism? | Feb 29, 2024 | | CodeCode Available | 2 |
| CricaVPR: Cross-image Correlation-aware Representation Learning for Visual Place Recognition | Feb 29, 2024 | Representation LearningVisual Place Recognition | CodeCode Available | 2 |
| NARUTO: Neural Active Reconstruction from Uncertain Target Observations | Feb 29, 2024 | Surface Reconstruction | CodeCode Available | 2 |
| Deep learning for 3D human pose estimation and mesh recovery: A survey | Feb 29, 2024 | 3D Human Pose EstimationAutonomous Driving | CodeCode Available | 2 |
| Global and Local Prompts Cooperation via Optimal Transport for Federated Learning | Feb 29, 2024 | Federated LearningPrompt Learning | CodeCode Available | 2 |
| Training Generative Image Super-Resolution Models by Wavelet-Domain Losses Enables Better Control of Artifacts | Feb 29, 2024 | Image Super-ResolutionSuper-Resolution | CodeCode Available | 2 |
| Functional Benchmarks for Robust Evaluation of Reasoning Performance, and the Reasoning Gap | Feb 29, 2024 | Math | CodeCode Available | 2 |
| GSM-Plus: A Comprehensive Benchmark for Evaluating the Robustness of LLMs as Mathematical Problem Solvers | Feb 29, 2024 | GSM8KMath | CodeCode Available | 2 |
| DiffAssemble: A Unified Graph-Diffusion Model for 2D and 3D Reassembly | Feb 29, 2024 | DenoisingGraph Neural Network | CodeCode Available | 2 |
| A Novel Approach to Industrial Defect Generation through Blended Latent Diffusion Model with Online Adaptation | Feb 29, 2024 | Anomaly DetectionDecoder | CodeCode Available | 2 |