| Probabilistic Answer Set Programming with Discrete and Continuous Random Variables | Sep 30, 2024 | | CodeCode Available | 1 |
| Law of the Weakest Link: Cross Capabilities of Large Language Models | Sep 30, 2024 | | CodeCode Available | 1 |
| Text Clustering as Classification with LLMs | Sep 30, 2024 | ClassificationClustering | CodeCode Available | 1 |
| On The Planning Abilities of OpenAI's o1 Models: Feasibility, Optimality, and Generalizability | Sep 30, 2024 | Decision MakingManagement | CodeCode Available | 1 |
| Volumetric Conditional Score-based Residual Diffusion Model for PET/MR Denoising | Sep 30, 2024 | Denoising | CodeCode Available | 1 |
| Camera Calibration using a Collimator System | Sep 30, 2024 | Camera Calibration | CodeCode Available | 1 |
| EndoDepth: A Benchmark for Assessing Robustness in Endoscopic Depth Prediction | Sep 30, 2024 | Depth EstimationDepth Prediction | CodeCode Available | 1 |
| PsyGUARD: An Automated System for Suicide Detection and Risk Assessment in Psychological Counseling | Sep 30, 2024 | | CodeCode Available | 1 |
| Delving Deep into Engagement Prediction of Short Videos | Sep 30, 2024 | PredictionRecommendation Systems | CodeCode Available | 1 |
| ASQuery: A Query-based Model for Action Segmentation | Sep 30, 2024 | Action SegmentationDecoder | CodeCode Available | 1 |
| Physics-Regularized Multi-Modal Image Assimilation for Brain Tumor Localization | Sep 30, 2024 | Anatomy | CodeCode Available | 1 |
| ProFD: Prompt-Guided Feature Disentangling for Occluded Person Re-Identification | Sep 30, 2024 | DecoderOccluded Person Re-Identification | CodeCode Available | 1 |
| SWIM: Short-Window CNN Integrated with Mamba for EEG-Based Auditory Spatial Attention Decoding | Sep 30, 2024 | Data AugmentationEEG | CodeCode Available | 1 |
| IRFusionFormer: Enhancing Pavement Crack Segmentation with RGB-T Fusion and Topological-Based Loss | Sep 30, 2024 | Crack SegmentationSegmentation | CodeCode Available | 1 |
| Towards Unified Multimodal Editing with Enhanced Knowledge Collaboration | Sep 30, 2024 | knowledge editing | CodeCode Available | 1 |
| Enhancing High-order Interaction Awareness in LLM-based Recommender Model | Sep 30, 2024 | Knowledge GraphsReranking | CodeCode Available | 1 |
| Magnet: We Never Know How Text-to-Image Diffusion Models Work, Until We Learn How Vision-Language Models Function | Sep 30, 2024 | AttributeDisentanglement | CodeCode Available | 1 |
| Basis-to-Basis Operator Learning Using Function Encoders | Sep 30, 2024 | Operator learning | CodeCode Available | 1 |
| Unified Gradient-Based Machine Unlearning with Remain Geometry Enhancement | Sep 29, 2024 | Machine Unlearning | CodeCode Available | 1 |
| T2Vs Meet VLMs: A Scalable Multimodal Dataset for Visual Harmfulness Recognition | Sep 29, 2024 | In-Context LearningQuestion Answering | CodeCode Available | 1 |
| MASKDROID: Robust Android Malware Detection with Masked Graph Representations | Sep 29, 2024 | Android Malware DetectionGraph Neural Network | CodeCode Available | 1 |
| BuildingView: Constructing Urban Building Exteriors Databases with Street View Imagery and Multimodal Large Language Mode | Sep 29, 2024 | Decision MakingSystematic Literature Review | CodeCode Available | 1 |
| Crafting Distribution Shifts for Validation and Training in Single Source Domain Generalization | Sep 29, 2024 | Domain GeneralizationImage to sketch recognition | CodeCode Available | 1 |
| Revealing Personality Traits: A New Benchmark Dataset for Explainable Personality Recognition on Dialogues | Sep 29, 2024 | Personality Trait Recognition | CodeCode Available | 1 |
| Can Large Language Models Analyze Graphs like Professionals? A Benchmark, Datasets and Models | Sep 29, 2024 | Recommendation Systems | CodeCode Available | 1 |
| Federated Learning from Vision-Language Foundation Models: Theoretical Analysis and Method | Sep 29, 2024 | Federated LearningLearning Theory | CodeCode Available | 1 |
| DATransNet: Dynamic Attention Transformer Network for Infrared Small Target Detection | Sep 29, 2024 | | CodeCode Available | 1 |
| LoRKD: Low-Rank Knowledge Decomposition for Medical Foundation Models | Sep 29, 2024 | 3D Medical Imaging SegmentationMedical Image Classification | CodeCode Available | 1 |
| Evolving Multi-Scale Normalization for Time Series Forecasting under Distribution Shifts | Sep 29, 2024 | Time SeriesTime Series Forecasting | CodeCode Available | 1 |
| 2D-TPE: Two-Dimensional Positional Encoding Enhances Table Understanding for Large Language Models | Sep 29, 2024 | Computational Efficiency | CodeCode Available | 1 |
| Modeling Layout Reading Order as Ordering Relations for Visually-rich Document Understanding | Sep 29, 2024 | document understandingEntity Linking | CodeCode Available | 1 |
| Hybrid Mamba for Few-Shot Segmentation | Sep 29, 2024 | Few-Shot Semantic SegmentationMamba | CodeCode Available | 1 |
| All-in-One Image Coding for Joint Human-Machine Vision with Multi-Path Aggregation | Sep 29, 2024 | AllData Compression | CodeCode Available | 1 |
| Gradient descent with adaptive stepsize converges (nearly) linearly under fourth-order growth | Sep 29, 2024 | | CodeCode Available | 1 |
| Vision-Language Models are Strong Noisy Label Detectors | Sep 29, 2024 | Denoisingimage-classification | CodeCode Available | 1 |
| MCDDPM: Multichannel Conditional Denoising Diffusion Model for Unsupervised Anomaly Detection in Brain MRI | Sep 29, 2024 | Anomaly DetectionDenoising | CodeCode Available | 1 |
| OrientedFormer: An End-to-End Transformer-Based Oriented Object Detector in Remote Sensing Images | Sep 29, 2024 | object-detectionObject Detection | CodeCode Available | 1 |
| CoTKR: Chain-of-Thought Enhanced Knowledge Rewriting for Complex Knowledge Graph Question Answering | Sep 29, 2024 | Graph Question AnsweringQuestion Answering | CodeCode Available | 1 |
| VLAD-BuFF: Burst-aware Fast Feature Aggregation for Visual Place Recognition | Sep 28, 2024 | Image RetrievalVisual Localization | CodeCode Available | 1 |
| GS-EVT: Cross-Modal Event Camera Tracking based on Gaussian Splatting | Sep 28, 2024 | Camera Pose EstimationEvent-based vision | CodeCode Available | 1 |
| X-Prompt: Multi-modal Visual Prompt for Video Object Segmentation | Sep 28, 2024 | Semantic SegmentationVideo Object Segmentation | CodeCode Available | 1 |
| Summit Vitals: Multi-Camera and Multi-Signal Biosensing at High Altitudes | Sep 28, 2024 | SpO2 estimation | CodeCode Available | 1 |
| SELP: Generating Safe and Efficient Task Plans for Robot Agents with Large Language Models | Sep 28, 2024 | Drone navigationRobot Manipulation | CodeCode Available | 1 |
| Analog In-Memory Computing Attention Mechanism for Fast and Energy-Efficient Large Language Models | Sep 28, 2024 | GPU | CodeCode Available | 1 |
| RMLR: Extending Multinomial Logistic Regression into General Geometries | Sep 28, 2024 | regression | CodeCode Available | 1 |
| A Confidence-Aware Matching Strategy For Generalized Multi-Object Tracking | Sep 27, 2024 | Multi-Object Trackingobject-detection | CodeCode Available | 1 |
| MECG-E: Mamba-based ECG Enhancer for Baseline Wander Removal | Sep 27, 2024 | DenoisingDiagnostic | CodeCode Available | 1 |
| Off to new Shores: A Dataset & Benchmark for (near-)coastal Flood Inundation Forecasting | Sep 27, 2024 | Flood extent forecasting | CodeCode Available | 1 |
| Mixture of Multicenter Experts in Multimodal Generative AI for Advanced Radiotherapy Target Delineation | Sep 27, 2024 | | CodeCode Available | 1 |
| Relighting from a Single Image: Datasets and Deep Intrinsic-based Architecture | Sep 27, 2024 | | CodeCode Available | 1 |