| A toolbox for calculating objective image properties in aesthetics research | Aug 20, 2024 | | CodeCode Available | 1 |
| V-RoAst: Visual Road Assessment. Can VLM be a Road Safety Assessor Using the iRAP Standard? | Aug 20, 2024 | Few-Shot LearningIn-Context Learning | CodeCode Available | 1 |
| TDS-CLIP: Temporal Difference Side Network for Image-to-Video Transfer Learning | Aug 20, 2024 | Action Recognitionparameter-efficient fine-tuning | CodeCode Available | 1 |
| BLADE: Benchmarking Language Model Agents for Data-Driven Science | Aug 19, 2024 | BenchmarkingDecision Making | CodeCode Available | 1 |
| AIR: Analytic Imbalance Rectifier for Continual Learning | Aug 19, 2024 | class-incremental learningClass Incremental Learning | CodeCode Available | 1 |
| Structure-preserving Image Translation for Depth Estimation in Colonoscopy Video | Aug 19, 2024 | Depth EstimationMonocular Depth Estimation | CodeCode Available | 1 |
| SHARP: Segmentation of Hands and Arms by Range using Pseudo-Depth for Enhanced Egocentric 3D Hand Pose Estimation and Action Recognition | Aug 19, 2024 | 3D Hand Pose EstimationAction Recognition | CodeCode Available | 1 |
| PolypDB: A Curated Multi-Center Dataset for Development of AI Algorithms in Colonoscopy | Aug 19, 2024 | Federated LearningSegmentation | CodeCode Available | 1 |
| Event Stream based Human Action Recognition: A High-Definition Benchmark Dataset and Algorithms | Aug 19, 2024 | Action RecognitionMamba | CodeCode Available | 1 |
| CLIPCleaner: Cleaning Noisy Labels with CLIP | Aug 19, 2024 | Learning with noisy labels | CodeCode Available | 1 |
| TDNetGen: Empowering Complex Network Resilience Prediction with Generative Augmentation of Topology and Dynamics | Aug 19, 2024 | Data AugmentationPrediction | CodeCode Available | 1 |
| CMoralEval: A Moral Evaluation Benchmark for Chinese Large Language Models | Aug 19, 2024 | DiversityLanguage Modeling | CodeCode Available | 1 |
| Contextual Importance and Utility in Python: New Functionality and Insights with the py-ciu Package | Aug 19, 2024 | | CodeCode Available | 1 |
| FFAA: Multimodal Large Language Model based Explainable Open-World Face Forgery Analysis Assistant | Aug 19, 2024 | DescriptiveFace Swapping | CodeCode Available | 1 |
| ExpoMamba: Exploiting Frequency SSM Blocks for Efficient and Effective Image Enhancement | Aug 19, 2024 | Computational EfficiencyImage Enhancement | CodeCode Available | 1 |
| Unsupervised Composable Representations for Audio | Aug 19, 2024 | Audio Source Separationblind source separation | CodeCode Available | 1 |
| A Dataset for Mechanical Mechanisms | Aug 19, 2024 | | CodeCode Available | 1 |
| Deep-MacroFin: Informed Equilibrium Neural Network for Continuous Time Economic Models | Aug 19, 2024 | Kolmogorov-Arnold Networks | CodeCode Available | 1 |
| PinnDE: Physics-Informed Neural Networks for Solving Differential Equations | Aug 19, 2024 | | CodeCode Available | 1 |
| Sliced Maximal Information Coefficient: A Training-Free Approach for Image Quality Assessment Enhancement | Aug 19, 2024 | Full reference image quality assessmentFull-Reference Image Quality Assessment | CodeCode Available | 1 |
| Transformers to SSMs: Distilling Quadratic Knowledge to Subquadratic Models | Aug 19, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| OccMamba: Semantic Occupancy Prediction with State Space Models | Aug 19, 2024 | MambaPrediction | CodeCode Available | 1 |
| TaSL: Continual Dialog State Tracking via Task Skill Localization and Consolidation | Aug 19, 2024 | dialog state trackingDialogue State Tracking | CodeCode Available | 1 |
| "Image, Tell me your story!" Predicting the original meta-context of visual misinformation | Aug 19, 2024 | Fact CheckingMisinformation | CodeCode Available | 1 |
| SAM-UNet:Enhancing Zero-Shot Segmentation of SAM for Universal Medical Images | Aug 19, 2024 | Image SegmentationMedical Image Segmentation | CodeCode Available | 1 |
| Long-Tail Temporal Action Segmentation with Group-wise Temporal Logit Adjustment | Aug 19, 2024 | Action SegmentationSegmentation | CodeCode Available | 1 |
| Customizing Language Models with Instance-wise LoRA for Sequential Recommendation | Aug 19, 2024 | Mixture-of-ExpertsMulti-Task Learning | CodeCode Available | 1 |
| Goldfish: Monolingual Language Models for 350 Languages | Aug 19, 2024 | Text Generation | CodeCode Available | 1 |
| Implicit Grid Convolution for Multi-Scale Image Super-Resolution | Aug 19, 2024 | Computational EfficiencyImage Super-Resolution | CodeCode Available | 1 |
| Facial Wrinkle Segmentation for Cosmetic Dermatology: Pretraining with Texture Map-Based Weak Supervision | Aug 19, 2024 | DecoderSegmentation | CodeCode Available | 1 |
| Uncertainty Quantification of Surrogate Models using Conformal Prediction | Aug 19, 2024 | Conformal PredictionPrediction | CodeCode Available | 1 |
| Parkinson's Disease Classification via EEG: All You Need is a Single Convolutional Layer | Aug 19, 2024 | AllEEG | CodeCode Available | 1 |
| AdapMoE: Adaptive Sensitivity-based Expert Gating and Management for Efficient MoE Inference | Aug 19, 2024 | ManagementMixture-of-Experts | CodeCode Available | 1 |
| Attribution Analysis Meets Model Editing: Advancing Knowledge Correction in Vision Language Models with VisEdit | Aug 19, 2024 | DecoderLanguage Modeling | CodeCode Available | 1 |
| Harnessing Multimodal Large Language Models for Multimodal Sequential Recommendation | Aug 19, 2024 | Large Language ModelMultimodal Large Language Model | CodeCode Available | 1 |
| Dynamic Label Injection for Imbalanced Industrial Defect Segmentation | Aug 19, 2024 | Semantic Segmentation | CodeCode Available | 1 |
| GNN-Empowered Effective Partial Observation MARL Method for AoI Management in Multi-UAV Network | Aug 18, 2024 | Distributed OptimizationEfficient Neural Network | CodeCode Available | 1 |
| HiAgent: Hierarchical Working Memory Management for Solving Long-Horizon Agent Tasks with Large Language Model | Aug 18, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| G2Face: High-Fidelity Reversible Face Anonymization via Generative and Geometric Priors | Aug 18, 2024 | DecoderFace Anonymization | CodeCode Available | 1 |
| Reefknot: A Comprehensive Benchmark for Relation Hallucination Evaluation, Analysis and Mitigation in Multimodal Large Language Models | Aug 18, 2024 | AttributeHallucination | CodeCode Available | 1 |
| Distinguish Confusion in Legal Judgment Prediction via Revised Relation Knowledge | Aug 18, 2024 | ArticlesInductive Bias | CodeCode Available | 1 |
| VrdONE: One-stage Video Visual Relation Detection | Aug 18, 2024 | Predicate DetectionRelation | CodeCode Available | 1 |
| Unsupervised Change Detection Based on Image Reconstruction Loss with Segment Anything | Aug 18, 2024 | Change DetectionImage Reconstruction | CodeCode Available | 1 |
| Enhancing Modal Fusion by Alignment and Label Matching for Multimodal Emotion Recognition | Aug 18, 2024 | Contrastive LearningEmotion Recognition | CodeCode Available | 1 |
| GitHub is an effective platform for collaborative and reproducible laboratory research | Aug 18, 2024 | Experimental DesignTransfer Learning | CodeCode Available | 1 |
| Flemme: A Flexible and Modular Learning Platform for Medical Images | Aug 18, 2024 | Image SegmentationSemantic Segmentation | CodeCode Available | 1 |
| Antidote: Post-fine-tuning Safety Alignment for Large Language Models against Harmful Fine-tuning | Aug 18, 2024 | PhilosophySafety Alignment | CodeCode Available | 1 |
| Re-boosting Self-Collaboration Parallel Prompt GAN for Unsupervised Image Restoration | Aug 17, 2024 | Image RestorationPrompt Learning | CodeCode Available | 1 |
| PADetBench: Towards Benchmarking Physical Attacks against Object Detection | Aug 17, 2024 | Adversarial RobustnessBenchmarking | CodeCode Available | 1 |
| Are CLIP features all you need for Universal Synthetic Image Origin Attribution? | Aug 17, 2024 | All | CodeCode Available | 1 |