| Dynamical Diffusion: Learning Temporal Dynamics with Diffusion Models | Mar 2, 2025 | Time Series ForecastingVideo Prediction | CodeCode Available | 1 |
| Bridging Spectral-wise and Multi-spectral Depth Estimation via Geometry-guided Contrastive Learning | Mar 2, 2025 | Autonomous VehiclesContrastive Learning | CodeCode Available | 1 |
| LightEndoStereo: A Real-time Lightweight Stereo Matching Method for Endoscopy Images | Mar 2, 2025 | MambaStereo Matching | CodeCode Available | 1 |
| CAGN-GAT Fusion: A Hybrid Contrastive Attentive Graph Neural Network for Network Intrusion Detection | Mar 2, 2025 | FairnessGraph Attention | CodeCode Available | 1 |
| Modeling Fine-Grained Hand-Object Dynamics for Egocentric Video Representation Learning | Mar 2, 2025 | Large Language ModelMulti-Instance Retrieval | CodeCode Available | 1 |
| On Generalization Across Environments In Multi-Objective Reinforcement Learning | Mar 2, 2025 | Decision MakingMulti-Objective Reinforcement Learning | CodeCode Available | 1 |
| Data Unlearning in Diffusion Models | Mar 2, 2025 | Machine UnlearningMemorization | CodeCode Available | 1 |
| Speculative Ad-hoc Querying | Mar 2, 2025 | | CodeCode Available | 1 |
| Shazam: Unifying Multiple Foundation Models for Advanced Computational Pathology | Mar 2, 2025 | | CodeCode Available | 1 |
| Advancing Prompt-Based Methods for Replay-Independent General Continual Learning | Mar 2, 2025 | Continual Learning | CodeCode Available | 1 |
| Enhancing Monocular 3D Scene Completion with Diffusion Model | Mar 2, 2025 | 3D Reconstruction3D Scene Reconstruction | CodeCode Available | 1 |
| Task-Agnostic Guided Feature Expansion for Class-Incremental Learning | Mar 2, 2025 | class-incremental learningClass Incremental Learning | CodeCode Available | 1 |
| TAET: Two-Stage Adversarial Equalization Training on Long-Tailed Distributions | Mar 2, 2025 | Adversarial RobustnessComputational Efficiency | CodeCode Available | 1 |
| Dynamic Gradient Sparsification Training for Few-Shot Fine-tuning of CT Lymph Node Segmentation Foundation Model | Mar 2, 2025 | PrognosisSegmentation | CodeCode Available | 1 |
| IteRPrimE: Zero-shot Referring Image Segmentation with Iterative Grad-CAM Refinement and Primary Word Emphasis | Mar 2, 2025 | Image SegmentationImage-text matching | CodeCode Available | 1 |
| T-cell receptor specificity landscape revealed through de novo peptide design | Mar 1, 2025 | Specificity | CodeCode Available | 1 |
| Space-Time Graphs of Convex Sets for Multi-Robot Motion Planning | Mar 1, 2025 | Motion Planning | CodeCode Available | 1 |
| Safety Tax: Safety Alignment Makes Your Large Reasoning Models Less Reasonable | Mar 1, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Decoupling Content and Expression: Two-Dimensional Detection of AI-Generated Text | Mar 1, 2025 | | CodeCode Available | 1 |
| Discrete Codebook World Models for Continuous Control | Mar 1, 2025 | continuous-controlContinuous Control | CodeCode Available | 1 |
| High Dynamic Range Video Compression: A Large-Scale Benchmark Dataset and A Learned Bit-depth Scalable Compression Algorithm | Mar 1, 2025 | Video Compression | CodeCode Available | 1 |
| CADRef: Robust Out-of-Distribution Detection via Class-Aware Decoupled Relative Feature Leveraging | Mar 1, 2025 | Out-of-Distribution Detection | CodeCode Available | 1 |
| ToolDial: Multi-turn Dialogue Generation Method for Tool-Augmented Language Models | Mar 1, 2025 | Dialogue Generation | CodeCode Available | 1 |
| SGC-Net: Stratified Granular Comparison Network for Open-Vocabulary HOI Detection | Mar 1, 2025 | Human-Object Interaction DetectionLarge Language Model | CodeCode Available | 1 |
| MIRROR: Multi-Modal Pathological Self-Supervised Representation Learning via Modality Alignment and Retention | Mar 1, 2025 | ClusteringRepresentation Learning | CodeCode Available | 1 |
| dyAb: Flow Matching for Flexible Antibody Design with AlphaFold-driven Pre-binding Antigen | Mar 1, 2025 | | CodeCode Available | 1 |
| Reinforcement learning with combinatorial actions for coupled restless bandits | Mar 1, 2025 | reinforcement-learningReinforcement Learning | CodeCode Available | 1 |
| Two-stream Beats One-stream: Asymmetric Siamese Network for Efficient Visual Tracking | Mar 1, 2025 | CPUGPU | CodeCode Available | 1 |
| Fast and Accurate Gigapixel Pathological Image Classification with Hierarchical Distillation Multi-Instance Learning | Feb 28, 2025 | image-classificationImage Classification | CodeCode Available | 1 |
| SSL4EO-S12 v1.1: A Multimodal, Multiseasonal Dataset for Pretraining, Updated | Feb 28, 2025 | Earth ObservationSelf-Supervised Learning | CodeCode Available | 1 |
| Dynamic Markov Blanket Detection for Macroscopic Physics Discovery | Feb 28, 2025 | Object | CodeCode Available | 1 |
| LexRAG: Benchmarking Retrieval-Augmented Generation in Multi-Turn Legal Consultation Conversation | Feb 28, 2025 | ArticlesBenchmarking | CodeCode Available | 1 |
| A novel Fourier Adjacency Transformer for advanced EEG emotion recognition | Feb 28, 2025 | EEGEEG Emotion Recognition | CodeCode Available | 1 |
| Mitigating Hallucinations in Large Vision-Language Models by Adaptively Constraining Information Flow | Feb 28, 2025 | HallucinationObject | CodeCode Available | 1 |
| UDora: A Unified Red Teaming Framework against LLM Agents by Dynamically Hijacking Their Own Reasoning | Feb 28, 2025 | Large Language ModelRed Teaming | CodeCode Available | 1 |
| MESC-3D:Mining Effective Semantic Cues for 3D Reconstruction from a Single Image | Feb 28, 2025 | 3D Reconstruction | CodeCode Available | 1 |
| BST: Badminton Stroke-type Transformer for Skeleton-based Action Recognition in Racket Sports | Feb 28, 2025 | Action RecognitionLine Detection | CodeCode Available | 1 |
| Contextualizing biological perturbation experiments through language | Feb 28, 2025 | Efficient Exploration | CodeCode Available | 1 |
| EVLoc: Event-based Visual Localization in LiDAR Maps via Event-Depth Registration | Feb 28, 2025 | Camera Pose EstimationOptical Flow Estimation | CodeCode Available | 1 |
| Towards General Visual-Linguistic Face Forgery Detection(V2) | Feb 28, 2025 | HallucinationLanguage Modeling | CodeCode Available | 1 |
| Data-free Universal Adversarial Perturbation with Pseudo-semantic Prior | Feb 28, 2025 | Adversarial Attack | CodeCode Available | 1 |
| Oscillation-Reduced MXFP4 Training for Vision Transformers | Feb 28, 2025 | GPUQuantization | CodeCode Available | 1 |
| FANformer: Improving Large Language Models Through Effective Periodicity Modeling | Feb 28, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Synthesizing Individualized Aging Brains in Health and Disease with Generative Models and Parallel Transport | Feb 28, 2025 | Anatomy | CodeCode Available | 1 |
| FedConv: A Learning-on-Model Paradigm for Heterogeneous Federated Clients | Feb 28, 2025 | Federated Learning | CodeCode Available | 1 |
| Towards Zero Touch Networks: Cross-Layer Automated Security Solutions for 6G Wireless Networks | Feb 28, 2025 | AutoMLIntrusion Detection | CodeCode Available | 1 |
| DeepSolution: Boosting Complex Engineering Solution Design via Tree-based Exploration and Bi-point Thinking | Feb 28, 2025 | RAGRetrieval | CodeCode Available | 1 |
| Protein Structure Tokenization: Benchmarking and New Recipe | Feb 28, 2025 | BenchmarkingLanguage Modeling | CodeCode Available | 1 |
| Towards Lossless Implicit Neural Representation via Bit Plane Decomposition | Feb 28, 2025 | Image CompressionQuantization | CodeCode Available | 1 |
| SafeAuto: Knowledge-Enhanced Safe Autonomous Driving with Multimodal Foundation Models | Feb 28, 2025 | AttributeAutonomous Driving | CodeCode Available | 1 |