| Effective LLM-Driven Code Generation with Pythoness | Jan 3, 2025 | Code Generation | CodeCode Available | 1 |
| CoT-based Synthesizer: Enhancing LLM Performance through Answer Synthesis | Jan 3, 2025 | Math | CodeCode Available | 1 |
| BERT4MIMO: A Foundation Model using BERT Architecture for Massive MIMO Channel State Information Prediction | Jan 3, 2025 | | CodeCode Available | 1 |
| MADGEN: Mass-Spec attends to De Novo Molecular generation | Jan 3, 2025 | Contrastive Learning | CodeCode Available | 1 |
| Balancing Accuracy and Efficiency for Large-Scale SLAM: A Minimal Subset Approach for Scalable Loop Closures | Jan 3, 2025 | global-optimizationLoop Closure Detection | CodeCode Available | 1 |
| MIRAGE: Exploring How Large Language Models Perform in Complex Social Interactive Environments | Jan 3, 2025 | Decision Making | CodeCode Available | 1 |
| ACE: Anti-Editing Concept Erasure in Text-to-Image Models | Jan 3, 2025 | | CodeCode Available | 1 |
| HSTforU: anomaly detection in aerial and ground-based videos with hierarchical spatio-temporal transformer for U-net | Jan 3, 2025 | Anomaly DetectionDecoder | CodeCode Available | 1 |
| Ingredients: Blending Custom Photos with Video Diffusion Transformers | Jan 3, 2025 | | CodeCode Available | 1 |
| Universal Online Temporal Calibration for Optimization-based Visual-Inertial Navigation Systems | Jan 3, 2025 | Motion Estimation | CodeCode Available | 1 |
| Architecture for Trajectory-Based Fishing Ship Classification with AIS Data | Jan 3, 2025 | Binary ClassificationClassification | CodeCode Available | 1 |
| Modeling Multi-Task Model Merging as Adaptive Projective Gradient Descent | Jan 2, 2025 | Multi-Task Learning | CodeCode Available | 1 |
| CySecBench: Generative AI-based CyberSecurity-focused Prompt Dataset for Benchmarking Large Language Models | Jan 2, 2025 | BenchmarkingComputer Security | CodeCode Available | 1 |
| Learning 3D Garment Animation from Trajectories of A Piece of Cloth | Jan 2, 2025 | | CodeCode Available | 1 |
| Conditional Consistency Guided Image Translation and Enhancement | Jan 2, 2025 | DenoisingImage Enhancement | CodeCode Available | 1 |
| Hadamard Attention Recurrent Transformer: A Strong Baseline for Stereo Matching Transformer | Jan 2, 2025 | Stereo Matching | CodeCode Available | 1 |
| MalCL: Leveraging GAN-Based Generative Replay to Combat Catastrophic Forgetting in Malware Classification | Jan 2, 2025 | class-incremental learningClass Incremental Learning | CodeCode Available | 1 |
| Efficient Connectivity-Preserving Instance Segmentation with Supervoxel-Based Loss Function | Jan 2, 2025 | Instance SegmentationSegmentation | CodeCode Available | 1 |
| Detail Matters: Mamba-Inspired Joint Unfolding Network for Snapshot Spectral Compressive Imaging | Jan 2, 2025 | Mamba | CodeCode Available | 1 |
| Long-range Brain Graph Transformer | Jan 2, 2025 | Graph Learning | CodeCode Available | 1 |
| SeFAR: Semi-supervised Fine-grained Action Recognition with Temporal Perturbation and Learning Stabilization | Jan 2, 2025 | Action RecognitionAction Understanding | CodeCode Available | 1 |
| ORACLE: A Real-Time, Hierarchical, Deep-Learning Photometric Classifier for the LSST | Jan 2, 2025 | ClassificationTime Series Classification | CodeCode Available | 1 |
| Predicting the Performance of Black-box LLMs through Self-Queries | Jan 2, 2025 | Question Answering | CodeCode Available | 1 |
| HybridTrack: A Hybrid Approach for Robust Multi-Object Tracking | Jan 2, 2025 | 3D Multi-Object TrackingMulti-Object Tracking | CodeCode Available | 1 |
| Unifying Specialized Visual Encoders for Video Language Models | Jan 2, 2025 | Multiple-choiceVideo Understanding | CodeCode Available | 1 |
| CSC-PA: Cross-image Semantic Correlation via Prototype Attentions for Single-network Semi-supervised Breast Tumor Segmentation | Jan 1, 2025 | Image SegmentationLesion Segmentation | CodeCode Available | 1 |
| Less is More: Token Context-aware Learning for Object Tracking | Jan 1, 2025 | Object TrackingVisual Tracking | CodeCode Available | 1 |
| PIDLoc: Cross-View Pose Optimization Network Inspired by PID Controllers | Jan 1, 2025 | Autonomous DrivingPose Estimation | CodeCode Available | 1 |
| Align-KD: Distilling Cross-Modal Alignment Knowledge for Mobile Vision-Language Large Model Enhancement | Jan 1, 2025 | cross-modal alignmentKnowledge Distillation | CodeCode Available | 1 |
| Multimodal Large Models Are Effective Action Anticipators | Jan 1, 2025 | Action AnticipationLong Term Action Anticipation | CodeCode Available | 1 |
| Free Lunch Enhancements for Multi-modal Crowd Counting | Jan 1, 2025 | cross-modal alignmentCrowd Counting | CodeCode Available | 1 |
| Detection-Friendly Nonuniformity Correction: A Union Framework for Infrared UAV Target Detection | Jan 1, 2025 | parameter estimation | CodeCode Available | 1 |
| UHD-processer: Unified UHD Image Restoration with Progressive Frequency Learning and Degradation-aware Prompts | Jan 1, 2025 | DeblurringDenoising | CodeCode Available | 1 |
| OW-OVD: Unified Open World and Open Vocabulary Object Detection | Jan 1, 2025 | AttributeIncremental Learning | CodeCode Available | 1 |
| Foreground-Covering Prototype Generation and Matching for SAM-Aided Few-Shot Segmentation | Jan 1, 2025 | | CodeCode Available | 1 |
| Rethinking Addressing in Language Models via Contexualized Equivariant Positional Encoding | Jan 1, 2025 | Arithmetic ReasoningLanguage Modeling | CodeCode Available | 1 |
| AVF-MAE++: Scaling Affective Video Facial Masked Autoencoders via Efficient Audio-Visual Self-Supervised Learning | Jan 1, 2025 | Self-Supervised Learning | CodeCode Available | 1 |
| VASparse: Towards Efficient Visual Hallucination Mitigation via Visual-Aware Token Sparsification | Jan 1, 2025 | Hallucination | CodeCode Available | 1 |
| VolFormer: Explore More Comprehensive Cube Interaction for Hyperspectral Image Restoration and Beyond | Jan 1, 2025 | Hyperspectral Image Super-ResolutionImage Restoration | CodeCode Available | 1 |
| T2ICount: Enhancing Cross-modal Understanding for Zero-Shot Counting | Jan 1, 2025 | DenoisingObject Counting | CodeCode Available | 1 |
| Fuzzy Multimodal Learning for Trusted Cross-modal Retrieval | Jan 1, 2025 | Cross-Modal RetrievalRetrieval | CodeCode Available | 1 |
| Docopilot: Improving Multimodal Models for Document-Level Understanding | Jan 1, 2025 | document understandingRAG | CodeCode Available | 1 |
| PS-Diffusion: Photorealistic Subject-Driven Image Editing with Disentangled Control and Attention | Jan 1, 2025 | Intrinsic Image Decomposition | CodeCode Available | 1 |
| Population Aware Diffusion for Time Series Generation | Jan 1, 2025 | Time SeriesTime Series Generation | CodeCode Available | 1 |
| FGAseg: Fine-Grained Pixel-Text Alignment for Open-Vocabulary Semantic Segmentation | Jan 1, 2025 | Open Vocabulary Semantic SegmentationOpen-Vocabulary Semantic Segmentation | CodeCode Available | 1 |
| LoRA Recycle: Unlocking Tuning-Free Few-Shot Adaptability in Visual Foundation Models by Recycling Pre-Tuned LoRAs | Jan 1, 2025 | In-Context LearningMeta-Learning | CodeCode Available | 1 |
| Relation3D : Enhancing Relation Modeling for Point Cloud Instance Segmentation | Jan 1, 2025 | 3D Instance SegmentationContrastive Learning | CodeCode Available | 1 |
| LC-Mamba: Local and Continuous Mamba with Shifted Windows for Frame Interpolation | Jan 1, 2025 | Mamba | CodeCode Available | 1 |
| Making Old Film Great Again: Degradation-aware State Space Model for Old Film Restoration | Jan 1, 2025 | MambaVideo Restoration | CodeCode Available | 1 |
| HCMA-UNet: A Hybrid CNN-Mamba UNet with Axial Self-Attention for Efficient Breast Cancer Segmentation | Jan 1, 2025 | Computational EfficiencyLesion Segmentation | CodeCode Available | 1 |