| SEF-PNet: Speaker Encoder-Free Personalized Speech Enhancement with Local and Global Contexts Aggregation | Jan 20, 2025 | Speaker VerificationSpeech Enhancement | CodeCode Available | 1 |
| Technical Report for the Forgotten-by-Design Project: Targeted Obfuscation for Machine Learning | Jan 20, 2025 | Inference AttackMachine Unlearning | CodeCode Available | 1 |
| A Survey of World Models for Autonomous Driving | Jan 20, 2025 | Anomaly DetectionAutonomous Driving | CodeCode Available | 1 |
| UniTrans: A Unified Vertical Federated Knowledge Transfer Framework for Enhancing Cross-Hospital Collaboration | Jan 20, 2025 | Federated LearningPrivacy Preserving | CodeCode Available | 1 |
| Communication-Efficient Federated Learning Based on Explanation-Guided Pruning for Remote Sensing Image Classification | Jan 20, 2025 | Federated Learningimage-classification | CodeCode Available | 1 |
| Curiosity-Driven Reinforcement Learning from Human Feedback | Jan 20, 2025 | DiversityInstruction Following | CodeCode Available | 1 |
| Automatic Labelling & Semantic Segmentation with 4D Radar Tensors | Jan 20, 2025 | Semantic Segmentationvehicle detection | CodeCode Available | 1 |
| MyGO Multiplex CoT: A Method for Self-Reflection in Large Language Models via Double Chain of Thought Thinking | Jan 20, 2025 | Decision MakingGSM8K | CodeCode Available | 1 |
| Chat3GPP: An Open-Source Retrieval-Augmented Generation Framework for 3GPP Documents | Jan 20, 2025 | ChunkingRAG | CodeCode Available | 1 |
| MedicoSAM: Towards foundation models for medical image segmentation | Jan 20, 2025 | Image SegmentationInteractive Segmentation | CodeCode Available | 1 |
| Glinthawk: A Two-Tiered Architecture for Offline LLM Inference | Jan 20, 2025 | CPULanguage Modeling | CodeCode Available | 1 |
| Finer-CAM: Spotting the Difference Reveals Finer Details for Visual Explanation | Jan 20, 2025 | Computational Efficiency | CodeCode Available | 1 |
| PD-SORT: Occlusion-Robust Multi-Object Tracking Using Pseudo-Depth Cues | Jan 20, 2025 | Motion CompensationMulti-Object Tracking | CodeCode Available | 1 |
| Control LLM: Controlled Evolution for Intelligence Retention in LLM | Jan 19, 2025 | MathMathematical Reasoning | CodeCode Available | 1 |
| Synthetic Data Generation by Supervised Neural Gas Network for Physiological Emotion Recognition Data | Jan 19, 2025 | EEGEmotion Recognition | CodeCode Available | 1 |
| InsQABench: Benchmarking Chinese Insurance Domain Question Answering with Large Language Models | Jan 19, 2025 | BenchmarkingQuestion Answering | CodeCode Available | 1 |
| ChaosEater: Fully Automating Chaos Engineering with Large Language Models | Jan 19, 2025 | Code Generation | CodeCode Available | 1 |
| AdaptiveLog: An Adaptive Log Analysis Framework with the Collaboration of Large and Small Language Model | Jan 19, 2025 | In-Context LearningLanguage Modeling | CodeCode Available | 1 |
| Tell me about yourself: LLMs are aware of their learned behaviors | Jan 19, 2025 | | CodeCode Available | 1 |
| BF-STVSR: B-Splines and Fourier-Best Friends for High Fidelity Spatial-Temporal Video Super-Resolution | Jan 19, 2025 | Optical Flow EstimationSSIM | CodeCode Available | 1 |
| GenAI Content Detection Task 1: English and Multilingual Machine-Generated Text Detection: AI vs. Human | Jan 19, 2025 | Text Detection | CodeCode Available | 1 |
| A Remote Sensing Image Change Detection Method Integrating Layer Exchange and Channel-Spatial Differences | Jan 19, 2025 | Change DetectionEarth Observation | CodeCode Available | 1 |
| The Alternative Annotator Test for LLM-as-a-Judge: How to Statistically Justify Replacing Human Annotators with LLMs | Jan 19, 2025 | | CodeCode Available | 1 |
| Simultaneous Computation with Multiple Prioritizations in Multi-Agent Motion Planning | Jan 18, 2025 | Motion PlanningMulti-Agent Path Finding | CodeCode Available | 1 |
| Graph Coloring to Reduce Computation Time in Prioritized Planning | Jan 18, 2025 | Motion PlanningMulti-Agent Path Finding | CodeCode Available | 1 |
| LD-DETR: Loop Decoder DEtection TRansformer for Video Moment Retrieval and Highlight Detection | Jan 18, 2025 | Contrastive LearningDecoder | CodeCode Available | 1 |
| Dynamic Trend Fusion Module for Traffic Flow Prediction | Jan 18, 2025 | PredictionTraffic Prediction | CodeCode Available | 1 |
| Semi-supervised Semantic Segmentation for Remote Sensing Images via Multi-scale Uncertainty Consistency and Cross-Teacher-Student Attention | Jan 18, 2025 | Image SegmentationSegmentation | CodeCode Available | 1 |
| MedFILIP: Medical Fine-grained Language-Image Pre-training | Jan 18, 2025 | Contrastive LearningDiagnostic | CodeCode Available | 1 |
| GaussianAvatar-Editor: Photorealistic Animatable Gaussian Head Avatar Editor | Jan 17, 2025 | | CodeCode Available | 1 |
| Evaluation and Efficiency Comparison of Evolutionary Algorithms for Service Placement Optimization in Fog Architectures | Jan 17, 2025 | DiversityEvolutionary Algorithms | CodeCode Available | 1 |
| GenSC-6G: A Prototype Testbed for Integrated Generative AI, Quantum, and Semantic Communication | Jan 17, 2025 | Semantic Communication | CodeCode Available | 1 |
| Few-shot Structure-Informed Machinery Part Segmentation with Foundation Models and Graph Neural Networks | Jan 17, 2025 | Few-Shot Semantic SegmentationSegmentation | CodeCode Available | 1 |
| MSTS: A Multimodal Safety Test Suite for Vision-Language Models | Jan 17, 2025 | | CodeCode Available | 1 |
| When language and vision meet road safety: leveraging multimodal large language models for video-based traffic accident analysis | Jan 17, 2025 | Large Language ModelMultimodal Large Language Model | CodeCode Available | 1 |
| Agent-as-Judge for Factual Summarization of Long Narratives | Jan 17, 2025 | Long-Form Narrative Summarization | CodeCode Available | 1 |
| The R-Vessel-X Project | Jan 17, 2025 | AnatomySegmentation | CodeCode Available | 1 |
| Aneumo: A Large-Scale Comprehensive Synthetic Dataset of Aneurysm Hemodynamics | Jan 17, 2025 | | CodeCode Available | 1 |
| PandaSkill -- Player Performance and Skill Rating in Esports: Application to League of Legends | Jan 17, 2025 | | CodeCode Available | 1 |
| landmarker: a Toolkit for Anatomical Landmark Localization in 2D/3D Images | Jan 17, 2025 | Pose Estimation | CodeCode Available | 1 |
| AIRCHITECT v2: Learning the Hardware Accelerator Design Space through Unified Representations | Jan 17, 2025 | Contrastive LearningNavigate | CodeCode Available | 1 |
| Surrogate-based multiscale analysis of experiments on thermoplastic composites under off-axis loading | Jan 17, 2025 | Transfer Learning | CodeCode Available | 1 |
| FaceXBench: Evaluating Multimodal LLMs on Face Understanding | Jan 17, 2025 | FairnessMultiple-choice | CodeCode Available | 1 |
| MechIR: A Mechanistic Interpretability Framework for Information Retrieval | Jan 17, 2025 | DiagnosticInformation Retrieval | CodeCode Available | 1 |
| A Unified Comparative Study with Generalized Conformity Scores for Multi-Output Conformal Regression | Jan 17, 2025 | Conformal PredictionPrediction | CodeCode Available | 1 |
| OpticFusion: Multi-Modal Neural Implicit 3D Reconstruction of Microstructures by Fusing White Light Interferometry and Optical Microscopy | Jan 16, 2025 | 3D geometry3D Reconstruction | CodeCode Available | 1 |
| HSPFormer: Hierarchical Spatial Perception Transformer for Semantic Segmentation | Jan 16, 2025 | Depth EstimationMonocular Depth Estimation | CodeCode Available | 1 |
| DSTIGCN: Deformable Spatial-Temporal Interaction Graph Convolution Network for Pedestrian Trajectory Prediction | Jan 16, 2025 | Autonomous DrivingPedestrian Trajectory Prediction | CodeCode Available | 1 |
| Merging Models on the Fly Without Retraining: A Sequential Approach to Scalable Continual Model Merging | Jan 16, 2025 | | CodeCode Available | 1 |
| Lossy Compression with Pretrained Diffusion Models | Jan 16, 2025 | | CodeCode Available | 1 |