| Geometry-Informed Neural Operator Transformer | Apr 28, 2025 | Computational EfficiencyDecoder | CodeCode Available | 1 |
| BRIDGE: Benchmarking Large Language Models for Understanding Real-world Clinical Practice Text | Apr 28, 2025 | Benchmarking | CodeCode Available | 1 |
| DeeCLIP: A Robust and Generalizable Transformer-Based Framework for Detecting AI-Generated Images | Apr 28, 2025 | Generative Adversarial Networkparameter-efficient fine-tuning | CodeCode Available | 1 |
| Towards Ball Spin and Trajectory Analysis in Table Tennis Broadcast Videos via Physically Grounded Synthetic-to-Real Transfer | Apr 28, 2025 | Monocular 3D Object LocalizationSports Analytics | CodeCode Available | 1 |
| Simplified and Secure MCP Gateways for Enterprise AI Integration | Apr 28, 2025 | Intrusion Detection | CodeCode Available | 1 |
| AutoP2C: An LLM-Based Agent Framework for Code Repository Generation from Multimodal Content in Academic Papers | Apr 28, 2025 | Code Generation | CodeCode Available | 1 |
| mrCAD: Multimodal Refinement of Computer-aided Designs | Apr 28, 2025 | | CodeCode Available | 1 |
| Efficient Reasoning for LLMs through Speculative Chain-of-Thought | Apr 27, 2025 | GSM8KMath | CodeCode Available | 1 |
| Relative Contrastive Learning for Sequential Recommendation with Similarity-based Positive Pair Selection | Apr 27, 2025 | Contrastive LearningData Augmentation | CodeCode Available | 1 |
| Neurosymbolic Association Rule Mining from Tabular Data | Apr 27, 2025 | Interpretable Machine Learning | CodeCode Available | 1 |
| LRFusionPR: A Polar BEV-Based LiDAR-Radar Fusion Network for Place Recognition | Apr 27, 2025 | Autonomous Driving | CodeCode Available | 1 |
| AlphaFuse: Learn ID Embeddings for Sequential Recommendation in Null Space of Language Embeddings | Apr 27, 2025 | Sequential Recommendation | CodeCode Available | 1 |
| AndroidGen: Building an Android Language Agent under Data Scarcity | Apr 27, 2025 | | CodeCode Available | 1 |
| Semantic-Aligned Learning with Collaborative Refinement for Unsupervised VI-ReID | Apr 27, 2025 | Contrastive LearningPerson Re-Identification | CodeCode Available | 1 |
| ChiseLLM: Unleashing the Power of Reasoning LLMs for Chisel Agile Hardware Development | Apr 27, 2025 | Code GenerationDomain Adaptation | CodeCode Available | 1 |
| Enhancing Speech-to-Speech Dialogue Modeling with End-to-End Retrieval-Augmented Generation | Apr 27, 2025 | RAGRetrieval | CodeCode Available | 1 |
| R-Sparse R-CNN: SAR Ship Detection Based on Background-Aware Sparse Learnable Proposals | Apr 26, 2025 | SAR Ship Detection | CodeCode Available | 1 |
| TSRM: A Lightweight Temporal Feature Encoding Architecture for Time Series Forecasting and Imputation | Apr 26, 2025 | ImputationMultivariate Time Series Forecasting | CodeCode Available | 1 |
| Multi-Resolution Pathology-Language Pre-training Model with Text-Guided Visual Representation | Apr 26, 2025 | Survival Analysiswhole slide images | CodeCode Available | 1 |
| Neurophysiologically Realistic Environment for Comparing Adaptive Deep Brain Stimulation Algorithms in Parkinson Disease | Apr 26, 2025 | Deep Reinforcement LearningReinforcement Learning (RL) | CodeCode Available | 1 |
| Clinical knowledge in LLMs does not translate to human interactions | Apr 26, 2025 | Clinical Knowledge | CodeCode Available | 1 |
| CAMeL: Cross-modality Adaptive Meta-Learning for Text-based Person Retrieval | Apr 26, 2025 | Meta-LearningPerson Retrieval | CodeCode Available | 1 |
| Unsupervised Visual Chain-of-Thought Reasoning via Preference Optimization | Apr 25, 2025 | Spatial Reasoning | CodeCode Available | 1 |
| Expressing stigma and inappropriate responses prevents LLMs from safely replacing mental health providers | Apr 25, 2025 | Large Language Model | CodeCode Available | 1 |
| Task-Oriented Communications for Visual Navigation with Edge-Aerial Collaboration in Low Altitude Economy | Apr 25, 2025 | Visual Navigation | CodeCode Available | 1 |
| PerfCam: Digital Twinning for Production Lines Using 3D Gaussian Splatting and Vision Models | Apr 25, 2025 | 3D ReconstructionObject Tracking | CodeCode Available | 1 |
| Action-Minimization Meets Generative Modeling: Efficient Transition Path Sampling with the Onsager-Machlup Functional | Apr 25, 2025 | Denoising | CodeCode Available | 1 |
| What is the Added Value of UDA in the VFM Era? | Apr 25, 2025 | Autonomous DrivingDomain Adaptation | CodeCode Available | 1 |
| DOSE : Drum One-Shot Extraction from Music Mixture | Apr 25, 2025 | FAD | CodeCode Available | 1 |
| A Multimodal Hybrid Late-Cascade Fusion Network for Enhanced 3D Object Detection | Apr 25, 2025 | 3D Object Detectionobject-detection | CodeCode Available | 1 |
| LEAM: A Prompt-only Large Language Model-enabled Antenna Modeling Method | Apr 25, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| MEDIBENG WHISPER TINY: A FINE-TUNED CODE-SWITCHED BENGALI-ENGLISH TRANSLATOR FOR CLINICAL APPLICATIONS | Apr 25, 2025 | Clinical Language TranslationMachine Translation | CodeCode Available | 1 |
| Action Flow Matching for Continual Robot Learning | Apr 25, 2025 | Continual Learning | CodeCode Available | 1 |
| E-InMeMo: Enhanced Prompting for Visual In-Context Learning | Apr 25, 2025 | Foreground SegmentationIn-Context Learning | CodeCode Available | 1 |
| VideoMultiAgents: A Multi-Agent Framework for Video Question Answering | Apr 25, 2025 | Caption GenerationEgoSchema | CodeCode Available | 1 |
| Mamba-Sea: A Mamba-based Framework with Global-to-Local Sequence Augmentation for Generalizable Medical Image Segmentation | Apr 24, 2025 | Domain GeneralizationImage Segmentation | CodeCode Available | 1 |
| TableCenterNet: A one-stage network for table structure recognition | Apr 24, 2025 | Computational Efficiency | CodeCode Available | 1 |
| PhysioSync: Temporal and Cross-Modal Contrastive Learning Inspired by Physiological Synchronization for EEG-Based Emotion Recognition | Apr 24, 2025 | Contrastive LearningEEG | CodeCode Available | 1 |
| iVR-GS: Inverse Volume Rendering for Explorable Visualization via Editable 3D Gaussian Splatting | Apr 24, 2025 | 3DGSNovel View Synthesis | CodeCode Available | 1 |
| Benchmarking Multimodal Mathematical Reasoning with Explicit Visual Dependency | Apr 24, 2025 | BenchmarkingMath | CodeCode Available | 1 |
| FRAG: Frame Selection Augmented Generation for Long Video and Long Document Understanding | Apr 24, 2025 | document understandingMME | CodeCode Available | 1 |
| Beyond Cox Models: Assessing the Performance of Machine-Learning Methods in Non-Proportional Hazards and Non-Linear Survival Analysis | Apr 24, 2025 | Survival AnalysisSurvival Prediction | CodeCode Available | 1 |
| Quadratic Interest Network for Multimodal Click-Through Rate Prediction | Apr 24, 2025 | Click-Through Rate PredictionMultimodal Recommendation | CodeCode Available | 1 |
| A RAG-Based Multi-Agent LLM System for Natural Hazard Resilience and Adaptation | Apr 24, 2025 | Decision MakingRAG | CodeCode Available | 1 |
| LiveLongBench: Tackling Long-Context Understanding for Spoken Texts from Live Streams | Apr 24, 2025 | Long-Context UnderstandingSpoken Language Understanding | CodeCode Available | 1 |
| CasualHDRSplat: Robust High Dynamic Range 3D Gaussian Splatting from Casually Captured Videos | Apr 24, 2025 | 3DGSNeRF | CodeCode Available | 1 |
| A Comprehensive Survey of Synthetic Tabular Data Generation | Apr 23, 2025 | Privacy PreservingSurvey | CodeCode Available | 1 |
| IRIS: Interactive Research Ideation System for Accelerating Scientific Discovery | Apr 23, 2025 | scientific discovery | CodeCode Available | 1 |
| Enhancing LLM-Based Agents via Global Planning and Hierarchical Execution | Apr 23, 2025 | Task Planning | CodeCode Available | 1 |
| VideoVista-CulturalLingo: 360^ Horizons-Bridging Cultures, Languages, and Domains in Video Comprehension | Apr 23, 2025 | | CodeCode Available | 1 |