| Terrier: A Deep Learning Repeat Classifier | Mar 12, 2025 | Deep Learning | CodeCode Available | 1 |
| Fair Federated Medical Image Classification Against Quality Shift via Inter-Client Progressive State Matching | Mar 12, 2025 | FairnessFederated Learning | CodeCode Available | 1 |
| CoRe^2: Collect, Reflect and Refine to Generate Better and Faster | Mar 12, 2025 | | CodeCode Available | 1 |
| BIMBA: Selective-Scan Compression for Long-Range Video Question Answering | Mar 12, 2025 | Video Question AnsweringZero-Shot Video Question Answer | CodeCode Available | 1 |
| Revisiting semi-supervised learning in the era of foundation models | Mar 12, 2025 | parameter-efficient fine-tuningPseudo Label | CodeCode Available | 1 |
| Robust Multimodal Survival Prediction with the Latent Differentiation Conditional Variational AutoEncoder | Mar 12, 2025 | Survival Predictionwhole slide images | CodeCode Available | 1 |
| RewardSDS: Aligning Score Distillation via Reward-Weighted Sampling | Mar 12, 2025 | 3D GenerationText to 3D | CodeCode Available | 1 |
| Prompt to Restore, Restore to Prompt: Cyclic Prompting for Universal Adverse Weather Removal | Mar 12, 2025 | Image RestorationPrompt Learning | CodeCode Available | 1 |
| AdvAD: Exploring Non-Parametric Diffusion for Imperceptible Adversarial Attacks | Mar 12, 2025 | DenoisingSSIM | CodeCode Available | 1 |
| AgentDAM: Privacy Leakage Evaluation for Autonomous Web Agents | Mar 12, 2025 | | CodeCode Available | 1 |
| CyberLLMInstruct: A New Dataset for Analysing Safety of Fine-Tuned LLMs Using Cyber Security Data | Mar 12, 2025 | Adversarial AttackMalware Analysis | CodeCode Available | 1 |
| CASTLE: Benchmarking Dataset for Static Code Analyzers and LLMs towards CWE Detection | Mar 12, 2025 | BenchmarkingCode Classification | CodeCode Available | 1 |
| Towards Next-Generation Recommender Systems: A Benchmark for Personalized Recommendation Assistant with LLMs | Mar 12, 2025 | Recommendation Systems | CodeCode Available | 1 |
| Motion Blender Gaussian Splatting for Dynamic Scene Reconstruction | Mar 12, 2025 | Dynamic ReconstructionSimulated Gaussian Manipulation | CodeCode Available | 1 |
| How Well Does Your Tabular Generator Learn the Structure of Tabular Data? | Mar 12, 2025 | | CodeCode Available | 1 |
| MP-HSIR: A Multi-Prompt Framework for Universal Hyperspectral Image Restoration | Mar 12, 2025 | Image RestorationSpectral Reconstruction | CodeCode Available | 1 |
| PerCoV2: Improved Ultra-Low Bit-Rate Perceptual Image Compression with Implicit Hierarchical Masked Image Modeling | Mar 12, 2025 | Image Compression | CodeCode Available | 1 |
| MOAT: Evaluating LMMs for Capability Integration and Instruction Grounding | Mar 12, 2025 | | CodeCode Available | 1 |
| AnyMoLe: Any Character Motion In-betweening Leveraging Video Diffusion Models | Mar 11, 2025 | Motion Generationmotion in-betweening | CodeCode Available | 1 |
| Regulatory DNA sequence Design with Reinforcement Learning | Mar 11, 2025 | reinforcement-learningReinforcement Learning | CodeCode Available | 1 |
| Learning to Detect Objects from Multi-Agent LiDAR Scans without Manual Labels | Mar 11, 2025 | 3D Object DetectionObject | CodeCode Available | 1 |
| CoLMDriver: LLM-based Negotiation Benefits Cooperative Autonomous Driving | Mar 11, 2025 | Autonomous Driving | CodeCode Available | 1 |
| Detecting Backdoor Attacks in Federated Learning via Direction Alignment Inspection | Mar 11, 2025 | Federated Learning | CodeCode Available | 1 |
| EgoBlind: Towards Egocentric Visual Assistance for the Blind | Mar 11, 2025 | | CodeCode Available | 1 |
| NullFace: Training-Free Localized Face Anonymization | Mar 11, 2025 | AttributeFace Anonymization | CodeCode Available | 1 |
| X-Field: A Physically Grounded Representation for 3D X-ray Reconstruction | Mar 11, 2025 | 3D ReconstructionComputed Tomography (CT) | CodeCode Available | 1 |
| VFM-UDA++: Improving Network Architectures and Data Strategies for Unsupervised Domain Adaptive Semantic Segmentation | Mar 11, 2025 | Domain AdaptationSemantic Segmentation | CodeCode Available | 1 |
| BiasEdit: Debiasing Stereotyped Language Models via Model Editing | Mar 11, 2025 | counterfactualLanguage Modeling | CodeCode Available | 1 |
| Chemical reasoning in LLMs unlocks steerable synthesis planning and reaction mechanism elucidation | Mar 11, 2025 | | CodeCode Available | 1 |
| Rethinking Diffusion Model in High Dimension | Mar 11, 2025 | model | CodeCode Available | 1 |
| PhysVLM: Enabling Visual Language Models to Understand Robotic Physical Reachability | Mar 11, 2025 | Visual Reasoning | CodeCode Available | 1 |
| Aligning Text to Image in Diffusion Models is Easier Than You Think | Mar 11, 2025 | Contrastive LearningImage Generation | CodeCode Available | 1 |
| MegaSR: Mining Customized Semantics and Expressive Guidance for Image Super-Resolution | Mar 11, 2025 | Image Super-ResolutionSuper-Resolution | CodeCode Available | 1 |
| SAS: Segment Any 3D Scene with Integrated 2D Priors | Mar 11, 2025 | Instance SegmentationSemantic Segmentation | CodeCode Available | 1 |
| ^RFLAV: Rolling Flow matching for infinite Audio Video generation | Mar 11, 2025 | Video Generation | CodeCode Available | 1 |
| Efficient Many-Shot In-Context Learning with Dynamic Block-Sparse Attention | Mar 11, 2025 | In-Context LearningRetrieval | CodeCode Available | 1 |
| Using Powerful Prior Knowledge of Diffusion Model in Deep Unfolding Networks for Image Compressive Sensing | Mar 11, 2025 | Compressive SensingImage Compressed Sensing | CodeCode Available | 1 |
| VRMDiff: Text-Guided Video Referring Matting Generation of Diffusion | Mar 11, 2025 | Image MattingVideo Alignment | CodeCode Available | 1 |
| Can We Detect Failures Without Failure Data? Uncertainty-Aware Runtime Failure Detection for Imitation Learning Policies | Mar 11, 2025 | Conformal PredictionImitation Learning | CodeCode Available | 1 |
| Enhancing Large Language Models for Hardware Verification: A Novel SystemVerilog Assertion Dataset | Mar 11, 2025 | | CodeCode Available | 1 |
| EvalTree: Profiling Language Model Weaknesses via Hierarchical Capability Trees | Mar 11, 2025 | ChatbotLanguage Modeling | CodeCode Available | 1 |
| Towards Interpretable Protein Structure Prediction with Sparse Autoencoders | Mar 11, 2025 | PredictionProtein Structure Prediction | CodeCode Available | 1 |
| EMMOE: A Comprehensive Benchmark for Embodied Mobile Manipulation in Open Environments | Mar 11, 2025 | | CodeCode Available | 1 |
| Controlling Latent Diffusion Using Latent CLIP | Mar 11, 2025 | DenoisingDescriptive | CodeCode Available | 1 |
| Chain-of-Thought Reasoning In The Wild Is Not Always Faithful | Mar 11, 2025 | | CodeCode Available | 1 |
| CFNet: Optimizing Remote Sensing Change Detection through Content-Aware Enhancement | Mar 11, 2025 | Change Detection | CodeCode Available | 1 |
| AG-VPReID: A Challenging Large-Scale Benchmark for Aerial-Ground Video-based Person Re-Identification | Mar 11, 2025 | Person Re-IdentificationVideo-Based Person Re-Identification | CodeCode Available | 1 |
| Oasis: One Image is All You Need for Multimodal Instruction Data Synthesis | Mar 11, 2025 | AllDataset Generation | CodeCode Available | 1 |
| STEAD: Spatio-Temporal Efficient Anomaly Detection for Time and Compute Sensitive Applications | Mar 11, 2025 | Anomaly DetectionAnomaly Detection In Surveillance Videos | CodeCode Available | 1 |
| Source-free domain adaptation based on label reliability for cross-domain bearing fault diagnosis | Mar 11, 2025 | Data AugmentationDomain Adaptation | CodeCode Available | 1 |