Seewo's Submission to MLC-SLM: Lessons learned from Speech Reasoning Language Models Jun 16, 2025 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0ASMR: Augmenting Life Scenario using Large Generative Models for Robotic Action Reflection Jun 16, 2025 Data Augmentation Large Language Model
— Unverified 0HPCTransCompile: An AI Compiler Generated Dataset for High-Performance CUDA Transpilation and LLM Preliminary Exploration Jun 12, 2025 CPU Data Augmentation
— Unverified 0Self-Adapting Language Models Jun 12, 2025 Data Augmentation
— Unverified 0DreamActor-H1: High-Fidelity Human-Product Demonstration Video Generation via Motion-designed Diffusion Transformers Jun 12, 2025 Data Augmentation Marketing
— Unverified 0Alzheimer's Dementia Detection Using Perplexity from Paired Large Language Models Jun 11, 2025 Data Augmentation Decision Making
— Unverified 0CINeMA: Conditional Implicit Neural Multi-Modal Atlas for a Spatio-Temporal Representation of the Perinatal Brain Jun 11, 2025 Data Augmentation Image Registration
Code Code Available 0ScoreMix: Improving Face Recognition via Score Composition in Diffusion Generators Jun 11, 2025 Data Augmentation Face Recognition
— Unverified 0An Explainable Deep Learning Framework for Brain Stroke and Tumor Progression via MRI Interpretation Jun 10, 2025 Anomaly Detection Data Augmentation
— Unverified 0SimClass: A Classroom Speech Dataset Generated via Game Engine Simulation For Automatic Speech Recognition Research Jun 10, 2025 Automatic Speech Recognition Data Augmentation
— Unverified 0scSSL-Bench: Benchmarking Self-Supervised Learning for Single-Cell Data Jun 10, 2025 Benchmarking Data Augmentation
Code Code Available 1GFRIEND: Generative Few-shot Reward Inference through EfficieNt DPO Jun 10, 2025 Data Augmentation Model Optimization
Code Code Available 0SSS: Semi-Supervised SAM-2 with Efficient Prompting for Medical Imaging Segmentation Jun 10, 2025 Data Augmentation Image Segmentation
Code Code Available 0MOBODY: Model Based Off-Dynamics Offline Reinforcement Learning Jun 10, 2025 Data Augmentation model
Code Code Available 0Data-Efficient Challenges in Visual Inductive Priors: A Retrospective Jun 10, 2025 Data Augmentation Deep Learning
— Unverified 0Data Augmentation For Small Object using Fast AutoAugment Jun 10, 2025 Data Augmentation Object
— Unverified 0Learning to Hear Broken Motors: Signature-Guided Data Augmentation for Induction-Motor Diagnostics Jun 10, 2025 Data Augmentation Diagnostic
— Unverified 0Spatiotemporal deep learning models for detection of rapid intensification in cyclones Jun 10, 2025 Data Augmentation Deep Learning
— Unverified 0Heavy Lasso: sparse penalized regression under heavy-tailed noise via data-augmented soft-thresholding Jun 9, 2025 Data Augmentation
Code Code Available 0Scaling Human Activity Recognition: A Comparative Evaluation of Synthetic Data Generation and Augmentation Techniques Jun 9, 2025 Activity Recognition Data Augmentation
— Unverified 0DeepVideo-R1: Video Reinforcement Fine-Tuning via Difficulty-aware Regressive GRPO Jun 9, 2025 Data Augmentation Large Language Model
— Unverified 0Dealing with the Evil Twins: Improving Random Augmentation by Addressing Catastrophic Forgetting of Diverse Augmentations Jun 9, 2025 Data Augmentation Domain Generalization
— Unverified 0Deep Inertial Pose: A deep learning approach for human pose estimation Jun 7, 2025 Data Augmentation Pose Estimation
— Unverified 0Robust sensor fusion against on-vehicle sensor staleness Jun 6, 2025 Autonomous Vehicles Data Augmentation
— Unverified 0Securing Traffic Sign Recognition Systems in Autonomous Vehicles Jun 6, 2025 Autonomous Vehicles Data Augmentation
— Unverified 0Geometric and Physical Constraints Synergistically Enhance Neural PDE Surrogates Jun 5, 2025 Data Augmentation
— Unverified 0Model-based Neural Data Augmentation for sub-wavelength Radio Localization Jun 5, 2025 Data Augmentation
— Unverified 0PixCell: A generative foundation model for digital histopathology images Jun 5, 2025 Cell Segmentation Data Augmentation
— Unverified 0IIITH-BUT system for IWSLT 2025 low-resource Bhojpuri to Hindi speech translation Jun 5, 2025 Data Augmentation Translation
— Unverified 0Flattery, Fluff, and Fog: Diagnosing and Mitigating Idiosyncratic Biases in Preference Models Jun 5, 2025 counterfactual Data Augmentation
Code Code Available 0LLM-based phoneme-to-grapheme for phoneme-based speech recognition Jun 5, 2025 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0hdl2v: A Code Translation Dataset for Enhanced LLM Verilog Generation Jun 5, 2025 Code Generation Code Translation
— Unverified 0Person Re-Identification System at Semantic Level based on Pedestrian Attributes Ontology Jun 4, 2025 Attribute Data Augmentation
— Unverified 0Fine-Tuning Video Transformers for Word-Level Bangla Sign Language: A Comparative Analysis for Classification Tasks Jun 4, 2025 Data Augmentation Model Selection
— Unverified 0A Novel Data Augmentation Approach for Automatic Speaking Assessment on Opinion Expressions Jun 4, 2025 Data Augmentation Diversity
— Unverified 0MASTER: Enhancing Large Language Model via Multi-Agent Simulated Teaching Jun 3, 2025 Data Augmentation Instruction Following
— Unverified 0Explicitly Modeling Subcortical Vision with a Neuro-Inspired Front-End Improves CNN Robustness Jun 3, 2025 Data Augmentation Object Recognition
— Unverified 0Simple, Good, Fast: Self-Supervised World Models Free of Baggage Jun 3, 2025 Data Augmentation Representation Learning
Code Code Available 1MISLEADER: Defending against Model Extraction with Ensembles of Distilled Models Jun 3, 2025 Bilevel Optimization Data Augmentation
Code Code Available 0How Explanations Leak the Decision Logic: Stealing Graph Neural Networks via Explanation Alignment Jun 3, 2025 Data Augmentation Drug Discovery
Code Code Available 0OmniV2V: Versatile Video Generation and Editing via Dynamic Content Manipulation Jun 2, 2025 Data Augmentation Human Animation
Code Code Available 5Dual encoding feature filtering generalized attention UNET for retinal vessel segmentation Jun 2, 2025 Data Augmentation Retinal Vessel Segmentation
Code Code Available 03D Skeleton-Based Action Recognition: A Review Jun 1, 2025 Action Recognition Data Augmentation
— Unverified 0Lightweight Convolutional Neural Networks for Retinal Disease Classification May 30, 2025 Classification Data Augmentation
— Unverified 0Shuffle PatchMix Augmentation with Confidence-Margin Weighted Pseudo-Labels for Enhanced Source-Free Domain Adaptation May 30, 2025 Data Augmentation Domain Adaptation
Code Code Available 0Leveraging Intermediate Features of Vision Transformer for Face Anti-Spoofing May 30, 2025 Data Augmentation Face Anti-Spoofing
— Unverified 0Reinforcing Video Reasoning with Focused Thinking May 30, 2025 Data Augmentation Visual Reasoning
Code Code Available 1Revisiting Cross-Modal Knowledge Distillation: A Disentanglement Approach for RGBD Semantic Segmentation May 30, 2025 Autonomous Driving Contrastive Learning
Code Code Available 0SPPSFormer: High-quality Superpoint-based Transformer for Roof Plane Instance Segmentation from Point Clouds May 30, 2025 Data Augmentation Instance Segmentation
— Unverified 0Improving Multilingual Speech Models on ML-SUPERB 2.0: Fine-tuning with Data Augmentation and LID-Aware CTC May 30, 2025 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0