Holistic Audit Dataset Generation for LLM Unlearning via Knowledge Graph Traversal and Redundancy Removal Feb 26, 2025 Dataset Generation Knowledge Graphs
— Unverified 0SpecDM: Hyperspectral Dataset Synthesis with Pixel-level Semantic Annotations Feb 24, 2025 Change Detection Dataset Generation
— Unverified 0Beyond Translation: LLM-Based Data Generation for Multilingual Fact-Checking Feb 21, 2025 Dataset Generation Fact Checking
Code Code Available 0Synth It Like KITTI: Synthetic Data Generation for Object Detection in Driving Scenarios Feb 20, 2025 3D Object Detection Autonomous Driving
Code Code Available 0TreeCut: A Synthetic Unanswerable Math Word Problem Dataset for LLM Hallucination Evaluation Feb 19, 2025 Dataset Generation GSM8K
Code Code Available 0One-Shot Federated Learning with Classifier-Free Diffusion Models Feb 12, 2025 Benchmarking Dataset Generation
— Unverified 0MultiFloodSynth: Multi-Annotated Flood Synthetic Dataset Generation Feb 6, 2025 Dataset Generation Image to 3D
— Unverified 0Behavioral Entropy-Guided Dataset Generation for Offline Reinforcement Learning Feb 6, 2025 Dataset Generation MuJoCo
— Unverified 0Synthetic User Behavior Sequence Generation with Large Language Models for Smart Homes Jan 31, 2025 Anomaly Detection Dataset Generation
— Unverified 0iTRI-QA: a Toolset for Customized Question-Answer Dataset Generation Using Language Models for Enhanced Scientific Research Jan 27, 2025 Dataset Generation
— Unverified 0Measuring and Mitigating Hallucinations in Vision-Language Dataset Generation for Remote Sensing Jan 24, 2025 Caption Generation Dataset Generation
— Unverified 0E-Gen: Leveraging E-Graphs to Improve Continuous Representations of Symbolic Expressions Jan 24, 2025 Contrastive Learning Dataset Generation
Code Code Available 0A Dataset Generation Toolbox for Dynamic Security Assessment: On the Role of the Security Boundary Jan 16, 2025 Dataset Generation
Code Code Available 0The Invisible Hand: Unveiling Provider Bias in Large Language Models for Code Generation Jan 14, 2025 Code Generation Dataset Generation
— Unverified 0CellViT++: Energy-Efficient and Adaptive Cell Segmentation and Classification Using Foundation Models Jan 9, 2025 Cell Segmentation Dataset Generation
Code Code Available 2Neural Error Covariance Estimation for Precise LiDAR Localization Jan 5, 2025 Autonomous Vehicles Dataset Generation
— Unverified 0CySecBench: Generative AI-based CyberSecurity-focused Prompt Dataset for Benchmarking Large Language Models Jan 2, 2025 Benchmarking Computer Security
Code Code Available 1Low-Biased General Annotated Dataset Generation Jan 1, 2025 Dataset Generation Image Generation
— Unverified 0DynScene: Scalable Generation of Dynamic Robotic Manipulation Scenes for Embodied AI Jan 1, 2025 Dataset Generation Diversity
— Unverified 0ICM-Assistant: Instruction-tuning Multimodal Large Language Models for Rule-based Explainable Image Content Moderation Dec 24, 2024 Dataset Generation
Code Code Available 1Generating Traffic Scenarios via In-Context Learning to Learn Better Motion Planner Dec 24, 2024 Autonomous Driving Dataset Generation
Code Code Available 1Movie2Story: A framework for understanding videos and telling stories in the form of novel text Dec 19, 2024 Dataset Generation Fairness
— Unverified 0Cognition Chain for Explainable Psychological Stress Detection on Social Media Dec 18, 2024 Dataset Generation
Code Code Available 0SciFaultyQA: Benchmarking LLMs on Faulty Science Question Detection with a GAN-Inspired Approach to Synthetic Dataset Generation Dec 16, 2024 Benchmarking Dataset Generation
Code Code Available 0Unbiased General Annotated Dataset Generation Dec 14, 2024 Dataset Generation Image Generation
— Unverified 0VariFace: Fair and Diverse Synthetic Dataset Generation for Face Recognition Dec 9, 2024 Dataset Generation Diversity
— Unverified 0JAPAGEN: Efficient Few/Zero-shot Learning via Japanese Training Dataset Generation with LLM Dec 9, 2024 Dataset Generation Zero-Shot Learning
Code Code Available 0SynFinTabs: A Dataset of Synthetic Financial Tables for Information and Table Extraction Dec 5, 2024 Articles Dataset Generation
Code Code Available 0An Evolutionary Large Language Model for Hallucination Mitigation Dec 3, 2024 Dataset Generation Hallucination
— Unverified 0SimuScope: Realistic Endoscopic Synthetic Dataset Generation through Surgical Simulation and Diffusion Models Dec 3, 2024 Dataset Generation Image-to-Image Translation
Code Code Available 1Know Your RAG: Dataset Taxonomy and Generation Strategies for Evaluating RAG Systems Nov 29, 2024 Dataset Generation RAG
— Unverified 0Global Tensor Motion Planning Nov 28, 2024 Dataset Generation Diversity
Code Code Available 1OpenLS-DGF: An Adaptive Open-Source Dataset Generation Framework for Machine Learning Tasks in Logic Synthesis Nov 14, 2024 Dataset Generation
Code Code Available 1HyperFace: Generating Synthetic Face Recognition Datasets by Exploring Face Embedding Hypersphere Nov 13, 2024 Benchmarking Dataset Generation
— Unverified 0Drone Detection using Deep Neural Networks Trained on Pure Synthetic Data Nov 13, 2024 Dataset Generation
Code Code Available 0Physics Informed Distillation for Diffusion Models Nov 13, 2024 Dataset Generation Image Generation
Code Code Available 2CorrSynth -- A Correlated Sampling Method for Diverse Dataset Generation from LLMs Nov 13, 2024 Dataset Generation Diversity
— Unverified 0Fineweb-Edu-Ar: Machine-translated Corpus to Support Arabic Small Language Models Nov 10, 2024 Dataset Generation Machine Translation
— Unverified 0Fairness-Utilization Trade-off in Wireless Networks with Explainable Kolmogorov-Arnold Networks Nov 4, 2024 Dataset Generation Fairness
— Unverified 0Simulating User Agents for Embodied Conversational-AI Oct 31, 2024 Dataset Generation Large Language Model
— Unverified 0SYNOSIS: Image synthesis pipeline for machine vision in metal surface inspection Oct 18, 2024 Dataset Generation Diversity
— Unverified 0FTSmartAudit: A Knowledge Distillation-Enhanced Framework for Automated Smart Contract Auditing Using Fine-Tuned LLMs Oct 17, 2024 Dataset Generation Knowledge Distillation
— Unverified 0Pseudo Dataset Generation for Out-of-Domain Multi-Camera View Recommendation Oct 17, 2024 Dataset Generation Decision Making
— Unverified 0Anchored Alignment for Self-Explanations Enhancement Oct 17, 2024 Dataset Generation
— Unverified 0Autonomous Self-Trained Channel State Prediction Method for mmWave Vehicular Communications Oct 3, 2024 Dataset Generation Prediction
— Unverified 0HealthQ: Unveiling Questioning Capabilities of LLM Chains in Healthcare Conversations Sep 28, 2024 Dataset Generation Informativeness
— Unverified 0EarthquakeNPP: Benchmark Datasets for Earthquake Forecasting with Neural Point Processes Sep 27, 2024 Benchmarking Dataset Generation
— Unverified 0Towards Synthetic Data Generation for Improved Pain Recognition in Videos under Patient Constraints Sep 24, 2024 Dataset Generation Privacy Preserving
Code Code Available 0Harnessing LLMs for API Interactions: A Framework for Classification and Synthetic Data Generation Sep 18, 2024 Dataset Generation Management
— Unverified 0Vec2Face: Scaling Face Dataset Generation with Loosely Constrained Vectors Sep 4, 2024 Attribute Dataset Generation
— Unverified 0