| Theorem Prover as a Judge for Synthetic Data Generation | Feb 18, 2025 | Mathematical ProofsMathematical Reasoning | —Unverified | 0 | 0 |
| The Prompt is Mightier than the Example | May 24, 2025 | In-Context LearningSynthetic Data Generation | —Unverified | 0 | 0 |
| The Synthetic Mirror -- Synthetic Data at the Age of Agentic AI | Jun 15, 2025 | Synthetic Data Generation | —Unverified | 0 | 0 |
| The Volctrans Machine Translation System for WMT20 | Oct 28, 2020 | Machine TranslationSynthetic Data Generation | —Unverified | 0 | 0 |
| Time Series Language Model for Descriptive Caption Generation | Jan 3, 2025 | Caption GenerationDenoising | —Unverified | 0 | 0 |
| Towards a framework on tabular synthetic data generation: a minimalist approach: theory, use cases, and limitations | Nov 17, 2024 | DecoderSynthetic Data Generation | —Unverified | 0 | 0 |
| Towards Synthetic Multivariate Time Series Generation for Flare Forecasting | May 16, 2021 | Generative Adversarial NetworkSynthetic Data Generation | —Unverified | 0 | 0 |
| Towards System 2 Reasoning in LLMs: Learning How to Think With Meta Chain-of-Though | Jan 8, 2025 | Synthetic Data Generation | —Unverified | 0 | 0 |
| Transitioning from Real to Synthetic data: Quantifying the bias in model | May 10, 2021 | FairnessSynthetic Data Generation | —Unverified | 0 | 0 |
| Trustable and Automated Machine Learning Running with Blockchain and Its Applications | Aug 14, 2019 | BIG-bench Machine LearningFraud Detection | —Unverified | 0 | 0 |
| TSynD: Targeted Synthetic Data Generation for Enhanced Medical Image Classification | Jun 25, 2024 | image-classificationImage Classification | —Unverified | 0 | 0 |
| Tubular Shape Aware Data Generation for Semantic Segmentation in Medical Imaging | Oct 2, 2020 | Generative Adversarial NetworkSemantic Segmentation | —Unverified | 0 | 0 |
| TUTOR: Training Neural Networks Using Decision Rules as Model Priors | Oct 12, 2020 | Synthetic Data Generation | —Unverified | 0 | 0 |
| Typhoon T1: An Open Thai Reasoning Model | Feb 13, 2025 | modelSynthetic Data Generation | —Unverified | 0 | 0 |
| UAV-Sim: NeRF-based Synthetic Data Generation for UAV-based Perception | Oct 25, 2023 | Data AugmentationImage Generation | —Unverified | 0 | 0 |
| Uni-AIMS: AI-Powered Microscopy Image Analysis | May 11, 2025 | Synthetic Data Generation | —Unverified | 0 | 0 |
| Unified Multi-Task Learning & Model Fusion for Efficient Language Model Guardrailing | Apr 27, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 | 0 |
| Unlocking Spatial Comprehension in Text-to-Image Diffusion Models | Nov 28, 2023 | AttributeImage Generation | —Unverified | 0 | 0 |
| Unlocking the Potential of Large Language Models in the Nuclear Industry with Synthetic Data | Jun 10, 2025 | Decision MakingInformation Retrieval | —Unverified | 0 | 0 |
| Unpicking Data at the Seams: VAEs, Disentanglement and Independent Components | Oct 29, 2024 | DisentanglementRobust classification | —Unverified | 0 | 0 |
| Unsupervised and Interpretable Synthesizing for Electrical Time Series Based on Information Maximizing Generative Adversarial Nets | Jul 18, 2024 | DescriptiveSynthetic Data Generation | —Unverified | 0 | 0 |
| Unsupervised Data Validation Methods for Efficient Model Training | Oct 10, 2024 | Data Augmentationmodel | —Unverified | 0 | 0 |
| Unsupervised Domain Transfer with Conditional Invertible Neural Networks | Mar 17, 2023 | Image GenerationMedical Image Generation | —Unverified | 0 | 0 |
| Unsupervised Adaptation for Synthetic-to-Real Handwritten Word Recognition | Sep 18, 2019 | Data AugmentationHandwritten Text Recognition | —Unverified | 0 | 0 |
| Unveiling Synthetic Faces: How Synthetic Datasets Can Expose Real Identities | Oct 31, 2024 | Face RecognitionInference Attack | —Unverified | 0 | 0 |
| UPose3D: Uncertainty-Aware 3D Human Pose Estimation with Cross-View and Temporal Cues | Apr 23, 2024 | 3D Human Pose EstimationMulti-view 3D Human Pose Estimation | —Unverified | 0 | 0 |
| User Simulation in the Era of Generative AI: User Modeling, Synthetic Data Generation, and System Evaluation | Jan 8, 2025 | Synthetic Data GenerationUser Simulation | —Unverified | 0 | 0 |
| Utility Theory of Synthetic Data Generation | May 17, 2023 | Synthetic Data Generation | —Unverified | 0 | 0 |
| Value Alignment from Unstructured Text | Aug 19, 2024 | Synthetic Data Generation | —Unverified | 0 | 0 |
| Variational Autoencoder Generative Adversarial Network for Synthetic Data Generation in Smart Home | Jan 19, 2022 | Generative Adversarial NetworkSynthetic Data Generation | —Unverified | 0 | 0 |
| Variational Autoencoders for Generative Modelling of Water Cherenkov Detectors | Nov 1, 2019 | Synthetic Data Generation | —Unverified | 0 | 0 |
| VietMix: A Naturally Occurring Vietnamese-English Code-Mixed Corpus with Iterative Augmentation for Machine Translation | May 30, 2025 | Machine TranslationSynthetic Data Generation | —Unverified | 0 | 0 |
| ViFu: Multiple 360^ Objects Reconstruction with Clean Background via Visible Part Fusion | Apr 15, 2024 | Novel View SynthesisSynthetic Data Generation | —Unverified | 0 | 0 |
| Virtual passengers for real car solutions: synthetic datasets | May 13, 2022 | Synthetic Data Generation | —Unverified | 0 | 0 |
| Virtual Temporal Samples for Recurrent Neural Networks: applied to semantic segmentation in agriculture | Jun 18, 2021 | Data AugmentationSegmentation | —Unverified | 0 | 0 |
| Watermarking Generative Categorical Data | Nov 16, 2024 | Synthetic Data Generation | —Unverified | 0 | 0 |
| Weak-to-Strong Compositional Learning from Generative Models for Language-based Object Detection | Jul 21, 2024 | Contrastive Learningobject-detection | —Unverified | 0 | 0 |
| WeChat Neural Machine Translation Systems for WMT20 | Oct 1, 2020 | Knowledge DistillationMachine Translation | —Unverified | 0 | 0 |
| WeChat Neural Machine Translation Systems for WMT21 | Aug 5, 2021 | Knowledge DistillationMachine Translation | —Unverified | 0 | 0 |
| Well log data generation and imputation using sequence-based generative adversarial networks | Dec 1, 2024 | ImputationSynthetic Data Generation | —Unverified | 0 | 0 |
| What Makes and Breaks Safety Fine-tuning? A Mechanistic Study | Jul 14, 2024 | Synthetic Data Generation | —Unverified | 0 | 0 |
| When in Doubt, Cascade: Towards Building Efficient and Capable Guardrails | Jul 8, 2024 | Synthetic Data Generation | —Unverified | 0 | 0 |
| Which is the best model for my data? | Oct 26, 2022 | Feature ImportanceMeta-Learning | —Unverified | 0 | 0 |
| Winning Amazon KDD Cup'24 | Aug 5, 2024 | Data AugmentationMultiple-choice | —Unverified | 0 | 0 |
| XL-Instruct: Synthetic Data for Cross-Lingual Open-Ended Generation | Mar 29, 2025 | 8kSynthetic Data Generation | —Unverified | 0 | 0 |
| Zero-shot Cross-Lingual Transfer for Synthetic Data Generation in Grammatical Error Detection | Jul 16, 2024 | Cross-Lingual TransferGrammatical Error Detection | —Unverified | 0 | 0 |
| zGAN: An Outlier-focused Generative Adversarial Network For Realistic Synthetic Data Generation | Oct 28, 2024 | Binary ClassificationGenerative Adversarial Network | —Unverified | 0 | 0 |
| Principal Component Clustering for Semantic Segmentation in Synthetic Data Generation | Jun 25, 2024 | ClusteringImage Segmentation | —Unverified | 0 | 0 |
| Algorithms for Collaborative Machine Learning under Statistical Heterogeneity | Jul 31, 2024 | Federated LearningSynthetic Data Generation | —Unverified | 0 | 0 |
| ABC Align: Large Language Model Alignment for Safety & Accuracy | Aug 1, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 | 0 |