SOTAVerified

Diversity

Diversity in data sampling is crucial across various use cases, including search, recommendation systems, and more. Ensuring diverse samples means capturing a wide range of variations and perspectives, which leads to more robust, unbiased, and comprehensive models. In search use cases, for instance, diversity helps avoid redundancy, ensuring that users are exposed to a broader set of relevant information rather than repeated similar results.

Papers

Showing 501550 of 9051 papers

TitleStatusHype
Improving Geo-diversity of Generated Images with Contextualized Vendi Score GuidanceCode1
Rethinking Guidance Information to Utilize Unlabeled Samples:A Label Encoding PerspectiveCode1
CtrSVDD: A Benchmark Dataset and Baseline Analysis for Controlled Singing Voice Deepfake DetectionCode1
Synergetic Event Understanding: A Collaborative Approach to Cross-Document Event Coreference Resolution with Large Language ModelsCode1
Diff-Mosaic: Augmenting Realistic Representations in Infrared Small Target Detection via Diffusion PriorCode1
SemFlow: Binding Semantic Segmentation and Image Synthesis via Rectified FlowCode1
Quest: Query-centric Data Synthesis Approach for Long-context Scaling of Large Language ModelCode1
Improve Student's Reasoning Generalizability through Cascading Decomposed CoTs DistillationCode1
AlchemistCoder: Harmonizing and Eliciting Code Capability by Hindsight Tuning on Multi-source DataCode1
DiveR-CT: Diversity-enhanced Red Teaming Large Language Model Assistants with Relaxing ConstraintsCode1
DSDL: Data Set Description Language for Bridging Modalities and Tasks in AI DataCode1
Learning diverse attacks on large language models for robust red-teaming and safety tuningCode1
Dataset GrowthCode1
Modeling Dynamic Topics in Chain-Free Fashion by Evolution-Tracking Contrastive Learning and Unassociated Word ExclusionCode1
Fair Federated Learning under Domain Skew with Local Consistency and Domain DiversityCode1
Automatically Generating Numerous Context-Driven SFT Data for LLMs across Diverse GranularityCode1
Graph Neural PDE Solvers with Conservation and Similarity-EquivarianceCode1
USD: Unsupervised Soft Contrastive Learning for Fault Detection in Multivariate Time SeriesCode1
ART: Automatic Red-teaming for Text-to-Image Models to Protect Benign UsersCode1
Controlling Behavioral Diversity in Multi-Agent Reinforcement LearningCode1
Learning to Transform Dynamically for Better Adversarial TransferabilityCode1
DirectMultiStep: Direct Route Generation for Multi-Step RetrosynthesisCode1
Mosaic-IT: Free Compositional Data Augmentation Improves Instruction TuningCode1
Annotation-Efficient Preference Optimization for Language Model AlignmentCode1
Addressing the Elephant in the Room: Robust Animal Re-Identification with Unsupervised Part-Based Feature AlignmentCode1
G-DIG: Towards Gradient-based Diverse and High-quality Instruction Data Selection for Machine TranslationCode1
Goals as Reward-Producing ProgramsCode1
PT43D: A Probabilistic Transformer for Generating 3D Shapes from Single Highly-Ambiguous RGB ImagesCode1
SynthesizRR: Generating Diverse Datasets with Retrieval AugmentationCode1
Color Space Learning for Cross-Color Person Re-IdentificationCode1
Cross-Domain Feature Augmentation for Domain GeneralizationCode1
Treatment Effect Estimation for User Interest Exploration on Recommender SystemsCode1
TAI++: Text as Image for Multi-Label Image Classification by Co-Learning Transferable PromptCode1
BenthicNet: A global compilation of seafloor images for deep learning applicationsCode1
Pedestrian Attribute Recognition as Label-balanced Multi-label LearningCode1
Navigating Chemical Space with Latent FlowsCode1
Towards Geographic Inclusion in the Evaluation of Text-to-Image ModelsCode1
Argumentative Large Language Models for Explainable and Contestable Claim VerificationCode1
Inherent Trade-Offs between Diversity and Stability in Multi-Task BenchmarksCode1
KVP10k : A Comprehensive Dataset for Key-Value Pair Extraction in Business DocumentsCode1
SCONE: A Novel Stochastic Sampling to Generate Contrastive Views and Hard Negative Samples for RecommendationCode1
Modeling Caption Diversity in Contrastive Vision-Language PretrainingCode1
Soft Prompt Generation for Domain GeneralizationCode1
CompilerDream: Learning a Compiler World Model for General Code OptimizationCode1
Elucidating the Design Space of Dataset CondensationCode1
FineRec:Exploring Fine-grained Sequential RecommendationCode1
MambaPupil: Bidirectional Selective Recurrent model for Event-based Eye trackingCode1
Forcing Diffuse Distributions out of Language ModelsCode1
Memory Sharing for Large Language Model based AgentsCode1
How Consistent are Clinicians? Evaluating the Predictability of Sepsis Disease Progression with Dynamics ModelsCode1
Show:102550
← PrevPage 11 of 182Next →

No leaderboard results yet.