SOTAVerified

Diversity

Diversity in data sampling is crucial across various use cases, including search, recommendation systems, and more. Ensuring diverse samples means capturing a wide range of variations and perspectives, which leads to more robust, unbiased, and comprehensive models. In search use cases, for instance, diversity helps avoid redundancy, ensuring that users are exposed to a broader set of relevant information rather than repeated similar results.

Papers

Showing 176200 of 9051 papers

TitleStatusHype
Counterfactual Multi-player Bandits for Explainable Recommendation DiversificationCode0
The NaijaVoices Dataset: Cultivating Large-Scale, High-Quality, Culturally-Rich Speech Data for African Languages0
Holes in Latent Space: Topological Signatures Under Adversarial Influence0
Token-Importance Guided Direct Preference Optimization0
Kuramoto-FedAvg: Using Synchronization Dynamics to Improve Federated Learning Optimization under Statistical Heterogeneity0
An Out-Of-Distribution Membership Inference Attack Approach for Cross-Domain Graph Attacks0
Prismatic Synthesis: Gradient-based Data Diversification Boosts Generalization in LLM Reasoning0
We Need to Measure Data Diversity in NLP -- Better and Broader0
Diversity-Driven Generative Dataset Distillation Based on Diffusion Model with Self-Adaptive Memory0
Force Prompting: Video Generation Models Can Learn and Generalize Physics-based Control Signals0
CulFiT: A Fine-grained Cultural-aware LLM Training Paradigm via Multilingual Critique Data SynthesisCode0
ReDDiT: Rehashing Noise for Discrete Visual Generation0
EmoNet-Face: An Expert-Annotated Benchmark for Synthetic Emotion Recognition0
The Role of Diversity in In-Context Learning for Large Language Models0
VisualToolAgent (VisTA): A Reinforcement Learning Framework for Visual Tool Selection0
DiffVLA: Vision-Language Guided Diffusion Planning for Autonomous Driving0
Pan-tropical plant functional trait variation from space0
PIGPVAE: Physics-Informed Gaussian Process Variational Autoencoders0
MMP-2K: A Benchmark Multi-Labeled Macro Photography Image Quality Assessment DatabaseCode1
Less is More: Efficient Point Cloud Reconstruction via Multi-Head Decoders0
The Price of Format: Diversity Collapse in LLMsCode0
MGD^3: Mode-Guided Dataset Distillation using Diffusion Models0
Beyond Editing Pairs: Fine-Grained Instructional Image Editing via Multi-Scale Learnable Regions0
SpokenNativQA: Multilingual Everyday Spoken Queries for LLMs0
Voice of a Continent: Mapping Africa's Speech Technology Frontier0
Show:102550
← PrevPage 8 of 363Next →

No leaderboard results yet.