SOTAVerified

Diversity

Diversity in data sampling is crucial across various use cases, including search, recommendation systems, and more. Ensuring diverse samples means capturing a wide range of variations and perspectives, which leads to more robust, unbiased, and comprehensive models. In search use cases, for instance, diversity helps avoid redundancy, ensuring that users are exposed to a broader set of relevant information rather than repeated similar results.

Papers

Showing 151200 of 9051 papers

TitleStatusHype
Diversity-Aware Policy Optimization for Large Language Model Reasoning0
Revisiting Multi-Agent Debate as Test-Time Scaling: A Systematic Study of Conditional Effectiveness0
DiCoFlex: Model-agnostic diverse counterfactuals with flexible control0
Generating Diverse Training Samples for Relation Extraction with Large Language Models0
ZIPA: A family of efficient models for multilingual phone recognitionCode2
Interspeech 2025 URGENT Speech Enhancement Challenge0
MermaidFlow: Redefining Agentic Workflow Generation via Safety-Constrained Evolutionary ProgrammingCode2
Single Domain Generalization for Alzheimer's Detection from 3D MRIs with Pseudo-Morphological Augmentations and Contrastive LearningCode0
Cultural Evaluations of Vision-Language Models Have a Lot to Learn from Cultural Theory0
VME: A Satellite Imagery Dataset and Benchmark for Detecting Vehicles in the Middle East and BeyondCode0
Can LLMs Deceive CLIP? Benchmarking Adversarial Compositionality of Pre-trained Multimodal Representation via Text Updates0
Analysis and Evaluation of Synthetic Data Generation in Speech Dysfluency DetectionCode1
AudioTurbo: Fast Text-to-Audio Generation with Rectified Diffusion0
From Failures to Fixes: LLM-Driven Scenario Repair for Self-Evolving Autonomous Driving0
Jailbreak Distillation: Renewable Safety Benchmarking0
Incorporating LLMs for Large-Scale Urban Complex Mobility Simulation0
PoisonSwarm: Universal Harmful Information Synthesis via Model Crowdsourcing0
CNVSRC 2024: The Second Chinese Continuous Visual Speech Recognition Challenge0
LLM-Driven E-Commerce Marketing Content Optimization: Balancing Creativity and Conversion0
Response to comment on Mutualism weaken the latitudinal diversity gradient among oceanic islandsCode0
Conditional Diffusion Models with Classifier-Free Gibbs-like GuidanceCode0
Fundamental Limits of Game-Theoretic LLM Alignment: Smith Consistency and Preference Matching0
Hybrid Disagreement-Diversity Active Learning for Bioacoustic Sound Event DetectionCode0
PromptEVC: Controllable Emotional Voice Conversion with Natural Language Prompts0
Towards Pretraining Robust ASR Foundation Model with Acoustic-Aware Data Augmentation0
Counterfactual Multi-player Bandits for Explainable Recommendation DiversificationCode0
The NaijaVoices Dataset: Cultivating Large-Scale, High-Quality, Culturally-Rich Speech Data for African Languages0
Holes in Latent Space: Topological Signatures Under Adversarial Influence0
Token-Importance Guided Direct Preference Optimization0
Kuramoto-FedAvg: Using Synchronization Dynamics to Improve Federated Learning Optimization under Statistical Heterogeneity0
An Out-Of-Distribution Membership Inference Attack Approach for Cross-Domain Graph Attacks0
Prismatic Synthesis: Gradient-based Data Diversification Boosts Generalization in LLM Reasoning0
We Need to Measure Data Diversity in NLP -- Better and Broader0
Diversity-Driven Generative Dataset Distillation Based on Diffusion Model with Self-Adaptive Memory0
Force Prompting: Video Generation Models Can Learn and Generalize Physics-based Control Signals0
CulFiT: A Fine-grained Cultural-aware LLM Training Paradigm via Multilingual Critique Data SynthesisCode0
ReDDiT: Rehashing Noise for Discrete Visual Generation0
EmoNet-Face: An Expert-Annotated Benchmark for Synthetic Emotion Recognition0
The Role of Diversity in In-Context Learning for Large Language Models0
VisualToolAgent (VisTA): A Reinforcement Learning Framework for Visual Tool Selection0
DiffVLA: Vision-Language Guided Diffusion Planning for Autonomous Driving0
Pan-tropical plant functional trait variation from space0
PIGPVAE: Physics-Informed Gaussian Process Variational Autoencoders0
MMP-2K: A Benchmark Multi-Labeled Macro Photography Image Quality Assessment DatabaseCode1
Less is More: Efficient Point Cloud Reconstruction via Multi-Head Decoders0
The Price of Format: Diversity Collapse in LLMsCode0
MGD^3: Mode-Guided Dataset Distillation using Diffusion Models0
Beyond Editing Pairs: Fine-Grained Instructional Image Editing via Multi-Scale Learnable Regions0
SpokenNativQA: Multilingual Everyday Spoken Queries for LLMs0
Voice of a Continent: Mapping Africa's Speech Technology Frontier0
Show:102550
← PrevPage 4 of 182Next →

No leaderboard results yet.