SOTAVerified

Diversity

Diversity in data sampling is crucial across various use cases, including search, recommendation systems, and more. Ensuring diverse samples means capturing a wide range of variations and perspectives, which leads to more robust, unbiased, and comprehensive models. In search use cases, for instance, diversity helps avoid redundancy, ensuring that users are exposed to a broader set of relevant information rather than repeated similar results.

Papers

Showing 201225 of 9051 papers

TitleStatusHype
MoMBS: Mixed-order minibatch sampling enhances model training from diverse-quality images0
ToDRE: Visual Token Pruning via Diversity and Task Awareness for Efficient Large Vision-Language Models0
LiSTEN: Learning Soft Token Embeddings for Neural Audio LLMs0
Measuring diversity of synthetic prompts and data generated with fine-grained persona prompting0
Large language model as user daily behavior data generator: balancing population diversity and individual personality0
High-Fidelity Functional Ultrasound Reconstruction via A Visual Auto-Regressive Framework0
BLAST: Balanced Sampling Time Series Corpus for Universal Forecasting ModelsCode5
JALMBench: Benchmarking Jailbreak Vulnerabilities in Audio Language ModelsCode0
CrashAgent: Crash Scenario Generation via Multi-modal Reasoning0
LongMagpie: A Self-synthesis Method for Generating Large-scale Long-context Instructions0
Generative AI and Creativity: A Systematic Literature Review and Meta-AnalysisCode0
Can LLMs Simulate Human Behavioral Variability? A Case Study in the Phonemic Fluency Task0
Position of Uncertainty: A Cross-Linguistic Study of Positional Bias in Large Language Models0
Robust Invariant Representation Learning by Distribution Extrapolation0
Diverse, not Short: A Length-Controlled Self-Learning Framework for Improving Response Diversity of Language Models0
Sudoku-Bench: Evaluating creative reasoning with Sudoku variantsCode0
AdaSTaR: Adaptive Data Sampling for Training Self-Taught Reasoners0
SimpleDeepSearcher: Deep Information Seeking via Web-Powered Reasoning Trajectory SynthesisCode4
Exploring the Relationship Between Diversity and Quality in Ad Text Generation0
Swarm Intelligence Enhanced Reasoning: A Density-Driven Framework for LLM-Based Multi-Agent Optimization0
Ensembling Sparse Autoencoders0
An Inclusive Foundation Model for Generalizable Cytogenetics in Precision Oncology0
Aug2Search: Enhancing Facebook Marketplace Search with LLM-Generated Synthetic Data Augmentation0
OpenEthics: A Comprehensive Ethical Evaluation of Open-Source Generative Large Language ModelsCode0
A Distributed Local Energy Market Clearing Framework Using a Two-Loop ADMM Method0
Show:102550
← PrevPage 9 of 363Next →

No leaderboard results yet.