SOTAVerified

Diversity

Diversity in data sampling is crucial across various use cases, including search, recommendation systems, and more. Ensuring diverse samples means capturing a wide range of variations and perspectives, which leads to more robust, unbiased, and comprehensive models. In search use cases, for instance, diversity helps avoid redundancy, ensuring that users are exposed to a broader set of relevant information rather than repeated similar results.

Papers

Showing 651700 of 9051 papers

TitleStatusHype
Non-Linear Flow Matching for Full-Atom Peptide Design0
MoMa: A Modular Deep Learning Framework for Material Property Prediction0
Reducing false positives in strong lens detection through effective augmentation and ensemble learning0
I-MCTS: Enhancing Agentic AutoML via Introspective Monte Carlo Tree SearchCode1
Affinity and Diversity: A Unified Metric for Demonstration Selection via Internal Representations0
Exploring Advanced Techniques for Visual Question Answering: A Comprehensive Comparison0
An Enhancement of Jiang, Z., et al.s Compression-Based Classification Algorithm Applied to News Article Categorization0
FragFM: Hierarchical Framework for Efficient Molecule Generation via Fragment-Level Discrete Flow Matching0
Mixed Signals: A Diverse Point Cloud Dataset for Heterogeneous LiDAR V2X Collaboration0
DiffExp: Efficient Exploration in Reward Fine-tuning for Text-to-Image Diffusion Models0
A Large and Balanced Corpus for Fine-grained Arabic Readability Assessment0
Diversity-driven Data Selection for Language Model Tuning through Sparse Autoencoder0
ETS: Efficient Tree Search for Inference-Time ScalingCode0
Cascading CMA-ES Instances for Generating Input-diverse Solution BatchesCode0
DiffSampling: Enhancing Diversity and Accuracy in Neural Text Generation0
Image compositing is all you need for data augmentation0
Poster: SpiderSim: Multi-Agent Driven Theoretical Cybersecurity Simulation for Industrial DigitalizationCode0
VITAL: A New Dataset for Benchmarking Pluralistic Alignment in Healthcare0
Multi-Novelty: Improve the Diversity and Novelty of Contents Generated by Large Language Models via inference-time Multi-Views Brainstorming0
Understanding and Evaluating Hallucinations in 3D Visual Language Models0
Flow-of-Options: Diversified and Improved LLM Reasoning by Thinking Through OptionsCode0
Thinking Outside the (Gray) Box: A Context-Based Score for Assessing Value and Originality in Neural Text Generation0
On the Computational Tractability of the (Many) Shapley Values0
Diverse Topology Optimization using Modulated Neural FieldsCode1
Aligning Sentence Simplification with ESL Learner's Proficiency for Language AcquisitionCode0
InsBank: Evolving Instruction Subset for Ongoing AlignmentCode0
Is Human-Like Text Liked by Humans? Multilingual Human Detection and Preference Against AI0
Diversity-Oriented Data Augmentation with Large Language Models0
Demographic Attributes Prediction from Speech Using WavLM Embeddings0
Energy-Conscious LLM Decoding: Impact of Text Generation Strategies on GPU Energy Consumption0
FairDiverse: A Comprehensive Toolkit for Fair and Diverse Information Retrieval AlgorithmsCode1
Leave No One Behind: Enhancing Diversity While Maintaining Accuracy in Social RecommendationCode0
Attention Mechanism for LLM-based Agents Dynamic Diffusion under Information Asymmetry0
Vendi-RAG: Adaptively Trading-Off Diversity And Quality Significantly Improves Retrieval Augmented Generation With LLMs0
The Shrinking Landscape of Linguistic Diversity in the Age of Large Language ModelsCode0
Diversified Sampling Improves Scaling LLM inference0
VarGes: Improving Variation in Co-Speech 3D Gesture Generation via StyleCLIPSCode0
Is Depth All You Need? An Exploration of Iterative Reasoning in LLMsCode0
The Vendiscope: An Algorithmic Microscope For Data Collections0
To Bin or not to Bin: Alternative Representations of Mass Spectra0
FuncGenFoil: Airfoil Generation and Editing Model in Function SpaceCode0
Direct Preference Optimization-Enhanced Multi-Guided Diffusion Model for Traffic Scenario Generation0
Expert-Agnostic Learning to Defer0
Enhancing Age-Related Robustness in Children Speaker Verification0
Diversity Enhances an LLM's Performance in RAG and Long-context Task0
Matina: A Large-Scale 73B Token Persian Text Corpus0
Communication is All You Need: Persuasion Dataset Construction via Multi-LLM Communication0
Inverse problems with experiment-guided AlphaFold0
When and How Does CLIP Enable Domain and Compositional Generalization?0
Diffusion Models for Molecules: A Survey of Methods and TasksCode2
Show:102550
← PrevPage 14 of 182Next →

No leaderboard results yet.