SOTAVerified

Diversity

Diversity in data sampling is crucial across various use cases, including search, recommendation systems, and more. Ensuring diverse samples means capturing a wide range of variations and perspectives, which leads to more robust, unbiased, and comprehensive models. In search use cases, for instance, diversity helps avoid redundancy, ensuring that users are exposed to a broader set of relevant information rather than repeated similar results.

Papers

Showing 151200 of 9051 papers

TitleStatusHype
A Closer Look into Mixture-of-Experts in Large Language ModelsCode2
Math-LLaVA: Bootstrapping Mathematical Reasoning for Multimodal Large Language ModelsCode2
RNA-FrameFlow: Flow Matching for de novo 3D RNA Backbone DesignCode2
Can Go AIs be adversarially robust?Code2
AEM: Attention Entropy Maximization for Multiple Instance Learning based Whole Slide Image ClassificationCode2
Scaling Efficient Masked Image Modeling on Large Remote Sensing DatasetCode2
STAR: Scale-wise Text-to-image generation via Auto-Regressive representationsCode2
Consistency-diversity-realism Pareto fronts of conditional image generative modelsCode2
SEACrowd: A Multilingual Multimodal Data Hub and Benchmark Suite for Southeast Asian LanguagesCode2
Is One GPU Enough? Pushing Image Generation at Higher-Resolutions with Foundation ModelsCode2
OphNet: A Large-Scale Video Benchmark for Ophthalmic Surgical Workflow UnderstandingCode2
Flow of Reasoning:Training LLMs for Divergent Problem Solving with Minimal ExamplesCode2
VidMuse: A Simple Video-to-Music Generation Framework with Long-Short-Term ModelingCode2
Pruner-Zero: Evolving Symbolic Pruning Metric from scratch for Large Language ModelsCode2
Out of Many, One: Designing and Scaffolding Proteins at the Scale of the Structural Universe with Genie 2Code2
Diffusion Bridge Implicit ModelsCode2
Diff-BGM: A Diffusion Model for Video Background Music GenerationCode2
Mammo-CLIP: A Vision Language Foundation Model to Enhance Data Efficiency and Robustness in MammographyCode2
DiverGen: Improving Instance Segmentation by Learning Wider Data Distribution with More Diverse Generative DataCode2
Grounded 3D-LLM with Referent TokensCode2
DiffTF++: 3D-aware Diffusion Transformer for Large-Vocabulary 3D GenerationCode2
CDFormer:When Degradation Prediction Embraces Diffusion Model for Blind Image Super-ResolutionCode2
Learnable Item Tokenization for Generative RecommendationCode2
Benchmarking Representations for Speech, Music, and Acoustic EventsCode2
SynFlowNet: Design of Diverse and Novel Molecules with Synthesis ConstraintsCode2
Multi-Space Alignments Towards Universal LiDAR SegmentationCode2
IndicGenBench: A Multilingual Benchmark to Evaluate Generation Capabilities of LLMs on Indic LanguagesCode2
The PRISM Alignment Dataset: What Participatory, Representative and Individualised Human Feedback Reveals About the Subjective and Multicultural Alignment of Large Language ModelsCode2
MAexp: A Generic Platform for RL-based Multi-Agent ExplorationCode2
Token-level Direct Preference OptimizationCode2
VBR: A Vision Benchmark in RomeCode2
in2IN: Leveraging individual Information to Generate Human INteractionsCode2
OmniSat: Self-Supervised Modality Fusion for Earth ObservationCode2
Bridging Remote Sensors with Multisensor Geospatial Foundation ModelsCode2
Guide to k-mer approaches for genomics across the tree of lifeCode2
LAKE-RED: Camouflaged Images Generation by Latent Background Knowledge Retrieval-Augmented DiffusionCode2
Protein Conformation Generation via Force-Guided SE(3) Diffusion ModelsCode2
MambaTalk: Efficient Holistic Gesture Synthesis with Selective State Space ModelsCode2
Face2Diffusion for Fast and Editable Face PersonalizationCode2
DPOT: Auto-Regressive Denoising Operator Transformer for Large-Scale PDE Pre-TrainingCode2
UniCtrl: Improving the Spatiotemporal Consistency of Text-to-Video Diffusion Models via Training-Free Unified Attention ControlCode2
TempCompass: Do Video LLMs Really Understand Videos?Code2
WDM: 3D Wavelet Diffusion Models for High-Resolution Medical Image SynthesisCode2
Clustering and Ranking: Diversity-preserved Instruction Selection through Expert-aligned Quality EstimationCode2
Unsupervised Zero-Shot Reinforcement Learning via Functional Reward EncodingsCode2
Measuring Multimodal Mathematical Reasoning with MATH-Vision DatasetCode2
Geometry-Informed Neural NetworksCode2
MultiMedEval: A Benchmark and a Toolkit for Evaluating Medical Vision-Language ModelsCode2
SALAD-Bench: A Hierarchical and Comprehensive Safety Benchmark for Large Language ModelsCode2
AdaFlow: Imitation Learning with Variance-Adaptive Flow-Based PoliciesCode2
Show:102550
← PrevPage 4 of 182Next →

No leaderboard results yet.