SOTAVerified

Diversity

Diversity in data sampling is crucial across various use cases, including search, recommendation systems, and more. Ensuring diverse samples means capturing a wide range of variations and perspectives, which leads to more robust, unbiased, and comprehensive models. In search use cases, for instance, diversity helps avoid redundancy, ensuring that users are exposed to a broader set of relevant information rather than repeated similar results.

Papers

Showing 150 of 9051 papers

TitleStatusHype
MinerU: An Open-Source Solution for Precise Document Content ExtractionCode16
olmOCR: Unlocking Trillions of Tokens in PDFs with Vision Language ModelsCode11
Hallo: Hierarchical Audio-Driven Visual Synthesis for Portrait Image AnimationCode9
AniPortrait: Audio-Driven Synthesis of Photorealistic Portrait AnimationCode9
Depth Anything V2Code9
Is Diversity All You Need for Scalable Robotic Manipulation?Code7
From Audio to Photoreal Embodiment: Synthesizing Humans in ConversationsCode7
Adaptive In-conversation Team Building for Language Model AgentsCode7
FoundationStereo: Zero-Shot Stereo MatchingCode7
PromptWizard: Task-Aware Prompt Optimization FrameworkCode7
Improving Sample Quality of Diffusion Models Using Self-Attention GuidanceCode7
MaskSketch: Unpaired Structure-guided Masked Image GenerationCode7
Better Synthetic Data by Retrieving and Transforming Existing DatasetsCode7
LMSYS-Chat-1M: A Large-Scale Real-World LLM Conversation DatasetCode7
SPHINX-X: Scaling Data and Parameters for a Family of Multi-modal Large Language ModelsCode7
Flow-GRPO: Training Flow Matching Models via Online RLCode7
Automatic Chain of Thought Prompting in Large Language ModelsCode6
BLAST: Balanced Sampling Time Series Corpus for Universal Forecasting ModelsCode5
Fake News Detection: It's All in the Data!Code5
GAPartManip: A Large-scale Part-centric Dataset for Material-Agnostic Articulated Object ManipulationCode5
R-CoT: Reverse Chain-of-Thought Problem Generation for Geometric Reasoning in Large Multimodal ModelsCode5
DepthCrafter: Generating Consistent Long Depth Sequences for Open-world VideosCode5
MING-MOE: Enhancing Medical Multi-Task Learning in Large Language Models with Sparse Mixture of Low-Rank Adapter ExpertsCode5
MedCare: Advancing Medical LLMs through Decoupling Clinical Alignment and Knowledge AggregationCode5
OmniDocBench: Benchmarking Diverse PDF Document Parsing with Comprehensive AnnotationsCode5
AgentCPM-GUI: Building Mobile-Use Agents with Reinforcement Fine-TuningCode5
VoxBlink2: A 100K+ Speaker Recognition Corpus and the Open-Set Speaker-Identification BenchmarkCode5
ConsistentID: Portrait Generation with Multimodal Fine-Grained Identity PreservingCode5
Choices are More Important than Efforts: LLM Enables Efficient Multi-Agent ExplorationCode4
A Preview of XiYan-SQL: A Multi-Generator Ensemble Framework for Text-to-SQLCode4
GET3D: A Generative Model of High Quality 3D Textured Shapes Learned from ImagesCode4
GeoWizard: Unleashing the Diffusion Priors for 3D Geometry Estimation from a Single ImageCode4
GaussianFormer: Scene as Gaussians for Vision-Based 3D Semantic Occupancy PredictionCode4
ActionStudio: A Lightweight Framework for Data and Training of Large Action ModelsCode4
SimpleDeepSearcher: Deep Information Seeking via Web-Powered Reasoning Trajectory SynthesisCode4
Quality-aware Masked Diffusion Transformer for Enhanced Music GenerationCode4
Efficient Part-level 3D Object Generation via Dual Volume PackingCode4
Expressive Whole-Body 3D Gaussian AvatarCode4
AlphaFold Meets Flow Matching for Generating Protein EnsemblesCode4
A New Formulation of Lipschitz Constrained With Functional Gradient Learning for GANsCode4
Enhancing Chat Language Models by Scaling High-quality Instructional ConversationsCode4
3D Scene Generation: A SurveyCode4
Distill Any Depth: Distillation Creates a Stronger Monocular Depth EstimatorCode4
Improving Text Embeddings with Large Language ModelsCode3
INTERS: Unlocking the Power of Large Language Models in Search with Instruction TuningCode3
LongAlign: A Recipe for Long Context Alignment of Large Language ModelsCode3
Improved motif-scaffolding with SE(3) flow matchingCode3
Hierarchical Text-Conditional Image Generation with CLIP LatentsCode3
Improving Model Evaluation using SMART Filtering of Benchmark DatasetsCode3
GenWarp: Single Image to Novel Views with Semantic-Preserving Generative WarpingCode3
Show:102550
← PrevPage 1 of 182Next →

No leaderboard results yet.