Diversity

Diversity in data sampling is crucial across various use cases, including search, recommendation systems, and more. Ensuring diverse samples means capturing a wide range of variations and perspectives, which leads to more robust, unbiased, and comprehensive models. In search use cases, for instance, diversity helps avoid redundancy, ensuring that users are exposed to a broader set of relevant information rather than repeated similar results.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 51–100 of 9051 papers

Title	Date	Tasks	Status	Hype
Improving Model Evaluation using SMART Filtering of Benchmark Datasets	Oct 26, 2024	ChatbotDiversity	CodeCode Available	3
Results of the Big ANN: NeurIPS'23 competition	Sep 25, 2024	Diversity	CodeCode Available	3
Playground v3: Improving Text-to-Image Alignment with Deep-Fusion Large Language Models	Sep 16, 2024	DecoderDiversity	CodeCode Available	3
SkillMimic: Learning Basketball Interaction Skills from Demonstrations	Aug 12, 2024	DiversityHuman-Object Interaction Detection	CodeCode Available	3
Zero-Shot Surgical Tool Segmentation in Monocular Video Using Segment Anything Model 2	Aug 3, 2024	DiversitySegmentation	CodeCode Available	3
AlphaForge: A Framework to Mine and Dynamically Combine Formulaic Alpha Factors	Jun 26, 2024	Diversity	CodeCode Available	3
Visible-Thermal Tiny Object Detection: A Benchmark Dataset and Baselines	Jun 20, 2024	Diversityobject-detection	CodeCode Available	3
Sequence-Augmented SE(3)-Flow Matching For Conditional Protein Backbone Generation	May 30, 2024	DiversityDrug Design	CodeCode Available	3
GenWarp: Single Image to Novel Views with Semantic-Preserving Generative Warping	May 27, 2024	Depth EstimationDiversity	CodeCode Available	3
FRACTAL: An Ultra-Large-Scale Aerial Lidar Dataset for 3D Semantic Segmentation of Diverse Landscapes	May 7, 2024	3D Point Cloud Classification3D Semantic Segmentation	CodeCode Available	3
Taming Diffusion Probabilistic Models for Character Control	Apr 23, 2024	Computational EfficiencyDiversity	CodeCode Available	3
UniMERNet: A Universal Network for Real-World Mathematical Expression Recognition	Apr 23, 2024	DecoderDiversity	CodeCode Available	3
Addressing the Abstraction and Reasoning Corpus via Procedural Example Generation	Apr 10, 2024	ARCDiversity	CodeCode Available	3
UniTraj: A Unified Framework for Scalable Vehicle Trajectory Prediction	Mar 22, 2024	DiversityPrediction	CodeCode Available	3
ThemeStation: Generating Theme-Aware 3D Assets from Few Exemplars	Mar 22, 2024	3D GenerationDiversity	CodeCode Available	3
Swin3D++: Effective Multi-Source Pretraining for 3D Indoor Scene Understanding	Feb 22, 2024	DiversityScene Understanding	CodeCode Available	3
LongAlign: A Recipe for Long Context Alignment of Large Language Models	Jan 31, 2024	DiversityInstruction Following	CodeCode Available	3
INTERS: Unlocking the Power of Large Language Models in Search with Instruction Tuning	Jan 12, 2024	Diversitydocument understanding	CodeCode Available	3
Improved motif-scaffolding with SE(3) flow matching	Jan 8, 2024	Data AugmentationDiversity	CodeCode Available	3
EMAGE: Towards Unified Holistic Co-Speech Gesture Generation via Expressive Masked Audio Gesture Modeling	Dec 31, 2023	3D Face AnimationDiversity	CodeCode Available	3
Improving Text Embeddings with Large Language Models	Dec 31, 2023	DecoderDiversity	CodeCode Available	3
Sequential Modeling Enables Scalable Learning for Large Vision Models	Dec 1, 2023	Diversity	CodeCode Available	3
Taiwan LLM: Bridging the Linguistic Divide with a Culturally Aligned Language Model	Nov 29, 2023	DiversityLanguage Modeling	CodeCode Available	3
SA-Med2D-20M Dataset: Segment Anything in 2D Medical Imaging with 20 Million masks	Nov 20, 2023	DiversityImage Segmentation	CodeCode Available	3
CRITERIA: a New Benchmarking Paradigm for Evaluating Trajectory Prediction Models for Autonomous Driving	Oct 11, 2023	Autonomous DrivingBenchmarking	CodeCode Available	3
Objaverse-XL: A Universe of 10M+ 3D Objects	Jul 11, 2023	DiversityNovel View Synthesis	CodeCode Available	3
SVIT: Scaling up Visual Instruction Tuning	Jul 9, 2023	DiversityImage Captioning	CodeCode Available	3
Self-QA: Unsupervised Knowledge Guided Language Model Alignment	May 19, 2023	DiversityLanguage Modeling	CodeCode Available	3
Principle-Driven Self-Alignment of Language Models from Scratch with Minimal Human Supervision	May 4, 2023	DiversityIn-Context Learning	CodeCode Available	3
Anything-3D: Towards Single-view Anything Reconstruction in the Wild	Apr 19, 2023	3D ReconstructionDiversity	CodeCode Available	3
RT-1: Robotics Transformer for Real-World Control at Scale	Dec 13, 2022	DiversityRobot Manipulation	CodeCode Available	3
DiffusionDB: A Large-scale Prompt Gallery Dataset for Text-to-Image Generative Models	Oct 26, 2022	DiversityMisinformation	CodeCode Available	3
MiniViT: Compressing Vision Transformers with Weight Multiplexing	Apr 14, 2022	DiversityImage Classification	CodeCode Available	3
Hierarchical Text-Conditional Image Generation with CLIP Latents	Apr 13, 2022	Conditional Image GenerationDecoder	CodeCode Available	3
MNN: A Universal and Efficient Inference Engine	Feb 27, 2020	Deep LearningDiversity	CodeCode Available	3
Generating Long Sequences with Sparse Transformers	Apr 23, 2019	DiversityImage Generation	CodeCode Available	3
Lost in Latent Space: An Empirical Study of Latent Diffusion Models for Physics Emulation	Jul 3, 2025	DiversityVideo Generation	CodeCode Available	2
MemBench: Towards More Comprehensive Evaluation on the Memory of LLM-based Agents	Jun 20, 2025	Diversity	CodeCode Available	2
Beyond Attention or Similarity: Maximizing Conditional Diversity for Token Pruning in MLLMs	Jun 12, 2025	Diversity	CodeCode Available	2
UniPre3D: Unified Pre-training of 3D Point Cloud Models with Cross-Modal Gaussian Splatting	Jun 11, 2025	DiversityRepresentation Learning	CodeCode Available	2
MermaidFlow: Redefining Agentic Workflow Generation via Safety-Constrained Evolutionary Programming	May 29, 2025	DiversityEfficient Exploration	CodeCode Available	2
ZIPA: A family of efficient models for multilingual phone recognition	May 29, 2025	Diversity	CodeCode Available	2
AD-AGENT: A Multi-agent Framework for End-to-end Anomaly Detection	May 19, 2025	Anomaly DetectionCode Generation	CodeCode Available	2
HISTAI: An Open-Source, Large-Scale Whole Slide Image Dataset for Computational Pathology	May 17, 2025	DiagnosticDiversity	CodeCode Available	2
NoisyRollout: Reinforcing Visual Reasoning with Data Augmentation	Apr 17, 2025	Data AugmentationDiversity	CodeCode Available	2
SocioVerse: A World Model for Social Simulation Powered by LLM Agents and A Pool of 10 Million Real-World Users	Apr 14, 2025	DiversityFace Alignment	CodeCode Available	2
MegaMath: Pushing the Limits of Open Math Corpora	Apr 3, 2025	DiversityMath	CodeCode Available	2
Dereflection Any Image with Diffusion Priors and Diversified Data	Mar 21, 2025	DiversityReflection Removal	CodeCode Available	2
Modifying Large Language Model Post-Training for Diverse Creative Writing	Mar 21, 2025	DiversityLanguage Modeling	CodeCode Available	2
PET-MAD, a universal interatomic potential for advanced materials modeling	Mar 18, 2025	Diversity	CodeCode Available	2

Show:10 25 50

← PrevPage 2 of 182Next →

No leaderboard results yet.