SOTAVerified

Data Summarization

Data Summarization is a central problem in the area of machine learning, where we want to compute a small summary of the data.

Source: How to Solve Fair k-Center in Massive Data Models

Papers

Showing 150 of 97 papers

TitleStatusHype
Soft-Label Dataset Distillation and Text Dataset DistillationCode1
Sequential estimation of Spearman rank correlation using Hermite series estimatorsCode1
Scalability vs. Utility: Do We Have to Sacrifice One for the Other in Data Importance Quantification?Code1
Submodlib: A Submodular Optimization LibraryCode1
ChartSumm: A Comprehensive Benchmark for Automatic Chart Summarization of Long and Short SummariesCode1
Very Fast Streaming Submodular Function MaximizationCode1
CO-Optimal TransportCode1
Get Rid of Isolation: A Continuous Multi-task Spatio-Temporal Learning FrameworkCode1
Flexible Dataset Distillation: Learn Labels Instead of ImagesCode1
Sequential Quantiles via Hermite Series Density EstimationCode1
Semi-supervised Batch Active Learning via Bilevel OptimizationCode1
Streaming Algorithms for Diversity Maximization with Fairness ConstraintsCode0
A Mixed Hierarchical Attention based Encoder-Decoder Approach for Standard Table SummarizationCode0
An Online Algorithm for Nonparametric CorrelationsCode0
Understanding collections of related datasets using dependent MMD coresetsCode0
apricot: Submodular selection for data summarization in PythonCode0
Balancing Utility and Fairness in Submodular Maximization (Technical Report)Code0
β-Cores: Robust Large-Scale Bayesian Data Summarization in the Presence of OutliersCode0
Black-box Coreset Variational InferenceCode0
Coverage-Based Designs Improve Sample Mining and Hyper-Parameter OptimizationCode0
Deuteros 2.0: Peptide-level significance testing of data from hydrogen deuterium exchange mass spectrometryCode0
DiffRed: Dimensionality Reduction guided by stable rankCode0
Fair and Diverse DPP-based Data SummarizationCode0
Fair k-Center Clustering for Data SummarizationCode0
Fast and Accurate Least-Mean-Squares SolversCode0
Group Equality in Adaptive Submodular MaximizationCode0
Iterative Projection and Matching: Finding Structure-preserving Representatives and Its Application to Computer VisionCode0
MatCha: Enhancing Visual Language Pretraining with Math Reasoning and Chart DerenderingCode0
Scalable k-Means Clustering via Lightweight CoresetsCode0
Streaming Submodular Maximization under a k-Set System ConstraintCode0
Fair and Representative Subset Selection from Data StreamsCode0
Synthetic Dataset Generation of Driver TelematicsCode0
Time-to-Pattern: Information-Theoretic Unsupervised Learning for Scalable Time Series SummarizationCode0
Towards Neural Numeric-To-Text Generation From Temporal Personal Health DataCode0
Streaming Robust Submodular Maximization: A Partitioned Thresholding Approach0
Fair Clustering for Data Summarization: Improved Approximation Algorithms and Complexity Insights0
Towards General Robustness to Bad Training Data0
Fair k-Centers via Maximum Matching0
Streaming Submodular Maximization under a k-Set System Constraint0
Fast and Private Submodular and k-Submodular Functions Maximization with Matroid Constraints0
Fast determinantal point processes via distortion-free intermediate sampling0
Fast Distributed Submodular Cover: Public-Private Data Summarization0
Federated Combinatorial Multi-Agent Multi-Armed Bandits0
Data Summarization at Scale: A Two-Stage Submodular Approach0
Coresets for Vector Summarization with Applications to Network Graphs0
GIST: Greedy Independent Set Thresholding for Diverse Data Summarization0
Graph Summarization Methods and Applications: A Survey0
GreedyML: A Parallel Algorithm for Maximizing Constrained Submodular Functions0
Adaptive Sampling for Fast Constrained Maximization of Submodular Function0
Group Fairness in Non-monotone Submodular Maximization0
Show:102550
← PrevPage 1 of 2Next →

No leaderboard results yet.