| Pneuma: Leveraging LLMs for Tabular Data Representation and Retrieval in an End-to-End System | Apr 12, 2025 | Information RetrievalRAG | CodeCode Available | 1 |
| LEDD: Large Language Model-Empowered Data Discovery in Data Lakes | Feb 21, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| TabSketchFM: Sketch-based Tabular Representation Learning for Data Discovery over Data Lakes | Jun 28, 2024 | Representation LearningTable Search | CodeCode Available | 0 |
| Toward Conversational Agents with Context and Time Sensitive Long-term Memory | May 29, 2024 | FormInformation Retrieval | CodeCode Available | 1 |
| From Specific to Generic Learned Sorted Set Dictionaries: A Theoretically Sound Paradigm Yelding Competitive Data Structural Boosters in Practice | Sep 2, 2023 | Table Search | CodeCode Available | 0 |
| Generative Benchmark Creation for Table Union Search | Aug 7, 2023 | ManagementTable Search | CodeCode Available | 0 |
| StruBERT: Structure-aware BERT for Table Search and Matching | Mar 27, 2022 | RetrievalTable Retrieval | CodeCode Available | 1 |
| Table Pre-training: A Survey on Model Architectures, Pre-training Objectives, and Downstream Tasks | Jan 24, 2022 | DenoisingQuestion Answering | CodeCode Available | 1 |
| Learned Sorted Table Search and Static Indexes in Small Model Space | Jul 19, 2021 | BenchmarkingOpen-Ended Question Answering | CodeCode Available | 0 |
| Retrieving Complex Tables with Multi-Granular Graph Representation Learning | May 4, 2021 | Graph Representation LearningNatural Language Queries | CodeCode Available | 1 |