NormTab: Improving Symbolic Reasoning in LLMs Through Tabular Data Normalization

2024-06-25Code Available0· sign in to hype

Md Mahadi Hasan Nahid, Davood Rafiei

Code Available — Be the first to reproduce this paper.

Code

github.com/mahadi-nahid/NormTab
tf★ 6

Abstract

In recent years, Large Language Models (LLMs) have demonstrated remarkable capabilities in parsing textual data and generating code. However, their performance in tasks involving tabular data, especially those requiring symbolic reasoning, faces challenges due to the structural variance and inconsistency in table cell values often found in web tables. In this paper, we introduce NormTab, a novel framework aimed at enhancing the symbolic reasoning performance of LLMs by normalizing web tables. We study table normalization as a stand-alone, one-time preprocessing step using LLMs to support symbolic reasoning on tabular data. Our experimental evaluation, conducted on challenging web table datasets such as WikiTableQuestion and TabFact, demonstrates that leveraging NormTab significantly improves symbolic reasoning performance, showcasing the importance and effectiveness of web table normalization for enhancing LLM-based symbolic reasoning tasks.

Tasks

Semantic Parsing Table-based Fact Verification

Benchmark Results

Dataset	Model	Metric	Claimed	Verified	Status
WikiTableQuestions	NormTab+TabSQLify	Accuracy (Test)	68.63	—	Unverified
WikiTableQuestions	NormTab (Targeted) + SQL	Accuracy (Test)	61.2	—	Unverified

NormTab: Improving Symbolic Reasoning in LLMs Through Tabular Data Normalization

Code

Abstract

Tasks

Benchmark Results

Reproductions