SOTAVerified

Not All Jokes Land: Evaluating Large Language Models Understanding of Workplace Humor

2025-06-02Unverified0· sign in to hype

Mohammadamin Shafiei, Hamidreza Saffari

Unverified — Be the first to reproduce this paper.

Reproduce

Abstract

With the recent advances in Artificial Intelligence (AI) and Large Language Models (LLMs), the automation of daily tasks, like automatic writing, is getting more and more attention. Hence, efforts have focused on aligning LLMs with human values, yet humor, particularly professional industrial humor used in workplaces, has been largely neglected. To address this, we develop a dataset of professional humor statements along with features that determine the appropriateness of each statement. Our evaluation of five LLMs shows that LLMs often struggle to judge the appropriateness of humor accurately.

Tasks

Reproductions