MvP: Multi-view Prompting Improves Aspect Sentiment Tuple Prediction
Zhibin Gou, Qingyan Guo, Yujiu Yang
Code Available — Be the first to reproduce this paper.
ReproduceCode
- github.com/ZubinGou/multi-view-promptingOfficialIn paperpytorch★ 94
Abstract
Generative methods greatly promote aspect-based sentiment analysis via generating a sequence of sentiment elements in a specified format. However, existing studies usually predict sentiment elements in a fixed order, which ignores the effect of the interdependence of the elements in a sentiment tuple and the diversity of language expression on the results. In this work, we propose Multi-view Prompting (MvP) that aggregates sentiment elements generated in different orders, leveraging the intuition of human-like problem-solving processes from different views. Specifically, MvP introduces element order prompts to guide the language model to generate multiple sentiment tuples, each with a different element order, and then selects the most reasonable tuples by voting. MvP can naturally model multi-view and multi-task as permutations and combinations of elements, respectively, outperforming previous task-specific designed methods on multiple ABSA tasks with a single model. Extensive experiments show that MvP significantly advances the state-of-the-art performance on 10 datasets of 4 benchmark tasks, and performs quite effectively in low-resource settings. Detailed evaluation verified the effectiveness, flexibility, and cross-task transferability of MvP.
Tasks
Benchmark Results
| Dataset | Model | Metric | Claimed | Verified | Status |
|---|---|---|---|---|---|
| ACOS | MvP | F1 (Laptop) | 43.92 | — | Unverified |
| ACOS | ChatGPT (gpt-3.5-turbo, zero-shot) | F1 (Restaurant) | 27.11 | — | Unverified |
| ACOS | ChatGPT (gpt-3.5-turbo, few-shot) | F1 (Restaurant) | 37.71 | — | Unverified |
| ACOS | MvP (muilti-task) | F1 (Laptop) | 43.84 | — | Unverified |
| ASQP | ChatGPT (gpt-3.5-turbo, zero-shot) | F1 (R15) | 22.87 | — | Unverified |
| ASQP | MvP (multi-task) | F1 (R15) | 52.21 | — | Unverified |
| ASQP | MvP | F1 (R15) | 51.04 | — | Unverified |
| ASQP | ChatGPT (gpt-3.5-turbo, few-shot) | F1 (R15) | 34.27 | — | Unverified |
| ASTE | MvP (multi-task) | F1 (L14) | 65.3 | — | Unverified |
| ASTE | MvP | F1 (L14) | 63.33 | — | Unverified |
| ASTE | ChatGPT (gpt-3.5-turbo, few-shot) | F1 (L14) | 38.12 | — | Unverified |
| ASTE | ChatGPT (gpt-3.5-turbo, zero-shot) | F1 (L14) | 36.05 | — | Unverified |
| TASD | MvP (multi-task) | F1 (R15) | 64.74 | — | Unverified |
| TASD | ChatGPT (gpt-3.5-turbo, zero-shot) | F1 (R16) | 34.08 | — | Unverified |
| TASD | ChatGPT (gpt-3.5-turbo, few-shot) | F1 (R16) | 46.51 | — | Unverified |
| TASD | MvP | F1 (R15) | 64.53 | — | Unverified |