SOTAVerified

Sentence-BERT: Sentence Embeddings using Siamese BERT-Networks

2019-08-27IJCNLP 2019Code Available1· sign in to hype

Nils Reimers, Iryna Gurevych

Code Available — Be the first to reproduce this paper.

Reproduce

Code

Abstract

BERT (Devlin et al., 2018) and RoBERTa (Liu et al., 2019) has set a new state-of-the-art performance on sentence-pair regression tasks like semantic textual similarity (STS). However, it requires that both sentences are fed into the network, which causes a massive computational overhead: Finding the most similar pair in a collection of 10,000 sentences requires about 50 million inference computations (~65 hours) with BERT. The construction of BERT makes it unsuitable for semantic similarity search as well as for unsupervised tasks like clustering. In this publication, we present Sentence-BERT (SBERT), a modification of the pretrained BERT network that use siamese and triplet network structures to derive semantically meaningful sentence embeddings that can be compared using cosine-similarity. This reduces the effort for finding the most similar pair from 65 hours with BERT / RoBERTa to about 5 seconds with SBERT, while maintaining the accuracy from BERT. We evaluate SBERT and SRoBERTa on common STS tasks and transfer learning tasks, where it outperforms other state-of-the-art sentence embeddings methods.

Tasks

Benchmark Results

DatasetModelMetricClaimedVerifiedStatus
SICKSBERT-NLI-largeSpearman Correlation0.74Unverified
SICKSBERT-NLI-baseSpearman Correlation0.73Unverified
SICKSentenceBERTSpearman Correlation0.75Unverified
SICKSRoBERTa-NLI-baseSpearman Correlation0.74Unverified
SICKSRoBERTa-NLI-largeSpearman Correlation0.74Unverified
STS12SRoBERTa-NLI-largeSpearman Correlation0.75Unverified
STS13SBERT-NLI-largeSpearman Correlation0.78Unverified
STS14SBERT-NLI-largeSpearman Correlation0.75Unverified
STS15SRoBERTa-NLI-largeSpearman Correlation0.82Unverified
STS16SRoBERTa-NLI-largeSpearman Correlation0.77Unverified
STS BenchmarkSBERT-NLI-baseSpearman Correlation0.77Unverified
STS BenchmarkSRoBERTa-NLI-baseSpearman Correlation0.78Unverified
STS BenchmarkSRoBERTa-NLI-STSb-largeSpearman Correlation0.86Unverified
STS BenchmarkSBERT-STSb-baseSpearman Correlation0.85Unverified
STS BenchmarkSBERT-NLI-largeSpearman Correlation0.79Unverified
STS BenchmarkSBERT-STSb-largeSpearman Correlation0.84Unverified

Reproductions