Skip to main content

Table 4 Performance of different methods using BERT for relation extraction in the SLR 2 dataset

From: Evaluation of a prototype machine learning tool to semi-automate data extraction for systematic literature reviews

Model

Three-sentence context window

Five-sentence context window

Precision, %

Recall, %

F1 score, %

Precision, %

Recall, %

F1 score, %

Role labelling

61

54

57

60

52

56

Relation classification

99

89

93

99

87

92

Pretrained role labelling

64

59

62

62

55

59

Pretrained relation classification

98

91

95

98

90

94

  1. The bold F1 score indicates the best-performing model. The 95% confidence intervals for the F1 scores are included within ± 0.5 percentage points of the estimates given.
  2. BERT Bidirectional encoded representations from transformers, SLR Systematic literature review