Table 6

Contribution of optical character recognition (OCR) error correction and negated assertions on the performance of the system

YesOther
PPVSensitivityF-measurePPVSensitivityF-measure
Leiomyomas
 Baseline95.86%98.25%97.04%97.97%95.24%96.59%
 +Negated assertions98.10%*97.95%98.03%*97.71%97.87%*97.79%*
 +OCR correction98.11%*98.39%98.25%*98.19%97.87%*98.03%*
Endometriosis
 Baseline88.98%100.00%94.17%100.00%98.91%99.45%
 +Negated assertions91.15%98.10%94.50%99.83%99.16%99.49%
 +OCR correction91.15%98.10%94.50%99.83%99.16%99.49%
Adenomyosis
 Baseline93.59%95.87%94.72%97.06%95.40%96.22%
 +Negated assertions98.27%*95.87%97.06%*97.15%*98.82%*97.98%*
 +OCR correction98.28%*96.62%97.45%*97.66%*98.82%*98.23%*
  • Baseline configuration refers to the exact match of search terms.

  • *Performance difference against baseline is very significant at alpha = 0.01.

  • PPV, positive predictive value.