Skip to main content

Table 10 The Adjustment Rules of the Text Spans in the BioCreative IV CHEMDNER competition.

From: A CRF-based system for recognizing chemical entity mentions (CEMs) in biomedical literature

ID

IF Condition

Action

1

#(·, "(") == #(·, ")") + 1 ∧ right(·, 1) == ")"

offset(·, end) = offset(·, end) + 1

2

#(·, "(") == #(·, ")") - 1 ∧ left(·, 1) == "("

offset(·, start) = offset(·, start) - 1

3

#(·, "[") == #(·, "]") + 1 ∧ right(·, 1) == "]"

offset(·, end) = offset(·, end) + 1

4

#(·, "[") == #(·, "]") - 1 ∧ left(·, 1) == "["

offset(·, start) = offset(·, start) - 1

5

#(·, "{") == #(·, "}") + 1 ∧ right(·, 1) == "}"

offset(·, end) = offset(·, end) + 1

6

#(·, "{") == #(·, "}") - 1 ∧ left(·, 1) == "{"

offset(·, start) = offset(·, start) - 1

7

#(·, "") == #(·, "") + 1 ∧ right(·, 5) == ""

offset(·, end) = offset(·, end) + 5

8

#(·, "") == #(·, "") - 1 ∧ left(·, 4) == ""

offset(·, start) = offset(·, start) - 4

9

#(·, "") == #(·, "") + 1 ∧ right(·, 4) == ""

offset(·, end) = offset(·, end) + 4

10

#(·, "") == #(·, "") - 1 ∧ left(·, 3) == ""

offset(·, start) = offset(·, start) - 3

11

#(·, "") == #(·, "") + 1 ∧ right(·, 6) == ""

offset(·, end) = offset(·, end) + 6

12

#(·, "") == #(·, "") - 1 ∧ left(·, 5) == ""

offset(·, start) = offset(·, start) - 5

13

#(·, "") == #(·, "") + 1 ∧ right(·, 6) == ""

offset(·, end) = offset(·, end) + 6

14

#(·, "") == #(·, "") - 1 ∧ left(·, 5) == ""

offset(·, start) = offset(·, start) - 5

  1. #(·, str) means the number of occurrences of the string str in the interested CEM, right(·, n) and left(·, n) denote the substring with the length of n right or left to the interested CEM, and offset(·, start) and offset(·, left) indicate the start or end offset of the interested CEM.