Skip to main content

Table 3 Statistical features

From: Improving chemical disease relation extraction with rich features and weakly labeled data

Feature Type
1 # of chemical mention Numeric
2 # of disease mention Numeric
3 Is chemical in title Boolean
4 Is disease in title Boolean
5 Is chemical in the 1st sentence of the abstract Boolean
6 Is disease in the 1st sentence of the abstract Boolean
7 Is chemical in the last sentence of the abstract Boolean
8 Is disease in the last sentence of the abstract Boolean
9 Are both of chemical and disease in the same sentence Boolean
10 Is disease-chemical relation curated by CTD in the past Boolean
11 Do both disease and chemical exist in the MeSH indexing in the past? Boolean
12 Is any keyword around the disease, such as therapy, complicating, affect, etc. Boolean
13 Is any keyword around the chemical, such as 3.0 mEg/L, mg, etc. Boolean
14 Is “increase” or “decrease” around chemical Boolean
15 Is “increase” or “decrease” around disease Boolean
16 Is “p value” around chemical Boolean
17 Is “p-value” around disease Boolean
18 Is “men”, “women”, or “patient” around chemical Boolean
19 Is “men”, “women”, or “patient” around disease Boolean