Skip to main content

Table 2 CHEMDNER abstracts, split into chemical disciplines (subject categories, first column; MULTIDISCIPL. CHEM.: Multidisciplinary Chemistry).

From: The CHEMDNER corpus of chemicals and drugs and its annotation principles

Chem. subject categories Abstracts Mentions AB FA FO ID MU NO SY TR
PHARMACOLOGY 1,983 23,368 18.81 10.54 6.42 4.93 0.64 0.29 17.28 41.09
MEDICINAL CHEMISTRY 1,957 17,543 10.00 21.11 8.00 2.10 1.56 0.12 25.88 31.23
ORGANIC CHEMISTRY 1,893 22,622 18.77 10.56 6.56 5.00 0.63 0.30 17.43 40.74
TOXICOLOGY 1,664 21,608 20.82 10.59 14.16 1.35 0.46 0.13 22.68 29.81
MULTIDISCIPL. CHEM. 1,217 11,892 14.38 12.15 27.97 0.52 0.55 0.13 25.62 18.67
PHYSICAL CHEMISTRY 997 9,682 12.14 9.81 36.39 0.27 0.43 0.15 27.57 13.24
BIOCHEMISTRY 879 6,503 18.75 16.55 14.24 1.12 0.34 0.11 23.17 25.73
APPLIED CHEMISTRY 843 7,759 8.48 24.45 7.71 0.17 1.37 0.10 24.99 32.74
ENDOCRINOLOGY 652 5,484 14.66 16.01 9.87 1.33 0.15 0.15 20.13 37.71
POLYMER SCIENCE 232 1,999 33.82 17.26 6.50 0.05 0.10 0.00 25.86 16.41
CHEMICAL ENGINEERING 3 42 0.00 0.00 38.10 0.00 0.00 0.00 61.90 0.00
  1. Abstracts: The number of abstracts associated with that category in the CHEMDNER corpus. Mentions: The total number of chemical entity mentions in the abstracts of that category. Remaining columns: The values provided for the different SACEM classes correspond to the percentage of mentions in that category; AB: ABBREVIATION, FA: FAMILY, FO: FORMULA, ID: IDENTIFIER, MU: MULTIPLE, NO: NO CLASS, SY: SYSTEMATIC, TR: TRIVIAL.