Skip to main content

Table 10 Example of a token sequence tagged with matches against chemical dictionaries.

From: Optimising chemical named entity recognition with pre-processing analytics, knowledge-rich features and heuristics

Token

Normal form

ChEBI

DrugBank

CTD

PubChem

Jochem

For

for

O

O

O

O

O

the

the

O

O

O

O

O

preparation

preparation

O

O

O

O

O

of

of

O

O

O

O

O

hydrogel

hydrogel

O

O

B

O

B

microspheres

microsphere

O

O

O

O

O

based

base

O

O

O

O

O

on

on

O

O

O

O

O

hydroxyethyl

hydroxyethyl

O

O

B

O

B

starch

starch

B

O

I

O

I

-

_

B

O

O

O

O

hydroxyethyl

hydroxyethyl

I

O

B

O

B

methacrylate

methacrylate

I

O

I

B

I

(

_

O

O

O

O

O

HES-HEMA

hes_hema

O

O

O

O

O

)

_

O

O

O

O

O