Skip to main content

Table 1 Number of unique SELFIES and IUPAC-name tokens for each dataset

From: STOUT: SMILES to IUPAC names using neural machine translation

Dataset size

Number of SELFIES tokens

Number of IUPAC tokens

30 Million

27

1190

60 Million

27

1190