Skip to main content

Table 1 Number of unique SELFIES and IUPAC-name tokens for each dataset

From: STOUT: SMILES to IUPAC names using neural machine translation

Dataset size Number of SELFIES tokens Number of IUPAC tokens
30 Million 27 1190
60 Million 27 1190