From: STOUT: SMILES to IUPAC names using neural machine translation
Dataset size
Number of SELFIES tokens
Number of IUPAC tokens
30 Million
27
1190
60 Million