Skip to main content

Table 4 Summary of REINVENT 4 priors. Mol2Mol comes with six different priors with pairs trained on different types of similarity

From: Reinvent 4: Modern AI–driven generative molecule design

Generator

Dataset

Notes

Reinvent

ChEMBL 25

Published in Ref. [19, 20]

Libinvent

ChEMBL 27

Published in Ref. [46]

Linkinvent

ChEMBL 27

Published in Ref. [47]

Mol2Mol

ChEMBL 28

Published in Ref. [22]

  

Similaritya

  

Medium similarityb

  

High similarityc

  

Scaffoldd

  

Generic scaffolde

  

Matched molecular pairsf

Mol2Mol

Pubchemg

Published in Ref. [54]

  

Similarityg

  1. aTanimoto similarity \(\ge\) 0.5.
  2. b.5 \(\le\) Tanimoto similarity < 0.7.
  3. cTanimoto similarity \(\ge\) 0.7.
  4. dMolecules sharing the same Murcko scaffold (RDKit).
  5. eMolecules sharing the same unlabelled Murcko scaffold.
  6. fMatched molecular pairs have been extracted with mmpdb [77].
  7. gPubchem was collected in December 2021.
  8. hTanimoto similarity \(\ge\) 0.5 on ECFP4 fingerprints with counts