Fig. 7From: Molecular de-novo design through deep reinforcement learningEvolution of generated structures during training Structures sampled every 100 training steps during the training of the Agent towards similarity to Celecoxib with \(k=0.7\) and \(\sigma =15\) Back to article page