Fig. 6From: Molecular de-novo design through deep reinforcement learningAverage similarity \({{{J}}}\) of generated structures as a function of training steps. Difference in learning dynamics for the Agents based on the canonical Prior, and those based on a reduced Prior where everything more similar than \(J=0.5\) to Celecoxib have been removedBack to article page