Skip to main content

Table 8 Test data results for subsets

From: DECIMER 1.0: deep learning for chemical image recognition using transformers

Metrics

Subset 1

Subset 2

Subset 3

Subset 4

Train data size

921,600

10,240,000

15,360,000

35,002,240

Test data size

102,400

1,024,000

1,536,000

3,929,093

Tanimoto

0.9371

0.9691

0.9779

0.9923

Tanimoto 1.0

74.57%

87.88%

91.02%

96.47%