Skip to main content

Table 4 Misclassification Costs per primary screen dataset and mixed primary/confirmatory datasets

From: Virtual screening of bioassay data

Dataset

Naive Bayes

SMO

Random Forest

J48

AID362 (70)

40

150

3000

285

AID604 (281)

40

250

Out of memory

650

AID456 (369)

18

200

100000

1000

AID688 (109)

34

78

Out of memory

220

AID373 (963)

20

2000

Out of memory

3000

AID746 (162)

25

100

Out of memory

450

AID687 (351)

50

250

Out of memory

680

AID746&AID1284 (1048)

100

1000

Out of memory

1900

AID604&AID644 (891)

70

750

Out of memory

1500

AID373&AID439 (4599)

70

9000

Out of memory

9500

AID687&AID721 (351)

700

6702

Out of memory

1900