Skip to main content

Advertisement

Table 1 Comparison of the document types, lengths, and how many of each type of tag was applied to it by Semanti-Cat

From: Too many tags spoil the metadata: investigating the knowledge management of scientific research with semantic web technologies

Participant/document information Discipline Document type Document format Document makeup # Pages # Words # Chars # Open Calais tags # Ontology tags # ChemicalTagger tags # GATE tags
Participant L Chemistry Thesis section .DOC Mostly text, 1 figure 4 990 5120 12 17 16 4
Participant J Chemistry Blog post HTML Mostly structure diagrams with some text 2 441 1919 6 2 15 0
Participant S Chemistry ESI document .DOC ~ 60% text ~  40% figures 12 2216 14,108 12 75 92 3
Participant AI Chemistry Thesis section .DOC Mostly text, 1 figure 4 1501 9562 12 61 62 4
Participant AJ Chemistry Handwritten lab book page Handwritten Mostly text and some filenames 1 160 1229 3 5 10 0
Participant AK Chemistry Experimental writeup .DOC Roughly equal text/figures 14 1558 8453 7 32 26 5
Participant A Physics Research paper .PDF ~  60% text ~  40% figures/equations 8 5996 35,398 12 79 92 2
Participant B Physics Sample report .DOC Roughly equal text/figures 4 711 4396 8 13 8 0
Participant AL Physics Handwritten lab book page Handwritten Little text and sketched diagram 1 75 723 0 1 3 0
Participant AM Physics Handwritten lab book page Handwritten 100% text 1 173 1070 8 4 0 0
Participant Q Biology Thesis section .DOC Mostly words, some figures/graphs 1 378 2478 10 18 8 0
Participant R Biology Thesis section .DOC Mostly words, some figures/graphs 3 899 5754 12 25 12 0
Participant AN Biology Experiment writeup .DOC Mostly words, some figures/graphs 12 3160 21,276 7 34 60 0
Participant AO Biology Literature report .DOC Mostly words, some figures/graphs 10 3449 23,780 12 38 58 1
Participant AQ Biology Experiment writeup .DOC Mostly words, some figures/graphs 7 2492 13,861 12 19 20 1