Ontology type: schema:ScholarlyArticle Open Access: True
2018-06-25
AUTHORSJan P. Buchmann, Mathieu Fourment, Edward C. Holmes
ABSTRACTThe large size and high complexity of biological data can represent a major methodological challenge for the analysis and exchange of data sets between computers and applications. There has also been a substantial increase in the amount of metadata associated with biological data sets, which is being increasingly incorporated into existing data formats. Despite the existence of structured formats based on XML, biological data sets are mainly formatted using unstructured file formats, and the incorporation of metadata results in increasingly complex parsing routines such that they become more error prone. To overcome these problems, we present the “biological object notation” (BON) format, a new way to exchange and parse nearly all biological data sets more efficiently and with less error than other currently available formats. Based on JavaScript Object Notation (JSON), BON simplifies parsing by clearly separating the biological data from its metadata and reduces complexity compared to XML based formats. The ability to selectively compress data up to 87% compared to other file formats and the reduced complexity results in improved transfer times and less error prone applications. More... »
PAGES9644
http://scigraph.springernature.com/pub.10.1038/s41598-018-28016-6
DOIhttp://dx.doi.org/10.1038/s41598-018-28016-6
DIMENSIONShttps://app.dimensions.ai/details/publication/pub.1104998386
PUBMEDhttps://www.ncbi.nlm.nih.gov/pubmed/29941895
JSON-LD is the canonical representation for SciGraph data.
TIP: You can open this SciGraph record using an external JSON-LD service: JSON-LD Playground Google SDTT
[
{
"@context": "https://springernature.github.io/scigraph/jsonld/sgcontext.json",
"about": [
{
"id": "http://purl.org/au-research/vocabulary/anzsrc-for/2008/08",
"inDefinedTermSet": "http://purl.org/au-research/vocabulary/anzsrc-for/2008/",
"name": "Information and Computing Sciences",
"type": "DefinedTerm"
},
{
"id": "http://purl.org/au-research/vocabulary/anzsrc-for/2008/0802",
"inDefinedTermSet": "http://purl.org/au-research/vocabulary/anzsrc-for/2008/",
"name": "Computation Theory and Mathematics",
"type": "DefinedTerm"
}
],
"author": [
{
"affiliation": {
"alternateName": "Marie Bashir Institute for Infectious Diseases and Biosecurity, Charles Perkins Centre, School of Life and Environmental Sciences and Sydney Medical School, the University of Sydney, 2006, Sydney, New South Wales, Australia",
"id": "http://www.grid.ac/institutes/grid.1013.3",
"name": [
"Marie Bashir Institute for Infectious Diseases and Biosecurity, Charles Perkins Centre, School of Life and Environmental Sciences and Sydney Medical School, the University of Sydney, 2006, Sydney, New South Wales, Australia"
],
"type": "Organization"
},
"familyName": "Buchmann",
"givenName": "Jan P.",
"id": "sg:person.01236257003.79",
"sameAs": [
"https://app.dimensions.ai/discover/publication?and_facet_researcher=ur.01236257003.79"
],
"type": "Person"
},
{
"affiliation": {
"alternateName": "ithree Institute, University of Technology Sydney, 2007, Ultimo, New South Wales, Australia",
"id": "http://www.grid.ac/institutes/grid.117476.2",
"name": [
"ithree Institute, University of Technology Sydney, 2007, Ultimo, New South Wales, Australia"
],
"type": "Organization"
},
"familyName": "Fourment",
"givenName": "Mathieu",
"id": "sg:person.01257106550.52",
"sameAs": [
"https://app.dimensions.ai/discover/publication?and_facet_researcher=ur.01257106550.52"
],
"type": "Person"
},
{
"affiliation": {
"alternateName": "Marie Bashir Institute for Infectious Diseases and Biosecurity, Charles Perkins Centre, School of Life and Environmental Sciences and Sydney Medical School, the University of Sydney, 2006, Sydney, New South Wales, Australia",
"id": "http://www.grid.ac/institutes/grid.1013.3",
"name": [
"Marie Bashir Institute for Infectious Diseases and Biosecurity, Charles Perkins Centre, School of Life and Environmental Sciences and Sydney Medical School, the University of Sydney, 2006, Sydney, New South Wales, Australia"
],
"type": "Organization"
},
"familyName": "Holmes",
"givenName": "Edward C.",
"id": "sg:person.012501436222.04",
"sameAs": [
"https://app.dimensions.ai/discover/publication?and_facet_researcher=ur.012501436222.04"
],
"type": "Person"
}
],
"citation": [
{
"id": "sg:pub.10.1186/1471-2105-13-209",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1015744304",
"https://doi.org/10.1186/1471-2105-13-209"
],
"type": "CreativeWork"
},
{
"id": "sg:pub.10.1186/1471-2105-10-356",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1050078012",
"https://doi.org/10.1186/1471-2105-10-356"
],
"type": "CreativeWork"
},
{
"id": "sg:pub.10.1007/3-540-45848-4_59",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1038863971",
"https://doi.org/10.1007/3-540-45848-4_59"
],
"type": "CreativeWork"
}
],
"datePublished": "2018-06-25",
"datePublishedReg": "2018-06-25",
"description": "The large size and high complexity of biological data can represent a major methodological challenge for the analysis and exchange of data sets between computers and applications. There has also been a substantial increase in the amount of metadata associated with biological data sets, which is being increasingly incorporated into existing data formats. Despite the existence of structured formats based on XML, biological data sets are mainly formatted using unstructured file formats, and the incorporation of metadata results in increasingly complex parsing routines such that they become more error prone. To overcome these problems, we present the \u201cbiological object notation\u201d (BON) format, a new way to exchange and parse nearly all biological data sets more efficiently and with less error than other currently available formats. Based on JavaScript Object Notation (JSON), BON simplifies parsing by clearly separating the biological data from its metadata and reduces complexity compared to XML based formats. The ability to selectively compress data up to 87% compared to other file formats and the reduced complexity results in improved transfer times and less error prone applications.",
"genre": "article",
"id": "sg:pub.10.1038/s41598-018-28016-6",
"isAccessibleForFree": true,
"isFundedItemOf": [
{
"id": "sg:grant.6809668",
"type": "MonetaryGrant"
}
],
"isPartOf": [
{
"id": "sg:journal.1045337",
"issn": [
"2045-2322"
],
"name": "Scientific Reports",
"publisher": "Springer Nature",
"type": "Periodical"
},
{
"issueNumber": "1",
"type": "PublicationIssue"
},
{
"type": "PublicationVolume",
"volumeNumber": "8"
}
],
"keywords": [
"JavaScript Object Notation",
"biological data sets",
"Object Notation",
"file format",
"data sets",
"error-prone applications",
"biological data",
"amount of metadata",
"data format",
"error prone",
"parsing routines",
"complexity results",
"structured format",
"high complexity",
"XML",
"metadata",
"more error prone",
"available formats",
"notation",
"format",
"transfer time",
"less error",
"complexity",
"set",
"new way",
"computer",
"applications",
"prone",
"simplifies",
"data",
"major methodological challenges",
"routines",
"challenges",
"large size",
"error",
"way",
"results",
"exchange",
"time",
"amount",
"ability",
"methodological challenges",
"analysis",
"size",
"substantial increase",
"existence",
"incorporation",
"increase",
"problem"
],
"name": "The Biological Object Notation (BON): a structured file format for biological data",
"pagination": "9644",
"productId": [
{
"name": "dimensions_id",
"type": "PropertyValue",
"value": [
"pub.1104998386"
]
},
{
"name": "doi",
"type": "PropertyValue",
"value": [
"10.1038/s41598-018-28016-6"
]
},
{
"name": "pubmed_id",
"type": "PropertyValue",
"value": [
"29941895"
]
}
],
"sameAs": [
"https://doi.org/10.1038/s41598-018-28016-6",
"https://app.dimensions.ai/details/publication/pub.1104998386"
],
"sdDataset": "articles",
"sdDatePublished": "2022-08-04T17:05",
"sdLicense": "https://scigraph.springernature.com/explorer/license/",
"sdPublisher": {
"name": "Springer Nature - SN SciGraph project",
"type": "Organization"
},
"sdSource": "s3://com-springernature-scigraph/baseset/20220804/entities/gbq_results/article/article_785.jsonl",
"type": "ScholarlyArticle",
"url": "https://doi.org/10.1038/s41598-018-28016-6"
}
]
Download the RDF metadata as: json-ld nt turtle xml License info
JSON-LD is a popular format for linked data which is fully compatible with JSON.
curl -H 'Accept: application/ld+json' 'https://scigraph.springernature.com/pub.10.1038/s41598-018-28016-6'
N-Triples is a line-based linked data format ideal for batch operations.
curl -H 'Accept: application/n-triples' 'https://scigraph.springernature.com/pub.10.1038/s41598-018-28016-6'
Turtle is a human-readable linked data format.
curl -H 'Accept: text/turtle' 'https://scigraph.springernature.com/pub.10.1038/s41598-018-28016-6'
RDF/XML is a standard XML format for linked data.
curl -H 'Accept: application/rdf+xml' 'https://scigraph.springernature.com/pub.10.1038/s41598-018-28016-6'
This table displays all metadata directly associated to this object as RDF triples.
140 TRIPLES
21 PREDICATES
77 URIs
66 LITERALS
7 BLANK NODES
Subject | Predicate | Object | |
---|---|---|---|
1 | sg:pub.10.1038/s41598-018-28016-6 | schema:about | anzsrc-for:08 |
2 | ″ | ″ | anzsrc-for:0802 |
3 | ″ | schema:author | Na68e8b62c6da470897720547d8e23b5d |
4 | ″ | schema:citation | sg:pub.10.1007/3-540-45848-4_59 |
5 | ″ | ″ | sg:pub.10.1186/1471-2105-10-356 |
6 | ″ | ″ | sg:pub.10.1186/1471-2105-13-209 |
7 | ″ | schema:datePublished | 2018-06-25 |
8 | ″ | schema:datePublishedReg | 2018-06-25 |
9 | ″ | schema:description | The large size and high complexity of biological data can represent a major methodological challenge for the analysis and exchange of data sets between computers and applications. There has also been a substantial increase in the amount of metadata associated with biological data sets, which is being increasingly incorporated into existing data formats. Despite the existence of structured formats based on XML, biological data sets are mainly formatted using unstructured file formats, and the incorporation of metadata results in increasingly complex parsing routines such that they become more error prone. To overcome these problems, we present the “biological object notation” (BON) format, a new way to exchange and parse nearly all biological data sets more efficiently and with less error than other currently available formats. Based on JavaScript Object Notation (JSON), BON simplifies parsing by clearly separating the biological data from its metadata and reduces complexity compared to XML based formats. The ability to selectively compress data up to 87% compared to other file formats and the reduced complexity results in improved transfer times and less error prone applications. |
10 | ″ | schema:genre | article |
11 | ″ | schema:isAccessibleForFree | true |
12 | ″ | schema:isPartOf | N8c36fdd2f1184b029f01eb9421aeda85 |
13 | ″ | ″ | Nefaf712c83604f05b2c4a607dd17bdd1 |
14 | ″ | ″ | sg:journal.1045337 |
15 | ″ | schema:keywords | JavaScript Object Notation |
16 | ″ | ″ | Object Notation |
17 | ″ | ″ | XML |
18 | ″ | ″ | ability |
19 | ″ | ″ | amount |
20 | ″ | ″ | amount of metadata |
21 | ″ | ″ | analysis |
22 | ″ | ″ | applications |
23 | ″ | ″ | available formats |
24 | ″ | ″ | biological data |
25 | ″ | ″ | biological data sets |
26 | ″ | ″ | challenges |
27 | ″ | ″ | complexity |
28 | ″ | ″ | complexity results |
29 | ″ | ″ | computer |
30 | ″ | ″ | data |
31 | ″ | ″ | data format |
32 | ″ | ″ | data sets |
33 | ″ | ″ | error |
34 | ″ | ″ | error prone |
35 | ″ | ″ | error-prone applications |
36 | ″ | ″ | exchange |
37 | ″ | ″ | existence |
38 | ″ | ″ | file format |
39 | ″ | ″ | format |
40 | ″ | ″ | high complexity |
41 | ″ | ″ | incorporation |
42 | ″ | ″ | increase |
43 | ″ | ″ | large size |
44 | ″ | ″ | less error |
45 | ″ | ″ | major methodological challenges |
46 | ″ | ″ | metadata |
47 | ″ | ″ | methodological challenges |
48 | ″ | ″ | more error prone |
49 | ″ | ″ | new way |
50 | ″ | ″ | notation |
51 | ″ | ″ | parsing routines |
52 | ″ | ″ | problem |
53 | ″ | ″ | prone |
54 | ″ | ″ | results |
55 | ″ | ″ | routines |
56 | ″ | ″ | set |
57 | ″ | ″ | simplifies |
58 | ″ | ″ | size |
59 | ″ | ″ | structured format |
60 | ″ | ″ | substantial increase |
61 | ″ | ″ | time |
62 | ″ | ″ | transfer time |
63 | ″ | ″ | way |
64 | ″ | schema:name | The Biological Object Notation (BON): a structured file format for biological data |
65 | ″ | schema:pagination | 9644 |
66 | ″ | schema:productId | N0b02e869355d47349625abdc58be1260 |
67 | ″ | ″ | N93d79db5f3c64a4792d9b808c58a3cc3 |
68 | ″ | ″ | Ne5c8836a3ac847618db2346093558a9c |
69 | ″ | schema:sameAs | https://app.dimensions.ai/details/publication/pub.1104998386 |
70 | ″ | ″ | https://doi.org/10.1038/s41598-018-28016-6 |
71 | ″ | schema:sdDatePublished | 2022-08-04T17:05 |
72 | ″ | schema:sdLicense | https://scigraph.springernature.com/explorer/license/ |
73 | ″ | schema:sdPublisher | N2ed5158fbf1440a2ab2bcc38036dd5e8 |
74 | ″ | schema:url | https://doi.org/10.1038/s41598-018-28016-6 |
75 | ″ | sgo:license | sg:explorer/license/ |
76 | ″ | sgo:sdDataset | articles |
77 | ″ | rdf:type | schema:ScholarlyArticle |
78 | N0b02e869355d47349625abdc58be1260 | schema:name | pubmed_id |
79 | ″ | schema:value | 29941895 |
80 | ″ | rdf:type | schema:PropertyValue |
81 | N2ed5158fbf1440a2ab2bcc38036dd5e8 | schema:name | Springer Nature - SN SciGraph project |
82 | ″ | rdf:type | schema:Organization |
83 | N8c36fdd2f1184b029f01eb9421aeda85 | schema:issueNumber | 1 |
84 | ″ | rdf:type | schema:PublicationIssue |
85 | N93d79db5f3c64a4792d9b808c58a3cc3 | schema:name | dimensions_id |
86 | ″ | schema:value | pub.1104998386 |
87 | ″ | rdf:type | schema:PropertyValue |
88 | Na68e8b62c6da470897720547d8e23b5d | rdf:first | sg:person.01236257003.79 |
89 | ″ | rdf:rest | Nc41232b6879b4622abd9ec0f96b7c41d |
90 | Nbab18ad66aec4e95a3e5b7554c7b1baf | rdf:first | sg:person.012501436222.04 |
91 | ″ | rdf:rest | rdf:nil |
92 | Nc41232b6879b4622abd9ec0f96b7c41d | rdf:first | sg:person.01257106550.52 |
93 | ″ | rdf:rest | Nbab18ad66aec4e95a3e5b7554c7b1baf |
94 | Ne5c8836a3ac847618db2346093558a9c | schema:name | doi |
95 | ″ | schema:value | 10.1038/s41598-018-28016-6 |
96 | ″ | rdf:type | schema:PropertyValue |
97 | Nefaf712c83604f05b2c4a607dd17bdd1 | schema:volumeNumber | 8 |
98 | ″ | rdf:type | schema:PublicationVolume |
99 | anzsrc-for:08 | schema:inDefinedTermSet | anzsrc-for: |
100 | ″ | schema:name | Information and Computing Sciences |
101 | ″ | rdf:type | schema:DefinedTerm |
102 | anzsrc-for:0802 | schema:inDefinedTermSet | anzsrc-for: |
103 | ″ | schema:name | Computation Theory and Mathematics |
104 | ″ | rdf:type | schema:DefinedTerm |
105 | sg:grant.6809668 | http://pending.schema.org/fundedItem | sg:pub.10.1038/s41598-018-28016-6 |
106 | ″ | rdf:type | schema:MonetaryGrant |
107 | sg:journal.1045337 | schema:issn | 2045-2322 |
108 | ″ | schema:name | Scientific Reports |
109 | ″ | schema:publisher | Springer Nature |
110 | ″ | rdf:type | schema:Periodical |
111 | sg:person.01236257003.79 | schema:affiliation | grid-institutes:grid.1013.3 |
112 | ″ | schema:familyName | Buchmann |
113 | ″ | schema:givenName | Jan P. |
114 | ″ | schema:sameAs | https://app.dimensions.ai/discover/publication?and_facet_researcher=ur.01236257003.79 |
115 | ″ | rdf:type | schema:Person |
116 | sg:person.012501436222.04 | schema:affiliation | grid-institutes:grid.1013.3 |
117 | ″ | schema:familyName | Holmes |
118 | ″ | schema:givenName | Edward C. |
119 | ″ | schema:sameAs | https://app.dimensions.ai/discover/publication?and_facet_researcher=ur.012501436222.04 |
120 | ″ | rdf:type | schema:Person |
121 | sg:person.01257106550.52 | schema:affiliation | grid-institutes:grid.117476.2 |
122 | ″ | schema:familyName | Fourment |
123 | ″ | schema:givenName | Mathieu |
124 | ″ | schema:sameAs | https://app.dimensions.ai/discover/publication?and_facet_researcher=ur.01257106550.52 |
125 | ″ | rdf:type | schema:Person |
126 | sg:pub.10.1007/3-540-45848-4_59 | schema:sameAs | https://app.dimensions.ai/details/publication/pub.1038863971 |
127 | ″ | ″ | https://doi.org/10.1007/3-540-45848-4_59 |
128 | ″ | rdf:type | schema:CreativeWork |
129 | sg:pub.10.1186/1471-2105-10-356 | schema:sameAs | https://app.dimensions.ai/details/publication/pub.1050078012 |
130 | ″ | ″ | https://doi.org/10.1186/1471-2105-10-356 |
131 | ″ | rdf:type | schema:CreativeWork |
132 | sg:pub.10.1186/1471-2105-13-209 | schema:sameAs | https://app.dimensions.ai/details/publication/pub.1015744304 |
133 | ″ | ″ | https://doi.org/10.1186/1471-2105-13-209 |
134 | ″ | rdf:type | schema:CreativeWork |
135 | grid-institutes:grid.1013.3 | schema:alternateName | Marie Bashir Institute for Infectious Diseases and Biosecurity, Charles Perkins Centre, School of Life and Environmental Sciences and Sydney Medical School, the University of Sydney, 2006, Sydney, New South Wales, Australia |
136 | ″ | schema:name | Marie Bashir Institute for Infectious Diseases and Biosecurity, Charles Perkins Centre, School of Life and Environmental Sciences and Sydney Medical School, the University of Sydney, 2006, Sydney, New South Wales, Australia |
137 | ″ | rdf:type | schema:Organization |
138 | grid-institutes:grid.117476.2 | schema:alternateName | ithree Institute, University of Technology Sydney, 2007, Ultimo, New South Wales, Australia |
139 | ″ | schema:name | ithree Institute, University of Technology Sydney, 2007, Ultimo, New South Wales, Australia |
140 | ″ | rdf:type | schema:Organization |