Learning to predict single-wall carbon nanotube-recognition DNA sequences View Full Text


Ontology type: schema:ScholarlyArticle      Open Access: True


Article Info

DATE

2019-12

AUTHORS

Yoona Yang, Ming Zheng, Anand Jagota

ABSTRACT

DNA/single-wall carbon nanotube (SWCNT) hybrids have enabled many applications because of their special ability to disperse and sort SWCNTs by their chirality and handedness. Much work has been done to discover sequences which recognize specific chiralities of SWCNT, and significant progress has been made in understanding the underlying structure and thermodynamics of these hybrids. Nevertheless, de novo prediction of recognition sequences remains essentially impossible and the success rate for their discovery by search of the vast single-stranded DNA library is very low. Here, we report an effective way of predicting recognition sequences based on machine learning analysis of existing experimental sequence data sets. Multiple input feature construction methods (position-specific, term-frequency, combined or segmented term frequency vector, and motif-based feature) were used and compared. The transformed features were used to train several classifier algorithms (logistic regression, support vector machine, and artificial neural network). Trained models were used to predict new sets of recognition sequences, and consensus among a number of models was used successfully to counteract the limited size of the data set. Predictions were tested using aqueous two-phase separation. New data thus acquired were used to retrain the models by adding an experimentally tested new set of predicted sequences to the original set. The frequency of finding correct recognition sequences by the trained model increased to >50% from the ~10% success rate in the original training data set. More... »

PAGES

3

Identifiers

URI

http://scigraph.springernature.com/pub.10.1038/s41524-018-0142-3

DOI

http://dx.doi.org/10.1038/s41524-018-0142-3

DIMENSIONS

https://app.dimensions.ai/details/publication/pub.1110955647


Indexing Status Check whether this publication has been indexed by Scopus and Web Of Science using the SN Indexing Status Tool
Incoming Citations Browse incoming citations for this publication using opencitations.net

JSON-LD is the canonical representation for SciGraph data.

TIP: You can open this SciGraph record using an external JSON-LD service: JSON-LD Playground Google SDTT

[
  {
    "@context": "https://springernature.github.io/scigraph/jsonld/sgcontext.json", 
    "about": [
      {
        "id": "http://purl.org/au-research/vocabulary/anzsrc-for/2008/0801", 
        "inDefinedTermSet": "http://purl.org/au-research/vocabulary/anzsrc-for/2008/", 
        "name": "Artificial Intelligence and Image Processing", 
        "type": "DefinedTerm"
      }, 
      {
        "id": "http://purl.org/au-research/vocabulary/anzsrc-for/2008/08", 
        "inDefinedTermSet": "http://purl.org/au-research/vocabulary/anzsrc-for/2008/", 
        "name": "Information and Computing Sciences", 
        "type": "DefinedTerm"
      }
    ], 
    "author": [
      {
        "affiliation": {
          "alternateName": "Lehigh University", 
          "id": "https://www.grid.ac/institutes/grid.259029.5", 
          "name": [
            "Department of Chemical & Biomolecular Engineering, Lehigh University, 18015, Bethlehem, PA, USA"
          ], 
          "type": "Organization"
        }, 
        "familyName": "Yang", 
        "givenName": "Yoona", 
        "id": "sg:person.011626664174.12", 
        "sameAs": [
          "https://app.dimensions.ai/discover/publication?and_facet_researcher=ur.011626664174.12"
        ], 
        "type": "Person"
      }, 
      {
        "affiliation": {
          "alternateName": "National Institute of Standards and Technology", 
          "id": "https://www.grid.ac/institutes/grid.94225.38", 
          "name": [
            "Materials Science and Engineering Division, National Institute of Standards and Technology, 20899, Gaithersburg, MD, USA"
          ], 
          "type": "Organization"
        }, 
        "familyName": "Zheng", 
        "givenName": "Ming", 
        "id": "sg:person.0617564542.45", 
        "sameAs": [
          "https://app.dimensions.ai/discover/publication?and_facet_researcher=ur.0617564542.45"
        ], 
        "type": "Person"
      }, 
      {
        "affiliation": {
          "alternateName": "Lehigh University", 
          "id": "https://www.grid.ac/institutes/grid.259029.5", 
          "name": [
            "Department of Chemical & Biomolecular Engineering, Lehigh University, 18015, Bethlehem, PA, USA", 
            "Department of Bioengineering, Lehigh University, 18015, Bethlehem, PA, USA"
          ], 
          "type": "Organization"
        }, 
        "familyName": "Jagota", 
        "givenName": "Anand", 
        "id": "sg:person.0616413355.14", 
        "sameAs": [
          "https://app.dimensions.ai/discover/publication?and_facet_researcher=ur.0616413355.14"
        ], 
        "type": "Person"
      }
    ], 
    "citation": [
      {
        "id": "https://doi.org/10.1039/c1an15179g", 
        "sameAs": [
          "https://app.dimensions.ai/details/publication/pub.1000710041"
        ], 
        "type": "CreativeWork"
      }, 
      {
        "id": "sg:pub.10.1038/nnano.2007.290", 
        "sameAs": [
          "https://app.dimensions.ai/details/publication/pub.1001853696", 
          "https://doi.org/10.1038/nnano.2007.290"
        ], 
        "type": "CreativeWork"
      }, 
      {
        "id": "https://doi.org/10.1093/bioinformatics/btg1058", 
        "sameAs": [
          "https://app.dimensions.ai/details/publication/pub.1003963728"
        ], 
        "type": "CreativeWork"
      }, 
      {
        "id": "https://doi.org/10.1016/j.snb.2014.10.040", 
        "sameAs": [
          "https://app.dimensions.ai/details/publication/pub.1008817454"
        ], 
        "type": "CreativeWork"
      }, 
      {
        "id": "sg:pub.10.1186/1471-2105-14-96", 
        "sameAs": [
          "https://app.dimensions.ai/details/publication/pub.1011961692", 
          "https://doi.org/10.1186/1471-2105-14-96"
        ], 
        "type": "CreativeWork"
      }, 
      {
        "id": "sg:pub.10.1186/1471-2105-14-96", 
        "sameAs": [
          "https://app.dimensions.ai/details/publication/pub.1011961692", 
          "https://doi.org/10.1186/1471-2105-14-96"
        ], 
        "type": "CreativeWork"
      }, 
      {
        "id": "https://doi.org/10.1021/la500013c", 
        "sameAs": [
          "https://app.dimensions.ai/details/publication/pub.1013413296"
        ], 
        "type": "CreativeWork"
      }, 
      {
        "id": "https://doi.org/10.1126/science.1060928", 
        "sameAs": [
          "https://app.dimensions.ai/details/publication/pub.1013559716"
        ], 
        "type": "CreativeWork"
      }, 
      {
        "id": "https://doi.org/10.1002/adma.201401402", 
        "sameAs": [
          "https://app.dimensions.ai/details/publication/pub.1013707546"
        ], 
        "type": "CreativeWork"
      }, 
      {
        "id": "sg:pub.10.1186/1471-2105-11-s1-s65", 
        "sameAs": [
          "https://app.dimensions.ai/details/publication/pub.1014920715", 
          "https://doi.org/10.1186/1471-2105-11-s1-s65"
        ], 
        "type": "CreativeWork"
      }, 
      {
        "id": "https://doi.org/10.1021/ja504078b", 
        "sameAs": [
          "https://app.dimensions.ai/details/publication/pub.1015988430"
        ], 
        "type": "CreativeWork"
      }, 
      {
        "id": "sg:pub.10.1186/1556-276x-9-393", 
        "sameAs": [
          "https://app.dimensions.ai/details/publication/pub.1017985419", 
          "https://doi.org/10.1186/1556-276x-9-393"
        ], 
        "type": "CreativeWork"
      }, 
      {
        "id": "https://doi.org/10.1016/0925-2312(91)90023-5", 
        "sameAs": [
          "https://app.dimensions.ai/details/publication/pub.1018872684"
        ], 
        "type": "CreativeWork"
      }, 
      {
        "id": "https://doi.org/10.1016/0925-2312(91)90023-5", 
        "sameAs": [
          "https://app.dimensions.ai/details/publication/pub.1018872684"
        ], 
        "type": "CreativeWork"
      }, 
      {
        "id": "sg:pub.10.1007/s41061-016-0098-z", 
        "sameAs": [
          "https://app.dimensions.ai/details/publication/pub.1020123678", 
          "https://doi.org/10.1007/s41061-016-0098-z"
        ], 
        "type": "CreativeWork"
      }, 
      {
        "id": "sg:pub.10.1007/s41061-016-0098-z", 
        "sameAs": [
          "https://app.dimensions.ai/details/publication/pub.1020123678", 
          "https://doi.org/10.1007/s41061-016-0098-z"
        ], 
        "type": "CreativeWork"
      }, 
      {
        "id": "https://doi.org/10.1021/nl071909j", 
        "sameAs": [
          "https://app.dimensions.ai/details/publication/pub.1022111912"
        ], 
        "type": "CreativeWork"
      }, 
      {
        "id": "https://doi.org/10.1021/nl071909j", 
        "sameAs": [
          "https://app.dimensions.ai/details/publication/pub.1022111912"
        ], 
        "type": "CreativeWork"
      }, 
      {
        "id": "https://doi.org/10.1093/bioinformatics/btr110", 
        "sameAs": [
          "https://app.dimensions.ai/details/publication/pub.1024363090"
        ], 
        "type": "CreativeWork"
      }, 
      {
        "id": "https://doi.org/10.1110/ps.0239403", 
        "sameAs": [
          "https://app.dimensions.ai/details/publication/pub.1024845437"
        ], 
        "type": "CreativeWork"
      }, 
      {
        "id": "sg:pub.10.1038/ncomms1313", 
        "sameAs": [
          "https://app.dimensions.ai/details/publication/pub.1026419943", 
          "https://doi.org/10.1038/ncomms1313"
        ], 
        "type": "CreativeWork"
      }, 
      {
        "id": "sg:pub.10.1038/nnano.2006.52", 
        "sameAs": [
          "https://app.dimensions.ai/details/publication/pub.1028729458", 
          "https://doi.org/10.1038/nnano.2006.52"
        ], 
        "type": "CreativeWork"
      }, 
      {
        "id": "sg:pub.10.1038/nnano.2006.52", 
        "sameAs": [
          "https://app.dimensions.ai/details/publication/pub.1028729458", 
          "https://doi.org/10.1038/nnano.2006.52"
        ], 
        "type": "CreativeWork"
      }, 
      {
        "id": "https://doi.org/10.1021/jp309523a", 
        "sameAs": [
          "https://app.dimensions.ai/details/publication/pub.1028940478"
        ], 
        "type": "CreativeWork"
      }, 
      {
        "id": "sg:pub.10.1038/nature08116", 
        "sameAs": [
          "https://app.dimensions.ai/details/publication/pub.1030619385", 
          "https://doi.org/10.1038/nature08116"
        ], 
        "type": "CreativeWork"
      }, 
      {
        "id": "sg:pub.10.1038/nature08116", 
        "sameAs": [
          "https://app.dimensions.ai/details/publication/pub.1030619385", 
          "https://doi.org/10.1038/nature08116"
        ], 
        "type": "CreativeWork"
      }, 
      {
        "id": "https://doi.org/10.1021/ja5055498", 
        "sameAs": [
          "https://app.dimensions.ai/details/publication/pub.1033675014"
        ], 
        "type": "CreativeWork"
      }, 
      {
        "id": "https://doi.org/10.1093/bioinformatics/btg005", 
        "sameAs": [
          "https://app.dimensions.ai/details/publication/pub.1043080454"
        ], 
        "type": "CreativeWork"
      }, 
      {
        "id": "sg:pub.10.1186/1471-2105-9-57", 
        "sameAs": [
          "https://app.dimensions.ai/details/publication/pub.1052408459", 
          "https://doi.org/10.1186/1471-2105-9-57"
        ], 
        "type": "CreativeWork"
      }, 
      {
        "id": "sg:pub.10.1186/1471-2105-9-57", 
        "sameAs": [
          "https://app.dimensions.ai/details/publication/pub.1052408459", 
          "https://doi.org/10.1186/1471-2105-9-57"
        ], 
        "type": "CreativeWork"
      }, 
      {
        "id": "https://doi.org/10.1021/la2007793", 
        "sameAs": [
          "https://app.dimensions.ai/details/publication/pub.1053622582"
        ], 
        "type": "CreativeWork"
      }, 
      {
        "id": "https://doi.org/10.1021/la2007793", 
        "sameAs": [
          "https://app.dimensions.ai/details/publication/pub.1053622582"
        ], 
        "type": "CreativeWork"
      }, 
      {
        "id": "https://doi.org/10.1021/ja1084942", 
        "sameAs": [
          "https://app.dimensions.ai/details/publication/pub.1055848659"
        ], 
        "type": "CreativeWork"
      }, 
      {
        "id": "https://doi.org/10.1021/ja1084942", 
        "sameAs": [
          "https://app.dimensions.ai/details/publication/pub.1055848659"
        ], 
        "type": "CreativeWork"
      }, 
      {
        "id": "https://doi.org/10.1021/jacs.6b09135", 
        "sameAs": [
          "https://app.dimensions.ai/details/publication/pub.1055876287"
        ], 
        "type": "CreativeWork"
      }, 
      {
        "id": "https://doi.org/10.1021/jp1051497", 
        "sameAs": [
          "https://app.dimensions.ai/details/publication/pub.1056077787"
        ], 
        "type": "CreativeWork"
      }, 
      {
        "id": "https://doi.org/10.1021/jp1051497", 
        "sameAs": [
          "https://app.dimensions.ai/details/publication/pub.1056077787"
        ], 
        "type": "CreativeWork"
      }, 
      {
        "id": "https://doi.org/10.1021/nl8022143", 
        "sameAs": [
          "https://app.dimensions.ai/details/publication/pub.1056221492"
        ], 
        "type": "CreativeWork"
      }, 
      {
        "id": "https://doi.org/10.1021/nl8022143", 
        "sameAs": [
          "https://app.dimensions.ai/details/publication/pub.1056221492"
        ], 
        "type": "CreativeWork"
      }, 
      {
        "id": "https://doi.org/10.1021/nl802645d", 
        "sameAs": [
          "https://app.dimensions.ai/details/publication/pub.1056221561"
        ], 
        "type": "CreativeWork"
      }, 
      {
        "id": "https://doi.org/10.1021/nl802645d", 
        "sameAs": [
          "https://app.dimensions.ai/details/publication/pub.1056221561"
        ], 
        "type": "CreativeWork"
      }, 
      {
        "id": "https://doi.org/10.1126/science.1144592", 
        "sameAs": [
          "https://app.dimensions.ai/details/publication/pub.1062456172"
        ], 
        "type": "CreativeWork"
      }, 
      {
        "id": "sg:pub.10.1038/nnano.2016.284", 
        "sameAs": [
          "https://app.dimensions.ai/details/publication/pub.1074189364", 
          "https://doi.org/10.1038/nnano.2016.284"
        ], 
        "type": "CreativeWork"
      }, 
      {
        "id": "sg:pub.10.1038/nnano.2016.284", 
        "sameAs": [
          "https://app.dimensions.ai/details/publication/pub.1074189364", 
          "https://doi.org/10.1038/nnano.2016.284"
        ], 
        "type": "CreativeWork"
      }, 
      {
        "id": "sg:pub.10.1038/nnano.2016.284", 
        "sameAs": [
          "https://app.dimensions.ai/details/publication/pub.1074189364", 
          "https://doi.org/10.1038/nnano.2016.284"
        ], 
        "type": "CreativeWork"
      }, 
      {
        "id": "https://doi.org/10.1021/acs.jpclett.7b00583", 
        "sameAs": [
          "https://app.dimensions.ai/details/publication/pub.1084821146"
        ], 
        "type": "CreativeWork"
      }, 
      {
        "id": "https://doi.org/10.1021/acs.jpcc.7b05168", 
        "sameAs": [
          "https://app.dimensions.ai/details/publication/pub.1090780579"
        ], 
        "type": "CreativeWork"
      }, 
      {
        "id": "https://doi.org/10.1021/acs.langmuir.7b03186", 
        "sameAs": [
          "https://app.dimensions.ai/details/publication/pub.1100239536"
        ], 
        "type": "CreativeWork"
      }, 
      {
        "id": "https://doi.org/10.2307/2347628", 
        "sameAs": [
          "https://app.dimensions.ai/details/publication/pub.1101983400"
        ], 
        "type": "CreativeWork"
      }, 
      {
        "id": "https://doi.org/10.2307/2347628", 
        "sameAs": [
          "https://app.dimensions.ai/details/publication/pub.1101983400"
        ], 
        "type": "CreativeWork"
      }, 
      {
        "id": "sg:pub.10.1038/s41524-018-0081-z", 
        "sameAs": [
          "https://app.dimensions.ai/details/publication/pub.1103889265", 
          "https://doi.org/10.1038/s41524-018-0081-z"
        ], 
        "type": "CreativeWork"
      }, 
      {
        "id": "https://doi.org/10.1111/j.2517-6161.1958.tb00292.x", 
        "sameAs": [
          "https://app.dimensions.ai/details/publication/pub.1110457190"
        ], 
        "type": "CreativeWork"
      }, 
      {
        "id": "https://doi.org/10.1111/j.2517-6161.1958.tb00292.x", 
        "sameAs": [
          "https://app.dimensions.ai/details/publication/pub.1110457190"
        ], 
        "type": "CreativeWork"
      }
    ], 
    "datePublished": "2019-12", 
    "datePublishedReg": "2019-12-01", 
    "description": "DNA/single-wall carbon nanotube (SWCNT) hybrids have enabled many applications because of their special ability to disperse and sort SWCNTs by their chirality and handedness. Much work has been done to discover sequences which recognize specific chiralities of SWCNT, and significant progress has been made in understanding the underlying structure and thermodynamics of these hybrids. Nevertheless, de novo prediction of recognition sequences remains essentially impossible and the success rate for their discovery by search of the vast single-stranded DNA library is very low. Here, we report an effective way of predicting recognition sequences based on machine learning analysis of existing experimental sequence data sets. Multiple input feature construction methods (position-specific, term-frequency, combined or segmented term frequency vector, and motif-based feature) were used and compared. The transformed features were used to train several classifier algorithms (logistic regression, support vector machine, and artificial neural network). Trained models were used to predict new sets of recognition sequences, and consensus among a number of models was used successfully to counteract the limited size of the data set. Predictions were tested using aqueous two-phase separation. New data thus acquired were used to retrain the models by adding an experimentally tested new set of predicted sequences to the original set. The frequency of finding correct recognition sequences by the trained model increased to >50% from the ~10% success rate in the original training data set.", 
    "genre": "research_article", 
    "id": "sg:pub.10.1038/s41524-018-0142-3", 
    "inLanguage": [
      "en"
    ], 
    "isAccessibleForFree": true, 
    "isPartOf": [
      {
        "id": "sg:journal.1285194", 
        "issn": [
          "2057-3960"
        ], 
        "name": "npj Computational Materials", 
        "type": "Periodical"
      }, 
      {
        "issueNumber": "1", 
        "type": "PublicationIssue"
      }, 
      {
        "type": "PublicationVolume", 
        "volumeNumber": "5"
      }
    ], 
    "name": "Learning to predict single-wall carbon nanotube-recognition DNA sequences", 
    "pagination": "3", 
    "productId": [
      {
        "name": "readcube_id", 
        "type": "PropertyValue", 
        "value": [
          "213fee1e72a46f7173e2749e41a4c57e91e24a846c371a78c19db1fa50b01d7a"
        ]
      }, 
      {
        "name": "doi", 
        "type": "PropertyValue", 
        "value": [
          "10.1038/s41524-018-0142-3"
        ]
      }, 
      {
        "name": "dimensions_id", 
        "type": "PropertyValue", 
        "value": [
          "pub.1110955647"
        ]
      }
    ], 
    "sameAs": [
      "https://doi.org/10.1038/s41524-018-0142-3", 
      "https://app.dimensions.ai/details/publication/pub.1110955647"
    ], 
    "sdDataset": "articles", 
    "sdDatePublished": "2019-04-11T08:37", 
    "sdLicense": "https://scigraph.springernature.com/explorer/license/", 
    "sdPublisher": {
      "name": "Springer Nature - SN SciGraph project", 
      "type": "Organization"
    }, 
    "sdSource": "s3://com-uberresearch-data-dimensions-target-20181106-alternative/cleanup/v134/2549eaecd7973599484d7c17b260dba0a4ecb94b/merge/v9/a6c9fde33151104705d4d7ff012ea9563521a3ce/jats-lookup/v90/0000000315_0000000315/records_6302_00000000.jsonl", 
    "type": "ScholarlyArticle", 
    "url": "https://www.nature.com/articles/s41524-018-0142-3"
  }
]
 

Download the RDF metadata as:  json-ld nt turtle xml License info

HOW TO GET THIS DATA PROGRAMMATICALLY:

JSON-LD is a popular format for linked data which is fully compatible with JSON.

curl -H 'Accept: application/ld+json' 'https://scigraph.springernature.com/pub.10.1038/s41524-018-0142-3'

N-Triples is a line-based linked data format ideal for batch operations.

curl -H 'Accept: application/n-triples' 'https://scigraph.springernature.com/pub.10.1038/s41524-018-0142-3'

Turtle is a human-readable linked data format.

curl -H 'Accept: text/turtle' 'https://scigraph.springernature.com/pub.10.1038/s41524-018-0142-3'

RDF/XML is a standard XML format for linked data.

curl -H 'Accept: application/rdf+xml' 'https://scigraph.springernature.com/pub.10.1038/s41524-018-0142-3'


 

This table displays all metadata directly associated to this object as RDF triples.

200 TRIPLES      21 PREDICATES      64 URIs      19 LITERALS      7 BLANK NODES

Subject Predicate Object
1 sg:pub.10.1038/s41524-018-0142-3 schema:about anzsrc-for:08
2 anzsrc-for:0801
3 schema:author N14cb29cc95244b22a33de73a8072981c
4 schema:citation sg:pub.10.1007/s41061-016-0098-z
5 sg:pub.10.1038/nature08116
6 sg:pub.10.1038/ncomms1313
7 sg:pub.10.1038/nnano.2006.52
8 sg:pub.10.1038/nnano.2007.290
9 sg:pub.10.1038/nnano.2016.284
10 sg:pub.10.1038/s41524-018-0081-z
11 sg:pub.10.1186/1471-2105-11-s1-s65
12 sg:pub.10.1186/1471-2105-14-96
13 sg:pub.10.1186/1471-2105-9-57
14 sg:pub.10.1186/1556-276x-9-393
15 https://doi.org/10.1002/adma.201401402
16 https://doi.org/10.1016/0925-2312(91)90023-5
17 https://doi.org/10.1016/j.snb.2014.10.040
18 https://doi.org/10.1021/acs.jpcc.7b05168
19 https://doi.org/10.1021/acs.jpclett.7b00583
20 https://doi.org/10.1021/acs.langmuir.7b03186
21 https://doi.org/10.1021/ja1084942
22 https://doi.org/10.1021/ja504078b
23 https://doi.org/10.1021/ja5055498
24 https://doi.org/10.1021/jacs.6b09135
25 https://doi.org/10.1021/jp1051497
26 https://doi.org/10.1021/jp309523a
27 https://doi.org/10.1021/la2007793
28 https://doi.org/10.1021/la500013c
29 https://doi.org/10.1021/nl071909j
30 https://doi.org/10.1021/nl8022143
31 https://doi.org/10.1021/nl802645d
32 https://doi.org/10.1039/c1an15179g
33 https://doi.org/10.1093/bioinformatics/btg005
34 https://doi.org/10.1093/bioinformatics/btg1058
35 https://doi.org/10.1093/bioinformatics/btr110
36 https://doi.org/10.1110/ps.0239403
37 https://doi.org/10.1111/j.2517-6161.1958.tb00292.x
38 https://doi.org/10.1126/science.1060928
39 https://doi.org/10.1126/science.1144592
40 https://doi.org/10.2307/2347628
41 schema:datePublished 2019-12
42 schema:datePublishedReg 2019-12-01
43 schema:description DNA/single-wall carbon nanotube (SWCNT) hybrids have enabled many applications because of their special ability to disperse and sort SWCNTs by their chirality and handedness. Much work has been done to discover sequences which recognize specific chiralities of SWCNT, and significant progress has been made in understanding the underlying structure and thermodynamics of these hybrids. Nevertheless, de novo prediction of recognition sequences remains essentially impossible and the success rate for their discovery by search of the vast single-stranded DNA library is very low. Here, we report an effective way of predicting recognition sequences based on machine learning analysis of existing experimental sequence data sets. Multiple input feature construction methods (position-specific, term-frequency, combined or segmented term frequency vector, and motif-based feature) were used and compared. The transformed features were used to train several classifier algorithms (logistic regression, support vector machine, and artificial neural network). Trained models were used to predict new sets of recognition sequences, and consensus among a number of models was used successfully to counteract the limited size of the data set. Predictions were tested using aqueous two-phase separation. New data thus acquired were used to retrain the models by adding an experimentally tested new set of predicted sequences to the original set. The frequency of finding correct recognition sequences by the trained model increased to >50% from the ~10% success rate in the original training data set.
44 schema:genre research_article
45 schema:inLanguage en
46 schema:isAccessibleForFree true
47 schema:isPartOf N1979a6f19c384f45999fa5664f0e87b1
48 N7493364b953147b0a435a14b239f4f3b
49 sg:journal.1285194
50 schema:name Learning to predict single-wall carbon nanotube-recognition DNA sequences
51 schema:pagination 3
52 schema:productId Nadc03328a631444699121051a7f2de84
53 Ndbdf870eea7c41c79ac674f6abf44d08
54 Nde7cc7f0cc634c5a838d7df069b266e4
55 schema:sameAs https://app.dimensions.ai/details/publication/pub.1110955647
56 https://doi.org/10.1038/s41524-018-0142-3
57 schema:sdDatePublished 2019-04-11T08:37
58 schema:sdLicense https://scigraph.springernature.com/explorer/license/
59 schema:sdPublisher N53f0bc4084db4d53a2e80f18913b85cc
60 schema:url https://www.nature.com/articles/s41524-018-0142-3
61 sgo:license sg:explorer/license/
62 sgo:sdDataset articles
63 rdf:type schema:ScholarlyArticle
64 N14cb29cc95244b22a33de73a8072981c rdf:first sg:person.011626664174.12
65 rdf:rest Nca21f37d46a84b299c83ca76c8d00cd8
66 N1979a6f19c384f45999fa5664f0e87b1 schema:volumeNumber 5
67 rdf:type schema:PublicationVolume
68 N53f0bc4084db4d53a2e80f18913b85cc schema:name Springer Nature - SN SciGraph project
69 rdf:type schema:Organization
70 N5b9673a837d0407f8e91a4c2e37d8a99 rdf:first sg:person.0616413355.14
71 rdf:rest rdf:nil
72 N7493364b953147b0a435a14b239f4f3b schema:issueNumber 1
73 rdf:type schema:PublicationIssue
74 Nadc03328a631444699121051a7f2de84 schema:name dimensions_id
75 schema:value pub.1110955647
76 rdf:type schema:PropertyValue
77 Nca21f37d46a84b299c83ca76c8d00cd8 rdf:first sg:person.0617564542.45
78 rdf:rest N5b9673a837d0407f8e91a4c2e37d8a99
79 Ndbdf870eea7c41c79ac674f6abf44d08 schema:name doi
80 schema:value 10.1038/s41524-018-0142-3
81 rdf:type schema:PropertyValue
82 Nde7cc7f0cc634c5a838d7df069b266e4 schema:name readcube_id
83 schema:value 213fee1e72a46f7173e2749e41a4c57e91e24a846c371a78c19db1fa50b01d7a
84 rdf:type schema:PropertyValue
85 anzsrc-for:08 schema:inDefinedTermSet anzsrc-for:
86 schema:name Information and Computing Sciences
87 rdf:type schema:DefinedTerm
88 anzsrc-for:0801 schema:inDefinedTermSet anzsrc-for:
89 schema:name Artificial Intelligence and Image Processing
90 rdf:type schema:DefinedTerm
91 sg:journal.1285194 schema:issn 2057-3960
92 schema:name npj Computational Materials
93 rdf:type schema:Periodical
94 sg:person.011626664174.12 schema:affiliation https://www.grid.ac/institutes/grid.259029.5
95 schema:familyName Yang
96 schema:givenName Yoona
97 schema:sameAs https://app.dimensions.ai/discover/publication?and_facet_researcher=ur.011626664174.12
98 rdf:type schema:Person
99 sg:person.0616413355.14 schema:affiliation https://www.grid.ac/institutes/grid.259029.5
100 schema:familyName Jagota
101 schema:givenName Anand
102 schema:sameAs https://app.dimensions.ai/discover/publication?and_facet_researcher=ur.0616413355.14
103 rdf:type schema:Person
104 sg:person.0617564542.45 schema:affiliation https://www.grid.ac/institutes/grid.94225.38
105 schema:familyName Zheng
106 schema:givenName Ming
107 schema:sameAs https://app.dimensions.ai/discover/publication?and_facet_researcher=ur.0617564542.45
108 rdf:type schema:Person
109 sg:pub.10.1007/s41061-016-0098-z schema:sameAs https://app.dimensions.ai/details/publication/pub.1020123678
110 https://doi.org/10.1007/s41061-016-0098-z
111 rdf:type schema:CreativeWork
112 sg:pub.10.1038/nature08116 schema:sameAs https://app.dimensions.ai/details/publication/pub.1030619385
113 https://doi.org/10.1038/nature08116
114 rdf:type schema:CreativeWork
115 sg:pub.10.1038/ncomms1313 schema:sameAs https://app.dimensions.ai/details/publication/pub.1026419943
116 https://doi.org/10.1038/ncomms1313
117 rdf:type schema:CreativeWork
118 sg:pub.10.1038/nnano.2006.52 schema:sameAs https://app.dimensions.ai/details/publication/pub.1028729458
119 https://doi.org/10.1038/nnano.2006.52
120 rdf:type schema:CreativeWork
121 sg:pub.10.1038/nnano.2007.290 schema:sameAs https://app.dimensions.ai/details/publication/pub.1001853696
122 https://doi.org/10.1038/nnano.2007.290
123 rdf:type schema:CreativeWork
124 sg:pub.10.1038/nnano.2016.284 schema:sameAs https://app.dimensions.ai/details/publication/pub.1074189364
125 https://doi.org/10.1038/nnano.2016.284
126 rdf:type schema:CreativeWork
127 sg:pub.10.1038/s41524-018-0081-z schema:sameAs https://app.dimensions.ai/details/publication/pub.1103889265
128 https://doi.org/10.1038/s41524-018-0081-z
129 rdf:type schema:CreativeWork
130 sg:pub.10.1186/1471-2105-11-s1-s65 schema:sameAs https://app.dimensions.ai/details/publication/pub.1014920715
131 https://doi.org/10.1186/1471-2105-11-s1-s65
132 rdf:type schema:CreativeWork
133 sg:pub.10.1186/1471-2105-14-96 schema:sameAs https://app.dimensions.ai/details/publication/pub.1011961692
134 https://doi.org/10.1186/1471-2105-14-96
135 rdf:type schema:CreativeWork
136 sg:pub.10.1186/1471-2105-9-57 schema:sameAs https://app.dimensions.ai/details/publication/pub.1052408459
137 https://doi.org/10.1186/1471-2105-9-57
138 rdf:type schema:CreativeWork
139 sg:pub.10.1186/1556-276x-9-393 schema:sameAs https://app.dimensions.ai/details/publication/pub.1017985419
140 https://doi.org/10.1186/1556-276x-9-393
141 rdf:type schema:CreativeWork
142 https://doi.org/10.1002/adma.201401402 schema:sameAs https://app.dimensions.ai/details/publication/pub.1013707546
143 rdf:type schema:CreativeWork
144 https://doi.org/10.1016/0925-2312(91)90023-5 schema:sameAs https://app.dimensions.ai/details/publication/pub.1018872684
145 rdf:type schema:CreativeWork
146 https://doi.org/10.1016/j.snb.2014.10.040 schema:sameAs https://app.dimensions.ai/details/publication/pub.1008817454
147 rdf:type schema:CreativeWork
148 https://doi.org/10.1021/acs.jpcc.7b05168 schema:sameAs https://app.dimensions.ai/details/publication/pub.1090780579
149 rdf:type schema:CreativeWork
150 https://doi.org/10.1021/acs.jpclett.7b00583 schema:sameAs https://app.dimensions.ai/details/publication/pub.1084821146
151 rdf:type schema:CreativeWork
152 https://doi.org/10.1021/acs.langmuir.7b03186 schema:sameAs https://app.dimensions.ai/details/publication/pub.1100239536
153 rdf:type schema:CreativeWork
154 https://doi.org/10.1021/ja1084942 schema:sameAs https://app.dimensions.ai/details/publication/pub.1055848659
155 rdf:type schema:CreativeWork
156 https://doi.org/10.1021/ja504078b schema:sameAs https://app.dimensions.ai/details/publication/pub.1015988430
157 rdf:type schema:CreativeWork
158 https://doi.org/10.1021/ja5055498 schema:sameAs https://app.dimensions.ai/details/publication/pub.1033675014
159 rdf:type schema:CreativeWork
160 https://doi.org/10.1021/jacs.6b09135 schema:sameAs https://app.dimensions.ai/details/publication/pub.1055876287
161 rdf:type schema:CreativeWork
162 https://doi.org/10.1021/jp1051497 schema:sameAs https://app.dimensions.ai/details/publication/pub.1056077787
163 rdf:type schema:CreativeWork
164 https://doi.org/10.1021/jp309523a schema:sameAs https://app.dimensions.ai/details/publication/pub.1028940478
165 rdf:type schema:CreativeWork
166 https://doi.org/10.1021/la2007793 schema:sameAs https://app.dimensions.ai/details/publication/pub.1053622582
167 rdf:type schema:CreativeWork
168 https://doi.org/10.1021/la500013c schema:sameAs https://app.dimensions.ai/details/publication/pub.1013413296
169 rdf:type schema:CreativeWork
170 https://doi.org/10.1021/nl071909j schema:sameAs https://app.dimensions.ai/details/publication/pub.1022111912
171 rdf:type schema:CreativeWork
172 https://doi.org/10.1021/nl8022143 schema:sameAs https://app.dimensions.ai/details/publication/pub.1056221492
173 rdf:type schema:CreativeWork
174 https://doi.org/10.1021/nl802645d schema:sameAs https://app.dimensions.ai/details/publication/pub.1056221561
175 rdf:type schema:CreativeWork
176 https://doi.org/10.1039/c1an15179g schema:sameAs https://app.dimensions.ai/details/publication/pub.1000710041
177 rdf:type schema:CreativeWork
178 https://doi.org/10.1093/bioinformatics/btg005 schema:sameAs https://app.dimensions.ai/details/publication/pub.1043080454
179 rdf:type schema:CreativeWork
180 https://doi.org/10.1093/bioinformatics/btg1058 schema:sameAs https://app.dimensions.ai/details/publication/pub.1003963728
181 rdf:type schema:CreativeWork
182 https://doi.org/10.1093/bioinformatics/btr110 schema:sameAs https://app.dimensions.ai/details/publication/pub.1024363090
183 rdf:type schema:CreativeWork
184 https://doi.org/10.1110/ps.0239403 schema:sameAs https://app.dimensions.ai/details/publication/pub.1024845437
185 rdf:type schema:CreativeWork
186 https://doi.org/10.1111/j.2517-6161.1958.tb00292.x schema:sameAs https://app.dimensions.ai/details/publication/pub.1110457190
187 rdf:type schema:CreativeWork
188 https://doi.org/10.1126/science.1060928 schema:sameAs https://app.dimensions.ai/details/publication/pub.1013559716
189 rdf:type schema:CreativeWork
190 https://doi.org/10.1126/science.1144592 schema:sameAs https://app.dimensions.ai/details/publication/pub.1062456172
191 rdf:type schema:CreativeWork
192 https://doi.org/10.2307/2347628 schema:sameAs https://app.dimensions.ai/details/publication/pub.1101983400
193 rdf:type schema:CreativeWork
194 https://www.grid.ac/institutes/grid.259029.5 schema:alternateName Lehigh University
195 schema:name Department of Bioengineering, Lehigh University, 18015, Bethlehem, PA, USA
196 Department of Chemical & Biomolecular Engineering, Lehigh University, 18015, Bethlehem, PA, USA
197 rdf:type schema:Organization
198 https://www.grid.ac/institutes/grid.94225.38 schema:alternateName National Institute of Standards and Technology
199 schema:name Materials Science and Engineering Division, National Institute of Standards and Technology, 20899, Gaithersburg, MD, USA
200 rdf:type schema:Organization
 




Preview window. Press ESC to close (or click here)


...