Ontology type: schema:ScholarlyArticle Open Access: True
2013-12
AUTHORSYongchao Liu, Adrianto Wirawan, Bertil Schmidt
ABSTRACTBACKGROUND: The maximal sensitivity for local alignments makes the Smith-Waterman algorithm a popular choice for protein sequence database search based on pairwise alignment. However, the algorithm is compute-intensive due to a quadratic time complexity. Corresponding runtimes are further compounded by the rapid growth of sequence databases. RESULTS: We present CUDASW++ 3.0, a fast Smith-Waterman protein database search algorithm, which couples CPU and GPU SIMD instructions and carries out concurrent CPU and GPU computations. For the CPU computation, this algorithm employs SSE-based vector execution units as accelerators. For the GPU computation, we have investigated for the first time a GPU SIMD parallelization, which employs CUDA PTX SIMD video instructions to gain more data parallelism beyond the SIMT execution model. Moreover, sequence alignment workloads are automatically distributed over CPUs and GPUs based on their respective compute capabilities. Evaluation on the Swiss-Prot database shows that CUDASW++ 3.0 gains a performance improvement over CUDASW++ 2.0 up to 2.9 and 3.2, with a maximum performance of 119.0 and 185.6 GCUPS, on a single-GPU GeForce GTX 680 and a dual-GPU GeForce GTX 690 graphics card, respectively. In addition, our algorithm has demonstrated significant speedups over other top-performing tools: SWIPE and BLAST+. CONCLUSIONS: CUDASW++ 3.0 is written in CUDA C++ and PTX assembly languages, targeting GPUs based on the Kepler architecture. This algorithm obtains significant speedups over its predecessor: CUDASW++ 2.0, by benefiting from the use of CPU and GPU SIMD instructions as well as the concurrent execution on CPUs and GPUs. The source code and the simulated data are available at http://cudasw.sourceforge.net. More... »
PAGES117
http://scigraph.springernature.com/pub.10.1186/1471-2105-14-117
DOIhttp://dx.doi.org/10.1186/1471-2105-14-117
DIMENSIONShttps://app.dimensions.ai/details/publication/pub.1032649695
PUBMEDhttps://www.ncbi.nlm.nih.gov/pubmed/23557111
JSON-LD is the canonical representation for SciGraph data.
TIP: You can open this SciGraph record using an external JSON-LD service: JSON-LD Playground Google SDTT
[
{
"@context": "https://springernature.github.io/scigraph/jsonld/sgcontext.json",
"about": [
{
"id": "http://purl.org/au-research/vocabulary/anzsrc-for/2008/0806",
"inDefinedTermSet": "http://purl.org/au-research/vocabulary/anzsrc-for/2008/",
"name": "Information Systems",
"type": "DefinedTerm"
},
{
"id": "http://purl.org/au-research/vocabulary/anzsrc-for/2008/08",
"inDefinedTermSet": "http://purl.org/au-research/vocabulary/anzsrc-for/2008/",
"name": "Information and Computing Sciences",
"type": "DefinedTerm"
},
{
"inDefinedTermSet": "https://www.nlm.nih.gov/mesh/",
"name": "Algorithms",
"type": "DefinedTerm"
},
{
"inDefinedTermSet": "https://www.nlm.nih.gov/mesh/",
"name": "Databases, Protein",
"type": "DefinedTerm"
},
{
"inDefinedTermSet": "https://www.nlm.nih.gov/mesh/",
"name": "Sequence Alignment",
"type": "DefinedTerm"
},
{
"inDefinedTermSet": "https://www.nlm.nih.gov/mesh/",
"name": "Sequence Analysis, Protein",
"type": "DefinedTerm"
},
{
"inDefinedTermSet": "https://www.nlm.nih.gov/mesh/",
"name": "Software",
"type": "DefinedTerm"
}
],
"author": [
{
"affiliation": {
"alternateName": "Johannes Gutenberg University of Mainz",
"id": "https://www.grid.ac/institutes/grid.5802.f",
"name": [
"Institut f\u00fcr Informatik, Johannes Gutenberg Universit\u00e4t Mainz, Mainz, Germany"
],
"type": "Organization"
},
"familyName": "Liu",
"givenName": "Yongchao",
"id": "sg:person.01356237607.16",
"sameAs": [
"https://app.dimensions.ai/discover/publication?and_facet_researcher=ur.01356237607.16"
],
"type": "Person"
},
{
"affiliation": {
"alternateName": "Johannes Gutenberg University of Mainz",
"id": "https://www.grid.ac/institutes/grid.5802.f",
"name": [
"Institut f\u00fcr Informatik, Johannes Gutenberg Universit\u00e4t Mainz, Mainz, Germany"
],
"type": "Organization"
},
"familyName": "Wirawan",
"givenName": "Adrianto",
"id": "sg:person.010056145157.83",
"sameAs": [
"https://app.dimensions.ai/discover/publication?and_facet_researcher=ur.010056145157.83"
],
"type": "Person"
},
{
"affiliation": {
"alternateName": "Johannes Gutenberg University of Mainz",
"id": "https://www.grid.ac/institutes/grid.5802.f",
"name": [
"Institut f\u00fcr Informatik, Johannes Gutenberg Universit\u00e4t Mainz, Mainz, Germany"
],
"type": "Organization"
},
"familyName": "Schmidt",
"givenName": "Bertil",
"id": "sg:person.01165760171.33",
"sameAs": [
"https://app.dimensions.ai/discover/publication?and_facet_researcher=ur.01165760171.33"
],
"type": "Person"
}
],
"citation": [
{
"id": "https://doi.org/10.1093/bioinformatics/bts061",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1004966652"
],
"type": "CreativeWork"
},
{
"id": "https://doi.org/10.1093/bioinformatics/bts276",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1007749674"
],
"type": "CreativeWork"
},
{
"id": "https://doi.org/10.1093/bioinformatics/13.2.145",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1008319186"
],
"type": "CreativeWork"
},
{
"id": "sg:pub.10.1186/1471-2105-9-377",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1009826728",
"https://doi.org/10.1186/1471-2105-9-377"
],
"type": "CreativeWork"
},
{
"id": "sg:pub.10.1186/1471-2105-9-377",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1009826728",
"https://doi.org/10.1186/1471-2105-9-377"
],
"type": "CreativeWork"
},
{
"id": "sg:pub.10.1186/1471-2105-8-85",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1010432928",
"https://doi.org/10.1186/1471-2105-8-85"
],
"type": "CreativeWork"
},
{
"id": "sg:pub.10.1186/1471-2105-8-85",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1010432928",
"https://doi.org/10.1186/1471-2105-8-85"
],
"type": "CreativeWork"
},
{
"id": "sg:pub.10.1186/1471-2105-12-221",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1010513713",
"https://doi.org/10.1186/1471-2105-12-221"
],
"type": "CreativeWork"
},
{
"id": "https://doi.org/10.1016/s0022-2836(05)80360-2",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1013618994"
],
"type": "CreativeWork"
},
{
"id": "https://doi.org/10.1093/bioinformatics/btl582",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1014155557"
],
"type": "CreativeWork"
},
{
"id": "sg:pub.10.1186/1471-2105-12-181",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1015805408",
"https://doi.org/10.1186/1471-2105-12-181"
],
"type": "CreativeWork"
},
{
"id": "sg:pub.10.1186/1756-0500-2-73",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1016918082",
"https://doi.org/10.1186/1756-0500-2-73"
],
"type": "CreativeWork"
},
{
"id": "sg:pub.10.1186/1756-0500-1-107",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1023265691",
"https://doi.org/10.1186/1756-0500-1-107"
],
"type": "CreativeWork"
},
{
"id": "sg:pub.10.1186/1471-2105-8-185",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1024279614",
"https://doi.org/10.1186/1471-2105-8-185"
],
"type": "CreativeWork"
},
{
"id": "sg:pub.10.1186/1471-2105-8-185",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1024279614",
"https://doi.org/10.1186/1471-2105-8-185"
],
"type": "CreativeWork"
},
{
"id": "https://doi.org/10.1016/0022-2836(81)90087-5",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1024589839"
],
"type": "CreativeWork"
},
{
"id": "https://doi.org/10.1016/0022-2836(82)90398-9",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1025042064"
],
"type": "CreativeWork"
},
{
"id": "https://doi.org/10.1093/bioinformatics/16.8.699",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1025315480"
],
"type": "CreativeWork"
},
{
"id": "sg:pub.10.1007/s10339-012-0496-2",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1034768061",
"https://doi.org/10.1007/s10339-012-0496-2"
],
"type": "CreativeWork"
},
{
"id": "https://doi.org/10.1093/bioinformatics/bti508",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1034858023"
],
"type": "CreativeWork"
},
{
"id": "https://doi.org/10.1073/pnas.85.8.2444",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1035928070"
],
"type": "CreativeWork"
},
{
"id": "https://doi.org/10.1016/j.jcp.2010.02.009",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1037434697"
],
"type": "CreativeWork"
},
{
"id": "sg:pub.10.1186/1471-2105-13-196",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1038214039",
"https://doi.org/10.1186/1471-2105-13-196"
],
"type": "CreativeWork"
},
{
"id": "https://doi.org/10.1093/bioinformatics/btp324",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1038266369"
],
"type": "CreativeWork"
},
{
"id": "sg:pub.10.1186/1756-0500-3-93",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1041106303",
"https://doi.org/10.1186/1756-0500-3-93"
],
"type": "CreativeWork"
},
{
"id": "https://doi.org/10.1093/nar/22.22.4673",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1042438223"
],
"type": "CreativeWork"
},
{
"id": "sg:pub.10.1186/1471-2105-11-s12-s3",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1043261293",
"https://doi.org/10.1186/1471-2105-11-s12-s3"
],
"type": "CreativeWork"
},
{
"id": "https://doi.org/10.1093/nar/25.17.3389",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1047265454"
],
"type": "CreativeWork"
},
{
"id": "sg:pub.10.1186/1471-2105-10-421",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1050579230",
"https://doi.org/10.1186/1471-2105-10-421"
],
"type": "CreativeWork"
},
{
"id": "sg:pub.10.1186/1471-2105-9-s2-s10",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1053056668",
"https://doi.org/10.1186/1471-2105-9-s2-s10"
],
"type": "CreativeWork"
},
{
"id": "https://doi.org/10.1109/mm.2008.31",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1061408600"
],
"type": "CreativeWork"
},
{
"id": "https://doi.org/10.1109/tcsii.2005.853340",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1061569178"
],
"type": "CreativeWork"
},
{
"id": "https://doi.org/10.1109/tpds.2007.1059",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1061753097"
],
"type": "CreativeWork"
},
{
"id": "https://doi.org/10.1109/tpds.2007.1069",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1061753107"
],
"type": "CreativeWork"
},
{
"id": "https://doi.org/10.1109/ipdps.2011.182",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1093377377"
],
"type": "CreativeWork"
},
{
"id": "https://doi.org/10.1109/ipdps.2009.5160931",
"sameAs": [
"https://app.dimensions.ai/details/publication/pub.1094780600"
],
"type": "CreativeWork"
}
],
"datePublished": "2013-12",
"datePublishedReg": "2013-12-01",
"description": "BACKGROUND: The maximal sensitivity for local alignments makes the Smith-Waterman algorithm a popular choice for protein sequence database search based on pairwise alignment. However, the algorithm is compute-intensive due to a quadratic time complexity. Corresponding runtimes are further compounded by the rapid growth of sequence databases.\nRESULTS: We present CUDASW++ 3.0, a fast Smith-Waterman protein database search algorithm, which couples CPU and GPU SIMD instructions and carries out concurrent CPU and GPU computations. For the CPU computation, this algorithm employs SSE-based vector execution units as accelerators. For the GPU computation, we have investigated for the first time a GPU SIMD parallelization, which employs CUDA PTX SIMD video instructions to gain more data parallelism beyond the SIMT execution model. Moreover, sequence alignment workloads are automatically distributed over CPUs and GPUs based on their respective compute capabilities. Evaluation on the Swiss-Prot database shows that CUDASW++ 3.0 gains a performance improvement over CUDASW++ 2.0 up to 2.9 and 3.2, with a maximum performance of 119.0 and 185.6 GCUPS, on a single-GPU GeForce GTX 680 and a dual-GPU GeForce GTX 690 graphics card, respectively. In addition, our algorithm has demonstrated significant speedups over other top-performing tools: SWIPE and BLAST+.\nCONCLUSIONS: CUDASW++ 3.0 is written in CUDA C++ and PTX assembly languages, targeting GPUs based on the Kepler architecture. This algorithm obtains significant speedups over its predecessor: CUDASW++ 2.0, by benefiting from the use of CPU and GPU SIMD instructions as well as the concurrent execution on CPUs and GPUs. The source code and the simulated data are available at http://cudasw.sourceforge.net.",
"genre": "research_article",
"id": "sg:pub.10.1186/1471-2105-14-117",
"inLanguage": [
"en"
],
"isAccessibleForFree": true,
"isPartOf": [
{
"id": "sg:journal.1023786",
"issn": [
"1471-2105"
],
"name": "BMC Bioinformatics",
"type": "Periodical"
},
{
"issueNumber": "1",
"type": "PublicationIssue"
},
{
"type": "PublicationVolume",
"volumeNumber": "14"
}
],
"name": "CUDASW++ 3.0: accelerating Smith-Waterman protein database search by coupling CPU and GPU SIMD instructions",
"pagination": "117",
"productId": [
{
"name": "readcube_id",
"type": "PropertyValue",
"value": [
"9b8a4d7546cd41184e353c2514e48dd44bb9a61387f7f8356fac6ad0c8c8f889"
]
},
{
"name": "pubmed_id",
"type": "PropertyValue",
"value": [
"23557111"
]
},
{
"name": "nlm_unique_id",
"type": "PropertyValue",
"value": [
"100965194"
]
},
{
"name": "doi",
"type": "PropertyValue",
"value": [
"10.1186/1471-2105-14-117"
]
},
{
"name": "dimensions_id",
"type": "PropertyValue",
"value": [
"pub.1032649695"
]
}
],
"sameAs": [
"https://doi.org/10.1186/1471-2105-14-117",
"https://app.dimensions.ai/details/publication/pub.1032649695"
],
"sdDataset": "articles",
"sdDatePublished": "2019-04-10T13:14",
"sdLicense": "https://scigraph.springernature.com/explorer/license/",
"sdPublisher": {
"name": "Springer Nature - SN SciGraph project",
"type": "Organization"
},
"sdSource": "s3://com-uberresearch-data-dimensions-target-20181106-alternative/cleanup/v134/2549eaecd7973599484d7c17b260dba0a4ecb94b/merge/v9/a6c9fde33151104705d4d7ff012ea9563521a3ce/jats-lookup/v90/0000000001_0000000264/records_8659_00000506.jsonl",
"type": "ScholarlyArticle",
"url": "http://link.springer.com/10.1186/1471-2105-14-117"
}
]
Download the RDF metadata as: json-ld nt turtle xml License info
JSON-LD is a popular format for linked data which is fully compatible with JSON.
curl -H 'Accept: application/ld+json' 'https://scigraph.springernature.com/pub.10.1186/1471-2105-14-117'
N-Triples is a line-based linked data format ideal for batch operations.
curl -H 'Accept: application/n-triples' 'https://scigraph.springernature.com/pub.10.1186/1471-2105-14-117'
Turtle is a human-readable linked data format.
curl -H 'Accept: text/turtle' 'https://scigraph.springernature.com/pub.10.1186/1471-2105-14-117'
RDF/XML is a standard XML format for linked data.
curl -H 'Accept: application/rdf+xml' 'https://scigraph.springernature.com/pub.10.1186/1471-2105-14-117'
This table displays all metadata directly associated to this object as RDF triples.
214 TRIPLES
21 PREDICATES
67 URIs
26 LITERALS
14 BLANK NODES