Ontology type: schema:Chapter Open Access: True
2010-11-30
AUTHORSElaine Angelino , Daniel Yamins , Margo Seltzer
ABSTRACTWe introduce StarFlow, a script-centric environment for data analysis. StarFlow has four main features: (1) extraction of control and data-flow dependencies through a novel combination of static analysis, dynamic runtime analysis, and user annotations, (2) command-line tools for exploring and propagating changes through the resulting dependency network, (3) support for workflow abstractions enabling robust parallel executions of complex analysis pipelines, and (4) a seamless interface with the Python scripting language. We describe real applications of StarFlow, including automatic parallelization of complex workflows in the cloud. More... »
PAGES236-250
Provenance and Annotation of Data and Processes
ISBN
978-3-642-17818-4
978-3-642-17819-1
http://scigraph.springernature.com/pub.10.1007/978-3-642-17819-1_27
DOIhttp://dx.doi.org/10.1007/978-3-642-17819-1_27
DIMENSIONShttps://app.dimensions.ai/details/publication/pub.1021418233
JSON-LD is the canonical representation for SciGraph data.
TIP: You can open this SciGraph record using an external JSON-LD service: JSON-LD Playground Google SDTT
[
{
"@context": "https://springernature.github.io/scigraph/jsonld/sgcontext.json",
"about": [
{
"id": "http://purl.org/au-research/vocabulary/anzsrc-for/2008/08",
"inDefinedTermSet": "http://purl.org/au-research/vocabulary/anzsrc-for/2008/",
"name": "Information and Computing Sciences",
"type": "DefinedTerm"
},
{
"id": "http://purl.org/au-research/vocabulary/anzsrc-for/2008/0801",
"inDefinedTermSet": "http://purl.org/au-research/vocabulary/anzsrc-for/2008/",
"name": "Artificial Intelligence and Image Processing",
"type": "DefinedTerm"
},
{
"id": "http://purl.org/au-research/vocabulary/anzsrc-for/2008/0806",
"inDefinedTermSet": "http://purl.org/au-research/vocabulary/anzsrc-for/2008/",
"name": "Information Systems",
"type": "DefinedTerm"
}
],
"author": [
{
"affiliation": {
"alternateName": "School of Engineering and Applied Sciences, Harvard University, 33 Oxford St., 02138, Cambridge, MA, USA",
"id": "http://www.grid.ac/institutes/grid.38142.3c",
"name": [
"School of Engineering and Applied Sciences, Harvard University, 33 Oxford St., 02138, Cambridge, MA, USA"
],
"type": "Organization"
},
"familyName": "Angelino",
"givenName": "Elaine",
"id": "sg:person.013503020776.43",
"sameAs": [
"https://app.dimensions.ai/discover/publication?and_facet_researcher=ur.013503020776.43"
],
"type": "Person"
},
{
"affiliation": {
"alternateName": "School of Engineering and Applied Sciences, Harvard University, 33 Oxford St., 02138, Cambridge, MA, USA",
"id": "http://www.grid.ac/institutes/grid.38142.3c",
"name": [
"School of Engineering and Applied Sciences, Harvard University, 33 Oxford St., 02138, Cambridge, MA, USA"
],
"type": "Organization"
},
"familyName": "Yamins",
"givenName": "Daniel",
"type": "Person"
},
{
"affiliation": {
"alternateName": "School of Engineering and Applied Sciences, Harvard University, 33 Oxford St., 02138, Cambridge, MA, USA",
"id": "http://www.grid.ac/institutes/grid.38142.3c",
"name": [
"School of Engineering and Applied Sciences, Harvard University, 33 Oxford St., 02138, Cambridge, MA, USA"
],
"type": "Organization"
},
"familyName": "Seltzer",
"givenName": "Margo",
"id": "sg:person.01344037330.14",
"sameAs": [
"https://app.dimensions.ai/discover/publication?and_facet_researcher=ur.01344037330.14"
],
"type": "Person"
}
],
"datePublished": "2010-11-30",
"datePublishedReg": "2010-11-30",
"description": "We introduce StarFlow, a script-centric environment for data analysis. StarFlow has four main features: (1) extraction of control and data-flow dependencies through a novel combination of static analysis, dynamic runtime analysis, and user annotations, (2) command-line tools for exploring and propagating changes through the resulting dependency network, (3) support for workflow abstractions enabling robust parallel executions of complex analysis pipelines, and (4) a seamless interface with the Python scripting language. We describe real applications of StarFlow, including automatic parallelization of complex workflows in the cloud.",
"editor": [
{
"familyName": "McGuinness",
"givenName": "Deborah L.",
"type": "Person"
},
{
"familyName": "Michaelis",
"givenName": "James R.",
"type": "Person"
},
{
"familyName": "Moreau",
"givenName": "Luc",
"type": "Person"
}
],
"genre": "chapter",
"id": "sg:pub.10.1007/978-3-642-17819-1_27",
"isAccessibleForFree": true,
"isPartOf": {
"isbn": [
"978-3-642-17818-4",
"978-3-642-17819-1"
],
"name": "Provenance and Annotation of Data and Processes",
"type": "Book"
},
"keywords": [
"data flow dependencies",
"data analysis environment",
"command-line tool",
"Python scripting language",
"complex analysis pipelines",
"workflow abstractions",
"user annotations",
"automatic parallelization",
"scripting language",
"complex workflows",
"parallel execution",
"analysis environment",
"dependency network",
"runtime analysis",
"real applications",
"seamless interface",
"analysis pipeline",
"static analysis",
"data analysis",
"StarFlow",
"parallelization",
"main features",
"novel combination",
"execution",
"workflow",
"environment",
"annotation",
"cloud",
"abstraction",
"network",
"language",
"pipeline",
"interface",
"tool",
"extraction",
"applications",
"dependency",
"features",
"support",
"analysis",
"control",
"combination",
"changes"
],
"name": "StarFlow: A Script-Centric Data Analysis Environment",
"pagination": "236-250",
"productId": [
{
"name": "dimensions_id",
"type": "PropertyValue",
"value": [
"pub.1021418233"
]
},
{
"name": "doi",
"type": "PropertyValue",
"value": [
"10.1007/978-3-642-17819-1_27"
]
}
],
"publisher": {
"name": "Springer Nature",
"type": "Organisation"
},
"sameAs": [
"https://doi.org/10.1007/978-3-642-17819-1_27",
"https://app.dimensions.ai/details/publication/pub.1021418233"
],
"sdDataset": "chapters",
"sdDatePublished": "2022-08-04T17:15",
"sdLicense": "https://scigraph.springernature.com/explorer/license/",
"sdPublisher": {
"name": "Springer Nature - SN SciGraph project",
"type": "Organization"
},
"sdSource": "s3://com-springernature-scigraph/baseset/20220804/entities/gbq_results/chapter/chapter_172.jsonl",
"type": "Chapter",
"url": "https://doi.org/10.1007/978-3-642-17819-1_27"
}
]
Download the RDF metadata as: json-ld nt turtle xml License info
JSON-LD is a popular format for linked data which is fully compatible with JSON.
curl -H 'Accept: application/ld+json' 'https://scigraph.springernature.com/pub.10.1007/978-3-642-17819-1_27'
N-Triples is a line-based linked data format ideal for batch operations.
curl -H 'Accept: application/n-triples' 'https://scigraph.springernature.com/pub.10.1007/978-3-642-17819-1_27'
Turtle is a human-readable linked data format.
curl -H 'Accept: text/turtle' 'https://scigraph.springernature.com/pub.10.1007/978-3-642-17819-1_27'
RDF/XML is a standard XML format for linked data.
curl -H 'Accept: application/rdf+xml' 'https://scigraph.springernature.com/pub.10.1007/978-3-642-17819-1_27'
This table displays all metadata directly associated to this object as RDF triples.
129 TRIPLES
22 PREDICATES
68 URIs
60 LITERALS
7 BLANK NODES