noWorkflow: Capturing and Analyzing Provenance of Scripts View Full Text


Ontology type: schema:Chapter      Open Access: True


Chapter Info

DATE

2015-03-21

AUTHORS

Leonardo Murta , Vanessa Braganholo , Fernando Chirigati , David Koop , Juliana Freire

ABSTRACT

We propose noWorkflow, a tool that transparently captures provenance of scripts and enables reproducibility. Unlike existing approaches, noWorkflow is non-intrusive and does not require users to change the way they work – users need not wrap their experiments in scientific workflow systems, install version control systems, or instrument their scripts. The tool leverages Software Engineering techniques, such as abstract syntax tree analysis, reflection, and profiling, to collect different types of provenance, including detailed information about the underlying libraries. We describe how noWorkflow captures multiple kinds of provenance and the different classes of analyses it supports: graph-based visualization; differencing over provenance trails; and inference queries. More... »

PAGES

71-83

Identifiers

URI

http://scigraph.springernature.com/pub.10.1007/978-3-319-16462-5_6

DOI

http://dx.doi.org/10.1007/978-3-319-16462-5_6

DIMENSIONS

https://app.dimensions.ai/details/publication/pub.1015501966


Indexing Status Check whether this publication has been indexed by Scopus and Web Of Science using the SN Indexing Status Tool
Incoming Citations Browse incoming citations for this publication using opencitations.net

JSON-LD is the canonical representation for SciGraph data.

TIP: You can open this SciGraph record using an external JSON-LD service: JSON-LD Playground Google SDTT

[
  {
    "@context": "https://springernature.github.io/scigraph/jsonld/sgcontext.json", 
    "about": [
      {
        "id": "http://purl.org/au-research/vocabulary/anzsrc-for/2008/08", 
        "inDefinedTermSet": "http://purl.org/au-research/vocabulary/anzsrc-for/2008/", 
        "name": "Information and Computing Sciences", 
        "type": "DefinedTerm"
      }, 
      {
        "id": "http://purl.org/au-research/vocabulary/anzsrc-for/2008/0803", 
        "inDefinedTermSet": "http://purl.org/au-research/vocabulary/anzsrc-for/2008/", 
        "name": "Computer Software", 
        "type": "DefinedTerm"
      }
    ], 
    "author": [
      {
        "affiliation": {
          "alternateName": "Universidade Federal Fluminense, Niter\u00f3i, Brazil", 
          "id": "http://www.grid.ac/institutes/grid.411173.1", 
          "name": [
            "Universidade Federal Fluminense, Niter\u00f3i, Brazil"
          ], 
          "type": "Organization"
        }, 
        "familyName": "Murta", 
        "givenName": "Leonardo", 
        "id": "sg:person.010453714753.09", 
        "sameAs": [
          "https://app.dimensions.ai/discover/publication?and_facet_researcher=ur.010453714753.09"
        ], 
        "type": "Person"
      }, 
      {
        "affiliation": {
          "alternateName": "Universidade Federal Fluminense, Niter\u00f3i, Brazil", 
          "id": "http://www.grid.ac/institutes/grid.411173.1", 
          "name": [
            "Universidade Federal Fluminense, Niter\u00f3i, Brazil"
          ], 
          "type": "Organization"
        }, 
        "familyName": "Braganholo", 
        "givenName": "Vanessa", 
        "id": "sg:person.010454345403.82", 
        "sameAs": [
          "https://app.dimensions.ai/discover/publication?and_facet_researcher=ur.010454345403.82"
        ], 
        "type": "Person"
      }, 
      {
        "affiliation": {
          "alternateName": "New York University, New York, USA", 
          "id": "http://www.grid.ac/institutes/grid.137628.9", 
          "name": [
            "New York University, New York, USA"
          ], 
          "type": "Organization"
        }, 
        "familyName": "Chirigati", 
        "givenName": "Fernando", 
        "id": "sg:person.013726616522.62", 
        "sameAs": [
          "https://app.dimensions.ai/discover/publication?and_facet_researcher=ur.013726616522.62"
        ], 
        "type": "Person"
      }, 
      {
        "affiliation": {
          "alternateName": "New York University, New York, USA", 
          "id": "http://www.grid.ac/institutes/grid.137628.9", 
          "name": [
            "New York University, New York, USA"
          ], 
          "type": "Organization"
        }, 
        "familyName": "Koop", 
        "givenName": "David", 
        "id": "sg:person.07476600303.53", 
        "sameAs": [
          "https://app.dimensions.ai/discover/publication?and_facet_researcher=ur.07476600303.53"
        ], 
        "type": "Person"
      }, 
      {
        "affiliation": {
          "alternateName": "New York University, New York, USA", 
          "id": "http://www.grid.ac/institutes/grid.137628.9", 
          "name": [
            "New York University, New York, USA"
          ], 
          "type": "Organization"
        }, 
        "familyName": "Freire", 
        "givenName": "Juliana", 
        "id": "sg:person.0627437140.61", 
        "sameAs": [
          "https://app.dimensions.ai/discover/publication?and_facet_researcher=ur.0627437140.61"
        ], 
        "type": "Person"
      }
    ], 
    "datePublished": "2015-03-21", 
    "datePublishedReg": "2015-03-21", 
    "description": "We propose noWorkflow, a tool that transparently captures provenance of scripts and enables reproducibility. Unlike existing approaches, noWorkflow is non-intrusive and does not require users to change the way they work \u2013 users need not wrap their experiments in scientific workflow systems, install version control systems, or instrument their scripts. The tool leverages Software Engineering techniques, such as abstract syntax tree analysis, reflection, and profiling, to collect different types of provenance, including detailed information about the underlying libraries. We describe how noWorkflow captures multiple kinds of provenance and the different classes of analyses it supports: graph-based visualization; differencing over provenance trails; and inference queries.", 
    "editor": [
      {
        "familyName": "Lud\u00e4scher", 
        "givenName": "Bertram", 
        "type": "Person"
      }, 
      {
        "familyName": "Plale", 
        "givenName": "Beth", 
        "type": "Person"
      }
    ], 
    "genre": "chapter", 
    "id": "sg:pub.10.1007/978-3-319-16462-5_6", 
    "isAccessibleForFree": true, 
    "isPartOf": {
      "isbn": [
        "978-3-319-16461-8", 
        "978-3-319-16462-5"
      ], 
      "name": "Provenance and Annotation of Data and Processes", 
      "type": "Book"
    }, 
    "keywords": [
      "software engineering techniques", 
      "scientific workflow systems", 
      "version control systems", 
      "graph-based visualization", 
      "inference queries", 
      "workflow systems", 
      "provenance trails", 
      "multiple kinds", 
      "noWorkflow", 
      "engineering techniques", 
      "control system", 
      "scripts", 
      "queries", 
      "different classes", 
      "users", 
      "tool", 
      "capturing", 
      "system", 
      "visualization", 
      "provenance", 
      "tree analysis", 
      "different types", 
      "information", 
      "library", 
      "detailed information", 
      "technique", 
      "kind", 
      "way", 
      "experiments", 
      "class", 
      "analysis", 
      "trails", 
      "types", 
      "profiling", 
      "reproducibility", 
      "reflection", 
      "approach"
    ], 
    "name": "noWorkflow: Capturing and Analyzing Provenance of Scripts", 
    "pagination": "71-83", 
    "productId": [
      {
        "name": "dimensions_id", 
        "type": "PropertyValue", 
        "value": [
          "pub.1015501966"
        ]
      }, 
      {
        "name": "doi", 
        "type": "PropertyValue", 
        "value": [
          "10.1007/978-3-319-16462-5_6"
        ]
      }
    ], 
    "publisher": {
      "name": "Springer Nature", 
      "type": "Organisation"
    }, 
    "sameAs": [
      "https://doi.org/10.1007/978-3-319-16462-5_6", 
      "https://app.dimensions.ai/details/publication/pub.1015501966"
    ], 
    "sdDataset": "chapters", 
    "sdDatePublished": "2022-08-04T17:18", 
    "sdLicense": "https://scigraph.springernature.com/explorer/license/", 
    "sdPublisher": {
      "name": "Springer Nature - SN SciGraph project", 
      "type": "Organization"
    }, 
    "sdSource": "s3://com-springernature-scigraph/baseset/20220804/entities/gbq_results/chapter/chapter_313.jsonl", 
    "type": "Chapter", 
    "url": "https://doi.org/10.1007/978-3-319-16462-5_6"
  }
]
 

Download the RDF metadata as:  json-ld nt turtle xml License info

HOW TO GET THIS DATA PROGRAMMATICALLY:

JSON-LD is a popular format for linked data which is fully compatible with JSON.

curl -H 'Accept: application/ld+json' 'https://scigraph.springernature.com/pub.10.1007/978-3-319-16462-5_6'

N-Triples is a line-based linked data format ideal for batch operations.

curl -H 'Accept: application/n-triples' 'https://scigraph.springernature.com/pub.10.1007/978-3-319-16462-5_6'

Turtle is a human-readable linked data format.

curl -H 'Accept: text/turtle' 'https://scigraph.springernature.com/pub.10.1007/978-3-319-16462-5_6'

RDF/XML is a standard XML format for linked data.

curl -H 'Accept: application/rdf+xml' 'https://scigraph.springernature.com/pub.10.1007/978-3-319-16462-5_6'


 

This table displays all metadata directly associated to this object as RDF triples.

132 TRIPLES      22 PREDICATES      61 URIs      54 LITERALS      7 BLANK NODES

Subject Predicate Object
1 sg:pub.10.1007/978-3-319-16462-5_6 schema:about anzsrc-for:08
2 anzsrc-for:0803
3 schema:author N7c4871d2ab0643108aa4e9060cf7124e
4 schema:datePublished 2015-03-21
5 schema:datePublishedReg 2015-03-21
6 schema:description We propose noWorkflow, a tool that transparently captures provenance of scripts and enables reproducibility. Unlike existing approaches, noWorkflow is non-intrusive and does not require users to change the way they work – users need not wrap their experiments in scientific workflow systems, install version control systems, or instrument their scripts. The tool leverages Software Engineering techniques, such as abstract syntax tree analysis, reflection, and profiling, to collect different types of provenance, including detailed information about the underlying libraries. We describe how noWorkflow captures multiple kinds of provenance and the different classes of analyses it supports: graph-based visualization; differencing over provenance trails; and inference queries.
7 schema:editor N73373ab951924c778e2b8506c9da8abb
8 schema:genre chapter
9 schema:isAccessibleForFree true
10 schema:isPartOf N89c0225d5c1c42039100c00ada3e2b1d
11 schema:keywords analysis
12 approach
13 capturing
14 class
15 control system
16 detailed information
17 different classes
18 different types
19 engineering techniques
20 experiments
21 graph-based visualization
22 inference queries
23 information
24 kind
25 library
26 multiple kinds
27 noWorkflow
28 profiling
29 provenance
30 provenance trails
31 queries
32 reflection
33 reproducibility
34 scientific workflow systems
35 scripts
36 software engineering techniques
37 system
38 technique
39 tool
40 trails
41 tree analysis
42 types
43 users
44 version control systems
45 visualization
46 way
47 workflow systems
48 schema:name noWorkflow: Capturing and Analyzing Provenance of Scripts
49 schema:pagination 71-83
50 schema:productId N2264e335edd74e0fb8558196339fda35
51 Nc1d346b2f8e944a7be3f46185cf206ad
52 schema:publisher N840842fe11404a2b9b3089e76fcd7355
53 schema:sameAs https://app.dimensions.ai/details/publication/pub.1015501966
54 https://doi.org/10.1007/978-3-319-16462-5_6
55 schema:sdDatePublished 2022-08-04T17:18
56 schema:sdLicense https://scigraph.springernature.com/explorer/license/
57 schema:sdPublisher N81c1cff83e4c4b8aa429a2926f13451c
58 schema:url https://doi.org/10.1007/978-3-319-16462-5_6
59 sgo:license sg:explorer/license/
60 sgo:sdDataset chapters
61 rdf:type schema:Chapter
62 N134bd00533404c6da3cf299638ae4a9f rdf:first sg:person.010454345403.82
63 rdf:rest N67d89ed3ed4b4361999ecbff25739eda
64 N2264e335edd74e0fb8558196339fda35 schema:name dimensions_id
65 schema:value pub.1015501966
66 rdf:type schema:PropertyValue
67 N4555ec2bae0644a7a172d4049a5ff25e rdf:first Nf7ff832ed0034dc99229cfe8ab223d3c
68 rdf:rest rdf:nil
69 N460e59afe41a4c67ba316af8c30d1c55 schema:familyName Ludäscher
70 schema:givenName Bertram
71 rdf:type schema:Person
72 N67d89ed3ed4b4361999ecbff25739eda rdf:first sg:person.013726616522.62
73 rdf:rest N7044395166db4e6fa219ab77ffa238de
74 N7044395166db4e6fa219ab77ffa238de rdf:first sg:person.07476600303.53
75 rdf:rest Ncba0de2d8b12484fb6fef06dd052c926
76 N73373ab951924c778e2b8506c9da8abb rdf:first N460e59afe41a4c67ba316af8c30d1c55
77 rdf:rest N4555ec2bae0644a7a172d4049a5ff25e
78 N7c4871d2ab0643108aa4e9060cf7124e rdf:first sg:person.010453714753.09
79 rdf:rest N134bd00533404c6da3cf299638ae4a9f
80 N81c1cff83e4c4b8aa429a2926f13451c schema:name Springer Nature - SN SciGraph project
81 rdf:type schema:Organization
82 N840842fe11404a2b9b3089e76fcd7355 schema:name Springer Nature
83 rdf:type schema:Organisation
84 N89c0225d5c1c42039100c00ada3e2b1d schema:isbn 978-3-319-16461-8
85 978-3-319-16462-5
86 schema:name Provenance and Annotation of Data and Processes
87 rdf:type schema:Book
88 Nc1d346b2f8e944a7be3f46185cf206ad schema:name doi
89 schema:value 10.1007/978-3-319-16462-5_6
90 rdf:type schema:PropertyValue
91 Ncba0de2d8b12484fb6fef06dd052c926 rdf:first sg:person.0627437140.61
92 rdf:rest rdf:nil
93 Nf7ff832ed0034dc99229cfe8ab223d3c schema:familyName Plale
94 schema:givenName Beth
95 rdf:type schema:Person
96 anzsrc-for:08 schema:inDefinedTermSet anzsrc-for:
97 schema:name Information and Computing Sciences
98 rdf:type schema:DefinedTerm
99 anzsrc-for:0803 schema:inDefinedTermSet anzsrc-for:
100 schema:name Computer Software
101 rdf:type schema:DefinedTerm
102 sg:person.010453714753.09 schema:affiliation grid-institutes:grid.411173.1
103 schema:familyName Murta
104 schema:givenName Leonardo
105 schema:sameAs https://app.dimensions.ai/discover/publication?and_facet_researcher=ur.010453714753.09
106 rdf:type schema:Person
107 sg:person.010454345403.82 schema:affiliation grid-institutes:grid.411173.1
108 schema:familyName Braganholo
109 schema:givenName Vanessa
110 schema:sameAs https://app.dimensions.ai/discover/publication?and_facet_researcher=ur.010454345403.82
111 rdf:type schema:Person
112 sg:person.013726616522.62 schema:affiliation grid-institutes:grid.137628.9
113 schema:familyName Chirigati
114 schema:givenName Fernando
115 schema:sameAs https://app.dimensions.ai/discover/publication?and_facet_researcher=ur.013726616522.62
116 rdf:type schema:Person
117 sg:person.0627437140.61 schema:affiliation grid-institutes:grid.137628.9
118 schema:familyName Freire
119 schema:givenName Juliana
120 schema:sameAs https://app.dimensions.ai/discover/publication?and_facet_researcher=ur.0627437140.61
121 rdf:type schema:Person
122 sg:person.07476600303.53 schema:affiliation grid-institutes:grid.137628.9
123 schema:familyName Koop
124 schema:givenName David
125 schema:sameAs https://app.dimensions.ai/discover/publication?and_facet_researcher=ur.07476600303.53
126 rdf:type schema:Person
127 grid-institutes:grid.137628.9 schema:alternateName New York University, New York, USA
128 schema:name New York University, New York, USA
129 rdf:type schema:Organization
130 grid-institutes:grid.411173.1 schema:alternateName Universidade Federal Fluminense, Niterói, Brazil
131 schema:name Universidade Federal Fluminense, Niterói, Brazil
132 rdf:type schema:Organization
 




Preview window. Press ESC to close (or click here)


...