CAREER: Querying Evolving Graphs View Homepage


Ontology type: schema:MonetaryGrant     


Grant Info

YEARS

2018-2023

FUNDING AMOUNT

497808.0 USD

ABSTRACT

Graphs are used to represent a plethora of phenomena, including the Web, social networks, biological pathways, transportation networks, and semantic knowledge bases. Many interesting and important questions about graphs concern their evolution rather than their static state: Which Web pages are showing an increasing popularity trend? How does influence propagate in social networks? How do the utilization of transportation options and the cost of ridership in a city change during the day and throughout the week? How does knowledge evolve? Formulating these questions as programs is currently beyond the skills of most data scientists. Executing such programs poses tremendous efficiency challenges, especially for graphs with billions of edges, and with non-trivial evolution rates. Much research and engineering effort today goes into developing sophisticated graph analytics and their efficient implementations, both stand-alone and in scope of data processing platforms. Yet, systematic support for querying and analysis of evolving graphs is still lacking. This support is urgently needed, due both to the scalability challenges inherent in evolving graph analysis, and to considerations of usability and ease of dissemination. This project will fill this gap by establishing the fundamental principles of effective modeling and efficient analysis of evolving graphs, and by making results available to the community of use in an open-source platform called Portal. This project will build on the state of the art in temporal data management, making the principles and techniques that were developed over decades of research and practice in that domain available to evolving graph applications. The project will develop: (1) a data model for evolving graphs and an expressive compositional algebra; (2) an efficient implementation of the data structures and of the algebraic operations, together with any necessary algebraic primitives and physical representations / access methods, in scope of a distributed data-parallel framework; (3) a declarative query language that supports concise specification of sophisticated graph analysis tasks, and a query optimizer that generates efficient query execution plans; (4) a principled evaluation methodology of usability and efficiency, based on real and synthetic datasets and analysis tasks. This project will impact research and practice in data management, by contributing novel representation, analysis and benchmarking methods for evolving graph data. Results of this project will help incorporate sophisticated evolving graph analysis into larger applications, and will enable scaling up to modern volumes. The Portal framework will support computational and data scientists who work with evolving graphs in social network analysis, knowledge management and network traffic analysis. A prominent set of use cases for this work will come from data science for social-good applications, including urban homelessness and analysis of transportation utilization and cost in cities. For further information see the project web page: portaldb.github.io. This award reflects NSF's statutory mission and has been deemed worthy of support through evaluation using the Foundation's intellectual merit and broader impacts review criteria. More... »

URL

http://www.nsf.gov/awardsearch/showAward?AWD_ID=1916505&HistoricalAwards=false

Related SciGraph Publications

  • 2022-02-23. BugDoc in THE VLDB JOURNAL
  • 2022-01-31. Data distribution debugging in machine learning pipelines in THE VLDB JOURNAL
  • JSON-LD is the canonical representation for SciGraph data.

    TIP: You can open this SciGraph record using an external JSON-LD service: JSON-LD Playground Google SDTT

    [
      {
        "@context": "https://springernature.github.io/scigraph/jsonld/sgcontext.json", 
        "about": [
          {
            "id": "http://purl.org/au-research/vocabulary/anzsrc-for/2008/08", 
            "inDefinedTermSet": "http://purl.org/au-research/vocabulary/anzsrc-for/2008/", 
            "type": "DefinedTerm"
          }
        ], 
        "amount": {
          "currency": "USD", 
          "type": "MonetaryAmount", 
          "value": 497808.0
        }, 
        "description": "Graphs are used to represent a plethora of phenomena, including the Web, social networks, biological pathways, transportation networks, and semantic knowledge bases. Many interesting and important questions about graphs concern their evolution rather than their static state: Which Web pages are showing an increasing popularity trend? How does influence propagate in social networks? How do the utilization of transportation options and the cost of ridership in a city change during the day and throughout the week? How does knowledge evolve? Formulating these questions as programs is currently beyond the skills of most data scientists. Executing such programs poses tremendous efficiency challenges, especially for graphs with billions of edges, and with non-trivial evolution rates. Much research and engineering effort today goes into developing sophisticated graph analytics and their efficient implementations, both stand-alone and in scope of data processing platforms. Yet, systematic support for querying and analysis of evolving graphs is still lacking. This support is urgently needed, due both to the scalability challenges inherent in evolving graph analysis, and to considerations of usability and ease of dissemination. This project will fill this gap by establishing the fundamental principles of effective modeling and efficient analysis of evolving graphs, and by making results available to the community of use in an open-source platform called Portal. This project will build on the state of the art in temporal data management, making the principles and techniques that were developed over decades of research and practice in that domain available to evolving graph applications. The project will develop: (1) a data model for evolving graphs and an expressive compositional algebra; (2) an efficient implementation of the data structures and of the algebraic operations, together with any necessary algebraic primitives and physical representations / access methods, in scope of a distributed data-parallel framework; (3) a declarative query language that supports concise specification of sophisticated graph analysis tasks, and a query optimizer that generates efficient query execution plans; (4) a principled evaluation methodology of usability and efficiency, based on real and synthetic datasets and analysis tasks. This project will impact research and practice in data management, by contributing novel representation, analysis and benchmarking methods for evolving graph data. Results of this project will help incorporate sophisticated evolving graph analysis into larger applications, and will enable scaling up to modern volumes. The Portal framework will support computational and data scientists who work with evolving graphs in social network analysis, knowledge management and network traffic analysis. A prominent set of use cases for this work will come from data science for social-good applications, including urban homelessness and analysis of transportation utilization and cost in cities. For further information see the project web page: portaldb.github.io. This award reflects NSF's statutory mission and has been deemed worthy of support through evaluation using the Foundation's intellectual merit and broader impacts review criteria.", 
        "endDate": "2023-02-28", 
        "funder": {
          "id": "http://www.grid.ac/institutes/grid.457785.c", 
          "type": "Organization"
        }, 
        "id": "sg:grant.7923555", 
        "identifier": [
          {
            "name": "dimensions_id", 
            "type": "PropertyValue", 
            "value": [
              "grant.7923555"
            ]
          }, 
          {
            "name": "nsf_id", 
            "type": "PropertyValue", 
            "value": [
              "1916505"
            ]
          }
        ], 
        "keywords": [
          "web pages", 
          "data scientists", 
          "analysis tasks", 
          "data management", 
          "efficient query execution plans", 
          "efficient implementation", 
          "declarative query language", 
          "temporal data management", 
          "social networks", 
          "graph analysis", 
          "billions of edges", 
          "data processing platform", 
          "graph analysis tasks", 
          "network traffic analysis", 
          "query execution plans", 
          "semantic knowledge bases", 
          "data-parallel frameworks", 
          "open-source platform", 
          "communities of use", 
          "consideration of usability", 
          "project web page", 
          "graph analytics", 
          "query language", 
          "graph data", 
          "graph applications", 
          "portal framework", 
          "query optimizer", 
          "scalability challenges", 
          "algebraic primitives", 
          "execution plan", 
          "use cases", 
          "processing platform", 
          "social network analysis", 
          "data science", 
          "traffic analysis", 
          "data structure", 
          "concise specification", 
          "popularity trends", 
          "knowledge bases", 
          "data model", 
          "access methods", 
          "Evolving Graphs", 
          "synthetic datasets", 
          "efficiency challenges", 
          "compositional algebra", 
          "large applications", 
          "knowledge management", 
          "novel representation", 
          "effective modeling", 
          "modern volumes", 
          "graph", 
          "transportation network", 
          "algebraic operations", 
          "intellectual merit", 
          "systematic support", 
          "network", 
          "efficient analysis", 
          "usability", 
          "evaluation methodology", 
          "pages", 
          "task", 
          "platform", 
          "knowledge evolve", 
          "implementation", 
          "network analysis", 
          "querying", 
          "framework", 
          "analytics", 
          "primitives", 
          "efforts today", 
          "project", 
          "applications", 
          "computational", 
          "ease of dissemination", 
          "dataset", 
          "optimizer", 
          "Web", 
          "plethora of phenomena", 
          "portal", 
          "cost", 
          "challenges", 
          "specification", 
          "billions", 
          "language", 
          "prominent set", 
          "representation", 
          "management", 
          "information", 
          "support", 
          "set", 
          "utilization", 
          "transportation options", 
          "scope", 
          "research", 
          "art", 
          "method", 
          "scientists", 
          "ease", 
          "methodology", 
          "operation", 
          "modeling", 
          "decades of research", 
          "domain", 
          "fundamental principles", 
          "static state", 
          "plethora", 
          "such programs", 
          "today", 
          "principles", 
          "technique", 
          "city changes", 
          "dissemination", 
          "efficiency", 
          "program", 
          "mission", 
          "edge", 
          "work", 
          "model", 
          "state", 
          "data", 
          "results", 
          "evolve", 
          "further information", 
          "science", 
          "plan", 
          "evaluation", 
          "analysis", 
          "merits", 
          "practice", 
          "important questions", 
          "use", 
          "algebra", 
          "community", 
          "questions", 
          "ridership", 
          "consideration", 
          "gap", 
          "skills", 
          "city", 
          "basis", 
          "decades", 
          "trends", 
          "criteria", 
          "biological pathways", 
          "structure", 
          "evolution", 
          "awards", 
          "volume", 
          "cases", 
          "options", 
          "rate", 
          "phenomenon", 
          "changes", 
          "review criteria", 
          "influence", 
          "days", 
          "evolution rate", 
          "homelessness", 
          "weeks", 
          "pathway", 
          "statutory mission", 
          "urban homelessness"
        ], 
        "name": "CAREER: Querying Evolving Graphs", 
        "recipient": [
          {
            "id": "http://www.grid.ac/institutes/grid.137628.9", 
            "type": "Organization"
          }, 
          {
            "affiliation": {
              "id": "http://www.grid.ac/institutes/None", 
              "name": "New York University", 
              "type": "Organization"
            }, 
            "familyName": "Stoyanovich", 
            "givenName": "Julia", 
            "id": "sg:person.0615021500.86", 
            "type": "Person"
          }, 
          {
            "member": "sg:person.0615021500.86", 
            "roleName": "PI", 
            "type": "Role"
          }
        ], 
        "sameAs": [
          "https://app.dimensions.ai/details/grant/grant.7923555"
        ], 
        "sdDataset": "grants", 
        "sdDatePublished": "2022-08-04T17:24", 
        "sdLicense": "https://scigraph.springernature.com/explorer/license/", 
        "sdPublisher": {
          "name": "Springer Nature - SN SciGraph project", 
          "type": "Organization"
        }, 
        "sdSource": "s3://com-springernature-scigraph/baseset/20220804/entities/gbq_results/grant/grant_56.jsonl", 
        "startDate": "2018-09-01", 
        "type": "MonetaryGrant", 
        "url": "http://www.nsf.gov/awardsearch/showAward?AWD_ID=1916505&HistoricalAwards=false"
      }
    ]
     

    Download the RDF metadata as:  json-ld nt turtle xml License info

    HOW TO GET THIS DATA PROGRAMMATICALLY:

    JSON-LD is a popular format for linked data which is fully compatible with JSON.

    curl -H 'Accept: application/ld+json' 'https://scigraph.springernature.com/grant.7923555'

    N-Triples is a line-based linked data format ideal for batch operations.

    curl -H 'Accept: application/n-triples' 'https://scigraph.springernature.com/grant.7923555'

    Turtle is a human-readable linked data format.

    curl -H 'Accept: text/turtle' 'https://scigraph.springernature.com/grant.7923555'

    RDF/XML is a standard XML format for linked data.

    curl -H 'Accept: application/rdf+xml' 'https://scigraph.springernature.com/grant.7923555'


     

    This table displays all metadata directly associated to this object as RDF triples.

    206 TRIPLES      18 PREDICATES      183 URIs      175 LITERALS      5 BLANK NODES

    Subject Predicate Object
    1 sg:grant.7923555 schema:about anzsrc-for:08
    2 schema:amount Na8b36fe698094ed8825b14185376309b
    3 schema:description Graphs are used to represent a plethora of phenomena, including the Web, social networks, biological pathways, transportation networks, and semantic knowledge bases. Many interesting and important questions about graphs concern their evolution rather than their static state: Which Web pages are showing an increasing popularity trend? How does influence propagate in social networks? How do the utilization of transportation options and the cost of ridership in a city change during the day and throughout the week? How does knowledge evolve? Formulating these questions as programs is currently beyond the skills of most data scientists. Executing such programs poses tremendous efficiency challenges, especially for graphs with billions of edges, and with non-trivial evolution rates. Much research and engineering effort today goes into developing sophisticated graph analytics and their efficient implementations, both stand-alone and in scope of data processing platforms. Yet, systematic support for querying and analysis of evolving graphs is still lacking. This support is urgently needed, due both to the scalability challenges inherent in evolving graph analysis, and to considerations of usability and ease of dissemination. This project will fill this gap by establishing the fundamental principles of effective modeling and efficient analysis of evolving graphs, and by making results available to the community of use in an open-source platform called Portal. This project will build on the state of the art in temporal data management, making the principles and techniques that were developed over decades of research and practice in that domain available to evolving graph applications. The project will develop: (1) a data model for evolving graphs and an expressive compositional algebra; (2) an efficient implementation of the data structures and of the algebraic operations, together with any necessary algebraic primitives and physical representations / access methods, in scope of a distributed data-parallel framework; (3) a declarative query language that supports concise specification of sophisticated graph analysis tasks, and a query optimizer that generates efficient query execution plans; (4) a principled evaluation methodology of usability and efficiency, based on real and synthetic datasets and analysis tasks. This project will impact research and practice in data management, by contributing novel representation, analysis and benchmarking methods for evolving graph data. Results of this project will help incorporate sophisticated evolving graph analysis into larger applications, and will enable scaling up to modern volumes. The Portal framework will support computational and data scientists who work with evolving graphs in social network analysis, knowledge management and network traffic analysis. A prominent set of use cases for this work will come from data science for social-good applications, including urban homelessness and analysis of transportation utilization and cost in cities. For further information see the project web page: portaldb.github.io. This award reflects NSF's statutory mission and has been deemed worthy of support through evaluation using the Foundation's intellectual merit and broader impacts review criteria.
    4 schema:endDate 2023-02-28
    5 schema:funder grid-institutes:grid.457785.c
    6 schema:identifier Nda9d82361e38403fad1d082c0b3b13a2
    7 Ne1fa127141224ca7840f7f2daec86ece
    8 schema:keywords Evolving Graphs
    9 Web
    10 access methods
    11 algebra
    12 algebraic operations
    13 algebraic primitives
    14 analysis
    15 analysis tasks
    16 analytics
    17 applications
    18 art
    19 awards
    20 basis
    21 billions
    22 billions of edges
    23 biological pathways
    24 cases
    25 challenges
    26 changes
    27 city
    28 city changes
    29 communities of use
    30 community
    31 compositional algebra
    32 computational
    33 concise specification
    34 consideration
    35 consideration of usability
    36 cost
    37 criteria
    38 data
    39 data management
    40 data model
    41 data processing platform
    42 data science
    43 data scientists
    44 data structure
    45 data-parallel frameworks
    46 dataset
    47 days
    48 decades
    49 decades of research
    50 declarative query language
    51 dissemination
    52 domain
    53 ease
    54 ease of dissemination
    55 edge
    56 effective modeling
    57 efficiency
    58 efficiency challenges
    59 efficient analysis
    60 efficient implementation
    61 efficient query execution plans
    62 efforts today
    63 evaluation
    64 evaluation methodology
    65 evolution
    66 evolution rate
    67 evolve
    68 execution plan
    69 framework
    70 fundamental principles
    71 further information
    72 gap
    73 graph
    74 graph analysis
    75 graph analysis tasks
    76 graph analytics
    77 graph applications
    78 graph data
    79 homelessness
    80 implementation
    81 important questions
    82 influence
    83 information
    84 intellectual merit
    85 knowledge bases
    86 knowledge evolve
    87 knowledge management
    88 language
    89 large applications
    90 management
    91 merits
    92 method
    93 methodology
    94 mission
    95 model
    96 modeling
    97 modern volumes
    98 network
    99 network analysis
    100 network traffic analysis
    101 novel representation
    102 open-source platform
    103 operation
    104 optimizer
    105 options
    106 pages
    107 pathway
    108 phenomenon
    109 plan
    110 platform
    111 plethora
    112 plethora of phenomena
    113 popularity trends
    114 portal
    115 portal framework
    116 practice
    117 primitives
    118 principles
    119 processing platform
    120 program
    121 project
    122 project web page
    123 prominent set
    124 query execution plans
    125 query language
    126 query optimizer
    127 querying
    128 questions
    129 rate
    130 representation
    131 research
    132 results
    133 review criteria
    134 ridership
    135 scalability challenges
    136 science
    137 scientists
    138 scope
    139 semantic knowledge bases
    140 set
    141 skills
    142 social network analysis
    143 social networks
    144 specification
    145 state
    146 static state
    147 statutory mission
    148 structure
    149 such programs
    150 support
    151 synthetic datasets
    152 systematic support
    153 task
    154 technique
    155 temporal data management
    156 today
    157 traffic analysis
    158 transportation network
    159 transportation options
    160 trends
    161 urban homelessness
    162 usability
    163 use
    164 use cases
    165 utilization
    166 volume
    167 web pages
    168 weeks
    169 work
    170 schema:name CAREER: Querying Evolving Graphs
    171 schema:recipient N1baa4a1f504c421c9182190a028e53e1
    172 sg:person.0615021500.86
    173 grid-institutes:grid.137628.9
    174 schema:sameAs https://app.dimensions.ai/details/grant/grant.7923555
    175 schema:sdDatePublished 2022-08-04T17:24
    176 schema:sdLicense https://scigraph.springernature.com/explorer/license/
    177 schema:sdPublisher N39cb0e742e5d46acbbee3b78f8be9616
    178 schema:startDate 2018-09-01
    179 schema:url http://www.nsf.gov/awardsearch/showAward?AWD_ID=1916505&HistoricalAwards=false
    180 sgo:license sg:explorer/license/
    181 sgo:sdDataset grants
    182 rdf:type schema:MonetaryGrant
    183 N1baa4a1f504c421c9182190a028e53e1 schema:member sg:person.0615021500.86
    184 schema:roleName PI
    185 rdf:type schema:Role
    186 N39cb0e742e5d46acbbee3b78f8be9616 schema:name Springer Nature - SN SciGraph project
    187 rdf:type schema:Organization
    188 Na8b36fe698094ed8825b14185376309b schema:currency USD
    189 schema:value 497808.0
    190 rdf:type schema:MonetaryAmount
    191 Nda9d82361e38403fad1d082c0b3b13a2 schema:name nsf_id
    192 schema:value 1916505
    193 rdf:type schema:PropertyValue
    194 Ne1fa127141224ca7840f7f2daec86ece schema:name dimensions_id
    195 schema:value grant.7923555
    196 rdf:type schema:PropertyValue
    197 anzsrc-for:08 schema:inDefinedTermSet anzsrc-for:
    198 rdf:type schema:DefinedTerm
    199 sg:person.0615021500.86 schema:affiliation grid-institutes:None
    200 schema:familyName Stoyanovich
    201 schema:givenName Julia
    202 rdf:type schema:Person
    203 grid-institutes:None schema:name New York University
    204 rdf:type schema:Organization
    205 grid-institutes:grid.137628.9 schema:Organization
    206 grid-institutes:grid.457785.c schema:Organization
     




    Preview window. Press ESC to close (or click here)


    ...