Web crawler semantic collaboration and competitive strategy based on conceptual background graph View Homepage


Ontology type: schema:MonetaryGrant     


Grant Info

YEARS

2012-2016

FUNDING AMOUNT

700000 CNY

ABSTRACT

In focused cralwing system, multi-crawlers crawl parallelly Web and download Web pages. It is one of hotspot research of search engine how the different focused crawlers avoid to visit the same URLs and they download efficiently Web pages related to the search topic. In order to accomplish rapidly the crawling tasks of the system for the specific topic, and embody fully every Web crawler's ability, we consider that these history visiting Web pages (URLs) of every focused crawler reflect their backgroup knowledge. On basis of cralwing independently, collaborating togather and competing with each other for Web crawlers of the system, we propose the novel understanding, cooperating and competing strategy of concept context graph by analyzing these Web page's content, extracting semantic features- - concepts of these Web pages in history collects of every Web crawlers as their backgroup knowledge and studing the semantic relationships of their backgroup knowledge. Our mainly researches are listed as follows: 1).Constructing the mathematical model of backgrounp knowledge of every Web crawler based on hierarchy concept context graph, according to the semantic characteristics- - concepts of Web pages and their semantic relationships among the concepts. 2).Studying the understanding method and model among Web crawler More... »

URL

http://npd.nsfc.gov.cn/projectDetail.action?pid=61271413

Related SciGraph Publications

  • 2017-10-21. Network Structural Balance Analysis for Sina Microblog Based on Particle Swarm Optimization Algorithm in INFORMATION RETRIEVAL
  • 2017. Mining Implicit Intention Using Attention-Based RNN Encoder-Decoder Model in INTELLIGENT COMPUTING METHODOLOGIES
  • 2016. A Novel Approach of Identifying User Intents in Microblog in INTELLIGENT COMPUTING METHODOLOGIES
  • 2016. A Novel Entity Relation Extraction Approach Based on Micro-Blog in INTELLIGENT COMPUTING METHODOLOGIES
  • 2015. Ranking Web Page with Path Trust Knowledge Graph in INTELLIGENCE SCIENCE AND BIG DATA ENGINEERING. BIG DATA AND MACHINE LEARNING TECHNIQUES
  • 2015. Locality Preserving Based K-Means Clustering in INTELLIGENCE SCIENCE AND BIG DATA ENGINEERING. BIG DATA AND MACHINE LEARNING TECHNIQUES
  • 2015. Locality Preserving One-Class Support Vector Machine in INTELLIGENCE SCIENCE AND BIG DATA ENGINEERING. BIG DATA AND MACHINE LEARNING TECHNIQUES
  • JSON-LD is the canonical representation for SciGraph data.

    TIP: You can open this SciGraph record using an external JSON-LD service: JSON-LD Playground Google SDTT

    [
      {
        "@context": "https://springernature.github.io/scigraph/jsonld/sgcontext.json", 
        "about": [
          {
            "id": "http://purl.org/au-research/vocabulary/anzsrc-for/2008/2208", 
            "inDefinedTermSet": "http://purl.org/au-research/vocabulary/anzsrc-for/2008/", 
            "type": "DefinedTerm"
          }, 
          {
            "id": "http://purl.org/au-research/vocabulary/anzsrc-for/2008/2208", 
            "inDefinedTermSet": "http://purl.org/au-research/vocabulary/anzsrc-for/2008/", 
            "type": "DefinedTerm"
          }
        ], 
        "amount": {
          "currency": "CNY", 
          "type": "MonetaryAmount", 
          "value": "700000"
        }, 
        "description": "In  focused cralwing system, multi-crawlers crawl parallelly Web and download Web pages. It is one of hotspot research of search engine how the different focused crawlers avoid to visit the same URLs and they download efficiently Web pages related to the search topic. In order to accomplish rapidly the crawling tasks of the system for the specific topic, and embody fully every Web crawler's ability, we consider that these history visiting Web pages (URLs) of every focused crawler reflect their backgroup knowledge. On basis of cralwing independently, collaborating togather and competing with each other for Web crawlers of the system, we propose the novel understanding, cooperating and  competing strategy of concept context graph by analyzing these Web page's content, extracting semantic features- - concepts of these Web pages in history collects of every Web crawlers as their backgroup knowledge and studing the semantic relationships of their backgroup  knowledge. Our mainly researches are listed as follows:\n1).Constructing the mathematical model of backgrounp knowledge of every Web crawler based on hierarchy concept context graph, according to the semantic characteristics- - concepts of Web pages and their semantic relationships among the concepts.\n2).Studying the understanding method and model among Web crawler", 
        "endDate": "2016-12-30T00:00:00Z", 
        "funder": {
          "id": "https://www.grid.ac/institutes/grid.419696.5", 
          "type": "Organization"
        }, 
        "id": "sg:grant.7205762", 
        "identifier": [
          {
            "name": "dimensions_id", 
            "type": "PropertyValue", 
            "value": [
              "7205762"
            ]
          }, 
          {
            "name": "nsfc_id", 
            "type": "PropertyValue", 
            "value": [
              "61271413"
            ]
          }
        ], 
        "inLanguage": [
          "zh"
        ], 
        "keywords": [
          "same URL", 
          "web crawler", 
          "download Web pages", 
          "understanding methods", 
          "task", 
          "concept context graph", 
          "multi-crawlers crawl", 
          "mathematical model", 
          "strategies", 
          "order", 
          "concept", 
          "conceptual background graph", 
          "history", 
          "search topics", 
          "semantic relationships", 
          "different focused crawlers", 
          "specific topics", 
          "hotspot research", 
          "model", 
          "competitive strategy", 
          "Web crawler's ability", 
          "semantic collaboration", 
          "hierarchy concept context graph", 
          "togather", 
          "knowledge", 
          "semantic characteristics- - concepts", 
          "web pages", 
          "crawlers", 
          "web", 
          "focused cralwing system", 
          "system", 
          "backgrounp knowledge", 
          "URLs", 
          "web page content", 
          "novel understanding", 
          "backgroup knowledge", 
          "backgroup", 
          "search engines", 
          "basis"
        ], 
        "name": "Web crawler semantic collaboration and competitive strategy based on conceptual background graph", 
        "recipient": [
          {
            "id": "https://www.grid.ac/institutes/grid.412983.5", 
            "type": "Organization"
          }, 
          {
            "affiliation": {
              "id": "https://www.grid.ac/institutes/grid.412983.5", 
              "name": "Xihua University", 
              "type": "Organization"
            }, 
            "familyName": "Du", 
            "givenName": "Ya Jun", 
            "id": "sg:person.012650165254.76", 
            "type": "Person"
          }, 
          {
            "member": "sg:person.012650165254.76", 
            "roleName": "PI", 
            "type": "Role"
          }
        ], 
        "sameAs": [
          "https://app.dimensions.ai/details/grant/grant.7205762"
        ], 
        "sdDataset": "grants", 
        "sdDatePublished": "2019-03-07T12:40", 
        "sdLicense": "https://scigraph.springernature.com/explorer/license/", 
        "sdPublisher": {
          "name": "Springer Nature - SN SciGraph project", 
          "type": "Organization"
        }, 
        "sdSource": "s3://com.uberresearch.data.processor/core_data/20181219_192338/projects/base/nsfc_projects_1.xml.gz", 
        "startDate": "2012-12-31T00:00:00Z", 
        "type": "MonetaryGrant", 
        "url": "http://npd.nsfc.gov.cn/projectDetail.action?pid=61271413"
      }
    ]
     

    Download the RDF metadata as:  json-ld nt turtle xml License info

    HOW TO GET THIS DATA PROGRAMMATICALLY:

    JSON-LD is a popular format for linked data which is fully compatible with JSON.

    curl -H 'Accept: application/ld+json' 'https://scigraph.springernature.com/grant.7205762'

    N-Triples is a line-based linked data format ideal for batch operations.

    curl -H 'Accept: application/n-triples' 'https://scigraph.springernature.com/grant.7205762'

    Turtle is a human-readable linked data format.

    curl -H 'Accept: text/turtle' 'https://scigraph.springernature.com/grant.7205762'

    RDF/XML is a standard XML format for linked data.

    curl -H 'Accept: application/rdf+xml' 'https://scigraph.springernature.com/grant.7205762'


     

    This table displays all metadata directly associated to this object as RDF triples.

    83 TRIPLES      19 PREDICATES      61 URIs      53 LITERALS      5 BLANK NODES

    Subject Predicate Object
    1 sg:grant.7205762 schema:about anzsrc-for:2208
    2 schema:amount N50e2ee1e15dd4d3e800c1f4a3b213fc3
    3 schema:description In focused cralwing system, multi-crawlers crawl parallelly Web and download Web pages. It is one of hotspot research of search engine how the different focused crawlers avoid to visit the same URLs and they download efficiently Web pages related to the search topic. In order to accomplish rapidly the crawling tasks of the system for the specific topic, and embody fully every Web crawler's ability, we consider that these history visiting Web pages (URLs) of every focused crawler reflect their backgroup knowledge. On basis of cralwing independently, collaborating togather and competing with each other for Web crawlers of the system, we propose the novel understanding, cooperating and competing strategy of concept context graph by analyzing these Web page's content, extracting semantic features- - concepts of these Web pages in history collects of every Web crawlers as their backgroup knowledge and studing the semantic relationships of their backgroup knowledge. Our mainly researches are listed as follows: 1).Constructing the mathematical model of backgrounp knowledge of every Web crawler based on hierarchy concept context graph, according to the semantic characteristics- - concepts of Web pages and their semantic relationships among the concepts. 2).Studying the understanding method and model among Web crawler
    4 schema:endDate 2016-12-30T00:00:00Z
    5 schema:funder https://www.grid.ac/institutes/grid.419696.5
    6 schema:identifier N7abadeeaa16c43cdb30e581adb0559f0
    7 N8c1fd2e02ac240728cd6b0d861183692
    8 schema:inLanguage zh
    9 schema:keywords URLs
    10 Web crawler's ability
    11 backgrounp knowledge
    12 backgroup
    13 backgroup knowledge
    14 basis
    15 competitive strategy
    16 concept
    17 concept context graph
    18 conceptual background graph
    19 crawlers
    20 different focused crawlers
    21 download Web pages
    22 focused cralwing system
    23 hierarchy concept context graph
    24 history
    25 hotspot research
    26 knowledge
    27 mathematical model
    28 model
    29 multi-crawlers crawl
    30 novel understanding
    31 order
    32 same URL
    33 search engines
    34 search topics
    35 semantic characteristics- - concepts
    36 semantic collaboration
    37 semantic relationships
    38 specific topics
    39 strategies
    40 system
    41 task
    42 togather
    43 understanding methods
    44 web
    45 web crawler
    46 web page content
    47 web pages
    48 schema:name Web crawler semantic collaboration and competitive strategy based on conceptual background graph
    49 schema:recipient Nf76ed9d8734749d389afbddabdefee34
    50 sg:person.012650165254.76
    51 https://www.grid.ac/institutes/grid.412983.5
    52 schema:sameAs https://app.dimensions.ai/details/grant/grant.7205762
    53 schema:sdDatePublished 2019-03-07T12:40
    54 schema:sdLicense https://scigraph.springernature.com/explorer/license/
    55 schema:sdPublisher Nc4e84968b027411881d0fba975f1f5b4
    56 schema:startDate 2012-12-31T00:00:00Z
    57 schema:url http://npd.nsfc.gov.cn/projectDetail.action?pid=61271413
    58 sgo:license sg:explorer/license/
    59 sgo:sdDataset grants
    60 rdf:type schema:MonetaryGrant
    61 N50e2ee1e15dd4d3e800c1f4a3b213fc3 schema:currency CNY
    62 schema:value 700000
    63 rdf:type schema:MonetaryAmount
    64 N7abadeeaa16c43cdb30e581adb0559f0 schema:name dimensions_id
    65 schema:value 7205762
    66 rdf:type schema:PropertyValue
    67 N8c1fd2e02ac240728cd6b0d861183692 schema:name nsfc_id
    68 schema:value 61271413
    69 rdf:type schema:PropertyValue
    70 Nc4e84968b027411881d0fba975f1f5b4 schema:name Springer Nature - SN SciGraph project
    71 rdf:type schema:Organization
    72 Nf76ed9d8734749d389afbddabdefee34 schema:member sg:person.012650165254.76
    73 schema:roleName PI
    74 rdf:type schema:Role
    75 anzsrc-for:2208 schema:inDefinedTermSet anzsrc-for:
    76 rdf:type schema:DefinedTerm
    77 sg:person.012650165254.76 schema:affiliation https://www.grid.ac/institutes/grid.412983.5
    78 schema:familyName Du
    79 schema:givenName Ya Jun
    80 rdf:type schema:Person
    81 https://www.grid.ac/institutes/grid.412983.5 schema:name Xihua University
    82 rdf:type schema:Organization
    83 https://www.grid.ac/institutes/grid.419696.5 schema:Organization
     




    Preview window. Press ESC to close (or click here)


    ...