Text Classification Techniques in Oil Industry Applications View Full Text


Ontology type: schema:Chapter     


Chapter Info

DATE

2014

AUTHORS

Nayat Sanchez-Pi , Luis Martí , Ana Cristina Bicharra Garcia

ABSTRACT

The development of automatic methods to produce usable structured information from unstructured text sources is extremely valuable to the oil and gas industry. A structured resource would allow researches and industry professionals to write relatively simple queries to retrieve all the information regards transcriptions of any accident. Instead of the thousands of abstracts provided by querying the unstructured corpus, the queries on structured corpus would result in a few hundred well-formed results. On this paper we propose and evaluate information extraction techniques in occupational health control process, particularly, for the case of automatic detection of accidents from unstructured texts. Our proposal divides the problem in subtasks such as text analysis, recognition and classification of failed occupational health control, resolving accidents. More... »

PAGES

211-220

References to SciGraph publications

  • 2007. Ontology-Based MEDLINE Document Classification in BIOINFORMATICS RESEARCH AND DEVELOPMENT
  • Book

    TITLE

    International Joint Conference SOCO’13-CISIS’13-ICEUTE’13

    ISBN

    978-3-319-01853-9
    978-3-319-01854-6

    Identifiers

    URI

    http://scigraph.springernature.com/pub.10.1007/978-3-319-01854-6_22

    DOI

    http://dx.doi.org/10.1007/978-3-319-01854-6_22

    DIMENSIONS

    https://app.dimensions.ai/details/publication/pub.1045774509


    Indexing Status Check whether this publication has been indexed by Scopus and Web Of Science using the SN Indexing Status Tool
    Incoming Citations Browse incoming citations for this publication using opencitations.net

    JSON-LD is the canonical representation for SciGraph data.

    TIP: You can open this SciGraph record using an external JSON-LD service: JSON-LD Playground Google SDTT

    [
      {
        "@context": "https://springernature.github.io/scigraph/jsonld/sgcontext.json", 
        "about": [
          {
            "id": "http://purl.org/au-research/vocabulary/anzsrc-for/2008/0801", 
            "inDefinedTermSet": "http://purl.org/au-research/vocabulary/anzsrc-for/2008/", 
            "name": "Artificial Intelligence and Image Processing", 
            "type": "DefinedTerm"
          }, 
          {
            "id": "http://purl.org/au-research/vocabulary/anzsrc-for/2008/08", 
            "inDefinedTermSet": "http://purl.org/au-research/vocabulary/anzsrc-for/2008/", 
            "name": "Information and Computing Sciences", 
            "type": "DefinedTerm"
          }
        ], 
        "author": [
          {
            "affiliation": {
              "alternateName": "Fluminense Federal University", 
              "id": "https://www.grid.ac/institutes/grid.411173.1", 
              "name": [
                "ADDLabs, Fluminense Federal University, Rua Passo da P\u00e1tria, 156., 24210-240\u00a0Niter\u00f3i, RJ, Brazil"
              ], 
              "type": "Organization"
            }, 
            "familyName": "Sanchez-Pi", 
            "givenName": "Nayat", 
            "id": "sg:person.07411775305.08", 
            "sameAs": [
              "https://app.dimensions.ai/discover/publication?and_facet_researcher=ur.07411775305.08"
            ], 
            "type": "Person"
          }, 
          {
            "affiliation": {
              "alternateName": "Pontifical Catholic University of Rio de Janeiro", 
              "id": "https://www.grid.ac/institutes/grid.4839.6", 
              "name": [
                "Dept. of Electrical Engineering, Pontif\u00edcia Universidade Cat\u00f3lica do Rio de Janeiro, R. Marqu\u00eas de S\u00e3o Vicente, 225., Rio de Janeiro, RJ, Brazil"
              ], 
              "type": "Organization"
            }, 
            "familyName": "Mart\u00ed", 
            "givenName": "Luis", 
            "id": "sg:person.013310403353.54", 
            "sameAs": [
              "https://app.dimensions.ai/discover/publication?and_facet_researcher=ur.013310403353.54"
            ], 
            "type": "Person"
          }, 
          {
            "affiliation": {
              "alternateName": "Fluminense Federal University", 
              "id": "https://www.grid.ac/institutes/grid.411173.1", 
              "name": [
                "ADDLabs, Fluminense Federal University, Rua Passo da P\u00e1tria, 156., 24210-240\u00a0Niter\u00f3i, RJ, Brazil"
              ], 
              "type": "Organization"
            }, 
            "familyName": "Garcia", 
            "givenName": "Ana Cristina Bicharra", 
            "id": "sg:person.07430767131.99", 
            "sameAs": [
              "https://app.dimensions.ai/discover/publication?and_facet_researcher=ur.07430767131.99"
            ], 
            "type": "Person"
          }
        ], 
        "citation": [
          {
            "id": "https://doi.org/10.1002/(sici)1097-4571(199009)41:6<391::aid-asi1>3.0.co;2-9", 
            "sameAs": [
              "https://app.dimensions.ai/details/publication/pub.1012153938"
            ], 
            "type": "CreativeWork"
          }, 
          {
            "id": "https://doi.org/10.1145/505282.505283", 
            "sameAs": [
              "https://app.dimensions.ai/details/publication/pub.1023316280"
            ], 
            "type": "CreativeWork"
          }, 
          {
            "id": "https://doi.org/10.1145/1242572.1242778", 
            "sameAs": [
              "https://app.dimensions.ai/details/publication/pub.1031228233"
            ], 
            "type": "CreativeWork"
          }, 
          {
            "id": "sg:pub.10.1007/978-3-540-71233-6_34", 
            "sameAs": [
              "https://app.dimensions.ai/details/publication/pub.1038142733", 
              "https://doi.org/10.1007/978-3-540-71233-6_34"
            ], 
            "type": "CreativeWork"
          }, 
          {
            "id": "https://doi.org/10.1006/knac.1993.1008", 
            "sameAs": [
              "https://app.dimensions.ai/details/publication/pub.1050172020"
            ], 
            "type": "CreativeWork"
          }, 
          {
            "id": "https://doi.org/10.1109/fskd.2007.432", 
            "sameAs": [
              "https://app.dimensions.ai/details/publication/pub.1093233721"
            ], 
            "type": "CreativeWork"
          }, 
          {
            "id": "https://doi.org/10.1109/icdm.2004.10077", 
            "sameAs": [
              "https://app.dimensions.ai/details/publication/pub.1095539674"
            ], 
            "type": "CreativeWork"
          }
        ], 
        "datePublished": "2014", 
        "datePublishedReg": "2014-01-01", 
        "description": "The development of automatic methods to produce usable structured information from unstructured text sources is extremely valuable to the oil and gas industry. A structured resource would allow researches and industry professionals to write relatively simple queries to retrieve all the information regards transcriptions of any accident. Instead of the thousands of abstracts provided by querying the unstructured corpus, the queries on structured corpus would result in a few hundred well-formed results. On this paper we propose and evaluate information extraction techniques in occupational health control process, particularly, for the case of automatic detection of accidents from unstructured texts. Our proposal divides the problem in subtasks such as text analysis, recognition and classification of failed occupational health control, resolving accidents.", 
        "editor": [
          {
            "familyName": "Herrero", 
            "givenName": "\u00c1lvaro", 
            "type": "Person"
          }, 
          {
            "familyName": "Baruque", 
            "givenName": "Bruno", 
            "type": "Person"
          }, 
          {
            "familyName": "Klett", 
            "givenName": "Fanny", 
            "type": "Person"
          }, 
          {
            "familyName": "Abraham", 
            "givenName": "Ajith", 
            "type": "Person"
          }, 
          {
            "familyName": "Sn\u00e1\u0161el", 
            "givenName": "V\u00e1clav", 
            "type": "Person"
          }, 
          {
            "familyName": "de Carvalho", 
            "givenName": "Andr\u00e9 C.P.L.F.", 
            "type": "Person"
          }, 
          {
            "familyName": "Bringas", 
            "givenName": "Pablo Garc\u00eda", 
            "type": "Person"
          }, 
          {
            "familyName": "Zelinka", 
            "givenName": "Ivan", 
            "type": "Person"
          }, 
          {
            "familyName": "Quinti\u00e1n", 
            "givenName": "H\u00e9ctor", 
            "type": "Person"
          }, 
          {
            "familyName": "Corchado", 
            "givenName": "Emilio", 
            "type": "Person"
          }
        ], 
        "genre": "chapter", 
        "id": "sg:pub.10.1007/978-3-319-01854-6_22", 
        "inLanguage": [
          "en"
        ], 
        "isAccessibleForFree": false, 
        "isPartOf": {
          "isbn": [
            "978-3-319-01853-9", 
            "978-3-319-01854-6"
          ], 
          "name": "International Joint Conference SOCO\u201913-CISIS\u201913-ICEUTE\u201913", 
          "type": "Book"
        }, 
        "name": "Text Classification Techniques in Oil Industry Applications", 
        "pagination": "211-220", 
        "productId": [
          {
            "name": "doi", 
            "type": "PropertyValue", 
            "value": [
              "10.1007/978-3-319-01854-6_22"
            ]
          }, 
          {
            "name": "readcube_id", 
            "type": "PropertyValue", 
            "value": [
              "da0b181a81ee0dd9f6231f7bc6e18e3f6cf540e7192cbaca35be18a82d21adb6"
            ]
          }, 
          {
            "name": "dimensions_id", 
            "type": "PropertyValue", 
            "value": [
              "pub.1045774509"
            ]
          }
        ], 
        "publisher": {
          "location": "Cham", 
          "name": "Springer International Publishing", 
          "type": "Organisation"
        }, 
        "sameAs": [
          "https://doi.org/10.1007/978-3-319-01854-6_22", 
          "https://app.dimensions.ai/details/publication/pub.1045774509"
        ], 
        "sdDataset": "chapters", 
        "sdDatePublished": "2019-04-15T12:34", 
        "sdLicense": "https://scigraph.springernature.com/explorer/license/", 
        "sdPublisher": {
          "name": "Springer Nature - SN SciGraph project", 
          "type": "Organization"
        }, 
        "sdSource": "s3://com-uberresearch-data-dimensions-target-20181106-alternative/cleanup/v134/2549eaecd7973599484d7c17b260dba0a4ecb94b/merge/v9/a6c9fde33151104705d4d7ff012ea9563521a3ce/jats-lookup/v90/0000000001_0000000264/records_8663_00000271.jsonl", 
        "type": "Chapter", 
        "url": "http://link.springer.com/10.1007/978-3-319-01854-6_22"
      }
    ]
     

    Download the RDF metadata as:  json-ld nt turtle xml License info

    HOW TO GET THIS DATA PROGRAMMATICALLY:

    JSON-LD is a popular format for linked data which is fully compatible with JSON.

    curl -H 'Accept: application/ld+json' 'https://scigraph.springernature.com/pub.10.1007/978-3-319-01854-6_22'

    N-Triples is a line-based linked data format ideal for batch operations.

    curl -H 'Accept: application/n-triples' 'https://scigraph.springernature.com/pub.10.1007/978-3-319-01854-6_22'

    Turtle is a human-readable linked data format.

    curl -H 'Accept: text/turtle' 'https://scigraph.springernature.com/pub.10.1007/978-3-319-01854-6_22'

    RDF/XML is a standard XML format for linked data.

    curl -H 'Accept: application/rdf+xml' 'https://scigraph.springernature.com/pub.10.1007/978-3-319-01854-6_22'


     

    This table displays all metadata directly associated to this object as RDF triples.

    149 TRIPLES      23 PREDICATES      34 URIs      20 LITERALS      8 BLANK NODES

    Subject Predicate Object
    1 sg:pub.10.1007/978-3-319-01854-6_22 schema:about anzsrc-for:08
    2 anzsrc-for:0801
    3 schema:author Ndf146223f5704db490302984e4ec793f
    4 schema:citation sg:pub.10.1007/978-3-540-71233-6_34
    5 https://doi.org/10.1002/(sici)1097-4571(199009)41:6<391::aid-asi1>3.0.co;2-9
    6 https://doi.org/10.1006/knac.1993.1008
    7 https://doi.org/10.1109/fskd.2007.432
    8 https://doi.org/10.1109/icdm.2004.10077
    9 https://doi.org/10.1145/1242572.1242778
    10 https://doi.org/10.1145/505282.505283
    11 schema:datePublished 2014
    12 schema:datePublishedReg 2014-01-01
    13 schema:description The development of automatic methods to produce usable structured information from unstructured text sources is extremely valuable to the oil and gas industry. A structured resource would allow researches and industry professionals to write relatively simple queries to retrieve all the information regards transcriptions of any accident. Instead of the thousands of abstracts provided by querying the unstructured corpus, the queries on structured corpus would result in a few hundred well-formed results. On this paper we propose and evaluate information extraction techniques in occupational health control process, particularly, for the case of automatic detection of accidents from unstructured texts. Our proposal divides the problem in subtasks such as text analysis, recognition and classification of failed occupational health control, resolving accidents.
    14 schema:editor N21b80e874f454cec9a337badad96534d
    15 schema:genre chapter
    16 schema:inLanguage en
    17 schema:isAccessibleForFree false
    18 schema:isPartOf N7bd1d39f535e4a7db8e0d98be09df0c1
    19 schema:name Text Classification Techniques in Oil Industry Applications
    20 schema:pagination 211-220
    21 schema:productId N92413479c4f14e979756384b35b25752
    22 Nc4c08aa4452e44e6aa2cf0ba9ed547c1
    23 Ndda4f65791b041639b23bd93007eb7d1
    24 schema:publisher N7bfce1334de6424aa413a690a38a3e24
    25 schema:sameAs https://app.dimensions.ai/details/publication/pub.1045774509
    26 https://doi.org/10.1007/978-3-319-01854-6_22
    27 schema:sdDatePublished 2019-04-15T12:34
    28 schema:sdLicense https://scigraph.springernature.com/explorer/license/
    29 schema:sdPublisher N719d6c5643474303a482b51bba0d34db
    30 schema:url http://link.springer.com/10.1007/978-3-319-01854-6_22
    31 sgo:license sg:explorer/license/
    32 sgo:sdDataset chapters
    33 rdf:type schema:Chapter
    34 N135642336e1043108997737bdfa2dd6d rdf:first N9738e9d2c9c94c3589b860bd81870416
    35 rdf:rest Nfa6d6ae10cea4c6ebb14f09a20ff0736
    36 N1553c801bbbb494e82ba265dc245d727 rdf:first sg:person.013310403353.54
    37 rdf:rest Nea8e3071194646c6be6f4e6c4c335ae7
    38 N21b80e874f454cec9a337badad96534d rdf:first N3cd13831f8604543ac695882a309a6f1
    39 rdf:rest N46ae6b6318ab4ada8293638242539d61
    40 N249419880b684d9a8959ab430a1570c0 rdf:first Nd099243ca3324a0ebf175af8fb52a9b7
    41 rdf:rest Na4081456f5f7447d8b065bee23ba10d8
    42 N3cd13831f8604543ac695882a309a6f1 schema:familyName Herrero
    43 schema:givenName Álvaro
    44 rdf:type schema:Person
    45 N46ae6b6318ab4ada8293638242539d61 rdf:first N50fef0ae2d434877b9b2ba734ead22f1
    46 rdf:rest N249419880b684d9a8959ab430a1570c0
    47 N50039d78a3734aad8f277fc358b7d236 rdf:first N54aa048c333342d3807bfbd99cfe62c3
    48 rdf:rest Ndf8db4c1736b4f63a55f88fe0ea9999f
    49 N50fef0ae2d434877b9b2ba734ead22f1 schema:familyName Baruque
    50 schema:givenName Bruno
    51 rdf:type schema:Person
    52 N54018826124f455ebcdcbb145b8bb983 schema:familyName Bringas
    53 schema:givenName Pablo García
    54 rdf:type schema:Person
    55 N54aa048c333342d3807bfbd99cfe62c3 schema:familyName Snášel
    56 schema:givenName Václav
    57 rdf:type schema:Person
    58 N6f0ddbfb1ed64b458991997d79e7fc33 schema:familyName de Carvalho
    59 schema:givenName André C.P.L.F.
    60 rdf:type schema:Person
    61 N719d6c5643474303a482b51bba0d34db schema:name Springer Nature - SN SciGraph project
    62 rdf:type schema:Organization
    63 N7bd1d39f535e4a7db8e0d98be09df0c1 schema:isbn 978-3-319-01853-9
    64 978-3-319-01854-6
    65 schema:name International Joint Conference SOCO’13-CISIS’13-ICEUTE’13
    66 rdf:type schema:Book
    67 N7bfce1334de6424aa413a690a38a3e24 schema:location Cham
    68 schema:name Springer International Publishing
    69 rdf:type schema:Organisation
    70 N92413479c4f14e979756384b35b25752 schema:name doi
    71 schema:value 10.1007/978-3-319-01854-6_22
    72 rdf:type schema:PropertyValue
    73 N9738e9d2c9c94c3589b860bd81870416 schema:familyName Quintián
    74 schema:givenName Héctor
    75 rdf:type schema:Person
    76 Na4081456f5f7447d8b065bee23ba10d8 rdf:first Nfb5be667970b4830b0061c8faa7f26ee
    77 rdf:rest N50039d78a3734aad8f277fc358b7d236
    78 Nb63c01ccac7d4d4083994e96b349e76f rdf:first Nc7ea24741e7a46d7aa83a8c0a32b4a10
    79 rdf:rest N135642336e1043108997737bdfa2dd6d
    80 Nc4c08aa4452e44e6aa2cf0ba9ed547c1 schema:name dimensions_id
    81 schema:value pub.1045774509
    82 rdf:type schema:PropertyValue
    83 Nc7ea24741e7a46d7aa83a8c0a32b4a10 schema:familyName Zelinka
    84 schema:givenName Ivan
    85 rdf:type schema:Person
    86 Nd099243ca3324a0ebf175af8fb52a9b7 schema:familyName Klett
    87 schema:givenName Fanny
    88 rdf:type schema:Person
    89 Ndda4f65791b041639b23bd93007eb7d1 schema:name readcube_id
    90 schema:value da0b181a81ee0dd9f6231f7bc6e18e3f6cf540e7192cbaca35be18a82d21adb6
    91 rdf:type schema:PropertyValue
    92 Ndf146223f5704db490302984e4ec793f rdf:first sg:person.07411775305.08
    93 rdf:rest N1553c801bbbb494e82ba265dc245d727
    94 Ndf8db4c1736b4f63a55f88fe0ea9999f rdf:first N6f0ddbfb1ed64b458991997d79e7fc33
    95 rdf:rest Nff72dfde837e4e8196b90793785a7d77
    96 Nea8e3071194646c6be6f4e6c4c335ae7 rdf:first sg:person.07430767131.99
    97 rdf:rest rdf:nil
    98 Nfa6d6ae10cea4c6ebb14f09a20ff0736 rdf:first Nfbf841e19cba4e228165600da85fed29
    99 rdf:rest rdf:nil
    100 Nfb5be667970b4830b0061c8faa7f26ee schema:familyName Abraham
    101 schema:givenName Ajith
    102 rdf:type schema:Person
    103 Nfbf841e19cba4e228165600da85fed29 schema:familyName Corchado
    104 schema:givenName Emilio
    105 rdf:type schema:Person
    106 Nff72dfde837e4e8196b90793785a7d77 rdf:first N54018826124f455ebcdcbb145b8bb983
    107 rdf:rest Nb63c01ccac7d4d4083994e96b349e76f
    108 anzsrc-for:08 schema:inDefinedTermSet anzsrc-for:
    109 schema:name Information and Computing Sciences
    110 rdf:type schema:DefinedTerm
    111 anzsrc-for:0801 schema:inDefinedTermSet anzsrc-for:
    112 schema:name Artificial Intelligence and Image Processing
    113 rdf:type schema:DefinedTerm
    114 sg:person.013310403353.54 schema:affiliation https://www.grid.ac/institutes/grid.4839.6
    115 schema:familyName Martí
    116 schema:givenName Luis
    117 schema:sameAs https://app.dimensions.ai/discover/publication?and_facet_researcher=ur.013310403353.54
    118 rdf:type schema:Person
    119 sg:person.07411775305.08 schema:affiliation https://www.grid.ac/institutes/grid.411173.1
    120 schema:familyName Sanchez-Pi
    121 schema:givenName Nayat
    122 schema:sameAs https://app.dimensions.ai/discover/publication?and_facet_researcher=ur.07411775305.08
    123 rdf:type schema:Person
    124 sg:person.07430767131.99 schema:affiliation https://www.grid.ac/institutes/grid.411173.1
    125 schema:familyName Garcia
    126 schema:givenName Ana Cristina Bicharra
    127 schema:sameAs https://app.dimensions.ai/discover/publication?and_facet_researcher=ur.07430767131.99
    128 rdf:type schema:Person
    129 sg:pub.10.1007/978-3-540-71233-6_34 schema:sameAs https://app.dimensions.ai/details/publication/pub.1038142733
    130 https://doi.org/10.1007/978-3-540-71233-6_34
    131 rdf:type schema:CreativeWork
    132 https://doi.org/10.1002/(sici)1097-4571(199009)41:6<391::aid-asi1>3.0.co;2-9 schema:sameAs https://app.dimensions.ai/details/publication/pub.1012153938
    133 rdf:type schema:CreativeWork
    134 https://doi.org/10.1006/knac.1993.1008 schema:sameAs https://app.dimensions.ai/details/publication/pub.1050172020
    135 rdf:type schema:CreativeWork
    136 https://doi.org/10.1109/fskd.2007.432 schema:sameAs https://app.dimensions.ai/details/publication/pub.1093233721
    137 rdf:type schema:CreativeWork
    138 https://doi.org/10.1109/icdm.2004.10077 schema:sameAs https://app.dimensions.ai/details/publication/pub.1095539674
    139 rdf:type schema:CreativeWork
    140 https://doi.org/10.1145/1242572.1242778 schema:sameAs https://app.dimensions.ai/details/publication/pub.1031228233
    141 rdf:type schema:CreativeWork
    142 https://doi.org/10.1145/505282.505283 schema:sameAs https://app.dimensions.ai/details/publication/pub.1023316280
    143 rdf:type schema:CreativeWork
    144 https://www.grid.ac/institutes/grid.411173.1 schema:alternateName Fluminense Federal University
    145 schema:name ADDLabs, Fluminense Federal University, Rua Passo da Pátria, 156., 24210-240 Niterói, RJ, Brazil
    146 rdf:type schema:Organization
    147 https://www.grid.ac/institutes/grid.4839.6 schema:alternateName Pontifical Catholic University of Rio de Janeiro
    148 schema:name Dept. of Electrical Engineering, Pontifícia Universidade Católica do Rio de Janeiro, R. Marquês de São Vicente, 225., Rio de Janeiro, RJ, Brazil
    149 rdf:type schema:Organization
     




    Preview window. Press ESC to close (or click here)


    ...