Text Classification Techniques in Oil Industry Applications View Full Text


Ontology type: schema:Chapter     


Chapter Info

DATE

2014

AUTHORS

Nayat Sanchez-Pi , Luis Martí , Ana Cristina Bicharra Garcia

ABSTRACT

The development of automatic methods to produce usable structured information from unstructured text sources is extremely valuable to the oil and gas industry. A structured resource would allow researches and industry professionals to write relatively simple queries to retrieve all the information regards transcriptions of any accident. Instead of the thousands of abstracts provided by querying the unstructured corpus, the queries on structured corpus would result in a few hundred well-formed results. On this paper we propose and evaluate information extraction techniques in occupational health control process, particularly, for the case of automatic detection of accidents from unstructured texts. Our proposal divides the problem in subtasks such as text analysis, recognition and classification of failed occupational health control, resolving accidents. More... »

PAGES

211-220

References to SciGraph publications

  • 2007. Ontology-Based MEDLINE Document Classification in BIOINFORMATICS RESEARCH AND DEVELOPMENT
  • Book

    TITLE

    International Joint Conference SOCO’13-CISIS’13-ICEUTE’13

    ISBN

    978-3-319-01853-9
    978-3-319-01854-6

    Identifiers

    URI

    http://scigraph.springernature.com/pub.10.1007/978-3-319-01854-6_22

    DOI

    http://dx.doi.org/10.1007/978-3-319-01854-6_22

    DIMENSIONS

    https://app.dimensions.ai/details/publication/pub.1045774509


    Indexing Status Check whether this publication has been indexed by Scopus and Web Of Science using the SN Indexing Status Tool
    Incoming Citations Browse incoming citations for this publication using opencitations.net

    JSON-LD is the canonical representation for SciGraph data.

    TIP: You can open this SciGraph record using an external JSON-LD service: JSON-LD Playground Google SDTT

    [
      {
        "@context": "https://springernature.github.io/scigraph/jsonld/sgcontext.json", 
        "about": [
          {
            "id": "http://purl.org/au-research/vocabulary/anzsrc-for/2008/0801", 
            "inDefinedTermSet": "http://purl.org/au-research/vocabulary/anzsrc-for/2008/", 
            "name": "Artificial Intelligence and Image Processing", 
            "type": "DefinedTerm"
          }, 
          {
            "id": "http://purl.org/au-research/vocabulary/anzsrc-for/2008/08", 
            "inDefinedTermSet": "http://purl.org/au-research/vocabulary/anzsrc-for/2008/", 
            "name": "Information and Computing Sciences", 
            "type": "DefinedTerm"
          }
        ], 
        "author": [
          {
            "affiliation": {
              "alternateName": "Fluminense Federal University", 
              "id": "https://www.grid.ac/institutes/grid.411173.1", 
              "name": [
                "ADDLabs, Fluminense Federal University, Rua Passo da P\u00e1tria, 156., 24210-240\u00a0Niter\u00f3i, RJ, Brazil"
              ], 
              "type": "Organization"
            }, 
            "familyName": "Sanchez-Pi", 
            "givenName": "Nayat", 
            "id": "sg:person.07411775305.08", 
            "sameAs": [
              "https://app.dimensions.ai/discover/publication?and_facet_researcher=ur.07411775305.08"
            ], 
            "type": "Person"
          }, 
          {
            "affiliation": {
              "alternateName": "Pontifical Catholic University of Rio de Janeiro", 
              "id": "https://www.grid.ac/institutes/grid.4839.6", 
              "name": [
                "Dept. of Electrical Engineering, Pontif\u00edcia Universidade Cat\u00f3lica do Rio de Janeiro, R. Marqu\u00eas de S\u00e3o Vicente, 225., Rio de Janeiro, RJ, Brazil"
              ], 
              "type": "Organization"
            }, 
            "familyName": "Mart\u00ed", 
            "givenName": "Luis", 
            "id": "sg:person.013310403353.54", 
            "sameAs": [
              "https://app.dimensions.ai/discover/publication?and_facet_researcher=ur.013310403353.54"
            ], 
            "type": "Person"
          }, 
          {
            "affiliation": {
              "alternateName": "Fluminense Federal University", 
              "id": "https://www.grid.ac/institutes/grid.411173.1", 
              "name": [
                "ADDLabs, Fluminense Federal University, Rua Passo da P\u00e1tria, 156., 24210-240\u00a0Niter\u00f3i, RJ, Brazil"
              ], 
              "type": "Organization"
            }, 
            "familyName": "Garcia", 
            "givenName": "Ana Cristina Bicharra", 
            "id": "sg:person.07430767131.99", 
            "sameAs": [
              "https://app.dimensions.ai/discover/publication?and_facet_researcher=ur.07430767131.99"
            ], 
            "type": "Person"
          }
        ], 
        "citation": [
          {
            "id": "https://doi.org/10.1002/(sici)1097-4571(199009)41:6<391::aid-asi1>3.0.co;2-9", 
            "sameAs": [
              "https://app.dimensions.ai/details/publication/pub.1012153938"
            ], 
            "type": "CreativeWork"
          }, 
          {
            "id": "https://doi.org/10.1145/505282.505283", 
            "sameAs": [
              "https://app.dimensions.ai/details/publication/pub.1023316280"
            ], 
            "type": "CreativeWork"
          }, 
          {
            "id": "https://doi.org/10.1145/1242572.1242778", 
            "sameAs": [
              "https://app.dimensions.ai/details/publication/pub.1031228233"
            ], 
            "type": "CreativeWork"
          }, 
          {
            "id": "sg:pub.10.1007/978-3-540-71233-6_34", 
            "sameAs": [
              "https://app.dimensions.ai/details/publication/pub.1038142733", 
              "https://doi.org/10.1007/978-3-540-71233-6_34"
            ], 
            "type": "CreativeWork"
          }, 
          {
            "id": "https://doi.org/10.1006/knac.1993.1008", 
            "sameAs": [
              "https://app.dimensions.ai/details/publication/pub.1050172020"
            ], 
            "type": "CreativeWork"
          }, 
          {
            "id": "https://doi.org/10.1109/fskd.2007.432", 
            "sameAs": [
              "https://app.dimensions.ai/details/publication/pub.1093233721"
            ], 
            "type": "CreativeWork"
          }, 
          {
            "id": "https://doi.org/10.1109/icdm.2004.10077", 
            "sameAs": [
              "https://app.dimensions.ai/details/publication/pub.1095539674"
            ], 
            "type": "CreativeWork"
          }
        ], 
        "datePublished": "2014", 
        "datePublishedReg": "2014-01-01", 
        "description": "The development of automatic methods to produce usable structured information from unstructured text sources is extremely valuable to the oil and gas industry. A structured resource would allow researches and industry professionals to write relatively simple queries to retrieve all the information regards transcriptions of any accident. Instead of the thousands of abstracts provided by querying the unstructured corpus, the queries on structured corpus would result in a few hundred well-formed results. On this paper we propose and evaluate information extraction techniques in occupational health control process, particularly, for the case of automatic detection of accidents from unstructured texts. Our proposal divides the problem in subtasks such as text analysis, recognition and classification of failed occupational health control, resolving accidents.", 
        "editor": [
          {
            "familyName": "Herrero", 
            "givenName": "\u00c1lvaro", 
            "type": "Person"
          }, 
          {
            "familyName": "Baruque", 
            "givenName": "Bruno", 
            "type": "Person"
          }, 
          {
            "familyName": "Klett", 
            "givenName": "Fanny", 
            "type": "Person"
          }, 
          {
            "familyName": "Abraham", 
            "givenName": "Ajith", 
            "type": "Person"
          }, 
          {
            "familyName": "Sn\u00e1\u0161el", 
            "givenName": "V\u00e1clav", 
            "type": "Person"
          }, 
          {
            "familyName": "de Carvalho", 
            "givenName": "Andr\u00e9 C.P.L.F.", 
            "type": "Person"
          }, 
          {
            "familyName": "Bringas", 
            "givenName": "Pablo Garc\u00eda", 
            "type": "Person"
          }, 
          {
            "familyName": "Zelinka", 
            "givenName": "Ivan", 
            "type": "Person"
          }, 
          {
            "familyName": "Quinti\u00e1n", 
            "givenName": "H\u00e9ctor", 
            "type": "Person"
          }, 
          {
            "familyName": "Corchado", 
            "givenName": "Emilio", 
            "type": "Person"
          }
        ], 
        "genre": "chapter", 
        "id": "sg:pub.10.1007/978-3-319-01854-6_22", 
        "inLanguage": [
          "en"
        ], 
        "isAccessibleForFree": false, 
        "isPartOf": {
          "isbn": [
            "978-3-319-01853-9", 
            "978-3-319-01854-6"
          ], 
          "name": "International Joint Conference SOCO\u201913-CISIS\u201913-ICEUTE\u201913", 
          "type": "Book"
        }, 
        "name": "Text Classification Techniques in Oil Industry Applications", 
        "pagination": "211-220", 
        "productId": [
          {
            "name": "doi", 
            "type": "PropertyValue", 
            "value": [
              "10.1007/978-3-319-01854-6_22"
            ]
          }, 
          {
            "name": "readcube_id", 
            "type": "PropertyValue", 
            "value": [
              "da0b181a81ee0dd9f6231f7bc6e18e3f6cf540e7192cbaca35be18a82d21adb6"
            ]
          }, 
          {
            "name": "dimensions_id", 
            "type": "PropertyValue", 
            "value": [
              "pub.1045774509"
            ]
          }
        ], 
        "publisher": {
          "location": "Cham", 
          "name": "Springer International Publishing", 
          "type": "Organisation"
        }, 
        "sameAs": [
          "https://doi.org/10.1007/978-3-319-01854-6_22", 
          "https://app.dimensions.ai/details/publication/pub.1045774509"
        ], 
        "sdDataset": "chapters", 
        "sdDatePublished": "2019-04-15T12:34", 
        "sdLicense": "https://scigraph.springernature.com/explorer/license/", 
        "sdPublisher": {
          "name": "Springer Nature - SN SciGraph project", 
          "type": "Organization"
        }, 
        "sdSource": "s3://com-uberresearch-data-dimensions-target-20181106-alternative/cleanup/v134/2549eaecd7973599484d7c17b260dba0a4ecb94b/merge/v9/a6c9fde33151104705d4d7ff012ea9563521a3ce/jats-lookup/v90/0000000001_0000000264/records_8663_00000271.jsonl", 
        "type": "Chapter", 
        "url": "http://link.springer.com/10.1007/978-3-319-01854-6_22"
      }
    ]
     

    Download the RDF metadata as:  json-ld nt turtle xml License info

    HOW TO GET THIS DATA PROGRAMMATICALLY:

    JSON-LD is a popular format for linked data which is fully compatible with JSON.

    curl -H 'Accept: application/ld+json' 'https://scigraph.springernature.com/pub.10.1007/978-3-319-01854-6_22'

    N-Triples is a line-based linked data format ideal for batch operations.

    curl -H 'Accept: application/n-triples' 'https://scigraph.springernature.com/pub.10.1007/978-3-319-01854-6_22'

    Turtle is a human-readable linked data format.

    curl -H 'Accept: text/turtle' 'https://scigraph.springernature.com/pub.10.1007/978-3-319-01854-6_22'

    RDF/XML is a standard XML format for linked data.

    curl -H 'Accept: application/rdf+xml' 'https://scigraph.springernature.com/pub.10.1007/978-3-319-01854-6_22'


     

    This table displays all metadata directly associated to this object as RDF triples.

    149 TRIPLES      23 PREDICATES      34 URIs      20 LITERALS      8 BLANK NODES

    Subject Predicate Object
    1 sg:pub.10.1007/978-3-319-01854-6_22 schema:about anzsrc-for:08
    2 anzsrc-for:0801
    3 schema:author N03d510521fea4f71a04fa188f54ccce0
    4 schema:citation sg:pub.10.1007/978-3-540-71233-6_34
    5 https://doi.org/10.1002/(sici)1097-4571(199009)41:6<391::aid-asi1>3.0.co;2-9
    6 https://doi.org/10.1006/knac.1993.1008
    7 https://doi.org/10.1109/fskd.2007.432
    8 https://doi.org/10.1109/icdm.2004.10077
    9 https://doi.org/10.1145/1242572.1242778
    10 https://doi.org/10.1145/505282.505283
    11 schema:datePublished 2014
    12 schema:datePublishedReg 2014-01-01
    13 schema:description The development of automatic methods to produce usable structured information from unstructured text sources is extremely valuable to the oil and gas industry. A structured resource would allow researches and industry professionals to write relatively simple queries to retrieve all the information regards transcriptions of any accident. Instead of the thousands of abstracts provided by querying the unstructured corpus, the queries on structured corpus would result in a few hundred well-formed results. On this paper we propose and evaluate information extraction techniques in occupational health control process, particularly, for the case of automatic detection of accidents from unstructured texts. Our proposal divides the problem in subtasks such as text analysis, recognition and classification of failed occupational health control, resolving accidents.
    14 schema:editor Naa4be1be2af84962be1401f2a5d932a4
    15 schema:genre chapter
    16 schema:inLanguage en
    17 schema:isAccessibleForFree false
    18 schema:isPartOf N07e1ea7f4a3147eca037672efe4ec949
    19 schema:name Text Classification Techniques in Oil Industry Applications
    20 schema:pagination 211-220
    21 schema:productId N48fd4e8f5fa84791bed3e4cd1509ccee
    22 N903fc9db4fca4506927ae4de1c9114aa
    23 Nb483ec91a4ab4269928f83bfb34dcebb
    24 schema:publisher N96f8adae6812450c9c0a8bdaa34dbb84
    25 schema:sameAs https://app.dimensions.ai/details/publication/pub.1045774509
    26 https://doi.org/10.1007/978-3-319-01854-6_22
    27 schema:sdDatePublished 2019-04-15T12:34
    28 schema:sdLicense https://scigraph.springernature.com/explorer/license/
    29 schema:sdPublisher N07fc968f2e39496caca54434900f5c60
    30 schema:url http://link.springer.com/10.1007/978-3-319-01854-6_22
    31 sgo:license sg:explorer/license/
    32 sgo:sdDataset chapters
    33 rdf:type schema:Chapter
    34 N03d510521fea4f71a04fa188f54ccce0 rdf:first sg:person.07411775305.08
    35 rdf:rest Nef52db56c35743f98916e3fe4daf8497
    36 N07e1ea7f4a3147eca037672efe4ec949 schema:isbn 978-3-319-01853-9
    37 978-3-319-01854-6
    38 schema:name International Joint Conference SOCO’13-CISIS’13-ICEUTE’13
    39 rdf:type schema:Book
    40 N07fc968f2e39496caca54434900f5c60 schema:name Springer Nature - SN SciGraph project
    41 rdf:type schema:Organization
    42 N16153a7efc4240b19125b765732b7619 schema:familyName Corchado
    43 schema:givenName Emilio
    44 rdf:type schema:Person
    45 N21ef9d514d334a589972a4e51421af93 schema:familyName Herrero
    46 schema:givenName Álvaro
    47 rdf:type schema:Person
    48 N256677945a4d494e95f68c8868041d32 rdf:first N816cc1f32bb64febbd25367717e1ee6f
    49 rdf:rest N967f4b6b0a984c2290564e4ce13952f6
    50 N275f209e9df94774a303dbcc6ea66d5b rdf:first sg:person.07430767131.99
    51 rdf:rest rdf:nil
    52 N33b063e0121c4e9e9a62ae86c736a87d schema:familyName de Carvalho
    53 schema:givenName André C.P.L.F.
    54 rdf:type schema:Person
    55 N48fd4e8f5fa84791bed3e4cd1509ccee schema:name dimensions_id
    56 schema:value pub.1045774509
    57 rdf:type schema:PropertyValue
    58 N49c813c4c0c44f398ef81995018716d0 schema:familyName Bringas
    59 schema:givenName Pablo García
    60 rdf:type schema:Person
    61 N7ea55ba4eb7d4fbc86e24b0002ad74f0 schema:familyName Abraham
    62 schema:givenName Ajith
    63 rdf:type schema:Person
    64 N816cc1f32bb64febbd25367717e1ee6f schema:familyName Klett
    65 schema:givenName Fanny
    66 rdf:type schema:Person
    67 N903fc9db4fca4506927ae4de1c9114aa schema:name readcube_id
    68 schema:value da0b181a81ee0dd9f6231f7bc6e18e3f6cf540e7192cbaca35be18a82d21adb6
    69 rdf:type schema:PropertyValue
    70 N941efffd404b4600861fbcabbe8e7b10 rdf:first Nd1cb5cb0e0c94430b1e1b5b526cf8f5d
    71 rdf:rest Nd5c214b302254aae9d442c8ae6cf9e66
    72 N9636bc9ff9d940b4943e95d913912f3a schema:familyName Snášel
    73 schema:givenName Václav
    74 rdf:type schema:Person
    75 N967f4b6b0a984c2290564e4ce13952f6 rdf:first N7ea55ba4eb7d4fbc86e24b0002ad74f0
    76 rdf:rest Nae19a1e6062e492fbf42a8f955f007b8
    77 N96f8adae6812450c9c0a8bdaa34dbb84 schema:location Cham
    78 schema:name Springer International Publishing
    79 rdf:type schema:Organisation
    80 N982fa6c147874b2a8c812a1d5fdb766e schema:familyName Quintián
    81 schema:givenName Héctor
    82 rdf:type schema:Person
    83 Naa4be1be2af84962be1401f2a5d932a4 rdf:first N21ef9d514d334a589972a4e51421af93
    84 rdf:rest Ne4b08c055a2940f59d8a05504364765a
    85 Nae19a1e6062e492fbf42a8f955f007b8 rdf:first N9636bc9ff9d940b4943e95d913912f3a
    86 rdf:rest Nb63820f2d918439884ff047c46171376
    87 Nb483ec91a4ab4269928f83bfb34dcebb schema:name doi
    88 schema:value 10.1007/978-3-319-01854-6_22
    89 rdf:type schema:PropertyValue
    90 Nb56d37965615453b9b818dc8efc7c56b rdf:first N16153a7efc4240b19125b765732b7619
    91 rdf:rest rdf:nil
    92 Nb63820f2d918439884ff047c46171376 rdf:first N33b063e0121c4e9e9a62ae86c736a87d
    93 rdf:rest Nfc974f7ca15149d6a3cbe506efaaaee6
    94 Nd0cadaf391d440d5b6a31306525a8f4f schema:familyName Baruque
    95 schema:givenName Bruno
    96 rdf:type schema:Person
    97 Nd1cb5cb0e0c94430b1e1b5b526cf8f5d schema:familyName Zelinka
    98 schema:givenName Ivan
    99 rdf:type schema:Person
    100 Nd5c214b302254aae9d442c8ae6cf9e66 rdf:first N982fa6c147874b2a8c812a1d5fdb766e
    101 rdf:rest Nb56d37965615453b9b818dc8efc7c56b
    102 Ne4b08c055a2940f59d8a05504364765a rdf:first Nd0cadaf391d440d5b6a31306525a8f4f
    103 rdf:rest N256677945a4d494e95f68c8868041d32
    104 Nef52db56c35743f98916e3fe4daf8497 rdf:first sg:person.013310403353.54
    105 rdf:rest N275f209e9df94774a303dbcc6ea66d5b
    106 Nfc974f7ca15149d6a3cbe506efaaaee6 rdf:first N49c813c4c0c44f398ef81995018716d0
    107 rdf:rest N941efffd404b4600861fbcabbe8e7b10
    108 anzsrc-for:08 schema:inDefinedTermSet anzsrc-for:
    109 schema:name Information and Computing Sciences
    110 rdf:type schema:DefinedTerm
    111 anzsrc-for:0801 schema:inDefinedTermSet anzsrc-for:
    112 schema:name Artificial Intelligence and Image Processing
    113 rdf:type schema:DefinedTerm
    114 sg:person.013310403353.54 schema:affiliation https://www.grid.ac/institutes/grid.4839.6
    115 schema:familyName Martí
    116 schema:givenName Luis
    117 schema:sameAs https://app.dimensions.ai/discover/publication?and_facet_researcher=ur.013310403353.54
    118 rdf:type schema:Person
    119 sg:person.07411775305.08 schema:affiliation https://www.grid.ac/institutes/grid.411173.1
    120 schema:familyName Sanchez-Pi
    121 schema:givenName Nayat
    122 schema:sameAs https://app.dimensions.ai/discover/publication?and_facet_researcher=ur.07411775305.08
    123 rdf:type schema:Person
    124 sg:person.07430767131.99 schema:affiliation https://www.grid.ac/institutes/grid.411173.1
    125 schema:familyName Garcia
    126 schema:givenName Ana Cristina Bicharra
    127 schema:sameAs https://app.dimensions.ai/discover/publication?and_facet_researcher=ur.07430767131.99
    128 rdf:type schema:Person
    129 sg:pub.10.1007/978-3-540-71233-6_34 schema:sameAs https://app.dimensions.ai/details/publication/pub.1038142733
    130 https://doi.org/10.1007/978-3-540-71233-6_34
    131 rdf:type schema:CreativeWork
    132 https://doi.org/10.1002/(sici)1097-4571(199009)41:6<391::aid-asi1>3.0.co;2-9 schema:sameAs https://app.dimensions.ai/details/publication/pub.1012153938
    133 rdf:type schema:CreativeWork
    134 https://doi.org/10.1006/knac.1993.1008 schema:sameAs https://app.dimensions.ai/details/publication/pub.1050172020
    135 rdf:type schema:CreativeWork
    136 https://doi.org/10.1109/fskd.2007.432 schema:sameAs https://app.dimensions.ai/details/publication/pub.1093233721
    137 rdf:type schema:CreativeWork
    138 https://doi.org/10.1109/icdm.2004.10077 schema:sameAs https://app.dimensions.ai/details/publication/pub.1095539674
    139 rdf:type schema:CreativeWork
    140 https://doi.org/10.1145/1242572.1242778 schema:sameAs https://app.dimensions.ai/details/publication/pub.1031228233
    141 rdf:type schema:CreativeWork
    142 https://doi.org/10.1145/505282.505283 schema:sameAs https://app.dimensions.ai/details/publication/pub.1023316280
    143 rdf:type schema:CreativeWork
    144 https://www.grid.ac/institutes/grid.411173.1 schema:alternateName Fluminense Federal University
    145 schema:name ADDLabs, Fluminense Federal University, Rua Passo da Pátria, 156., 24210-240 Niterói, RJ, Brazil
    146 rdf:type schema:Organization
    147 https://www.grid.ac/institutes/grid.4839.6 schema:alternateName Pontifical Catholic University of Rio de Janeiro
    148 schema:name Dept. of Electrical Engineering, Pontifícia Universidade Católica do Rio de Janeiro, R. Marquês de São Vicente, 225., Rio de Janeiro, RJ, Brazil
    149 rdf:type schema:Organization
     




    Preview window. Press ESC to close (or click here)


    ...