Co-author inclusion: A novel recursive algorithmic method for dealingwith homonyms in bibliometric analysis View Full Text


Ontology type: schema:ScholarlyArticle     


Article Info

DATE

2006-01

AUTHORS

Steven Wooding, Kate Wilcox-Jay, Grant Lewison, Jonathan Grant

ABSTRACT

Large scale bibliometric analysis is often hindered by the presence of homonyms, or namesakes, of the researchers of interest in literature databases. This makes it difficult to build up a true picture of a researcher's publication record, as publications by another researcher with the same name will be included in search results. Using additional information such as title and author addresses, an expert in the field can generally tell if a paper is by a researcher or a namesake; however, manual checking is not practical in large scale studies. Previously various methods have been used to address this problem, chiefly based on filtering by subject, funding acknowledgement or author address. Co-author inclusion is a novel algorithmic method based on co-authorship for dealing with problems of homonyms in large bibliometric surveys. We compared co-author inclusion and subject and funding based filter against the manual assignment of papers by a subject expert (which we assumed to be correct). The subject and funding based filtering identifies only 75% as many papers as assigned by manual scoring. By using co-author inclusion once we increase this to 95%, two further rounds produces 99% as many papers as manual filtering. Although the number of papers identified that were not assigned to the PIs manually also increases, the absolute number is low: rising from 0.2% papers with subject and funding filtering, to 3% papers for three rounds of co-author inclusion. More... »

PAGES

11-21

Identifiers

URI

http://scigraph.springernature.com/pub.10.1007/s11192-006-0002-7

DOI

http://dx.doi.org/10.1007/s11192-006-0002-7

DIMENSIONS

https://app.dimensions.ai/details/publication/pub.1023043897


Indexing Status Check whether this publication has been indexed by Scopus and Web Of Science using the SN Indexing Status Tool
Incoming Citations Browse incoming citations for this publication using opencitations.net

JSON-LD is the canonical representation for SciGraph data.

TIP: You can open this SciGraph record using an external JSON-LD service: JSON-LD Playground Google SDTT

[
  {
    "@context": "https://springernature.github.io/scigraph/jsonld/sgcontext.json", 
    "about": [
      {
        "id": "http://purl.org/au-research/vocabulary/anzsrc-for/2008/0801", 
        "inDefinedTermSet": "http://purl.org/au-research/vocabulary/anzsrc-for/2008/", 
        "name": "Artificial Intelligence and Image Processing", 
        "type": "DefinedTerm"
      }, 
      {
        "id": "http://purl.org/au-research/vocabulary/anzsrc-for/2008/08", 
        "inDefinedTermSet": "http://purl.org/au-research/vocabulary/anzsrc-for/2008/", 
        "name": "Information and Computing Sciences", 
        "type": "DefinedTerm"
      }
    ], 
    "author": [
      {
        "affiliation": {
          "alternateName": "RAND Europe", 
          "id": "https://www.grid.ac/institutes/grid.425785.9", 
          "name": [
            "RAND Europe, Westbrook Centre, Milton Road, Cambridge CB4 1YG, England"
          ], 
          "type": "Organization"
        }, 
        "familyName": "Wooding", 
        "givenName": "Steven", 
        "id": "sg:person.01007356215.87", 
        "sameAs": [
          "https://app.dimensions.ai/discover/publication?and_facet_researcher=ur.01007356215.87"
        ], 
        "type": "Person"
      }, 
      {
        "affiliation": {
          "alternateName": "City, University of London", 
          "id": "https://www.grid.ac/institutes/grid.28577.3f", 
          "name": [
            "Department of Information Science, City University, London EC1V 0HB (England)"
          ], 
          "type": "Organization"
        }, 
        "familyName": "Wilcox-Jay", 
        "givenName": "Kate", 
        "type": "Person"
      }, 
      {
        "affiliation": {
          "alternateName": "City, University of London", 
          "id": "https://www.grid.ac/institutes/grid.28577.3f", 
          "name": [
            "Department of Information Science, City University, London EC1V 0HB (England)"
          ], 
          "type": "Organization"
        }, 
        "familyName": "Lewison", 
        "givenName": "Grant", 
        "id": "sg:person.0661232163.87", 
        "sameAs": [
          "https://app.dimensions.ai/discover/publication?and_facet_researcher=ur.0661232163.87"
        ], 
        "type": "Person"
      }, 
      {
        "affiliation": {
          "alternateName": "RAND Europe", 
          "id": "https://www.grid.ac/institutes/grid.425785.9", 
          "name": [
            "RAND Europe, Cambridge (England)"
          ], 
          "type": "Organization"
        }, 
        "familyName": "Grant", 
        "givenName": "Jonathan", 
        "id": "sg:person.01055471415.15", 
        "sameAs": [
          "https://app.dimensions.ai/discover/publication?and_facet_researcher=ur.01055471415.15"
        ], 
        "type": "Person"
      }
    ], 
    "citation": [
      {
        "id": "https://doi.org/10.1093/rheumatology/38.1.13", 
        "sameAs": [
          "https://app.dimensions.ai/details/publication/pub.1026162028"
        ], 
        "type": "CreativeWork"
      }, 
      {
        "id": "sg:pub.10.1186/1478-4505-1-2", 
        "sameAs": [
          "https://app.dimensions.ai/details/publication/pub.1052367250", 
          "https://doi.org/10.1186/1478-4505-1-2"
        ], 
        "type": "CreativeWork"
      }
    ], 
    "datePublished": "2006-01", 
    "datePublishedReg": "2006-01-01", 
    "description": "Large scale bibliometric analysis is often hindered by the presence of homonyms, or namesakes, of the researchers of interest in literature databases. This makes it difficult to build up a true picture of a researcher's publication record, as publications by another researcher with the same name will be included in search results. Using additional information such as title and author addresses, an expert in the field can generally tell if a paper is by a researcher or a namesake; however, manual checking is not practical in large scale studies. Previously various methods have been used to address this problem, chiefly based on filtering by subject, funding acknowledgement or author address. Co-author inclusion is a novel algorithmic method based on co-authorship for dealing with problems of homonyms in large bibliometric surveys. We compared co-author inclusion and subject and funding based filter against the manual assignment of papers by a subject expert (which we assumed to be correct). The subject and funding based filtering identifies only 75% as many papers as assigned by manual scoring. By using co-author inclusion once we increase this to 95%, two further rounds produces 99% as many papers as manual filtering. Although the number of papers identified that were not assigned to the PIs manually also increases, the absolute number is low: rising from 0.2% papers with subject and funding filtering, to 3% papers for three rounds of co-author inclusion.", 
    "genre": "research_article", 
    "id": "sg:pub.10.1007/s11192-006-0002-7", 
    "inLanguage": [
      "en"
    ], 
    "isAccessibleForFree": false, 
    "isPartOf": [
      {
        "id": "sg:journal.1089056", 
        "issn": [
          "0138-9130", 
          "1588-2861"
        ], 
        "name": "Scientometrics", 
        "type": "Periodical"
      }, 
      {
        "issueNumber": "1", 
        "type": "PublicationIssue"
      }, 
      {
        "type": "PublicationVolume", 
        "volumeNumber": "66"
      }
    ], 
    "name": "Co-author inclusion: A novel recursive algorithmic method for dealingwith homonyms in bibliometric analysis", 
    "pagination": "11-21", 
    "productId": [
      {
        "name": "readcube_id", 
        "type": "PropertyValue", 
        "value": [
          "6dc938368bf4d272395a9486efbed61a06610bd9dc47187e9dd0c02bfce4cfa0"
        ]
      }, 
      {
        "name": "doi", 
        "type": "PropertyValue", 
        "value": [
          "10.1007/s11192-006-0002-7"
        ]
      }, 
      {
        "name": "dimensions_id", 
        "type": "PropertyValue", 
        "value": [
          "pub.1023043897"
        ]
      }
    ], 
    "sameAs": [
      "https://doi.org/10.1007/s11192-006-0002-7", 
      "https://app.dimensions.ai/details/publication/pub.1023043897"
    ], 
    "sdDataset": "articles", 
    "sdDatePublished": "2019-04-10T15:54", 
    "sdLicense": "https://scigraph.springernature.com/explorer/license/", 
    "sdPublisher": {
      "name": "Springer Nature - SN SciGraph project", 
      "type": "Organization"
    }, 
    "sdSource": "s3://com-uberresearch-data-dimensions-target-20181106-alternative/cleanup/v134/2549eaecd7973599484d7c17b260dba0a4ecb94b/merge/v9/a6c9fde33151104705d4d7ff012ea9563521a3ce/jats-lookup/v90/0000000001_0000000264/records_8664_00000521.jsonl", 
    "type": "ScholarlyArticle", 
    "url": "http://link.springer.com/10.1007%2Fs11192-006-0002-7"
  }
]
 

Download the RDF metadata as:  json-ld nt turtle xml License info

HOW TO GET THIS DATA PROGRAMMATICALLY:

JSON-LD is a popular format for linked data which is fully compatible with JSON.

curl -H 'Accept: application/ld+json' 'https://scigraph.springernature.com/pub.10.1007/s11192-006-0002-7'

N-Triples is a line-based linked data format ideal for batch operations.

curl -H 'Accept: application/n-triples' 'https://scigraph.springernature.com/pub.10.1007/s11192-006-0002-7'

Turtle is a human-readable linked data format.

curl -H 'Accept: text/turtle' 'https://scigraph.springernature.com/pub.10.1007/s11192-006-0002-7'

RDF/XML is a standard XML format for linked data.

curl -H 'Accept: application/rdf+xml' 'https://scigraph.springernature.com/pub.10.1007/s11192-006-0002-7'


 

This table displays all metadata directly associated to this object as RDF triples.

92 TRIPLES      21 PREDICATES      29 URIs      19 LITERALS      7 BLANK NODES

Subject Predicate Object
1 sg:pub.10.1007/s11192-006-0002-7 schema:about anzsrc-for:08
2 anzsrc-for:0801
3 schema:author N4d89c9dde2f745fc96cc1dd6cc826230
4 schema:citation sg:pub.10.1186/1478-4505-1-2
5 https://doi.org/10.1093/rheumatology/38.1.13
6 schema:datePublished 2006-01
7 schema:datePublishedReg 2006-01-01
8 schema:description Large scale bibliometric analysis is often hindered by the presence of homonyms, or namesakes, of the researchers of interest in literature databases. This makes it difficult to build up a true picture of a researcher's publication record, as publications by another researcher with the same name will be included in search results. Using additional information such as title and author addresses, an expert in the field can generally tell if a paper is by a researcher or a namesake; however, manual checking is not practical in large scale studies. Previously various methods have been used to address this problem, chiefly based on filtering by subject, funding acknowledgement or author address. Co-author inclusion is a novel algorithmic method based on co-authorship for dealing with problems of homonyms in large bibliometric surveys. We compared co-author inclusion and subject and funding based filter against the manual assignment of papers by a subject expert (which we assumed to be correct). The subject and funding based filtering identifies only 75% as many papers as assigned by manual scoring. By using co-author inclusion once we increase this to 95%, two further rounds produces 99% as many papers as manual filtering. Although the number of papers identified that were not assigned to the PIs manually also increases, the absolute number is low: rising from 0.2% papers with subject and funding filtering, to 3% papers for three rounds of co-author inclusion.
9 schema:genre research_article
10 schema:inLanguage en
11 schema:isAccessibleForFree false
12 schema:isPartOf N1983ccfc71ba4d8d9eeee0242f83b230
13 Nefe21a922a3b4a92b0c8d04127b379ec
14 sg:journal.1089056
15 schema:name Co-author inclusion: A novel recursive algorithmic method for dealingwith homonyms in bibliometric analysis
16 schema:pagination 11-21
17 schema:productId N0e16038c1da24e32ba58192d78871f8b
18 N244dba22726843b5bc8310e2af18b07a
19 Nb1a8fbcc411440e18fd4ccb01cbd80e0
20 schema:sameAs https://app.dimensions.ai/details/publication/pub.1023043897
21 https://doi.org/10.1007/s11192-006-0002-7
22 schema:sdDatePublished 2019-04-10T15:54
23 schema:sdLicense https://scigraph.springernature.com/explorer/license/
24 schema:sdPublisher Nb4093ea0c6434c0ab788254ce053d9d7
25 schema:url http://link.springer.com/10.1007%2Fs11192-006-0002-7
26 sgo:license sg:explorer/license/
27 sgo:sdDataset articles
28 rdf:type schema:ScholarlyArticle
29 N0e16038c1da24e32ba58192d78871f8b schema:name readcube_id
30 schema:value 6dc938368bf4d272395a9486efbed61a06610bd9dc47187e9dd0c02bfce4cfa0
31 rdf:type schema:PropertyValue
32 N1983ccfc71ba4d8d9eeee0242f83b230 schema:issueNumber 1
33 rdf:type schema:PublicationIssue
34 N244dba22726843b5bc8310e2af18b07a schema:name dimensions_id
35 schema:value pub.1023043897
36 rdf:type schema:PropertyValue
37 N2830b852fcc2469c899153e16b8a3e1a rdf:first sg:person.0661232163.87
38 rdf:rest N578ad1e268464ad5a5b6a3fbadcdb94b
39 N4d89c9dde2f745fc96cc1dd6cc826230 rdf:first sg:person.01007356215.87
40 rdf:rest N793904d70758458a80733edaa9be10f0
41 N578ad1e268464ad5a5b6a3fbadcdb94b rdf:first sg:person.01055471415.15
42 rdf:rest rdf:nil
43 N6fa860a498ff4ba89286b5daf9ee4884 schema:affiliation https://www.grid.ac/institutes/grid.28577.3f
44 schema:familyName Wilcox-Jay
45 schema:givenName Kate
46 rdf:type schema:Person
47 N793904d70758458a80733edaa9be10f0 rdf:first N6fa860a498ff4ba89286b5daf9ee4884
48 rdf:rest N2830b852fcc2469c899153e16b8a3e1a
49 Nb1a8fbcc411440e18fd4ccb01cbd80e0 schema:name doi
50 schema:value 10.1007/s11192-006-0002-7
51 rdf:type schema:PropertyValue
52 Nb4093ea0c6434c0ab788254ce053d9d7 schema:name Springer Nature - SN SciGraph project
53 rdf:type schema:Organization
54 Nefe21a922a3b4a92b0c8d04127b379ec schema:volumeNumber 66
55 rdf:type schema:PublicationVolume
56 anzsrc-for:08 schema:inDefinedTermSet anzsrc-for:
57 schema:name Information and Computing Sciences
58 rdf:type schema:DefinedTerm
59 anzsrc-for:0801 schema:inDefinedTermSet anzsrc-for:
60 schema:name Artificial Intelligence and Image Processing
61 rdf:type schema:DefinedTerm
62 sg:journal.1089056 schema:issn 0138-9130
63 1588-2861
64 schema:name Scientometrics
65 rdf:type schema:Periodical
66 sg:person.01007356215.87 schema:affiliation https://www.grid.ac/institutes/grid.425785.9
67 schema:familyName Wooding
68 schema:givenName Steven
69 schema:sameAs https://app.dimensions.ai/discover/publication?and_facet_researcher=ur.01007356215.87
70 rdf:type schema:Person
71 sg:person.01055471415.15 schema:affiliation https://www.grid.ac/institutes/grid.425785.9
72 schema:familyName Grant
73 schema:givenName Jonathan
74 schema:sameAs https://app.dimensions.ai/discover/publication?and_facet_researcher=ur.01055471415.15
75 rdf:type schema:Person
76 sg:person.0661232163.87 schema:affiliation https://www.grid.ac/institutes/grid.28577.3f
77 schema:familyName Lewison
78 schema:givenName Grant
79 schema:sameAs https://app.dimensions.ai/discover/publication?and_facet_researcher=ur.0661232163.87
80 rdf:type schema:Person
81 sg:pub.10.1186/1478-4505-1-2 schema:sameAs https://app.dimensions.ai/details/publication/pub.1052367250
82 https://doi.org/10.1186/1478-4505-1-2
83 rdf:type schema:CreativeWork
84 https://doi.org/10.1093/rheumatology/38.1.13 schema:sameAs https://app.dimensions.ai/details/publication/pub.1026162028
85 rdf:type schema:CreativeWork
86 https://www.grid.ac/institutes/grid.28577.3f schema:alternateName City, University of London
87 schema:name Department of Information Science, City University, London EC1V 0HB (England)
88 rdf:type schema:Organization
89 https://www.grid.ac/institutes/grid.425785.9 schema:alternateName RAND Europe
90 schema:name RAND Europe, Cambridge (England)
91 RAND Europe, Westbrook Centre, Milton Road, Cambridge CB4 1YG, England
92 rdf:type schema:Organization
 




Preview window. Press ESC to close (or click here)


...