A probabilistic similarity metric for Medline records: a model for author name disambiguation.
- 1 January 2003
- journal article
- Vol. 2003, 1033
Abstract
We present a model for automatically generating training sets and estimating the probability that a pair of Medline records sharing a last and first name initial are authored by the same individual, based on shared title words, journal name, co-authors, medical subject headings, language, and affiliation, as well as distinctive features of the name itself (i.e., presence of middle initial, suffix, and prevalence in Medline).This publication has 0 references indexed in Scilit: