Ontology highlight
ABSTRACT:
SUBMITTER: Monasterio L
PROVIDER: S-EPMC5421764 | biostudies-literature | 2017
REPOSITORIES: biostudies-literature
This paper presents a method for classifying the ancestry of Brazilian surnames based on historical sources. The information obtained forms the basis for applying fuzzy matching and machine learning classification algorithms to more than 46 million workers in 5 categories: Iberian, Italian, Japanese, German and East European. The vast majority (96.7%) of the single surnames were identified using a fuzzy matching and the rest using a method proposed by Cavnar and Trenkle (1994). A comparison of t ...[more]