An effective named entity similarity metric for comparing data from multiple sources with varying syntax

David Croft, Stephen Brown, Simon Coupland

Research output: Contribution to journalArticle

2 Downloads (Pure)

Abstract

This paper describes and demonstrates a names entity similarity metric developed for, and currentlyin use by, the FuzzyPhoto project. The presented metric is effective at comparing named entity datain and across syntax less data schemas such as are often encounter in GLAM collections. Theefficiency of the approach was compared to an existing named entity similarity metric and is shownto be a significant improvement when comparing messy named entity data. Publisher Statement: This is a pre-copyedited, author-produced version of an article accepted for publication in Digital Scholarship in the Humanities following peer review. The version of record Croft, D, Brown, S & Coupland, S 2016, 'An effective Named Entity similarity metric for use with syntax independent data' Digital Scholarship in the Humanities, vol 32, no. 4, pp. 779-787 is available online at: https://dx.doi.org/10.1093/llc/fqw035
Original languageEnglish
Pages (from-to)779-787
Number of pages9
JournalDigital Scholarship in the Humanities
Volume32
Issue number4
DOIs
Publication statusPublished - 26 Aug 2016

Fingerprint

syntax
peer review
Syntax
Entity

Bibliographical note

This is a pre-copyedited, author-produced version of an article accepted for publication in Digital Scholarship in the Humanities following peer review. The version of record Croft, D, Brown, S & Coupland, S 2016, 'An effective Named Entity similarity metric for use with syntax independent data' Digital Scholarship in the Humanities, vol 32, no. 4, pp. 779-787 is available online at: https://dx.doi.org/10.1093/llc/fqw035

Cite this

An effective named entity similarity metric for comparing data from multiple sources with varying syntax. / Croft, David; Brown, Stephen; Coupland, Simon.

In: Digital Scholarship in the Humanities, Vol. 32, No. 4, 26.08.2016, p. 779-787.

Research output: Contribution to journalArticle

Croft, David ; Brown, Stephen ; Coupland, Simon. / An effective named entity similarity metric for comparing data from multiple sources with varying syntax. In: Digital Scholarship in the Humanities. 2016 ; Vol. 32, No. 4. pp. 779-787.
@article{b47e31a7f2a547fe921322a4e4f24cf8,
title = "An effective named entity similarity metric for comparing data from multiple sources with varying syntax",
abstract = "This paper describes and demonstrates a names entity similarity metric developed for, and currentlyin use by, the FuzzyPhoto project. The presented metric is effective at comparing named entity datain and across syntax less data schemas such as are often encounter in GLAM collections. Theefficiency of the approach was compared to an existing named entity similarity metric and is shownto be a significant improvement when comparing messy named entity data. Publisher Statement: This is a pre-copyedited, author-produced version of an article accepted for publication in Digital Scholarship in the Humanities following peer review. The version of record Croft, D, Brown, S & Coupland, S 2016, 'An effective Named Entity similarity metric for use with syntax independent data' Digital Scholarship in the Humanities, vol 32, no. 4, pp. 779-787 is available online at: https://dx.doi.org/10.1093/llc/fqw035",
author = "David Croft and Stephen Brown and Simon Coupland",
note = "This is a pre-copyedited, author-produced version of an article accepted for publication in Digital Scholarship in the Humanities following peer review. The version of record Croft, D, Brown, S & Coupland, S 2016, 'An effective Named Entity similarity metric for use with syntax independent data' Digital Scholarship in the Humanities, vol 32, no. 4, pp. 779-787 is available online at: https://dx.doi.org/10.1093/llc/fqw035",
year = "2016",
month = "8",
day = "26",
doi = "10.1093/llc/fqw035",
language = "English",
volume = "32",
pages = "779--787",
journal = "Digital Scholarship in the Humanities",
issn = "2055-7671",
publisher = "Oxford University Press",
number = "4",

}

TY - JOUR

T1 - An effective named entity similarity metric for comparing data from multiple sources with varying syntax

AU - Croft, David

AU - Brown, Stephen

AU - Coupland, Simon

N1 - This is a pre-copyedited, author-produced version of an article accepted for publication in Digital Scholarship in the Humanities following peer review. The version of record Croft, D, Brown, S & Coupland, S 2016, 'An effective Named Entity similarity metric for use with syntax independent data' Digital Scholarship in the Humanities, vol 32, no. 4, pp. 779-787 is available online at: https://dx.doi.org/10.1093/llc/fqw035

PY - 2016/8/26

Y1 - 2016/8/26

N2 - This paper describes and demonstrates a names entity similarity metric developed for, and currentlyin use by, the FuzzyPhoto project. The presented metric is effective at comparing named entity datain and across syntax less data schemas such as are often encounter in GLAM collections. Theefficiency of the approach was compared to an existing named entity similarity metric and is shownto be a significant improvement when comparing messy named entity data. Publisher Statement: This is a pre-copyedited, author-produced version of an article accepted for publication in Digital Scholarship in the Humanities following peer review. The version of record Croft, D, Brown, S & Coupland, S 2016, 'An effective Named Entity similarity metric for use with syntax independent data' Digital Scholarship in the Humanities, vol 32, no. 4, pp. 779-787 is available online at: https://dx.doi.org/10.1093/llc/fqw035

AB - This paper describes and demonstrates a names entity similarity metric developed for, and currentlyin use by, the FuzzyPhoto project. The presented metric is effective at comparing named entity datain and across syntax less data schemas such as are often encounter in GLAM collections. Theefficiency of the approach was compared to an existing named entity similarity metric and is shownto be a significant improvement when comparing messy named entity data. Publisher Statement: This is a pre-copyedited, author-produced version of an article accepted for publication in Digital Scholarship in the Humanities following peer review. The version of record Croft, D, Brown, S & Coupland, S 2016, 'An effective Named Entity similarity metric for use with syntax independent data' Digital Scholarship in the Humanities, vol 32, no. 4, pp. 779-787 is available online at: https://dx.doi.org/10.1093/llc/fqw035

U2 - 10.1093/llc/fqw035

DO - 10.1093/llc/fqw035

M3 - Article

VL - 32

SP - 779

EP - 787

JO - Digital Scholarship in the Humanities

JF - Digital Scholarship in the Humanities

SN - 2055-7671

IS - 4

ER -