Identifying data set specific duplicate patient records

Publication Type	poster
School or College	School of Medicine
Department	Biomedical Informatics
Creator	DuVall, Scott L.
Title	Identifying data set specific duplicate patient records
Date	2009
Description	Probabilistic models are commonly used in the identification of duplicate records. These methods are usually more accurate than deterministic methods, but are exponentially more computationally complex. Thus to make them computationally feasible, they rely on deterministic blocking strategies. This project investigates how machine learning methods can be used to automatically determine an optimal blocking strategy using duplicate records already identified.
Type	Text
Publisher	University of Utah
Subject	Probabilistic models; Duplicate records; Duplicate patient records; Trapeze Interactive Poster
Subject LCSH	Medical informatics; Medical records -- Data processing
Language	eng
Bibliographic Citation	DuVall, S. (2009). Identifying data set specific duplicate patient records. University of Utah.
Rights Management	©Scott DuVall
Format Medium	application/pdf
Format Extent	180,197 bytes
Identifier	ir-main,11054
ARK	ark:/87278/s6zk61cp
Setname	ir_uspace
ID	707874
Reference URL	https://collections.lib.utah.edu/ark:/87278/s6zk61cp