Title |
SymText : a natural language understanding system for encoding free text medical data |
Publication Type |
dissertation |
School or College |
School of Medicine |
Department |
Biomedical Informatics |
Author |
Koehler, Spencer B. |
Date |
1998-06 |
Description |
As a need for access to information grows, the lack of accessible information becomes more evident, particularly in clinical settings. Hospital information systems, electronic medical records, and other efforts to computerize data for improved patient care have shown a need for more structured, or coded, data. As a natural and primary conveyance of clinical data, free text is a rich source of information; however, the lack of structure leaves much of the information hidden to automated systems. A method of transforming free text to useful, coded data is that of applied natural language processing; but this field is still being developed. Theoretical and implementational techniques for accurate processing have been developed and evaluated in a search for better parsing solutions. One question in particular is the relationship and importance of syntax, or structure, and semantics, or meaning, for deciphering language. This research describes the theories and development of SymText, a symbolic text processing system that specifies a formal model for combining syntax and semantics to encode free text. The system is designed to separate inferencing from knowledge so that it can be easily modified and extended both within a domain and to alternate domains of knowledge. It details the use of Bayesian networks as a model for context. This model is constructed for the domain of dictated chest x-ray reports. Within this domain the knowledge modules of the system are tuned through two developmental iterations after the initial development. The system is proven to respond to training with an overall increase in coding accuracy on an independent data set from 48.1% to 86.1% in recall and from 70.3% to 85.6% in precision. We conclude that the applied techniques for understanding natural language are generalizable and a promising beginning for solutions to generating coded data. |
Type |
Text |
Publisher |
University of Utah |
Subject |
Hospital Information Systems; Semantics |
Subject MESH |
Medical Records; Automatic Data Processing; Medical Informatics Computing |
Dissertation Institution |
University of Utah |
Dissertation Name |
PhD |
Language |
eng |
Relation is Version of |
Digital reproduction of "SymText: a natural language understanding system for encoding free text medical data". Spencer S. Eccles Health Sciences Library. Print version of "SymText: a natural language understanding system for encoding free text medical data". available at J. Willard Marriott Library Special Collection R117.5 1998 .K63. |
Rights Management |
© Spencer B. Koehler. |
Format |
application/pdf |
Format Medium |
application/pdf |
Format Extent |
3,553,882 bytes |
Identifier |
undthes,4541 |
Source |
Original: University of Utah Spencer S. Eccles Health Sciences Library (no longer available). |
Funding/Fellowship |
Grants R01LM05323 and 1R1HL53427 from the National Library of Medicine. |
Master File Extent |
3,553,956 bytes |
ARK |
ark:/87278/s6qf8vq9 |
Setname |
ir_etd |
ID |
191445 |
Reference URL |
https://collections.lib.utah.edu/ark:/87278/s6qf8vq9 |