Publication Type |
Journal Article |
School or College |
College of Engineering |
Department |
Computing, School of |
Creator |
Riloff, Ellen M. |
Other Author |
Thelen, Michael |
Title |
Bootstrapping method for learning semantic lexicons using extraction pattern contexts |
Date |
2002 |
Description |
This paper describes a bootstrapping algorithm called Basilisk that learns high-quality semantic lexicons for multiple categories. Basilisk begins with an unannotated corpus and seed words for each semantic category, which are then bootstrapped to learn new words for each category. Basilisk hypothesizes the semantic class of a word based on collective information over a large body of extraction pattern contexts. We evaluate Basilisk on six semantic categories. The semantic lexicons produced by Basilisk have higher precision than those produced by previous techniques, with several categories showing substantial improvement. |
Type |
Text |
Publisher |
Association for Computational Linguistics |
First Page |
1 |
Last Page |
8 |
Subject |
Basilisk; Bootstrapping method; Semantic lexicons |
Subject LCSH |
Information retrieval; Programming languages (Electronic computers) -- Semantics |
Language |
eng |
Bibliographic Citation |
Thelen, M., & Riloff, E. M. (2002). Bootstrapping method for learning semantic lexicons using extraction pattern contexts. Proceedings of the 2002 Conference on Empirical Methods in Natural Language Processing (EMNLP-02), 1-8. |
Rights Management |
(c) Thelen, M., & Riloff, E. M. |
Format Medium |
application/pdf |
Format Extent |
270,153 bytes |
Identifier |
ir-main,12424 |
ARK |
ark:/87278/s6xs6d1h |
Setname |
ir_uspace |
ID |
706910 |
Reference URL |
https://collections.lib.utah.edu/ark:/87278/s6xs6d1h |