Publication Type |
Journal Article |
School or College |
College of Engineering |
Department |
Computing, School of |
Creator |
Riloff, Ellen M. |
Other Author |
Furnkranz, Johannes; Mitchell, Tom |
Title |
Case study in using linguistic phrases for text categorization on the WWW |
Date |
1998 |
Description |
Most learning algorithms that arc applied to text categorization problems rely on a bag-of-words document representation, i.e., each word occurring in the document is considered as a separate feature. In this paper, we investigate the use of linguistic phrases as input features for text categorization problems. These features are based on information extraction patterns that are generated and used by the AUTOSLOG- TS system. We present experimental results on using such features as background knowledge for two machine learning algorithms on a classification task on the WWW. The results show that phrasal features can improve the precision of learned theories at the expense of coverage. |
Type |
Text |
Publisher |
Association for the Advancement of Artificial Intelligence (AAAI) |
First Page |
1 |
Last Page |
8 |
Subject |
Learning algorithms; Text categorization; Linguistic phrases; Information extraction patterns; AutoSlog-TS |
Subject LCSH |
Information retrieval |
Language |
eng |
Bibliographic Citation |
Furnkranz, J., Mitchell, T., & Riloff, E. M. (1998). Case study in using linguistic phrases for text categorization on the WWW. AAAI/ICML Workshop on Learning for Text Categorization, 1-8. |
Rights Management |
(c)AAAI http://www.aaai.org/ |
Format Medium |
application/pdf |
Format Extent |
962,469 bytes |
Identifier |
ir-main,12440 |
ARK |
ark:/87278/s6h13kb5 |
Setname |
ir_uspace |
ID |
704338 |
Reference URL |
https://collections.lib.utah.edu/ark:/87278/s6h13kb5 |