Creator | Title | Description | Subject | Date | ||
---|---|---|---|---|---|---|
1 |
![]() | Riloff, Ellen M. | Automatically generating extraction patterns from untagged text | Many corpus-based natural language processing systems rely on text corpora that have been manually annotated with syntactic or semantic tags. In particular, all previous dictionary construction systems for information extraction have used an annotated training corpus or some form of annotated input... | Information extraction; Automatically generating; Extraction patterns; Untagged text; Corpus-based; AutoSlog-TS; AutoSlog system; MUC-4; Dictionary construction | 1996 |
2 |
![]() | Riloff, Ellen M. | Case study in using linguistic phrases for text categorization on the WWW | Most learning algorithms that arc applied to text categorization problems rely on a bag-of-words document representation, i.e., each word occurring in the document is considered as a separate feature. In this paper, we investigate the use of linguistic phrases as input features for text categoriz... | Learning algorithms; Text categorization; Linguistic phrases; Information extraction patterns; AutoSlog-TS | 1998 |