@inproceedings{DBLP:conf/sigir/Finch95, author = {Steven Finch}, editor = {Edward A. Fox and Peter Ingwersen and Raya Fidel}, title = {Partial Orders for Document Representation: A New Methodology for Combining Document Features}, booktitle = {SIGIR'95, Proceedings of the 18th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval. Seattle, Washington, USA, July 9-13, 1995 (Special Issue of the SIGIR Forum)}, publisher = {ACM Press}, year = {1995}, isbn = {0-89791-714-6}, pages = {264-272}, ee = {db/conf/sigir/Finch95.html}, crossref = {DBLP:conf/sigir/95}, bibsource = {DBLP, http://dblp.uni-trier.de} }BibTeX
This paper describes a novel paradigm for representing many types of information about documents in a manner particularly suited to text categorization by a trivial empirical rule induction system. It also has potential application to full-text retrieval paradigms.
The paradigm allows many different types of document predicates to be combined together with logical dependencies being controlled for. This is shown to be justified by any reasonable model of descriptor inference, and the effect of increasing representation sophistication is shown for two corpora.
Copyright © 1995 by the ACM, Inc., used by permission. Permission to make digital or hard copies is granted provided that copies are not made or distributed for profit or direct commercial advantage, and that copies show this notice on the first page or initial screen of a display along with the full citation.