Little Words Can Make a Big Difference for Text Classification.
Ellen Riloff:
Little Words Can Make a Big Difference for Text Classification.
SIGIR 1995: 130-136@inproceedings{DBLP:conf/sigir/Riloff95,
author = {Ellen Riloff},
editor = {Edward A. Fox and
Peter Ingwersen and
Raya Fidel},
title = {Little Words Can Make a Big Difference for Text Classification},
booktitle = {SIGIR'95, Proceedings of the 18th Annual International ACM SIGIR
Conference on Research and Development in Information Retrieval.
Seattle, Washington, USA, July 9-13, 1995 (Special Issue of
the SIGIR Forum)},
publisher = {ACM Press},
year = {1995},
isbn = {0-89791-714-6},
pages = {130-136},
ee = {db/conf/sigir/Riloff95.html},
crossref = {DBLP:conf/sigir/95},
bibsource = {DBLP, http://dblp.uni-trier.de}
}
BibTeX
Abstract
Most information retrieval systems use stopword lists and stemming algorithms.
However, we have found that recognizing singular and plural nouns, vexb forms, negation, and prepositions can produce dramatically different text classification results.
We present results from text classification experiments that compare relevancy signatures, which use local linguistic context, with corresponding indexing terms that do not.
In two different domains, relevancy signatures produced better results than the simple indexing terms.
These experiments suggest that stopword lists and stemming algorithms may remove or conflate many words that could be used to create more effective indexing terms.
Copyright © 1995 by the ACM,
Inc., used by permission. Permission to make
digital or hard copies is granted provided that
copies are not made or distributed for profit or
direct commercial advantage, and that copies show
this notice on the first page or initial screen of
a display along with the full citation.
CDROM Version: Load the CDROM "Volume 2 Issue 3, SIGIR, DASFAA'97, OODBS'86" and ...
DVD Version: Load ACM SIGMOD Anthology DVD 1" and ...
BibTeX
Printed Edition
Edward A. Fox, Peter Ingwersen, Raya Fidel (Eds.):
SIGIR'95, Proceedings of the 18th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval. Seattle, Washington, USA, July 9-13, 1995 (Special Issue of the SIGIR Forum).
ACM Press 1995, ISBN 0-89791-714-6
Contents BibTeX
Citation page
Referenced by
- Jeffrey A. Goldman, Douglas Stott Parker Jr., Wesley W. Chu:
Knowledge Discovery in an Earthquake Text Database: Correlation between Significant Earthquakes and the Time of Day.
SSDBM 1997: 12-21
BibTeX
ACM SIGMOD Anthology - DBLP:
[Home | Search: Author, Title | Conferences | Journals]
ACM SIGMOD Anthology: Copyright © by ACM (info@acm.org), Corrections: anthology@acm.org
DBLP: Copyright © by Michael Ley (ley@uni-trier.de), last change: Sat May 16 23:38:48 2009