Classifying News Stories using Memory Based Reasoning.
Brij M. Masand, Gordon Linoff, David L. Waltz:
Classifying News Stories using Memory Based Reasoning.
SIGIR 1992: 59-65@inproceedings{DBLP:conf/sigir/MasandLW92,
author = {Brij M. Masand and
Gordon Linoff and
David L. Waltz},
editor = {Nicholas J. Belkin and
Peter Ingwersen and
Annelise Mark Pejtersen},
title = {Classifying News Stories using Memory Based Reasoning},
booktitle = {Proceedings of the 15th Annual International ACM SIGIR Conference
on Research and Development in Information Retrieval. Copenhagen,
Denmark, June 21-24, 1992},
publisher = {ACM},
year = {1992},
isbn = {0-89791-523-2},
pages = {59-65},
ee = {db/conf/sigir/MasandLW92.html},
crossref = {DBLP:conf/sigir/92},
bibsource = {DBLP, http://dblp.uni-trier.de}
}
BibTeX
Abstract
We describe a method for classifying news stories using Memory Based Reasoning (MBR) a
k-nearest neighbor method), that does not require manual topic definitions. Using an already coded
training database of about 50,000 stories from the Dow Jones Press Release News Wire, and
SEEKER [Stanfill] (a text retrieval system that supports relevance feedback) as the underlying match
engine, codes are assigned to new, unseen stories with a recall of about 80% and precision of about
70%. There are about 350 different codes to be assigned. Using a massively parallel supercomputer,
we leverage the information already contained in the thousands of coded stories and are able to code
a story in about 2 seconds. Given SEEKER, the text retrieval system, we achieved these results in
about two person-months. We believe this approach is effective in reducing the development time to
implement classification systems involving large number of topics for the purpose of classification,
message routing etc.
Copyright © 1992 by the ACM,
Inc., used by permission. Permission to make
digital or hard copies is granted provided that
copies are not made or distributed for profit or
direct commercial advantage, and that copies show
this notice on the first page or initial screen of
a display along with the full citation.
CDROM Version: Load the CDROM "Volume 2 Issue 3, SIGIR, DASFAA'97, OODBS'86" and ...
DVD Version: Load ACM SIGMOD Anthology DVD 1" and ...
BibTeX
Printed Edition
Nicholas J. Belkin, Peter Ingwersen, Annelise Mark Pejtersen (Eds.):
Proceedings of the 15th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval. Copenhagen, Denmark, June 21-24, 1992.
ACM 1992, ISBN 0-89791-523-2
Contents BibTeX
Citation page
BibTeX
ACM SIGMOD Anthology - DBLP:
[Home | Search: Author, Title | Conferences | Journals]
ACM SIGMOD Anthology: Copyright © by ACM (info@acm.org), Corrections: anthology@acm.org
DBLP: Copyright © by Michael Ley (ley@uni-trier.de), last change: Sat May 16 23:38:40 2009