ACM SIGMOD Anthology SIGIR dblp.uni-trier.de

Using n-Grams for Korean Text Retrieval.

Joon Ho Lee, Jeong Soo Ahn: Using n-Grams for Korean Text Retrieval. SIGIR 1996: 216-224
@inproceedings{DBLP:conf/sigir/LeeA96,
  author    = {Joon Ho Lee and
               Jeong Soo Ahn},
  editor    = {Hans-Peter Frei and
               Donna Harman and
               Peter Sch{\"a}uble and
               Ross Wilkinson},
  title     = {Using n-Grams for Korean Text Retrieval},
  booktitle = {Proceedings of the 19th Annual International ACM SIGIR Conference
               on Research and Development in Information Retrieval, SIGIR'96,
               August 18-22, 1996, Zurich, Switzerland (Special Issue of the
               SIGIR Forum)},
  publisher = {ACM},
  year      = {1996},
  isbn      = {0-89791-792-8},
  pages     = {216-224},
  ee        = {db/conf/sigir/LeeA96.html},
  crossref  = {DBLP:conf/sigir/96},
  bibsource = {DBLP, http://dblp.uni-trier.de}
}
BibTeX

Abstract

There is a difficulty in applying the conventional word-based indexing to Korean. The indexable segment of a word, i.e. stem is often a compound noun, which results in the serious decrease of retrieval effectiveness. The morpheme-based indexing, which decomposes a compound noun into simple nouns, has been developed to overcome the problem of compound nouns. It, however, requires a large dictionary and complex linguistic knowledge. In this paper we propose a new indexing method by combining the word-based indexing and the n-gram indexing. The proposed method alleviates the problem of compound nouns without dictionaries and linguistic knowledge. Experiment al results show that the proposed method might be almost as effective as the morpheme-based indexing.

Copyright © 1996 by the ACM, Inc., used by permission. Permission to make digital or hard copies is granted provided that copies are not made or distributed for profit or direct commercial advantage, and that copies show this notice on the first page or initial screen of a display along with the full citation.


ACM SIGMOD Anthology

CDROM Version: Load the CDROM "Volume 2 Issue 3, SIGIR, DASFAA'97, OODBS'86" and ... DVD Version: Load ACM SIGMOD Anthology DVD 1" and ... BibTeX

Printed Edition

Hans-Peter Frei, Donna Harman, Peter Schäuble, Ross Wilkinson (Eds.): Proceedings of the 19th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, SIGIR'96, August 18-22, 1996, Zurich, Switzerland (Special Issue of the SIGIR Forum). ACM 1996, ISBN 0-89791-792-8
Contents BibTeX

Online Edition: ACM Digital Library

Citation page
BibTeX
ACM SIGMOD Anthology - DBLP: [Home | Search: Author, Title | Conferences | Journals]
ACM SIGMOD Anthology: Copyright © by ACM (info@acm.org), Corrections: anthology@acm.org
DBLP: Copyright © by Michael Ley (ley@uni-trier.de), last change: Sat May 16 23:38:51 2009