ACM SIGMOD Anthology SIGIR dblp.uni-trier.de

A New Character-based Indexing Organization using Frequency Data for Japanese Documents.

Yasushi Ogawa, Masajirou Iwasaki: A New Character-based Indexing Organization using Frequency Data for Japanese Documents. SIGIR 1995: 121-129
@inproceedings{DBLP:conf/sigir/OgawaI95,
  author    = {Yasushi Ogawa and
               Masajirou Iwasaki},
  editor    = {Edward A. Fox and
               Peter Ingwersen and
               Raya Fidel},
  title     = {A New Character-based Indexing Organization using Frequency Data
               for Japanese Documents},
  booktitle = {SIGIR'95, Proceedings of the 18th Annual International ACM SIGIR
               Conference on Research and Development in Information Retrieval.
                Seattle, Washington, USA, July 9-13, 1995 (Special Issue of
               the SIGIR Forum)},
  publisher = {ACM Press},
  year      = {1995},
  isbn      = {0-89791-714-6},
  pages     = {121-129},
  ee        = {db/conf/sigir/OgawaI95.html},
  crossref  = {DBLP:conf/sigir/95},
  bibsource = {DBLP, http://dblp.uni-trier.de}
}
BibTeX

Abstract

A character based indexing is preferable for Japanese IR systems since Japanese words are not segmented. This paper proposes a new character indexing method to enhance our previous method which divided character pair index entries into disjoint groups based on character classes. Since frequency data is used to determine hashed entries for character pairs and to establish a special string index, both search speed and precision are improved. Moreover, bit strings are managed using small and large blocks, so registration and retrieval are accelerated. Experiments using patent abstracts showed these proposals are quite effective.

Copyright © 1995 by the ACM, Inc., used by permission. Permission to make digital or hard copies is granted provided that copies are not made or distributed for profit or direct commercial advantage, and that copies show this notice on the first page or initial screen of a display along with the full citation.


ACM SIGMOD Anthology

CDROM Version: Load the CDROM "Volume 2 Issue 3, SIGIR, DASFAA'97, OODBS'86" and ... DVD Version: Load ACM SIGMOD Anthology DVD 1" and ... BibTeX

Printed Edition

Edward A. Fox, Peter Ingwersen, Raya Fidel (Eds.): SIGIR'95, Proceedings of the 18th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval. Seattle, Washington, USA, July 9-13, 1995 (Special Issue of the SIGIR Forum). ACM Press 1995, ISBN 0-89791-714-6
Contents BibTeX

Online Edition: ACM Digital Library

Citation page

Referenced by

  1. Yasushi Ogawa: Effective & Efficient Document Ranking without using a Large Lexicon. VLDB 1996: 192-202
BibTeX
ACM SIGMOD Anthology - DBLP: [Home | Search: Author, Title | Conferences | Journals]
ACM SIGMOD Anthology: Copyright © by ACM (info@acm.org), Corrections: anthology@acm.org
DBLP: Copyright © by Michael Ley (ley@uni-trier.de), last change: Sat May 16 23:38:48 2009