Building Hierarchical Classifiers Using Class Proximity.
Ke Wang, Senqiang Zhou, Shiang Chen Liew:
Building Hierarchical Classifiers Using Class Proximity.
VLDB 1999: 363-374@inproceedings{DBLP:conf/vldb/WangZL99,
author = {Ke Wang and
Senqiang Zhou and
Shiang Chen Liew},
editor = {Malcolm P. Atkinson and
Maria E. Orlowska and
Patrick Valduriez and
Stanley B. Zdonik and
Michael L. Brodie},
title = {Building Hierarchical Classifiers Using Class Proximity},
booktitle = {VLDB'99, Proceedings of 25th International Conference on Very
Large Data Bases, September 7-10, 1999, Edinburgh, Scotland,
UK},
publisher = {Morgan Kaufmann},
year = {1999},
isbn = {1-55860-615-7},
pages = {363-374},
ee = {db/conf/vldb/WangZL99.html},
crossref = {DBLP:conf/vldb/99},
bibsource = {DBLP, http://dblp.uni-trier.de}
}
BibTeX
Abstract
In this paper, we address the need to automatically classify text documents into
topic hierarchies like those in ACM Digital Library and Yahoo!. The existing local
approach constructs a classifier at each split of the topic hierarchy. However, the
local approach does not address the closeness of classification in hierarchical
classification where the concern often is how close a classification is, rather than
simply correct or wrong. Also, the local approach puts its bet on classification at
higher levels where the classification structure often diminishes. To address these
issues, we propose the notion of class proximity and cast the hierarchical
classification as a at classification with the class proximity modeling the closeness
of classes. Our approach is global in that it constructs a single classifier based on
the global information about all classes and class proximity. We leverage generalized
association rules as the rule/feature space to address several other issues in
hierarchical classification.
Copyright © 1999 by the VLDB Endowment.
Permission to copy without fee all or part of this material is granted provided that the copies are not made or
distributed for direct commercial advantage, the VLDB
copyright notice and the title of the publication and
its date appear, and notice is given that copying
is by the permission of the Very Large Data Base
Endowment. To copy otherwise, or to republish, requires
a fee and/or special permission from the Endowment.
Online Paper
DVD Version: Load ACM SIGMOD Anthology DVD 1" and ...
BibTeX
Printed Edition
Malcolm P. Atkinson, Maria E. Orlowska, Patrick Valduriez, Stanley B. Zdonik, Michael L. Brodie (Eds.):
VLDB'99, Proceedings of 25th International Conference on Very Large Data Bases, September 7-10, 1999, Edinburgh, Scotland, UK.
Morgan Kaufmann 1999, ISBN 1-55860-615-7
Contents BibTeX
References
- [AAK96]
- Hussein Almuallim, Yasuhiro Akiba, Shigeo Kaneda:
An Efficient Algorithm for Finding Optimal Gain-Ratio Multiple-Split Tests on Hierarchical Attributes in Decision Tree Learning.
AAAI/IAAI, Vol. 1 1996: 703-708 BibTeX
- [AIS93]
- Rakesh Agrawal, Tomasz Imielinski, Arun N. Swami:
Mining Association Rules between Sets of Items in Large Databases.
SIGMOD Conference 1993: 207-216 BibTeX
- [AS94]
- Rakesh Agrawal, Ramakrishnan Srikant:
Fast Algorithms for Mining Association Rules in Large Databases.
VLDB 1994: 487-499 BibTeX
- [CDAR97-a]
- Soumen Chakrabarti, Byron Dom, Rakesh Agrawal, Prabhakar Raghavan:
Using Taxonomy, Discriminants, and Signatures for Navigating in Text Databases.
VLDB 1997: 446-455 BibTeX
- [CDAR97-b]
- Soumen Chakrabarti, Byron Dom, Rakesh Agrawal, Prabhakar Raghavan:
Scalable Feature Selection, Classification and Signature Generation for Organizing Large Text Databases into Hierarchical Topic Taxonomies.
VLDB J. 7(3): 163-178(1998) BibTeX
- [CDI98]
- Soumen Chakrabarti, Byron Dom, Piotr Indyk:
Enhanced Hypertext Categorization Using Hyperlinks.
SIGMOD Conference 1998: 307-318 BibTeX
- [HF95]
- Jiawei Han, Yongjian Fu:
Discovery of Multiple-Level Association Rules from Large Databases.
VLDB 1995: 420-431 BibTeX
- [KS97]
- ...
- [LHM98]
- Bing Liu, Wynne Hsu, Yiming Ma:
Integrating Classification and Association Rule Mining.
KDD 1998: 80-86 BibTeX
- [WZL99]
- ...
- [Q93]
- J. Ross Quinlan:
C4.5: Programs for Machine Learning.
Morgan Kaufmann 1993, ISBN 1-55860-238-0
BibTeX
- [SA95]
- Ramakrishnan Srikant, Rakesh Agrawal:
Mining Generalized Association Rules.
VLDB 1995: 407-419 BibTeX
- [SG92]
- Padhraic Smyth, Rodney M. Goodman:
An Information Theoretic Approach to Rule Induction from Databases.
IEEE Trans. Knowl. Data Eng. 4(4): 301-316(1992) BibTeX
- [SHP95]
- Hinrich Schütze, David A. Hull, Jan O. Pedersen:
A Comparison of Classifiers and Document Representations for the Routing Problem.
SIGIR 1995: 229-237 BibTeX
- [SOM]
- Self-organizing Map.
http://www.cis.hut.fi/nnrc/nnrc-programs.html BibTeX
- [YP97]
- ...
Referenced by
- Ke Wang, Yu He, Jiawei Han:
Mining Frequent Itemsets Using Support Constraints.
VLDB 2000: 43-52
BibTeX
ACM SIGMOD Anthology - DBLP:
[Home | Search: Author, Title | Conferences | Journals]
VLDB Proceedings: Copyright © by VLDB Endowment,
ACM SIGMOD Anthology: Copyright © by ACM (info@acm.org), Corrections: anthology@acm.org
DBLP: Copyright © by Michael Ley (ley@uni-trier.de), last change: Sat May 16 23:46:27 2009