Feature Reduction for Neural Network Based Text Categorization.

Savio L. Y. Lam, Dik Lun Lee: Feature Reduction for Neural Network Based Text Categorization. DASFAA 1999: 195-202

@inproceedings{DBLP:conf/dasfaa/LamL99,
  author    = {Savio L. Y. Lam and
               Dik Lun Lee},
  editor    = {Arbee L. P. Chen and
               Frederick H. Lochovsky},
  title     = {Feature Reduction for Neural Network Based Text Categorization},
  booktitle = {Database Systems for Advanced Applications, Proceedings of the
               Sixth International Conference on Database Systems for Advanced
               Applications (DASFAA), April 19-21, Hsinchu, Taiwan},
  publisher = {IEEE Computer Society},
  year      = {1999},
  isbn      = {0-7695-0084-6},
  pages     = {195-202},
  ee        = {db/conf/dasfaa/LamL99.html},
  crossref  = {DBLP:conf/dasfaa/99},
  bibsource = {DBLP, http://dblp.uni-trier.de}
}

BibTeX

Abstract

In a text categorization model using an artificial neural network as the text classifier, scalability is poor if the neural network is trained using the raw feature space since textural data has a very high-dimension feature space.

We proposed and compared four dimensionality reduction techniques to reduce the feature space into an input space of much lower dimension for the neural network classifier. To test the effectiveness of the proposed model, experiments were conducted using a subset of the Reuters-22173 test collection for text categorization.

The results showed that the proposed model was able to achieve high categorization effectiveness as measured by precision and recall. Among the four dimensionality reduction techniques proposed, Principal Component Analysis was found to be the most effective in reducing the dimensionality of the feature space.

ACM SIGMOD DiSC

CDROM Version: Load the CDROM "DiSC, Volume 2 Number 1" and ...

Windows: Click the letter of your CD drive
A B C D E F G H I J K L M N O P Q R S T U V W X Y Z
Mac: Click here
UNIX/LINUX: mount the CD and click on the path of your mount point:
/Anthology/DISC001 or /cdrom

ACM SIGMOD Anthology

DVD Version: Load ACM SIGMOD Anthology DVD 1" and ...

Windows: Click the letter of your CD drive
A B C D E F G H I J K L M N O P Q R S T U V W X Y Z
Mac: Click here
UNIX/LINUX: mount the DVD and click on the path of your mount point:
/Anthology/aDVD1 or /dvd

BibTeX

Online Edition: IEEE Computer Society Digital Library

Citation Page

References

[1]: ...
[2]: Geoffrey E. Hinton: Connectionist Learning Procedures. Artif. Intell. 40(1-3): 185-234(1989) BibTeX
[3]: ...
[4]: Kevin Knight: Connectionist Ideas and Algorithms. Commun. ACM 33(11): 58-74(1990) BibTeX
[5]: ...
[6]: ...
[7]: ...
[8]: ...
[9]: David E. Rumelhart, Bernard Widrow, Michael A. Lehr: The Basic Ideas in Neural Networks. Commun. ACM 37(3): 87-92(1994) BibTeX
[10]: Gerard Salton, Chris Buckley: Term-Weighting Approaches in Automatic Text Retrieval. Inf. Process. Manage. 24(5): 513-523(1988) BibTeX

BibTeX

ACM SIGMOD Anthology - DBLP: [Home | Search: Author, Title | Conferences | Journals]

DASFAA 1999 Proceedings: Copyright © by IEEE,
ACM SIGMOD Anthology: Copyright © by ACM (info@acm.org), Corrections: anthology@acm.org
DBLP: Copyright © by Michael Ley (ley@uni-trier.de), last change: Sat May 16 23:05:37 2009