Learning to Extract Information From Text Based on User-Provided Examples.
Scott B. Huffman:
Learning to Extract Information From Text Based on User-Provided Examples.
CIKM 1996: 154-163@inproceedings{DBLP:conf/cikm/Huffman96,
author = {Scott B. Huffman},
title = {Learning to Extract Information From Text Based on User-Provided
Examples},
booktitle = {CIKM '96, Proceedings of the Fifth International Conference on
Information and Knowledge Management, November 12 - 16, 1996,
Rockville, Maryland, USA},
publisher = {ACM},
year = {1996},
pages = {154-163},
ee = {db/conf/cikm/Huffman96.html, http://doi.acm.org/10.1145/238355.238477},
crossref = {DBLP:conf/cikm/96},
bibsource = {DBLP, http://dblp.uni-trier.de}
}
BibTeX
Abstract
A growing population of users want to extract a growing variety of
information from on-line texts. Unfortunately, current information
extraction systems typically require experts to hand-build
dictionaries of extraction patterns for each new type of information
to be extracted. This paper presents a system that can learn
dictionaries of extraction patterns directly from user-provided
examples of texts and events to be extracted from them. The system,
called LIEP, learns patterns that recognize relationships between key
constituents based on local syntax. Patterns take the form of paths
through a finite-state machine. Sets of patterns learned by LIEP for
a sample extraction task perform nearly at the level of a hand-built
dictionary of patterns.
Copyright © 1996 by the ACM,
Inc., used by permission. Permission to make
digital or hard copies is granted provided that
copies are not made or distributed for profit or
direct commercial advantage, and that copies show
this notice on the first page or initial screen of
a display along with the full citation.
CDROM Version: Load the CDROM "Volume 2 Issue 4, CIKM, DOLAP, GIS, SIGFIDET, ..." and ...
DVD Version: Load ACM SIGMOD Anthology DVD 1" and ...
BibTeX
Printed Edition
CIKM '96, Proceedings of the Fifth International Conference on Information and Knowledge Management, November 12 - 16, 1996, Rockville, Maryland, USA.
ACM 1996
Contents BibTeX
Online Edition
Citation Page
BibTeX
BibTeX
ACM SIGMOD Anthology - DBLP:
[Home | Search: Author, Title | Conferences | Journals]
CIKM 1996 Proceedings, ACM SIGMOD Anthology: Copyright © by ACM (info@acm.org), Corrections: anthology@acm.org
DBLP: Copyright © by Michael Ley (ley@uni-trier.de), last change: Sat May 16 23:01:52 2009