LT World

General Information
  • Language Technology
  • About LT World
  • Intern. Advisory Board
  • LT World Back Issues

ACL Anthology Searchbench Logo

You are here: Home kb Information & Knowledge Technologies Information Extraction

Information Extraction

FASTUS: A Cascaded Finite-State Transducer for Extracting Information from Natural-Language Text.
Hobbs, Appelt, Bear, Israel, Kameyama, Stickel and Tyson.
Roche and Schabes. MIT Press. Cambridge MA. 1996.

Maximum entropy Markov models for information extraction and segmentation.
A. McCallum, D. Freitag, and F. Pereira. 2000.

SRA: Description of the IE-System used for MUC-7.
Chinatsu Aone and Lauren Halverson and Tom Hampton and Mila Ramos-Santacruz.
Proceedings of MUC-7. 1998.

Message Understanding Conference -- 6: A Brief History.
R. Grishman and B. Sundheim.
coling96. Kopenhagen, Denmark, Europe. 1996.

An Information Extraction Core System for Real World German Text Processing.
G. Neumann and R. Backofen and J. Baur and M. Becker and C. Braun.
anlp97. Washington, USA. March 1997.

Unsupervised Discovery of Scenario-Level Patterns for Information Extraction.
R. Yangarber and R. Grishman and P. Tapanainen and S. Huttunen.
Proceedings of the 6th ANLP. Seattle, USA. April 2000.

The Generic Information Extraction System.
J. Hobbs.

FASTUS: A Finite State Processor for Information Extraction from Real World Text.
D. Appelt and J. Hobbs and J. Bear and D. Israel and M. Tyson.
ijcai93. Chambery, France. August 1993.

Information Extraction.
J.Cowie and W.Lehnert.
Communications of the ACM. 39 (1). 1996. 51-87.

  • Message Understanding Conferences Series (MUC)
  • Proteus Project
  • Information Extraction at the University of Sheffield
  • Inxight
  • Introduction to Information Extraction Technology
  • Intelligente Informationsextraktion

  • SRI AI Labs (AIC)
  • Text Analysis International Inc.
  • Language Technology Lab (LT Lab)
  • University of Sheffield
  • Inxight Software
  • insiders technologies GmbH
  • Center for Automated Learning and Discovery (CALD)
  • Compris Intelligence GmbH (

  • Dayne Freitag
  • Yorick Wilks
  • Hamish Cunningham
  • Günter Neumann
  • Tom Mitchell
  • Robert Gaizauskas
  • Douglas E. Appelt
  • Ellen Riloff
  • Ralph Grishman

  • Proteus
  • Whiteboard
  • System for Large Scale Information Extraction (LaSIE)
  • Parameterizable Domain-Adaptive Information and Message Extraction (PARADIME)
  • Finite-State Automa-based Text Understanding System (Fastus)
  • Genome Information Extraction (GenIE)

  • GATE
  • FactMiner
  • Textpro

The goal of information extraction (IE) is to build systems that find and link relevant information from natural language text ignoring irrelevant information. The information of interest is typically pre-specified in form of uninstantiated frame-like structures also called templates. The templates are domain and task specific. The major task of an IE-system is then the identification of the relevant parts of the text which are used to fill a template's slots.