Personal tools
You are here: Home Publikationen Alle Publikationen Natural Language Processing for Web Document Analysis
Document Actions

Manuela Kunze and Dietmar Rösner (2003)

Natural Language Processing for Web Document Analysis

In: Web Document Analysis – Challenges and Opportunities, edited by A. Antonacopoulos and J. Hu. World Scientific Publishing Co. Pte.Ltd., New Jersey, chapter 4, pages 59-78.

In this chapter we present an approach to the analysis of web documents — and other electronically available document collections — that is based on the combination of XML technology with NLP techniques. A key issue addressed is to offer end users a collection of highly interoperable and flexible tools for their experiments with document collections. These tools should be easy to use and as robust as possible. XML is chosen as a uniform encoding for all kinds of data: input and output of modules, process information and linguistic resources. This allows effective sharing and reuse of generic solutions for many tasks (e.g. search, presentation, statistics, transformation).
ISBN 981-238-582-7
 

Powered by Plone, the Open Source Content Management System