Home -> Extract / excerpt creation
Number of found records: 40

Author

BRANDOW, Ronald; MITZE, Karl; RAU, Lisa F.
Title
Automatic condensation of electronic publications by sentence selection.
Source
Information Processing and Management, 1995, vol. 31, n. 5, pp.675-685.
Support
On line (11/05/2005)
Abstract
Contribution to a special issue on summarizing text. As electronic information access becomes the norm, and the variety of retrievable material increases, automatic methods of summarizing or condensing text will become critical. Describes a system that performs domain-independent automatic condensation of news from a large commercial news service encompassing 41 different publications. This system was evaluated against a system that condensed the same articles using only the 1st portion of the texts (the lead), up to the target length of the summaries. 3 lengths of articles were evaluated for 250 documents by both systems, totalling 1500 suitability judgements in all. The outcome of perhaps the largest evaluation of human versus machine summarization performed to date was unexpected. The lead-based summaries outperformed the `intelligent' summaries significantly, achieving acceptability ratings of over 90 per cent, compared to 74.4 per cent. Reviews the literature, details the implications of these results, and addresses the remaining hopes for content based summarization. The results should be useful to other researchers currently investigating the viability of summarization through sentence selection heuristics. (DB)
Keywords
Automatic abstracting; Sentences
Assessment

Author

CHAKRABARTI, Soumen; DOM, Byron; RAGHAVAN, Prabhakar; et al
Title
Automatic Resource List Compilation by Analyzing Hyperlink Structure and Associated Text
Source
Proceedings of the 7th International World Wide Web Conference, 1998.
Support
On line (11/05/2005)
Abstract
We describe the design, prototyping and evaluation of ARC, a system for automatically compiling a list of authoritative web resources on any (sufficiently broad) topic. The goal of ARC is to compile resource lists similar to those provided by Yahoo! or Infoseek. The fundamental difference is that these services construct lists either manually or through a combination of human and automated effort, while ARC operates fully automatically. We describe the evaluation of ARC, Yahoo!, and Infoseek resource lists by a panel of human users. This evaluation suggests that the resources found by ARC frequently fare almost as well as, and sometimes better than, lists of resources that are manually compiled or classified into a topic. We also provide examples of ARC resource lists for the reader to examine. (AU)
Keywords
Search; taxonomies; link analysis; anchor text; information retrieval
Assessment

Author

CHOWDHURY, Gobinda G.
Title
Template mining for information extraction from digital documents.
Source
Library Trends, 1999, vol.48, n.1, pp.182-208
Support
On line (11/05/2005)
Abstract
Article included in an issue devoted to the theme: Knowledge discovery in bibliographic databases. With the rapid growth of digital information resources, information extraction (IE), the process of automatically extracting information from natural language texts, is becoming more important. A number of IE systems, particularly in the areas of news/fact retrieval and in domain specific areas, such as in chemical and patent information retrieval, have been developed in the recent past using the template mining approach that involves a natural language processing (NLP) technique to extract data directly from text if either the data and/or text surrounding the data form recognizable patterns. When text matches a template, the system extracts data according to the instructions associated with that template. Briefly reviews template mining research and shows how templates are used in World Wide Web search engines, such as Alta Vista and in metasearch engines, such as Ask Jeeves, for helping end users generate natural language search expressions. Some potential areas of application of template mining for extraction of different kinds of information from digital documents are highlighted and how such applications are used is indicated. (AU)
Keywords
Databases; Searching; Knowledge discovery; Data mining; Knowledge management; Templates
Assessment

Author

CHOWDHURY, Gobinda G.
Title
Natural Language Processing
Source
Annual Review of Information Science and Technology (ARIST), 2003, vol. 37, pp. 51-89
Support
On line ( 09/2005) (Only UGR)
Abstract
Discusses issues related to natural language processing, including theoretical developments; natural language understanding; tools and techniques; natural language text processing systems; abstracting; information extraction; information retrieval; interfaces; software; Internet, Web, and digital library applications; machine translation for multilingual retrieval; and evaluation. (Contains 136 references.) (DB)
Keywords
Information retrieval; Natural language processing
Assessment
Showing page 2 of 10

Previous 1 2 3 4 5 6 7 8 9 10 Next

Director: © Maria Pinto (UGR)

Creation 31/07/2005 | Update 11/04/2011 | Tutorial | Map | e-mail