Number of found records: 40
|
BRANDOW, Ronald; MITZE, Karl; RAU, Lisa F. |
|
Automatic condensation of electronic publications by sentence selection. |
|
Information Processing and Management, 1995, vol. 31, n. 5, pp.675-685. |
|
On line (11/05/2005) |
|
Contribution to a special issue on summarizing text. As electronic information access becomes the norm, and the variety of retrievable material increases, automatic methods of summarizing or condensing text will become critical. Describes a system that performs domain-independent automatic condensation of news from a large commercial news service encompassing 41 different publications. This system was evaluated against a system that condensed the same articles using only the 1st portion of the texts (the lead), up to the target length of the summaries. 3 lengths of articles were evaluated for 250 documents by both systems, totalling 1500 suitability judgements in all. The outcome of perhaps the largest evaluation of human versus machine summarization performed to date was unexpected. The lead-based summaries outperformed the `intelligent' summaries significantly, achieving acceptability ratings of over 90 per cent, compared to 74.4 per cent. Reviews the literature, details the implications of these results, and addresses the remaining hopes for content based summarization. The results should be useful to other researchers currently investigating the viability of summarization through sentence selection heuristics. (DB) |
|
Automatic abstracting; Sentences |
Assessment |
|
|
|
|
CHAKRABARTI, Soumen; DOM, Byron; RAGHAVAN, Prabhakar; et al |
|
Automatic Resource List Compilation by Analyzing Hyperlink Structure and Associated Text |
|
Proceedings of the 7th International World Wide Web Conference, 1998. |
|
On line (11/05/2005) |
|
We describe the design, prototyping and evaluation of ARC, a system for automatically compiling a list of authoritative web resources on any (sufficiently broad) topic. The goal of ARC is to compile resource lists similar to those provided by Yahoo! or Infoseek. The fundamental difference is that these services construct lists either manually or through a combination of human and automated effort, while ARC operates fully automatically. We describe the evaluation of ARC, Yahoo!, and Infoseek resource lists by a panel of human users. This evaluation suggests that the resources found by ARC frequently fare almost as well as, and sometimes better than, lists of resources that are manually compiled or classified into a topic. We also provide examples of ARC resource lists for the reader to examine. (AU) |
|
Search; taxonomies; link analysis; anchor text; information retrieval |
Assessment |
|
|
|
|
CHOWDHURY, Gobinda G. |
|
Template mining for information extraction from digital documents. |
|
Library Trends, 1999, vol.48, n.1, pp.182-208 |
|
On line (11/05/2005) |
|
Article included in an issue devoted to the theme: Knowledge discovery in bibliographic databases. With the rapid growth of digital information resources, information extraction (IE), the process of automatically extracting information from natural language texts, is becoming more important. A number of IE systems, particularly in the areas of news/fact retrieval and in domain specific areas, such as in chemical and patent information retrieval, have been developed in the recent past using the template mining approach that involves a natural language processing (NLP) technique to extract data directly from text if either the data and/or text surrounding the data form recognizable patterns. When text matches a template, the system extracts data according to the instructions associated with that template. Briefly reviews template mining research and shows how templates are used in World Wide Web search engines, such as Alta Vista and in metasearch engines, such as Ask Jeeves, for helping end users generate natural language search expressions. Some potential areas of application of template mining for extraction of different kinds of information from digital documents are highlighted and how such applications are used is indicated. (AU) |
|
Databases; Searching; Knowledge discovery; Data mining; Knowledge management; Templates |
Assessment |
|
|
|
|
CHOWDHURY, Gobinda G. |
|
Natural Language Processing |
|
Annual Review of Information Science and Technology (ARIST), 2003, vol. 37, pp. 51-89 |
|
On line ( 09/2005) (Only UGR) |
|
Discusses issues related to natural language processing, including theoretical developments; natural language understanding; tools and techniques; natural language text processing systems; abstracting; information extraction; information retrieval; interfaces; software; Internet, Web, and digital library applications; machine translation for multilingual retrieval; and evaluation. (Contains 136 references.) (DB) |
|
Information retrieval; Natural language processing |
Assessment |
|
|
|
|