(PDF) e-Book The Need For A Thesaurus In Automated Information Retrieval Full Download

Explorations in Automatic Thesaurus Discovery

Gregory Grefenstette

Published: 2012-12-06

Total Pages: 313

Explorations in Automatic Thesaurus Discovery presents an automated method for creating a first-draft thesaurus from raw text. It describes natural processing steps of tokenization, surface syntactic analysis, and syntactic attribute extraction. From these attributes, word and term similarity is calculated and a thesaurus is created showing important common terms and their relation to each other, common verb--noun pairings, common expressions, and word family members. The techniques are tested on twenty different corpora ranging from baseball newsgroups, assassination archives, medical X-ray reports, abstracts on AIDS, to encyclopedia articles on animals, even on the text of the book itself. The corpora range from 40,000 to 6 million characters of text, and results are presented for each in the Appendix. The methods described in the book have undergone extensive evaluation. Their time and space complexity are shown to be modest. The results are shown to converge to a stable state as the corpus grows. The similarities calculated are compared to those produced by psychological testing. A method of evaluation using Artificial Synonyms is tested. Gold Standards evaluation show that techniques significantly outperform non-linguistic-based techniques for the most important words in corpora. Explorations in Automatic Thesaurus Discovery includes applications to the fields of information retrieval using established testbeds, existing thesaural enrichment, semantic analysis. Also included are applications showing how to create, implement, and test a first-draft thesaurus.

NBS Technical Note

Published: 1967-12

Total Pages: 220

Get eBook

The Automated Approach to Technical Information Retrieval

United States. Navy Dept. Bureau of Ships. Technical Library

Published: 1964

Total Pages: 60

Get eBook

Evaluation of Information Systems

Madeline M. Henderson

Published: 1967

Total Pages: 114

Get eBook

The Need for a Thesaurus in Automated Information Retrieval

Charles F. Balz

Published: 1962

Total Pages: 32

Get eBook

Introduction to Information Retrieval

Christopher D. Manning

Published: 2008-07-07

Total Pages:

Get eBook

Class-tested and coherent, this textbook teaches classical and web information retrieval, including web search and the related areas of text classification and text clustering from basic concepts. It gives an up-to-date treatment of all aspects of the design and implementation of systems for gathering, indexing, and searching documents; methods for evaluating systems; and an introduction to the use of machine learning methods on text collections. All the important ideas are explained using examples and figures, making it perfect for introductory courses in information retrieval for advanced undergraduates and graduate students in computer science. Based on feedback from extensive classroom experience, the book has been carefully structured in order to make teaching more natural and effective. Slides and additional exercises (with solutions for lecturers) are also available through the book's supporting website to help course instructors prepare their lectures.

Mathematical Linguistics and Automatic Language Processing

Published: 1968

Total Pages: 268

Get eBook

Comptes Rendus 28th Conference

M. Williams

Published: 2013-10-22

Total Pages: 449

Get eBook

Comptes Rendus 28th Conference contains information concerning the various aspects or activities during the National Adhering Organizations at 28th Council Meeting. This book is composed of 69 chapters that include information on the members of different divisions and committees, as well as the minutes of the 28th Council Meeting.

Information Retrieval

William Hersh

Published: 2006-05-04

Total Pages: 524

Get eBook

Coupled with the growth of the World Wide Web, the topic of health information retrieval has had a tremendous impact on consumer health information. With the aid of newly added questions and discussions at the end of each chapter, this Second Edition covers theory practical applications, evaluation, and research directions of all aspects of medical information retireval systems.

Library & Information Bulletin

Published: 1974

Total Pages: 172

Get eBook

Books and Bottles

Download Free The Need For A Thesaurus In Automated Information Retrieval Book in PDF and EPUB Free Download. You can read online The Need For A Thesaurus In Automated Information Retrieval and write the review.

Explorations in Automatic Thesaurus Discovery

NBS Technical Note

The Automated Approach to Technical Information Retrieval

Evaluation of Information Systems

The Need for a Thesaurus in Automated Information Retrieval

Introduction to Information Retrieval

Mathematical Linguistics and Automatic Language Processing

Comptes Rendus 28th Conference

Information Retrieval

Library & Information Bulletin

New Books