Download Free Lexical Conceptual Structure And Generation In Machine Translation Book in PDF and EPUB Free Download. You can read online Lexical Conceptual Structure And Generation In Machine Translation and write the review.

To determine the correct lexical items and syntactical realization associated with the surface form in such cases, the underlying lexical-semantic forms are systematically mapped to the target-language syntactic structures. The model described constitutes a lexical-semantic extention to UNITRAN [Dorr, 1987], a syntactic-based translation system that is bidirectional between Spanish and English."
Current approaches to generation for machine translation make use of direct-replacement templates, large grammars, and knowledge-based inferencing techniques. Not only are rules language-specific, but they are too simplistic to handle sentences that exhibit more complex phenomena. Furthermore, these systems are not easily extendable to other languages because the rules that map the internal representation to the surface form are entirely dependent on both the domain of the system and the language being generated. Finally an adequate interlingual representation has not yet been discovered; thus, knowledge-based inferencing is necessary and syntactic cross-linguistic generalization cannot be exploited. This report introduces a plan for the development of a theoretically based computational scheme of natural language generation for a translation system. The emphasis of the project is the mapping from the lexical conceptual structure of sentences to an underlying or base syntactic structure called deep structure. This approach tackles the problems of thematic and structural divergence, i.e., it allows generation of target language sentences that are not thematically or structurally equivalent to their conceptually equivalent source language counterparts. Two other more secondary tasks, construction of a dictionary and mapping from deep structure to surface structure, will also be discussed. The generator operates on a constrained grammatical theory rather than on a set of surface level transformations. (kr).
This paper describes a system for generating natural-language sentences from an interlingual representation, Lexical Conceptual Structure (LCS). The system has been developed as part of a Chinese-English Machine Translation system; however, it is designed to be used for many other MT language pairs and Natural Language applications. The contributions of this work include: (1) Development of a language-independent generation system that maximizes efficiency through the use of a hybrid rule-based/statistical module; (2) Enhancements to an interlingual representation and associated algorithms for interpretation of multiply ambiguous input sentences; (3) Development of an efficient reusable language-independent linearization module with a grammar description language that can be used with other systems; (4) Improvements to an earlier algorithm for hierarchically mapping thematic roles to surface positions; (5) Development of a diagnostic tool for lexicon coverage and correctness and use of the tool for verification of English, Spanish, and Chinese lexicons. An evaluation of translation quality shows comparable performance with commercial translation system. The generation system can also be straightforwardly extended to their languages and this is demonstrated and evaluated for Spanish.
This paper describes a system for generating natural-language sentences from an interlingual representation, Lexical Conceptual Structure (LCS). The system has been developed as part of a Chinese-English Machine Translation system; however, it is designed to be used for many other MT language pairs and Natural Language applications. The contributions of this work include: (1) Development of a language-independent generation system that maximizes efficiency through the use of a hybrid rule-based/statiscal module; (2) Enhancements to an interlingual representation and associated algorithms for interpretation of multiply ambiguous input sentences; (3) Development of an efficient reusable language-independent linearization module with a grammar description language that can be used with other systems; (4) Improvements to an earlier algorithm for hierarchically mapping thematic roles to surface positions; (5) Development of a diagnostic tool for lexicon coverage and correctness and use of the tool for verification of English, Spanish, and Chinese lexicons. An evaluation of translation quality shows comparable performance with commercial translation system. The generation system can also be straightforwardly extended to ther languages and this is demonstrated and evaluated for Spanish.
Abstract: "This report describes the organization and content of lexical information required for the task of machine translation. In particular, the lexical-conceptual basis for UNITRAN, an implemented machine translation system, will be described. UNITRAN uses an underlying form called lexical conceptual structure to perform two difficult, but crucial, tasks: lexical selection (i.e., choosing the appropriate target-language terms for a given source-language sentence) and syntactic realization (i.e., mapping an underlying lexical representation to a corresponding syntactic structure)
This book describes a novel, cross-linguistic approach to machine translation that solves certain classes of syntactic and lexical divergences by means of a lexical conceptual structure that can be composed and decomposed in language-specific ways. This approach allows the translator to operate uniformly across many languages, while still accounting for knowledge that is specific to each language.
This volume constitutes the proceedings of the Third International Workshop of the European Association for Machine Translation, held in Heidelberg, Germany in April 1993. The EAMT Workshops traditionally aim at bringing together researchers, developers, users, and others interested in the field of machine or computer-assisted translation research, development and use. The volume presents thoroughly revised versions of the 15 best workshop contributions together with an introductory survey by the volume editor. The presentations are centered primarily on questions of acquiring, sharing, and managing lexical data, but also address aspects of lexical description.
Machine translation of natural languages is one of the most complex and comprehensive applications of computational linguistics and artificial intelligence. This is especially true of knowledge-based machine translation (KBMT) systems, which require many knowledge resources and processing modules to carry out the necessary levels of analysis, representation and generation of meaning and form. The number of real-world problems, tasks, and solutions involved in developing any realistic-size knowledge-based machine translation system is enormous. It is thus difficult for researchers in the field to learn what a system "really does". This book fills that need with a detailed case study of a KBMT system implemented at the Center for Machine Translation at Carnegie Mellon University. The research consists in part of the creation of a system for translation between English and Japanese. The corpora used in the project were manuals for installing and maintaining IBM personal computers (sponsorship by IBM, through its Tokyo Research Laboratory) Individual chapters describe the interlingua texts used in knowledge-based machine translation, the grammar formalism embodied in the system, the grammars and lexicons and their roles in the translation process, the process of source language analysis, an augmentation module that interactively and automatically resolves ambiguities remaining after source language analysis, and the generator, which produces target language sentences. Detailed appendices illustrate the process from analysis through generation. This book is intended for developers, researchers and advanced students in natural language processing and computational linguistics, including all those who have an interest in machine translation and machine-aided translation.
On-line information -- and free text in particular -- has emerged as a major, yet unexploited, resource available in raw form. Available, but not accessible. The lexicon provides the major key for enabling accessibility to on-line text. The expert contributors to this book explore the range of possibilities for the generation of extensive lexicons. In so doing, they investigate the use of existing on-line dictionaries and thesauri, and explain how lexicons can be acquired from the corpus -- the text under investigation -- itself. Leading researchers in four related fields offer the latest investigations: computational linguists cover the natural language processing aspect; statisticians point out the issues involved in the use of massive data; experts discuss the limitations of current technology; and lexicographers share their experience in the design of the traditional dictionaries.