Comparable Corpora And Computer Assisted Translation

Author: Estelle Maryline Delpech
Publisher: John Wiley & Sons
ISBN: 1119002702
Size: 12.30 MB
Format: PDF, ePub, Mobi
View: 3651
Download Read Online

Comparable Corpora And Computer Assisted Translation from the Author: Estelle Maryline Delpech. Computer-assisted translation (CAT) has always used translation memories, which require the translator to have a corpus of previous translations that the CAT software can use to generate bilingual lexicons. This can be problematic when the translator does not have such a corpus, for instance, when the text belongs to an emerging field. To solve this issue, CAT research has looked into the leveraging of comparable corpora, i.e. a set of texts, in two or more languages, which deal with the same topic but are not translations of one another. This work had two primary objectives. The first is to assess the input of lexicons extracted from comparable corpora in the context of a specialized human translation task. The second objective is to identify bilingual-lexicon-extraction methods which best match the translators’ needs, determining the current limits of these techniques and suggesting improvements. The author focuses, in particular, on the identification of fertile translations, the management of multiple morphological structures, and the ranking of candidate translations. The experiments are carried out on two language pairs (English–French and English–German) and on specialized texts dealing with breast cancer. This research puts significant emphasis on applicability – methodological choices are guided by the needs of the final users. This book is organized in two parts: the first part presents the applicative and scientific context of the research, and the second part is given over to efforts to improve compositional translation. The research work presented in this book received the PhD Thesis award 2014 from the French association for natural language processing (ATALA).

Building And Using Comparable Corpora

Author: Serge Sharoff
Publisher: Springer Science & Business Media
ISBN: 3642201288
Size: 14.44 MB
Format: PDF, ePub, Docs
View: 576
Download Read Online

Building And Using Comparable Corpora from the Author: Serge Sharoff. The 1990s saw a paradigm change in the use of corpus-driven methods in NLP. In the field of multilingual NLP (such as machine translation and terminology mining) this implied the use of parallel corpora. However, parallel resources are relatively scarce: many more texts are produced daily by native speakers of any given language than translated. This situation resulted in a natural drive towards the use of comparable corpora, i.e. non-parallel texts in the same domain or genre. Nevertheless, this research direction has not produced a single authoritative source suitable for researchers and students coming to the field. The proposed volume provides a reference source, identifying the state of the art in the field as well as future trends. The book is intended for specialists and students in natural language processing, machine translation and computer-assisted translation.

Corpus Linguistics For Translation And Contrastive Studies

Author: Mikhail Mikhailov
Publisher: Routledge
ISBN: 1317229398
Size: 41.68 MB
Format: PDF, Mobi
View: 4291
Download Read Online

Corpus Linguistics For Translation And Contrastive Studies from the Author: Mikhail Mikhailov. Corpus Linguistics for Translation and Contrastive Studies provides a clear and practical introduction to using corpora in these fields. Giving special attention to parallel corpora, which are collections of texts in two or more languages, and demonstrating the potential benefits for multilingual corpus linguistics research to both translators and researchers, this book: explores the different types of parallel corpora available, and shows how to use basic and advanced search procedures to analyse them; explains how to compile a parallel corpus, and discusses their uses for translation purposes and to research linguistic phenomena across languages; demonstrates the use of corpus extracts across a wide range of texts, including dictionaries, novels by authors including Jane Austen and Mikhail Bulgakov, and newspapers such as The Sunday Times; is illustrated with case studies from a range of languages including Finnish, Russian, English and French. Written by two experienced researchers and practitioners, Corpus Linguistics for Translation and Contrastive Studies is essential reading for postgraduate students and researchers working within the area of translation and contrastive studies.

Human Language Technologies The Baltic Perspective

Author: A. Tavast
Publisher: IOS Press
ISBN: 1614991332
Size: 11.68 MB
Format: PDF, ePub
View: 1402
Download Read Online

Human Language Technologies The Baltic Perspective from the Author: A. Tavast. Human language technologies continue to play an important part in the modern information society. This book contains papers presented at the fifth international conference ‘Human Language Technologies – The Baltic Perspective (Baltic HLT 2012)’, held in Tartu, Estonia, in October 2012. Baltic HLT provides a special venue for new and ongoing work in computational linguistics and related disciplines, both in the Baltic states and in a broader geographical perspective. It brings together scientists, developers, providers and users of HLT, and is a forum for the sharing of new ideas and recent advances in human language processing, promoting cooperation between the research communities of computer science and linguistics from the Baltic countries and the rest of the world. Twenty long papers, as well as the posters or demos accepted for presentation at the conference, are published here. They cover a wide range of topics: morphological disambiguation, dependency syntax and valency, computational semantics, named entities, dialogue modeling, terminology extraction and management, machine translation, corpus and parallel corpus compiling, speech modeling and multimodal communication. Some of the papers also give a general overview of the state of the art of human language technology and language resources in the Baltic states. This book will be of interest to all those whose work involves the use and application of computational linguistics and related disciplines.

Corpus Based Perspectives In Linguistics

Author: Yuji Kawaguchi
Publisher: John Benjamins Publishing
ISBN: 9027292388
Size: 60.16 MB
Format: PDF, Docs
View: 6270
Download Read Online

Corpus Based Perspectives In Linguistics from the Author: Yuji Kawaguchi. UBLI has conducted field surveys since 2002 and built spoken language corpora for French, Spanish, Italian (Salentino dialect), Russian, Malaysian, Turkish, Japanese, and Canadian multilinguals. This volume features new research presented at the UBLI second workshop on Corpus Linguistics – Research Domain, which was held on September 14, 2006. The first part consisting of eleven presentations to this workshop shows a wide range of subjects within the area of corpus-based research, such as dictionary, linguistic atlas, dialect, translation, ancient texts, non-standard texts, sociolinguistics, second language acquisition, and natural language processing. The second part of this volume comprises ten additional contributions to both written and spoken corpora by the members and research assistants of UBLI.

Translation A Multidisciplinary Approach

Author: J. House
Publisher: Springer
ISBN: 1137025484
Size: 60.13 MB
Format: PDF, Mobi
View: 7161
Download Read Online

Translation A Multidisciplinary Approach from the Author: J. House. The cross-linguistic and cross-cultural practice of translation is a field of rapidly growing international importance. World-renowned experts offer new and multidisciplinary insights on this subject, viewing translation as social action and intercultural communication, and as a phenomenon of languages in contact and a socio-cognitive process.

Brain Computer Interfaces 2

Author: Maureen Clerc
Publisher: John Wiley & Sons
ISBN: 1119347742
Size: 69.35 MB
Format: PDF, ePub, Mobi
View: 3142
Download Read Online

Brain Computer Interfaces 2 from the Author: Maureen Clerc. Brain–computer interfaces (BCI) are devices which measure brain activity and translate it into messages or commands, thereby opening up many possibilities for investigation and application. This book provides keys for understanding and designing these multi-disciplinary interfaces, which require many fields of expertise such as neuroscience, statistics, informatics and psychology. This second volume, Technology and Applications, is focused on the field of BCI from the perspective of its end users, such as those with disabilities to practitioners. Covering clinical applications and the field of video games, the book then goes on to explore user needs which drive the design and development of BCI. The software used for their design, primarily OpenViBE, is explained step by step, before a discussion on the use of BCI from ethical, philosophical and social perspectives. The basic notions developed in this reference book are intended to be accessible to all readers interested in BCI, whatever their background. More advanced material is also offered, for readers who want to expand their knowledge in disciplinary fields underlying BCI.

Natural Language Processing And Computational Linguistics 2

Author: Mohamed Zakaria Kurdi
Publisher: John Wiley & Sons
ISBN: 1119419727
Size: 12.60 MB
Format: PDF, ePub
View: 7771
Download Read Online

Natural Language Processing And Computational Linguistics 2 from the Author: Mohamed Zakaria Kurdi. Natural Language Processing (NLP) is a scientific discipline which is found at the intersection of fields such as Artificial Intelligence, Linguistics, and Cognitive Psychology. This book presents in four chapters the state of the art and fundamental concepts of key NLP areas. Are presented in the first chapter the fundamental concepts in lexical semantics, lexical databases, knowledge representation paradigms, and ontologies. The second chapter is about combinatorial and formal semantics. Discourse and text representation as well as automatic discourse segmentation and interpretation, and anaphora resolution are the subject of the third chapter. Finally, in the fourth chapter, I will cover some aspects of large scale applications of NLP such as software architecture and their relations to cognitive models of NLP as well as the evaluation paradigms of NLP software. Furthermore, I will present in this chapter the main NLP applications such as Machine Translation (MT), Information Retrieval (IR), as well as Big Data and Information Extraction such as event extraction, sentiment analysis and opinion mining.

Incorporating Corpora

Author: Gunilla M. Anderman
Publisher: Multilingual Matters
ISBN: 1853599859
Size: 14.27 MB
Format: PDF, ePub
View: 6577
Download Read Online

Incorporating Corpora from the Author: Gunilla M. Anderman. Covering a number of European languages from Portuguese to Hungarian, this volume includes many new studies of translation patterns using parallel corpora focusing on particular linguistic features, as well as broader-ranging contributions on translation 'universals'.

Contemporary Corpus Linguistics

Author: Paul Baker
Publisher: A&C Black
ISBN: 1441181334
Size: 32.30 MB
Format: PDF, Mobi
View: 310
Download Read Online

Contemporary Corpus Linguistics from the Author: Paul Baker. Acts as a one-volume resource, providing an introduction to every aspect of corpus linguistics as it is being used at the moment.