Sign-In / RegisterMy AccountMy BookshelfShopping Cart
Home   ·   About M&C   ·   For Authors   ·   Contacts   ·   Digital Libraries   ·   International Partners

Advanced Search
Browse by Subject
  • Artificial Intelligence and Machine Learning
  • Computer Architecture
  • Computer Science
  • Computer Vision
  • Data Management
  • Data Mining and Knowledge Discovery
  • Data, Semantics, and Knowledge
  • Distributed Computing Theory
  • Emerging Trends in Librarianship
  • Engineering, Science & Technology
  • Games and Computational Intelligence
  • Human Language Technologies
  • Human-Centered Informatics
  • Image, Video, and Multimedia Processing
  • Information Concepts, Retrieval, and Services
  • Information Security, Privacy, and Trust
  • Learning, Networks, and Algorithms
  • Mathematics and Statistics
  • Mobile and Pervasive Computing
  • Network Systems
  • Professionalism and Career Advancement for Scientists and Engineers
  • Quantum Computing
  • Software Engineering
  • Technology and Health
  • Technology, Management, and Entrepreneurship
  • Threatcasting
  • Visual Computing: Computer Graphics, Animation, Computational Photography & Imaging
  • Visualization
  • Advances in Automotive Technology
  • Algorithms and Software in Engineering
  • Antennas
  • Biomedical Engineering
  • Biotechnology
  • Case Studies in Engineering
  • Chemical Engineering and Biochemical Engineering
  • Communication Networks
  • Communications
  • Computation and Analytics
  • Computational Electromagnetics
  • Control and Mechatronics
  • Digital Circuits and Systems
  • Electrical Engineering
  • Electromagnetics
  • Emerging Engineering Technologies
  • Engineering, Science & Technology
  • Engineers, Technology, and Society
  • Global Engineering
  • Image, Video, and Multimedia Processing
  • Materials and Optics
  • Mathematics and Statistics
  • Mechanical Engineering
  • Nuclear Technology and Society
  • Ocean Systems Engineering
  • Operations Research and Applications
  • Power Electronics
  • Professionalism and Career Advancement for Scientists and Engineers
  • Renewable Energy Technologies
  • RF/Microwaves
  • Signal Processing
  • Solid State Materials and Devices
  • Speech and Audio Processing
  • Sustainable Development
  • Synthetic Biology
  • Tissue Engineering
  • Wave Phenomena in the Physical Sciences
  • Building Blocks of the Cell
  • Cell Biology of Medicine
  • Developing Brain
  • Developmental Biology
  • Genetic Basis of Disease
  • Genomic Medicine
  • Integrated Physiology
  • Learning and Memory
  • Neurobiology of Alzheimer's
  • Neuroglia
  • Neuropeptides
  • Protein Activation & Cancer
  • Quantitative Cell Biology
  • Stem Cell Biology
ACM Books
  • Applied and Industrial Physics
  • Applied Mechanics and Fluid Dynamics
  • Astronomy and Astrophysics
  • Atomic and Molecular Physics
  • Biophysics
  • Chemical Physics
  • Computational Physics
  • Condensed Matter and Materials Science
  • Electromagnetics
  • Electronic Materials and Devices
  • Energy and Environmental Physics
  • General and Introductory Physics
  • Geophysics
  • History of Physics
  • Lasers, Optics and Photonics
  • Mathematical Physics
  • Medical Physics
  • Nonlinear Physics
  • Nuclear Physics
  • Particle/High Energy Physics
  • Popular Interest
  • Plasma Physics
  • Quantum Physics
  • Sensor Science and Technology
  • Statistical Physics and Thermodynamics
  • Tools and Methods
  • Waves and Wave Phenomena
Society for Experimental Mechanics Books
Information
Shipping & Returns
Privacy Notice
Conditions of Use
Membership Benefits

Human Language Technologies >> Syntax-based Statistical Machine Translation
Syntax-based Statistical Machine Translation

Syntax-based Statistical Machine Translation

Philip Williams, University of Edinburgh,
Rico Sennrich, University of Edinburgh,
Matt Post, Johns Hopkins University,
Philipp Koehn, Johns Hopkins University

ISBN: 9781627059008 | PDF ISBN: 9781627055024
Copyright © 2016 | 209 Pages
DOI: 10.2200/S00716ED1V04Y201604HLT033
Many institutions worldwide provide digital library access to Morgan & Claypool titles. You can check for personal access by clicking on the DOI link.

READ A SAMPLE

Ordering Options:
Paperback $70.00   |   E-book $56.00   |   Paperback & E-book Combo $87.50

Purchasing Options:


This unique book provides a comprehensive introduction to the most popular syntax-based statistical machine translation models, filling a gap in the current literature for researchers and developers in human language technologies. While phrase-based models have previously dominated the field, syntax-based approaches have proved a popular alternative, as they elegantly solve many of the shortcomings of phrase-based models. The heart of this book is a detailed introduction to decoding for syntax-based models.

The book begins with an overview of synchronous-context free grammar (SCFG) and synchronous tree-substitution grammar (STSG) along with their associated statistical models. It also describes how three popular instantiations (Hiero, SAMT, and GHKM) are learned from parallel corpora. It introduces and details hypergraphs and associated general algorithms, as well as algorithms for decoding with both tree and string input. Special attention is given to efficiency, including search approximations such as beam search and cube pruning, data structures, and parsing algorithms. The book consistently highlights the strengths (and limitations) of syntax-based approaches, including their ability to generalize phrase-based translation units, their modeling of specific linguistic phenomena, and their function of structuring the search space.

Top | Description | Table of Contents | Author Information | Reviews

Table of Contents

Preface
Acknowledgments
Models
Learning from Parallel Text
Decoding I: Preliminaries
Decoding II: Tree Decoding
Decoding III: String Decoding
Selected Topics
Closing Remarks
Bibliography
Authors' Biographies
Author Index
Index

Top | Description | Table of Contents | Author Information | Reviews

About the Author(s)

Philip Williams, University of Edinburgh
Philip Williams is a Research Associate at the University of Edinburgh, where he completed his Ph.D. in 2014. His main research interest is the integration of linguistic information into statistical machine translation. In his thesis, he applied unification-based constraints to syntax-based statistical machine translation. He is the main contributor to the syntax-based models in the Moses toolkit.

Rico Sennrich, University of Edinburgh
Rico Sennrich is a Research Associate at the University of Edinburgh. He received his Ph.D. in Computational Linguistics from the University of Zurich in 2013. His research focuses on data-driven natural language processing, in particular machine translation, syntax, and morphology. His contributions to syntax-based machine translation include a more efficient algorithm for SCFG decoding, and novel models for syntactic language modelling and productive generation of compounds. He developed syntax-based SMT systems for English-German that were tied for first place in the shared translation tasks of WMT 2014 and 2015.

Matt Post, Johns Hopkins University
Matt Post is a Senior Research Scientist at the Human Language Technology Center of Excellence at Johns Hopkins University, where he has been since completing his Ph.D. at the University of Rochester in 2011. Since 2012, he has co-organized the WMT Conference on Statistical Machine Translation. He is the maintainer of the Apache Joshua statistical machine translation toolkit.

Philipp Koehn, Johns Hopkins University
Philipp Koehn is a Professor of Computer Science at Johns Hopkins University, where he is affiliated with the Center for Language and Speech Processing. He also is the Chair of Machine Translation at the University of Edinburgh. He received his Ph.D. in 2003 from the University of Southern California. He is the creator and maintainer of Moses, the de facto statistical machine translation system, used throughout the world in both research and industry. He is a co-founder of the WMT Conference on Statistical Machine Translation, and author of the 2009 textbook Statistical Machine Translation.

Top | Description | Table of Contents | Author Information | Reviews

Reviews

I found this book to be a useful reference book for those interested in syntax-based SMT. The book is well organized, which makes it easy for readers to refer to specific aspects of syntax-based SMT. An improvement can be made to the presentation of ideas in this book. Throughout the book, there are many technical keywords, resulting from the complexity of syntax-based SMT. It would be useful to highlight these keywords in a side bar to remind readers that they are important keywords. In addition, although examples are given throughout the book, it would be even more useful to use these examples to illustrate how the algorithms work, so that readers can gain a better understanding of the algorithms.
Christian Hadiwinoto, National University of Singapore - Computational Linguistics, Volume 43, Number 4

Write a Review

Top | Description | Table of Contents | Author Information | Reviews
Reviews (1)

Customers who bought this product also purchased
Metaphor
Metaphor
Automatic Text Simplification
Automatic Text Simplification
Planning with Markov Decision Processes
Planning with Markov Decision Processes
Case-Based Reasoning
Case-Based Reasoning
A Concise Introduction to Models and Methods for Automated Planning
A Concise Introduction to Models and Methods for Automated Planning
Computational Modeling of Narrative
Computational Modeling of Narrative
Latest News
Tweets by MorganClaypool

Coming Soon
9781636392837.png
Disability Interactions
This book explores disability interactions (DIX), a new approach to using HCI methods and theories to co-create new technologies, experiences, and ways of working with disabled people...

Read More


Newsletter
Note: Registered customers go to: Your Account to subscribe.

E-Mail Address:

Your Name:
 

Copyright 2022 © Morgan & Claypool Publishers. All rights reserved.