Training Data – Conversion and Corpus Preparation

Published on Jul 7, 2014

This presentation and screencast describes the required training data format for the Moses SMT system and shows how to convert data into this format. It also shows how to align text from translated documents and how to convert TMX files to source more data for SMT training.