Lab 11 - Word Alignment with IBM Model 1

The goal of the lab is to implement IBM Model 1 for word alignment (a model that considers only lexical values of words, i.e. the words as they are written, not their position etc.)

  1. Implement the IBM model 1 as shown in pseudocode in the slides from MT Marathon 2010 (Patrick Lambert, slides originally by Philipp Koehn).
  2. Download manual word alignments: czenali.gz (2501 lines)
  3. Evaluate and report alignment error rate, precision and recall of your IBM1 alignments against the manual alignments.
  4. Your solutions are one part of HW04. From this lab, I will need: