I just stumbled across an important article in the history of statistical machine translation. It is available online, so I thought I would post it here for future reference: The Mathematics of Statistical Machine Translation: Parameter Estimation.