It happened in prepare.py. Can you provide the corpus you are using and the address where it was downloaded.