@schani now ,I want to traning my transformer model wtih corpus chinese-japanese,I have corpus about 10 million, 1 ,generator traing and dev data,the code adding my data in word2def.py , as follows: from future import absolute_import from future import division from future import print_function import os import tarfile from tensor2tensor.data_generators import problem from tensor2tensor.data_gener