Image by authorBuilding a transformer model from scratch can often be the only option for many more specific use cases. Although BERT and other transformer models have been pre-trained for many languages and domains, they do not cover everything. Often, these less common use cases stand to gain the most from having someone come along and build a specific transformer model. It could be for an uncom