Bling Fire Tokenizer provides state of the art performance for Natural Language text tokenization. Bling Fire supports the following tokenization algorithms: Pattern-based tokenization WordPiece tokenization SentencePiece Unigram LM SentencePiece BPE Induced/learned syllabification patterns (identifies possible hyphenation points within a token) Bling Fire provides uniform interface for working wi