Understanding the content of Tweets is important for many reasons: grasping a user’s interests (which in turn lets us show more relevant content), improving search, and fighting spam. There are many steps involved in a typical natural language processing pipeline, but one of the first and most fundamental steps is language identification — determining the language in which a piece of text is writt
![Evaluating language identification performance](https://cdn-ak-scissors.b.st-hatena.com/image/square/a7dfbba80c1b2e3d08e53178d335285f8dc68548/height=288;version=1;width=512/https%3A%2F%2Fcdn.cms-twdigitalassets.com%2Fcontent%2Fdam%2Fblog-twitter%2Fengineering%2Fen_us%2Fmain-template-assets%2FEng_EXPLORE_Pink.png.twimg.768.png)