As it turns out, the commenters who noted that the top and bottom languages were likely because of small samples were correct. Although the confidence ranges of the top and bottom groups don’t overlap, the difference is not as clear-cut as the means would suggest. I’m going to try to gather some data from sparser-represented languages to clean this up, and will update here when I have better numb