Cookbooks, Wikipedia, and auto-generated Spanglish: The quirky ways AI researchers gather data Here are four of the most creative data collection methods used by experts at the leading annual conference on natural-language processing. Data is the oil that fuels AI development, and it gives us many of the advances we take for granted: YouTube captions, Spotify music recommendations, those creepy ad
![Cookbooks, Wikipedia, and auto-generated Spanglish: The quirky ways AI researchers gather data](https://cdn-ak-scissors.b.st-hatena.com/image/square/1ae4ffd940d6345be5216385fbcdfe08b3d30885/height=288;version=1;width=512/https%3A%2F%2Fwp.technologyreview.com%2Fwp-content%2Fuploads%2F2018%2F11%2Fgettyimages-466343187-9.jpg%3Fresize%3D1200%2C600)