A growing number of consumer devices, including smart speakers, headphones, and watches, use speech as the primary means of user input. As a result, voice trigger detection systems—a mechanism that uses voice recognition technology to control access to a particular device or feature—have become an important component of the user interaction pipeline as they signal the start of an interaction betwe
![Apple Machine Learning Journal](https://cdn-ak-scissors.b.st-hatena.com/image/square/8791a6c496638ee5b11fe79215c9f49e6e52d1e4/height=288;version=1;width=512/https%3A%2F%2Fmlr.cdn-apple.com%2Fmedia%2FHome_1200x630_48225d82e9.png)