Data2vec 2.0: Highly efficient self-supervised learning for vision, speech and text Many recent breakthroughs in AI have been powered by self-supervised learning, which enables machines to learn without relying on labeled data. But current algorithms have several significant limitations, often including being specialized for a single modality (such as images or text) and requiring lots of computat
![Data2vec 2.0: Highly efficient self-supervised learning for vision, speech and text](https://cdn-ak-scissors.b.st-hatena.com/image/square/bd8fdf20d524853ff96d86c9237a41c850c652ee/height=288;version=1;width=512/https%3A%2F%2Fscontent-nrt1-1.xx.fbcdn.net%2Fv%2Ft39.2365-6%2F318451836_1178090343105940_5457223586182092425_n.jpg%3F_nc_cat%3D100%26ccb%3D1-7%26_nc_sid%3De280be%26_nc_ohc%3DvYpTa-EDymQAb5NrKGS%26_nc_oc%3DAdhMXVkJiatIXdqWRDEYgBbfJgJpRtAk2ByxicukiZkaTw-V-vffeEr2lEamb-0mQnk%26_nc_ht%3Dscontent-nrt1-1.xx%26oh%3D00_AfDvxxPgD-aau3AceZ5LZeQRghJcqg_RlfiAJiSr7d7zgw%26oe%3D663DDB39)