Releasing Pythia for vision and language multimodal AI models What it is: Pythia is a deep learning framework that supports multitasking in the vision and language domain. Built on our open-source PyTorch framework, the modular, plug-and-play design enables researchers to quickly build, reproduce, and benchmark AI models. Pythia is designed for vision and language tasks, such as answering question
![Releasing Pythia for vision and language multimodal AI models](https://cdn-ak-scissors.b.st-hatena.com/image/square/0579ddb5cce7cc85be5a49f56a4f611e0ad21132/height=288;version=1;width=512/https%3A%2F%2Fengineering.fb.com%2Fwp-content%2Fuploads%2F2019%2F03%2FOSIBBlue1.jpg)