Releasing 3B and 7B RedPajama-INCITE family of models including base, instruction-tuned & chat models The RedPajama project aims to create a set of leading open-source models and to rigorously understand the ingredients that yield good performance. A few weeks ago we released the RedPajama base dataset based on the LLaMA paper, which has galvanized the open-source community. The 5 terabyte datase