laughingのブックマーク - はてなブックマーク

fireworks-ai/llama-3-firefunction-v2 · Hugging Face

laughing 2024/06/20

リンク

NousResearch/Genstruct-7B · Hugging Face

laughing 2024/03/11

リンク

Speculative Decoding for 2x Faster Whisper Inference

laughing 2024/01/05

リンク

microsoft/phi-2 · Hugging Face

Model Summary Phi-2 is a Transf ormer with 2.7 billion parameters. It was trained using the same data sources as Phi-1.5, augmented with a new data source that consists of various NLP synthetic texts and filtered websites (for safety and educational value). When assessed against benchmarks testing common sense, language understanding, and logical reasoning, Phi-2 showcased a nearly state-of-the-art

laughing 2023/12/15

リンク

Optimizing your LLM in production

Note: This blog post is also available as a documentation page on Transf ormers. Large Language Models (LLMs) such as GPT3/4, Falcon, and LLama are rapidly advancing in their ability to tackle human-centric tasks, establishing themselves as essential tools in modern knowledge-based industries. Deploying these models in real-world tasks rem ains challenging, however: To exhibit near-human text unders

laughing 2023/09/17

あとで読む

リンク

rinna/japanese-hubert-base · Hugging Face

rinna/japanese-hubert-base","children":[],"isValid":true,"title":"rinna/japanese-hubert-base"},{"id":"overview","label":"Overview","children":[],"isValid":true,"title":"Overview"},{"id":"how-to-use-the-model","label":"How to use the model","children":[],"isValid":true,"title":"How to use the model"},{"id":"how-to-cite","label":"How to cite","children":[],"isValid":true,"title":"How to cite"},{"id"

laughing 2023/04/29

あとで読む

リンク

Fine-tuning 20B LLMs with RLHF on a 24GB consumer GPU

We are excited to officially release the integration of trl with peft to make Large Language Model (LLM) fine-tuning with Reinforcement Learning more accessible to anyone! In this post, we explain why this is a competitive alternative to existing fine-tuning approaches. Note peft is a general tool that can be applied to many ML use-cases but it’s particularly interesting for RLHF as this method is