yuisekiのブックマーク - はてなブックマーク

Implement Flash Attention Option · Issue #19 · ggerganov/llama.cpp

yuiseki 2024/05/22

リンク

ggml : add Flash Attention by ggerganov · Pull Request #5021 · ggerganov/llama.cpp

yuiseki 2024/05/22

リンク

https://github.com/ggerganov/llama.cpp/tree/master/examples/train-text-from-scratch

yuiseki 2024/03/14

リンク

llama : add BERT support · Issue #2872 · ggerganov/llama.cpp

yuiseki 2024/03/03

リンク

Add `gemma` model by postmasters · Pull Request #5631 · ggerganov/llama.cpp

yuiseki 2024/02/22

リンク

Windows XP: support MinGW 8.1.0 by guilt · Pull Request #3419 · ggerganov/llama.cpp

yuiseki 2024/01/15

リンク

GGUF file format specification by philpax · Pull Request #302 · ggerganov/ggml

yuiseki 2023/09/07

リンク

GGUF by ggerganov · Pull Request #2398 · ggerganov/llama.cpp

ref: GGUF file format specification ggml#302 llama : refactor model loading code #1991 This PR paves the way for integrating more models into llama.cpp. It changes the file format in which we convert the models by extending it with key-value pairs meta information. This meta data is flexible and allows to add specific information about the model being converted. This is a breaking change, meaning

yuiseki 2023/09/07

リンク

Difference in different quantization methods · ggerganov/llama.cpp · Discussion #2094

You signed in with another tab or window. Reload to refresh your session. You signed out in another tab or window. Reload to refresh your session. You switched accounts on another tab or window. Reload to refresh your session. Dismiss alert

yuiseki 2023/07/21

リンク

llama.cpp/examples/server/README.md at master · ggerganov/llama.cpp

You signed in with another tab or window. Reload to refresh your session. You signed out in another tab or window. Reload to refresh your session. You switched accounts on another tab or window. Reload to refresh your session. Dismiss alert

yuiseki 2023/07/20

リンク

mpi : attempt inference of 65B LLaMA on a cluster of Raspberry Pis · Issue #2164 · ggerganov/llama.cpp

You signed in with another tab or window. Reload to refresh your session. You signed out in another tab or window. Reload to refresh your session. You switched accounts on another tab or window. Reload to refresh your session. Dismiss alert

yuiseki 2023/07/17

リンク

GitHub - ggerganov/llama.cpp: LLM inference in C/C++

The main goal of llama.cpp is to enable LLM inference with minimal setup and state-of-the-art performance on a wide variety of hardware - locally and in the cloud. Plain C/C++ implementation without any dependencies Apple silicon is a first-class citizen - optimized via ARM NEON, Accelerate and Metal frameworks AVX, AVX2 and AVX512 support for x86 architectures 1.5-bit, 2-bit, 3-bit, 4-bit, 5-bit,

yuiseki 2023/07/11

リンク

GitHub - ggerganov/wave-share: Serverless, peer-to-peer, local file sharing through sound

A proof-of-concept for WebRTC signaling using sound. Works with all devices that have microphone + speakers. Runs in the browser. Nearby devices negotiate the WebRTC connection by exchanging the necessary Session Description Protocol (SDP) data via a sequence of audio tones. Upon successful negotiation, a local WebRTC connection is established between the browsers allowing data to be exchanged via