deejayrokaのブックマーク - はてなブックマーク

Tiny Time Mixers (TTM): A Powerful Zero-Shot Forecasting Model by IBM

deejayroka 2024/06/10

リンク

Generate Music Recommendations Utilizing LangChain Agents

deejayroka 2024/03/06

リンク

Navigating the AI Landscape of 2024: Trends, Predictions, and Possibilities

deejayroka 2024/01/09

リンク

A Guide on 12 Tuning Strategies for Production-Ready RAG Applications

This article covers the following “hyperparameters” sorted by their relevant stage. In the ingestion stage of a RAG pipeline, you can achieve performance improvements by: Data cleaningChunkingEmbedding modelsMetadataMulti-indexingIndexing algorithmsAnd in the inferencing stage (retrieval and generation), you can tune: Query transf ormationsRetrieval parametersAdvanced retrieval strategiesRe-ranking

deejayroka 2023/12/15

リンク

TimeGPT: The First Foundation Model for Time Series Forecasting

deejayroka 2023/11/22

リンク

Knowledge Graph Transformers: Architecting Dynamic Reasoning for Evolving Knowledge

deejayroka 2023/11/03

リンク

Augmenting LLMs with RAG

deejayroka 2023/10/12

リンク

Mastering Customer Segmentation with LLM

Let’s see a brief description of the columns of our dataset: age (numeric)job : type of job (categorical: “admin.” ,”unknown”,”unemployed”, ”management”, ”housem aid”, ”entrepreneur”, ”student”, “blue-collar”, ”self-employed”, ”retired”, ”technician”, ”services”)marital : marital status (categorical: “married”,”divorced”,”single”; note: “divorced” means divorced or widowed)education (categorical: “

deejayroka 2023/10/01

リンク

Fine-Tune Your Own Llama 2 Model in a Colab Notebook

Image by authorWith the release of LLaMA v1, we saw a Cambrian explosion of fine-tuned models, including Alpaca, Vicuna, and WizardLM, among others. This trend encouraged different businesses to launch their own base models with licenses suitable for commercial use, such as OpenLLaMA, Falcon, XGen, etc. The release of Llama 2 now combines the best elements from both sides: it offers a highly effic

deejayroka 2023/09/23

リンク

10 Ways to Improve the Performance of Retrieval Augmented Generation Systems

The Quick-start Guide Isn’t Enough“Retrieval augmented generation is the process of supplementing a user’s input to a large language model (LLM) like ChatGPT with additional information that you (the system) have retrieved from somewhere else. The LLM can then use that information to augment the response that it generates.” — Cory Zue LLMs are an amazing invention, prone to one key issue. They mak

deejayroka 2023/09/20

langchain
IT

リンク

Applying LLMs to Enterprise Data: Concepts, Concerns, and Hot-Takes

deejayroka 2023/09/04

あとで読む

リンク

A Beginner’s Guide to LLM Fine-Tuning

deejayroka 2023/08/31

リンク

Practical Prompt Engineering

deejayroka 2023/07/31

リンク

Learning the Ropes for Your Next LangChain Project

deejayroka 2023/07/09

リンク

In-Context Learning Approaches in Large Language Models

deejayroka 2023/07/03

リンク

Testing Language Models (and Prompts) Like We Test Software

Image created by the authors.How can we test applications built with LLMs? In this post we look at the concept of testing applications (or prompts) built with language models, in order to better understand their capabilities and limitations. We focus entirely on testing in this article, but if you are interested in tips for writing better prompts, check out our Art of Prompt Design series (ongoing

deejayroka 2023/06/11

リンク

Not All Vector Databases Are Made Equal

While working on this blog post I had a privilege of interacting with all search engine key developers / leadership: Bob van Luijt and Etienne Dilocker (Weaviate), Greg Kogan (Pinecone), Pat Lasserre, George Williams (GSI Techno logies Inc), Filip Haltmayer (Milvus), Jo Kristian Bergum (Vespa), Kiichiro Yukawa (Vald) and Andre Zayarni (Qdrant) This blog has been discussed on HN: https://news.ycombi