2023¶

2023/11/02
in Personal
1 min read

AI Engineer Keynote: Pydantic is all you need

2023/09/17
in Language Models, Retrieval Augmented Generation, Query Understanding, Search Systems
7 min read

RAG is more than just embedding search

With the advent of large language models (LLM), retrival augmented generation (RAG) has become a hot topic. However throught the past year of helping startups integrate LLMs into their stack I've noticed that the pattern of taking user queries, embedding them, and directly searching a vector store is effectively demoware.

What is RAG?

Retrival augmented generation (RAG) is a technique that uses an LLM to generate responses, but uses a search backend to augment the generation. In the past year using text embeddings with a vector databases has been the most popular approach I've seen being socialized.

Simple RAG that embedded the user query and makes a search.

So let's kick things off by examining what I like to call the 'Dumb' RAG Model—a basic setup that's more common than you'd think.

2023/06/01
in Language Models
2 min read

Kojima's Philosophy in LLMs: From Sticks to Ropes

Hideo Kojima's unique perspective on game design, emphasizing empowerment over guidance, offers a striking parallel to the evolving world of Large Language Models (LLMs). Kojima advocates for giving players a rope, not a stick, signifying support that encourages exploration and personal growth. This concept, when applied to LLMs, raises a critical question: Are we merely using these models as tools for straightforward tasks, or are we empowering users to think critically and creatively?

2023/04/04
in Observability, Language Models
3 min read

Good LLM Observability is just plain observability

In this post, I aim to demystify the concept of LLM observability. I'll illustrate how everyday tools employed in system monitoring and debugging can be effectively harnessed to enhance AI agents. Using Open Telemetry, we'll delve into creating comprehensive telemetry for intricate agent actions, spanning from question answering to autonomous decision-making.

What is Open Telemetry?

Essentially, Open Telemetry comprises a suite of APIs, tools, and SDKs that facilitate the creation, collection, and exportation of telemetry data (such as metrics, logs, and traces). This data is crucial for analyzing and understanding the performance and behavior of software applications.

2023/02/05
in LLM
3 min read

Centuar Chess: AI as a Collaborative Partner

This is a experimental post, please leave feedback in the comments below

This was a essay written by ChatGPT given a quick transcript of a 5 minute mono-logue. The goal was to see if I could use ChatGPT to write a blog post. I think it did a pretty good job, but I'll let you be the judge.

It is my intention that by the end you'll understand that AI is not a threat to human intelligence, but rather a tool that can be used to augment human creativity and productivity.

2023/02/05
in Personal
1 min read

Freediving under ice

Growing up, I wasn't very physically active. However, as I got older and had more time, I made a conscious effort to get in shape and improve my relationship with my body. During the Covid pandemic, I developed RSI and my thumbs from coding too much, which prevented me from participating in any sports.