All posts

statistics hypothesis testing experimentation data analysis onboarding

codex coding agents rust observability developer tools

Agentusage would not exist without open source.

Jul 12, 2026

Statistical Tests for Data Analysis: A Practical Onboarding Guide

DATA SCIENCE

Learn how to choose and use statistical tests without turning analysis into a p-value checklist—from experimental design and assumptions to effect sizes, confidence intervals, and...

Jul 10, 2026

Use Local Models in VS Code Copilot with LM Studio and Unsloth Studio

AI ENGINEERING LARGE LANGUAGE MODELS

vscode copilot lm studio unsloth local llm byok

Before you begin Install VS Code with Copilot Chat, then download a model in LM Studio or Unsloth Studio.

Jul 5, 2026

KV-Centric LLM Serving: vLLM, SGLang, and Disaggregated Attention

vllm sglang paged attention radix attention kv cache disaggregated serving prefill decode

The more I look at LLM serving, the more it feels like the main object is not the request, the model, or even the GPU.

Jul 3, 2026

Reproducing vLLM and LMCache KV Cache Reuse on a CPU-Only MacBook

AI ENGINEERING DATA ENGINEERING SOFTWARE ENGINEERING

vllm lmcache kv cache cpu uv

I became interested in LMCache because it sits in the part of LLM serving that feels both very practical and very under-discussed: KV cache movement.

Jun 23, 2026

Use Databricks Models with VS Code Copilot and Copilot CLI

databricks copilot vscode llm model serving

I wanted one Databricks-hosted model to work in two developer surfaces:

Jun 18, 2026

A Git-Native Message Channel for Local Coding Agents

coding agents git multi-agent systems

My previous local development workflow was simple:

Mar 15, 2026

Attention Dilution

llm fundamentals rag

Attention dilution (also called context dilution) is one of the fundamental limitations of transformer-based LLMs when dealing with long contexts or extended agent memory.

Mar 8, 2026

AI Terminology: Agents, Skills, RAG, MCP, and the Layers Beneath the Hype

agents rag mcp

How many of these terms do you actually recognize?

Jan 4, 2026

ChatGPT in 2025: A Year in Review

llm fundamentals

ChatGPT Stats ChatGPT Growth ChatGPT Revenue

Nov 27, 2025

The Mandate for Leadership in AI Engineering

leadership

Over the next 12 to 24 months, the differentiator among engineers will shift from mastery of programming languages like Rust, Go, or Python, or the volume of code produced, to the...

Oct 30, 2025

LLM Interview Questions

llm fundamentals

Hyperparameters are external settings chosen before training, such as the learning rate or regularization strength.

Oct 29, 2025

LLM Training Epoch

model training

As large language models (LLMs) scale up, researchers have begun to notice a growing imbalance between model size and the availability of high-quality training tokens. The...

Oct 20, 2025

vllm throughput

llmops

In large-language-model (LLM) inference serving contexts, once the model compute becomes sufficiently fast, the performance bottleneck often shifts to the key-value (KV) cache...

Oct 19, 2025

LangGraph Reflection

langgraph agents

Reflection is related to agent self-improvement or reasoning feedback loops.

Oct 2, 2025

LangGraph Sample Project

langgraph

[x] Independent deployable services - Each agent can scale horizontally (e.g., analysisservice replicas) - You can version and deploy agents independently

Sep 29, 2025

LangChain/LangGraph Q&A

langchain langgraph

Its advantages over traditional sequential chains are evident in two areas:

Aug 10, 2025

Training LLM From Zero

model training

1. Objective 2. Environment Setup

Jul 16, 2025

FastMCP MCP Server Hub

mcp

MCP Server Hub Currently, our different projects are using various MCP servers. To streamline and unify the process, we plan to implement a HUB MCP server that can handle multiple...

Jul 11, 2025

How LLM Tools work

tool use

Tools in Large Language Models (LLMs) Tools enable large language models (LLMs) to interact with external systems, APIs, or data sources, extending their capabilities beyond text...

Jul 1, 2025

LangChain Retry Logic

langchain

LangChain Invoke Retry Logic LLM call is not stable and may fail due to network issues or other reasons, therefore, retry logic is necessary.

Jun 23, 2025

MCP Transports

mcp

| Feature | stdio | sse (Server-Sent Events) | streamable-http | |--------------------------|------------------------------------------|--------------------------------------------...

May 4, 2025

Text to SQL (Smolagents)

llm apps sql

Out: None [Step 1: Duration 146.87 seconds| Input tokens: 2,113 | Output tokens: 923] ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ Step 2 ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ ─ Executing...

Apr 25, 2025

MCP Server & Client (SSE)

mcp

Step-by-Step Guide: Building an MCP Server using Python-SDK, AlphaVantage & Claude AI Model Context Protocol (MCP) lab

Apr 22, 2025

RAG-Reranking

rag

Retrieval-Augmented Generation (RAG) is a powerful approach that combines retrieval and generation to produce high-quality responses. However, the quality of the final response can...

Apr 21, 2025

Ollama Import GGUF Models

llmops

You start by creating a Modelfile, which acts as a key to unlock any GGUF model you want to use.

Mar 29, 2025

GenAI Projects

llm apps

Learning never exhausts the mind         ― Leonardo da Vinci

Feb 16, 2025

Crawling the Web with LLM

llm apps

Skyvern ScrapegraphAI Crawl4AI Reader Firecrawl Markdowner

Feb 9, 2025

LangGraph VS AutoGen

agents langgraph autogen

|Feature| LangGraph| AutoGen| |---|---|---| |Core Concept| Graph-based workflow for LLM chaining| Multi-agent system with customizable agents| |Architecture| Node-based computation...

Feb 8, 2025

Autogen Intro and RAG Workflow

agents autogen rag

AutoGen is a framework for creating multi-agent AI applications that can act autonomously or work alongside humans.

Feb 2, 2025

Local LLM Setup

llmops

If you find this in your VSCode, congratulations! You have successfully set up Ollama for code generation and assistance in Visual Studio Code. alt text

Dec 15, 2024

Gradio with Ollama

llm apps python llmops

%%{init: { 'look':'handDrawn' } }%%

Nov 15, 2024

PySpark Dataframe Transformation

spark python dataframe

```python linenums="1" spark = ( SparkSession.builder.master("local[]").appName("test").getOrCreate() ) d = [ Event(1, "abc"), Event(2, "ddd"), ]

Nov 1, 2024

Databricks Wheel Job

data platform spark python

My previous spark project is scala based and I use IDEA to compile and test conveniently.:smile::smile::smile: Databricks Job nice UI save your time to create JAR job.

Oct 23, 2024

Python Decorator

python

:bulb: It will extend your function behaviors during runtime.

Oct 16, 2024

ZIO

functional programming

This video is helpful to understand it. type:video

Oct 13, 2024

Reflex Learning

python web development

Reflex (pynecone) Reflex is a library to build full-stack web apps in pure Python. Repo Video type:video

Oct 5, 2024

Snowflake Data Science Training Summary

DATA SCIENCE

data platform

I have enrolled in a private Snowflake Data Science Training. Let me list what I learned from it.

Sep 8, 2024

AutoGen HttpClient

agents autogen

```python linenums="1" title="myclient.py"

Sep 8, 2024

How to execute python modules

python

We can use internal runpy to execute different moduls in our project.

Aug 12, 2024

Model Registry

MACHINE LEARNING

mlops

Problem: How to introduce ml-based production/features to cross-functional teams.

Jul 18, 2021

Setup Minikube

DEVOPS

containers

bin/spark-submit \ master k8s://https://192.168.99.100:8443 \ deploy-mode cluster \ name spark-pi \ class org.apache.spark.examples.SparkPi \ conf spark.driver.cores=1 \ conf...

Nov 18, 2020

Azure Data Factory (Data Flow)

etl cloud

Recently I'm working in Azure to implement ETL jobs. The main tool is ADF (Azure Data Factory). This post show some solutions to resolve issue in my work.

Mar 1, 2020

Spark Dataframe window function

spark dataframe

scala ref create dataframe

Feb 21, 2020

Spark SQL

spark sql optimization

```txt master MASTERURL --> 运行模式例：spark://host:port, mesos://host:port, yarn, or local.

Feb 21, 2020

Spark Optimization

spark optimization

PROCESSLOCAL data is in the same JVM as the running code. This is the best locality possible NODELOCAL data is on the same node. Examples might be in HDFS on the same node, or in...

Feb 11, 2020

Airflow

orchestration

import airflow from airflow.models import DAG from airflow.operators.pythonoperator import PythonOperator

Feb 11, 2020

Whitening transformation

MACHINE LEARNING

linear algebra

Whitening Transformation

Feb 8, 2020

Spark Structured Streaming