Bin Zhang Data, AI, and engineering notes
Home Blog Archive

Tags

  • #agents
  • #autogen
  • #cloud
  • #containers
  • #data platform
  • #dataframe
  • #deep learning
  • #etl
  • #functional programming
  • #langchain
  • #langgraph
  • #leadership
  • #linear algebra
  • #llm apps
  • #llm fundamentals
  • #llmops
  • #mcp
  • #mlops
  • #model training
  • #optimization
  • #orchestration
  • #python
  • #rag
  • #spark
  • #sql
  • #streaming
  • #tool use
  • #web development

Attention Dilution

March 15, 2026
LARGE LANGUAGE MODELS
llm fundamentalsrag

Attention dilution (also called context dilution) is one of the fundamental limitations of transformer-based LLMs when dealing with long contexts or...

From Prompt to Response: A Step-by-Step Walkthrough of LLM Inference

March 7, 2026
LARGE LANGUAGE MODELS
llm fundamentals

From input to output, a prompt generally goes through seven steps: request packaging, tokenization, inference scheduling, prefill, and decode before...

ChatGPT in 2025: A Year in Review

January 4, 2026
LARGE LANGUAGE MODELS
llm fundamentals

ChatGPT Stats ChatGPT Growth ChatGPT Revenue

LLM Interview Questions

October 30, 2025
LARGE LANGUAGE MODELS
llm fundamentals

Hyperparameters are external settings chosen before training, such as the learning rate or regularization strength.

© 2026 Bin Zhang. All rights reserved.