Bin Zhang Data, AI, and engineering notes
Home Blog Archive

Tags

  • #agents
  • #autogen
  • #cloud
  • #containers
  • #data platform
  • #dataframe
  • #deep learning
  • #etl
  • #functional programming
  • #langchain
  • #langgraph
  • #leadership
  • #linear algebra
  • #llm apps
  • #llm fundamentals
  • #llmops
  • #mcp
  • #mlops
  • #model training
  • #optimization
  • #orchestration
  • #python
  • #rag
  • #spark
  • #sql
  • #streaming
  • #tool use
  • #web development

Spark SQL

February 21, 2020
DATA ENGINEERING
sparksqloptimization

```txt master MASTERURL --> 运行模式 例:spark://host:port, mesos://host:port, yarn, or local.

Spark Optimization

February 21, 2020
DATA ENGINEERING
sparkoptimization

PROCESSLOCAL data is in the same JVM as the running code. This is the best locality possible NODELOCAL data is on the same node. Examples might be in...

Gradient Descent

February 2, 2020
MACHINE LEARNING
optimization

Vanilla gradient descent, aka batch gradient descent, computes the gradient of the cost function w.r.t. to the parameters θ

© 2026 Bin Zhang. All rights reserved.