Andrew
Home
Library
Categories
History
Ctrl+K
Home
Library
Categories
History
About Me
Other
Protection
Eden Switch
Photography
Technical Interview Assessment
Shopee-SZ
LLM
Structured Output
Function Calling
ReAct
Agent Skills
MCP
Prompt-Engineering
Java Web
JVM
JUC
Java
Redis
MySQL
CS Fundamentals
Computer Networks
Big Data
Hadoop
Spark
Go
gRPC
Chassis
Go
Last update: 2026-03-22
RDD
分布式计算过程
什么是 Shuffle?
为什么 Spark SQL 比原生的 RDD 快?
排序的区别
1. ORDER BY:全局排序
2. SORT BY:局部排序并写入
3. DISTRIBUTE BY:按键重分区并写入
4. CLUSTER BY:分发并局部排序(同字段)写入
DISTRIBUTE BY + SORT BY
Spark执行计划分析
DRIVER LOG
Spark UI(HISTORY URL)
集群所有执行计划
Job
Stage
Task
Executors
TRACKING URL
AM URL
日志怎么打
文章列表
About Me
Other
Protection
Eden Switch
Photography
Technical Interview Assessment
Shopee-SZ
LLM
Structured Output
Function Calling
ReAct
Agent Skills
MCP
Prompt-Engineering
Java Web
JVM
JUC
Java
Redis
MySQL
CS Fundamentals
Computer Networks
Big Data
Hadoop
Spark
Go
gRPC
Chassis
Go
文章列表
目录