Spark 源码系列 - DAGScheduler -> runJob
作者:互联网
目录
结论
DAGScheduler -> runJob
def runJob[T, U](
val waiter = submitJob(rdd, func, partitions, callSite, resultHandler, properties)
DAGScheduler -> submitJob
- 将this, 新生成的jobid, 分区数 生成对象 JobWaiter
- 将JobWatier放入阻塞队列EventLoop, EventLoop是通过内部的死循环机制,不停遍历阻塞队列,当有数据就调用onReceive方法。
def submitJob[T, U](...): JobWaiter[U] = {
...
val jobId = nextJobId.getAndIncrement()
...
val waiter = new JobWaiter[U](this, jobId, partitions.size, resultHandler)
// eventPorcessLoop 是 阻塞队列 LinkedBlockingDeque
eventProcessLoop.post(JobSubmitted(
jobId, rdd, func2, partitions.toArray, callSite, waiter,
Utils.cloneProperties(properties))) // end post
waiter
}
标签:submitJob,...,val,waiter,jobId,DAGScheduler,Spark,源码 来源: https://www.cnblogs.com/abc608088/p/16323384.html