Spark1.6-----原始碼解讀之BlockManager元件MemoryStore

阿新 • • 發佈：2018-12-16

MemoryStore負責將沒有序列化的java物件陣列或者序列化的ByteBuffer儲存到記憶體中：

MemoryStore記憶體模型

maxUnrollMemory：當前Driver或者Executor的block最多提前佔用的記憶體的大小，每個執行緒都能佔記憶體。(類似上課佔座，人沒到，但是位置有了)

maxMemory：當前Driver或者Executor儲存所能利用最大記憶體大小。

currentMemoey：當前Driver或者Executor以及用了記憶體。

freeMemory：當前Driver或者Executor為使用的記憶體。

currentUnrollMemory：當前Driver或者Executor的block已經提前佔用的記憶體的大小，所有執行緒block已經提前佔用的記憶體的大小之和

unrollMemoryMap:都存入map中 key為執行緒id,value為每個執行緒佔用的記憶體。

private[spark] class MemoryStore(blockManager: BlockManager, memoryManager: MemoryManager)
  extends BlockStore(blockManager) {

  // Note: all changes to memory allocations, notably putting blocks, evicting blocks, and
  // acquiring or releasing unroll memory, must be synchronized on `memoryManager`!

  private val conf = blockManager.conf
  private val entries = new LinkedHashMap[BlockId, MemoryEntry](32, 0.75f, true)

  // A mapping from taskAttemptId to amount of memory used for unrolling a block (in bytes)
  // All accesses of this map are assumed to have manually synchronized on `memoryManager`
  private val unrollMemoryMap = mutable.HashMap[Long, Long]()
  // Same as `unrollMemoryMap`, but for pending unroll memory as defined below.
  // Pending unroll memory refers to the intermediate memory occupied by a task
  // after the unroll but before the actual putting of the block in the cache.
  // This chunk of memory is expected to be released *as soon as* we finish
  // caching the corresponding block as opposed to until after the task finishes.
  // This is only used if a block is successfully unrolled in its entirety in
  // memory (SPARK-4777).
  private val pendingUnrollMemoryMap = mutable.HashMap[Long, Long]()

  // Initial memory to request before unrolling any block
  private val unrollMemoryThreshold: Long =
    conf.getLong("spark.storage.unrollMemoryThreshold", 1024 * 1024)
  /** Total amount of memory available for storage, in bytes. */
  private def maxMemory: Long = memoryManager.maxStorageMemory

MemoryStore繼承自BlockStore。實現了getBytes,putBytes,putArray,putIterator,getValues等方法。

資料儲存putBytes

如何Block的儲存級別為能序列化，則先進行序列化再呼叫putIterator，否則呼叫TryPut.

override def putBytes(blockId: BlockId, _bytes: ByteBuffer, level: StorageLevel): PutResult = {
    // Work on a duplicate - since the original input might be used elsewhere.
    val bytes = _bytes.duplicate()
    bytes.rewind()
    if (level.deserialized) {
      val values = blockManager.dataDeserialize(blockId, bytes)
      putIterator(blockId, values, level, returnValues = true)
    } else {
      val droppedBlocks = new ArrayBuffer[(BlockId, BlockStatus)]
      tryToPut(blockId, bytes, bytes.limit, deserialized = false, droppedBlocks)
      PutResult(bytes.limit(), Right(bytes.duplicate()), droppedBlocks)
    }
  }

Iterator寫入方法putIterator

呼叫unrollSafely測試看看能不能去佔用Block塊大小的記憶體，如果返回的資料型別為Left(array Values)說明記憶體能裝下，呼叫putArray寫入記憶體。

返回的為Right（array Values）說明記憶體不足將寫入硬碟或者拋棄。

/**
   * Attempt to put the given block in memory store.
   *
   * There may not be enough space to fully unroll the iterator in memory, in which case we
   * optionally drop the values to disk if
   *   (1) the block's storage level specifies useDisk, and
   *   (2) `allowPersistToDisk` is true.
   *
   * One scenario in which `allowPersistToDisk` is false is when the BlockManager reads a block
   * back from disk and attempts to cache it in memory. In this case, we should not persist the
   * block back on disk again, as it is already in disk store.
   */
  private[storage] def putIterator(
      blockId: BlockId,
      values: Iterator[Any],
      level: StorageLevel,
      returnValues: Boolean,
      allowPersistToDisk: Boolean): PutResult = {
    val droppedBlocks = new ArrayBuffer[(BlockId, BlockStatus)]
    val unrolledValues = unrollSafely(blockId, values, droppedBlocks)
    unrolledValues match {
      case Left(arrayValues) =>
        // Values are fully unrolled in memory, so store them as an array
        val res = putArray(blockId, arrayValues, level, returnValues)
        droppedBlocks ++= res.droppedBlocks
        PutResult(res.size, res.data, droppedBlocks)
      case Right(iteratorValues) =>
        // Not enough space to unroll this block; drop to disk if applicable
        if (level.useDisk && allowPersistToDisk) {
          logWarning(s"Persisting block $blockId to disk instead.")
          val res = blockManager.diskStore.putIterator(blockId, iteratorValues, level, returnValues)
          PutResult(res.size, res.data, droppedBlocks)
        } else {
          PutResult(0, Left(iteratorValues), droppedBlocks)
        }
    }
  }

記憶體寫入PutArray

  override def putArray(
      blockId: BlockId,
      values: Array[Any],
      level: StorageLevel,
      returnValues: Boolean): PutResult = {
    val droppedBlocks = new ArrayBuffer[(BlockId, BlockStatus)]
    if (level.deserialized) {
      //估算物件大小
      val sizeEstimate = SizeEstimator.estimate(values.asInstanceOf[AnyRef])
      //嘗試去寫入記憶體
      tryToPut(blockId, values, sizeEstimate, deserialized = true, droppedBlocks)
      PutResult(sizeEstimate, Left(values.iterator), droppedBlocks)
    } else {
      val bytes = blockManager.dataSerialize(blockId, values.iterator)
      tryToPut(blockId, bytes, bytes.limit, deserialized = false, droppedBlocks)
      PutResult(bytes.limit(), Right(bytes.duplicate()), droppedBlocks)
    }
  }

嘗試寫入記憶體方法tryToPut

  private def tryToPut(
      blockId: BlockId,
      value: () => Any,
      size: Long,
      deserialized: Boolean,
      droppedBlocks: mutable.Buffer[(BlockId, BlockStatus)]): Boolean = {

    /* TODO: Its possible to optimize the locking by locking entries only when selecting blocks
     * to be dropped. Once the to-be-dropped blocks have been selected, and lock on entries has
     * been released, it must be ensured that those to-be-dropped blocks are not double counted
     * for freeing up more space for another block that needs to be put. Only then the actually
     * dropping of blocks (and writing to disk if necessary) can proceed in parallel. */
    //多執行緒，必須要鎖住
    memoryManager.synchronized {
      // Note: if we have previously unrolled this block successfully, then pending unroll
      // memory should be non-zero. This is the amount that we already reserved during the
      // unrolling process. In this case, we can just reuse this space to cache our block.
      // The synchronization on `memoryManager` here guarantees that the release and acquire
      // happen atomically. This relies on the assumption that all memory acquisitions are
      // synchronized on the same lock.
      releasePendingUnrollMemoryForThisTask()
     //在測試一下，看現在記憶體還能不能放下該Block，因為多執行緒緣故，可能剛才滿足現在不滿足條件
      val enoughMemory = memoryManager.acquireStorageMemory(blockId, size, droppedBlocks)
      if (enoughMemory) {
        // We acquired enough memory for the block, so go ahead and put it
        val entry = new MemoryEntry(value(), size, deserialized)
        entries.synchronized {
          //能放下就寫入記憶體
          entries.put(blockId, entry)
        }
        val valuesOrBytes = if (deserialized) "values" else "bytes"
        logInfo("Block %s stored as %s in memory (estimated size %s, free %s)".format(
          blockId, valuesOrBytes, Utils.bytesToString(size), Utils.bytesToString(blocksMemoryUsed)))
      } else {
        // Tell the block manager that we couldn't put it in memory so that it can drop it to
        // disk if the block allows disk storage.
        lazy val data = if (deserialized) {
          Left(value().asInstanceOf[Array[Any]])
        } else {
          Right(value().asInstanceOf[ByteBuffer].duplicate())
        }
        val droppedBlockStatus = blockManager.dropFromMemory(blockId, () => data)
        droppedBlockStatus.foreach { status => droppedBlocks += ((blockId, status)) }
      }
      enoughMemory
    }
  }

Spark1.6-----原始碼解讀之BlockManager元件MemoryStore

MemoryStore負責將沒有序列化的java物件陣列或者序列化的ByteBuffer儲存到記憶體中： MemoryStore記憶體模型 maxUnrollMemory：當前Driver或者Executor的block最多提前佔用的記憶體的大小，每個執行緒都能佔記憶體。(類似上課佔座，人沒

Spark1.6-----原始碼解讀之BlockManager元件shuffle服務和客戶端

spark是分散式部署的，每個Task最終都執行在不同的機器節點上，map任務的輸出結果直接儲存到map任務所在的機器的儲存體系，reduce極有可能不再同一個機器上執行，所以需要遠端下載map任務的中間輸出。所以儲存系統中也包含ShuffleClient。在BlockManager 176行

Spark1.6-----原始碼解讀之BlockManager的概述

BlockManager的實現 BlockManager是spark儲存體系中的核心元件，Driver 和Executor都會建立BlockManager。在SparkEnv 364行會建立BlockManager： // NB: blockManager is not val

Spark1.6-----原始碼解讀之TaskScheduler啟動

必須啟動TaskScheduler才能讓他發揮作用 SparkContext 530行： _taskScheduler.start() 實際去調TaskSchedulerImpl 143行： override def start() { backend.start

Spark1.6-----原始碼解讀之DAGScheduler

純滑鼠點程式碼寫出來的，閱讀時希望你能跟著這樣操作。 DAGScheduler的主要用於在任務正式提交給TaskSchedulerImpl提交之前做一些準備工作。比如建立job，將DAG的RDD劃分到不同的stage，提交stage SparkContext 525行建立DAGSchedul

Spark1.6-----原始碼解讀之TaskScheduler

TaskScheduler是SparkContext重要成員之一，負責任務的提交，並且請求叢集管理器對任務排程。他也可以看做任務排程的客戶端。 SparkContext 522行建立TaskScheduler： val (sched, ts) = SparkContex

Spark1.6-----原始碼解讀之SparkEnv

我是跟著原始碼點進去一步一寫的，所以觀看時希望大家能一步一跟著原始碼走，不要只看博文。在SparkContext 284行建立SparkEnv: // This function allows components created by SparkEnv to be mocked in

Spring原始碼解讀之Spring MVC HandlerMapping元件（二）

一、HandlerMapping HandlerMapping作用是根據request找到相應的處理器Handler和Interceptors，並將Handler和Interceptors封裝成HandlerExecutionChain 物件返回。Handler

Spring原始碼解讀之——元件註冊(隨筆)

@ComponentScan value:指定要掃描的包 excludeFilters = Filter[] ：指定掃描的時候按照什麼規則排除那些元件 includeFilters = Filt

【1】pytorch torchvision原始碼解讀之Alexnet

最近開始學習一個新的深度學習框架PyTorch。框架中有一個非常重要且好用的包：torchvision，顧名思義這個包主要是關於計算機視覺cv的。這個包主要由3個子包組成，分別是：torchvision.datasets、torchvision.models、torchvision.trans

java原始碼解讀之HashMap

1:首先下載openjdk(http://pan.baidu.com/s/1dFMZXg1),把原始碼匯入eclipse,以便看到jdk原始碼 Windows-Prefe

PyTorch原始碼解讀之torch.utils.data.DataLoader(轉)

原文連結 https://blog.csdn.net/u014380165/article/details/79058479 寫得特別好！最近正好在學習pytorch，學習一下！ PyTorch中資料讀取的一個重要介面是torch.utils.data.DataLoade

PyTorch原始碼解讀之torchvision.models(轉)

原文地址：https://blog.csdn.net/u014380165/article/details/79119664 PyTorch框架中有一個非常重要且好用的包：torchvision，該包主要由3個子包組成，分別是：torchvision.datasets、torchvision.mode

PyTorch原始碼解讀之torchvision.transforms（轉）

原文地址：https://blog.csdn.net/u014380165/article/details/79167753 PyTorch框架中有一個非常重要且好用的包：torchvision，該包主要由3個子包組成，分別是：torchvision.dat

jQuery原始碼解讀之init函式

jQuery的構造方法： // 直接new了一個物件。同時根據jQuery.fn = jQuery.prototype，jQuery.fn相當於jQuery.prototype。 jQuery = function( selector, context ) { return

PyTorch原始碼解讀之torchvision.transforms

PyTorch框架中有一個非常重要且好用的包：torchvision，該包主要由3個子包組成，分別是：torchvision.datasets、torchvision.models、torchvision.transforms。這3個子包的具體介紹可以參考

eureka原始碼解讀之服務端

剖析eureka服務端啟動流程服務端啟動類-入口處 @EnableEurekaServer @SpringBootApplication public class EurekaServerApplication { public static void main(Strin

Dubbo原始碼解讀之動態代理

前言或許我們已悉知Java的動態代理的方式：jdk——通過介面中的方法名，在動態生成的代理類中呼叫業務實現類的同名方法；cglib——通過繼承業務類，生成的動態代理類是業務類的子類，通過重寫業務方法進行代理。dubbo在沿用java的jdk方式外，還採取了javassist方式——通過

Lumen開發：lumen原始碼解讀之初始化(2)——門面(Facades)與資料庫(db)

緊接上一篇 $app->withFacades();//為應用程式註冊門面。 $app->withEloquent();//為應用程式載入功能強大的庫。先來看看withFacades() /** * Register the facades

Lumen開發：lumen原始碼解讀之初始化(1)——app例項

先來看看入口檔案public/index.php //請求頭 header('Content-Type: application/json; charset=utf-8'); /* |-------------------------------------------------

Spark1.6-----原始碼解讀之BlockManager元件MemoryStore

資料儲存putBytes

Iterator寫入方法putIterator

記憶體寫入PutArray

嘗試寫入記憶體方法tryToPut

相關推薦