MIT-6.824 lab1

阿新 • • 發佈：2019-02-18

member tar image art merger new con ase ats

github：https://github.com/haoweiz/MIT-6.824

Part1：

　　第一部分比較簡單，我們只需要修改doMap和doReduce函數即可，主要涉及Go語言對Json文件的讀寫。簡單說說part1的測試流程吧，Sequential部分代碼如下

 1 func TestSequentialSingle(t *testing.T) {
 2     mr := Sequential("test", makeInputs(1), 1, MapFunc, ReduceFunc)
 3     mr.Wait()
 4     check(t, mr.files)
 5     checkWorker(t, mr.stats)
 
 6     cleanup(mr)
 7 }
 8 
 9 func TestSequentialMany(t *testing.T) {
10     mr := Sequential("test", makeInputs(5), 3, MapFunc, ReduceFunc)
11     mr.Wait()
12     check(t, mr.files)
13     checkWorker(t, mr.stats)
14     cleanup(mr)

　　makeInputs（M int）對於0-100000個數字平均分成了M個文件寫入，，根據題目要求，我們需要將每個文件分成N個文件寫出，因此doMap過程總共產生M*N個文件，我們可以先將文件的所有鍵值對通過mapF函數（本質上是test_test.go中的MapFunc函數）存儲在數組keyvalue中，然後調用getStartEnd函數活動第Number個reduce文件應存儲的keyvalue的切片範圍，利用Encoder寫入文件即可

 1 // Get the start and end index of Number if we divide Total keyvalue into nReduce parts
 2 func getStartEnd(Number, nReduce, Total int) (start, end int) {
 3     part := Total/nReduce
 4     if Number == nReduce-1 {
 5         start = Number*part
 6         end = Total
 7     } else {
 8         start = Number*part
 
 9         end = (Number+1)*part
10     }
11     return
12 }
13 
14 // doMap manages one map task: it reads one of the input files
15 // (inFile), calls the user-defined map function (mapF) for that file‘s
16 // contents, and partitions the output into nReduce intermediate files.
17 func doMap(
18     jobName string, // the name of the MapReduce job
19     mapTaskNumber int, // which map task this is
20     inFile string,
21     nReduce int, // the number of reduce task that will be run ("R" in the paper)
22     mapF func(file string, contents string) []KeyValue,
23 ) {
24     //
25     // You will need to write this function.
26     //
27     // The intermediate output of a map task is stored as multiple
28     // files, one per destination reduce task. The file name includes
29     // both the map task number and the reduce task number. Use the
30     // filename generated by reduceName(jobName, mapTaskNumber, r) as
31     // the intermediate file for reduce task r. Call ihash() (see below)
32     // on each key, mod nReduce, to pick r for a key/value pair.
33     //
34     // mapF() is the map function provided by the application. The first
35     // argument should be the input file name, though the map function
36     // typically ignores it. The second argument should be the entire
37     // input file contents. mapF() returns a slice containing the
38     // key/value pairs for reduce; see common.go for the definition of
39     // KeyValue.
40     //
41     // Look at Go‘s ioutil and os packages for functions to read
42     // and write files.
43     //
44     // Coming up with a scheme for how to format the key/value pairs on
45     // disk can be tricky, especially when taking into account that both
46     // keys and values could contain newlines, quotes, and any other
47     // character you can think of.
48     //
49     // One format often used for serializing data to a byte stream that the
50     // other end can correctly reconstruct is JSON. You are not required to
51     // use JSON, but as the output of the reduce tasks *must* be JSON,
52     // familiarizing yourself with it here may prove useful. You can write
53     // out a data structure as a JSON string to a file using the commented
54     // code below. The corresponding decoding functions can be found in
55     // common_reduce.go.
56     //
57     //   enc := json.NewEncoder(file)
58     //   for _, kv := ... {
59     //     err := enc.Encode(&kv)
60     //
61     // Remember to close the file after you have written all the values!
62     //
63     
64     // Read from inFile and save all keys and values in keyvalue 
65     var keyvalue []KeyValue
66     fi, err := os.Open(inFile)
67     if err != nil {
68         log.Fatal("doMap Open: ", err)
69     }
70     defer fi.Close()
71     br := bufio.NewReader(fi)
72     for {
73         a, _, c := br.ReadLine()
74         if c == io.EOF {
75             break
76         }
77         kv := mapF(inFile, string(a))
78         for i := 0; i != len(kv); i++ {
79             keyvalue = append(keyvalue, kv[i])
80         }
81     }
82 
83     // Divide keyvalue into nReduce parts and save them in nReduce files
84     var names []string
85     for r := 0; r != nReduce; r++ {
86         names = append(names, fmt.Sprintf("mrtmp.test-%d-%d", mapTaskNumber, r))
87         file, err := os.Create(names[r])
88         if err != nil {
89             log.Fatal("doMap Create: ", err)
90         }
91         start, end := getStartEnd(r, nReduce, len(keyvalue))
92         enc := json.NewEncoder(file)
93         for _, kv := range keyvalue[start:end] {
94             enc.Encode(kv)
95         }
96         file.Close()
97     }
98 }

　　對於doReduce函數我們需要讀取nMap個文件，將所有鍵值對解碼並重新編碼寫出到outFile中

 1 // doReduce manages one reduce task: it reads the intermediate
 2 // key/value pairs (produced by the map phase) for this task, sorts the
 3 // intermediate key/value pairs by key, calls the user-defined reduce function
 4 // (reduceF) for each key, and writes the output to disk.
 5 func doReduce(
 6     jobName string, // the name of the whole MapReduce job
 7     reduceTaskNumber int, // which reduce task this is
 8     outFile string, // write the output here
 9     nMap int, // the number of map tasks that were run ("M" in the paper)
10     reduceF func(key string, values []string) string,
11 ) {
12     //
13     // You will need to write this function.
14     //
15     // You‘ll need to read one intermediate file from each map task;
16     // reduceName(jobName, m, reduceTaskNumber) yields the file
17     // name from map task m.
18     //
19     // Your doMap() encoded the key/value pairs in the intermediate
20     // files, so you will need to decode them. If you used JSON, you can
21     // read and decode by creating a decoder and repeatedly calling
22     // .Decode(&kv) on it until it returns an error.
23     //
24     // You may find the first example in the golang sort package
25     // documentation useful.
26     //
27     // reduceF() is the application‘s reduce function. You should
28     // call it once per distinct key, with a slice of all the values
29     // for that key. reduceF() returns the reduced value for that key.
30     //
31     // You should write the reduce output as JSON encoded KeyValue
32     // objects to the file named outFile. We require you to use JSON
33     // because that is what the merger than combines the output
34     // from all the reduce tasks expects. There is nothing special about
35     // JSON -- it is just the marshalling format we chose to use. Your
36     // output code will look something like this:
37     //
38     // enc := json.NewEncoder(file)
39     // for key := ... {
40     //     enc.Encode(KeyValue{key, reduceF(...)})
41     // }
42     // file.Close()
43     //
44 
45     // Read all mrtmp.xxx-m-reduceTaskNumber and write to outFile
46     var names []string
47     file, err := os.Create(outFile)
48     if err != nil {
49         log.Fatal("doReduce Create: ", err)
50     }
51     enc := json.NewEncoder(file)
52     defer file.Close()
53 
54     // Read all contents from mrtmp.xxx-m-reduceTaskNumber
55     for m := 0; m != nMap; m++ {
56         names = append(names,  fmt.Sprintf("mrtmp.test-%d-%d", m, reduceTaskNumber))
57         fi, err := os.Open(names[m])
58         if err != nil {
59             log.Fatal("doReduce Open: ", err)
60         }
61         dec := json.NewDecoder(fi)
62         for {
63             var kv KeyValue
64             err = dec.Decode(&kv)
65             if err != nil {
66                 break
67             }
68             enc.Encode(kv)
69         }
70         fi.Close()
71     }
72 }

　　通過測試

技術分享圖片

MIT-6.824 lab1

MIT 6.824 Lab1 mapreduce

這學期選了分散式計算這門課。不得不說真的是一門有料的課程。所有的東西講的都是MIT 6.824上的，Lab也是一樣。不過乾貨太多帶來的也就是，需要花費比較多的時間去看資料和寫程式碼。但是我喜歡這種感覺。課程網址是 http://nil.csail.mit.e

MIT-6.824 lab1

member tar image art merger new con ase ats github：https://github.com/haoweiz/MIT-6.824 Part1：　　第一部分比較簡單，我們只需要修改doMap和doReduce函數即可，主要涉及G

MIT 6.824 lab1:mapreduce

line function not eating them 鏈接 collected rpc ner 這是 MIT 6.824 課程 lab1 的學習總結，記錄我在學習過程中的收獲和踩的坑。我的實驗環境是 windows 10，所以對lab的code 做了一些環境上的修

MIT-6.828 Lab1實驗報告

imp 彈出遞歸調用 switch語句 kde 信息編譯器化工 x86匯編 Lab1:Booting a PC 概述本文主要介紹lab1，從內容上分為三部分，part1簡單介紹了匯編語言，物理內存地址空間，BIOS。part2介紹了BIOS從磁盤0號扇區讀取boot

MIT 6.824拾遺（一）聊聊basic-paxos

前言 ====== > The Paxos algorithm, when presented in plain English, is very simple. > —————— Lamport，《Paxos Made Simple》 > > Unfortunat

[分布式系統學習] 6.824 LEC2 RPC和線程筆記

amp star nbsp 機制並且 als goroutine 操作 page 6.824的課程通常是在課前讓你做一些準備。一般來說是先讀一篇論文，然後請你提一個問題，再請你回答一個問題。然後上課，然後布置Lab。第二課的準備-Crawler 第二課的準備不是論文

MIT 6.00.1x學習心得

學習 6.0 try tom 復雜度都是本科編寫流程　　現在是大三上半學期，看了蕭井陌的編程入門指南之後，用了大概一個月的時間終於把MIT 6.00.1x 課程學完了，有編程經驗，但是因為本科是信息工程準備跨考計算機科學，而且不願意當一個只會敲代碼的碼農，所以對於

Mit6.824 Lab1-MapReduce

named 裏的 them 切片我們 doc 感受控制高並發前言 Mit6.824 是我在學習一些分布式系統方面的知識的時候偶然看到的，然後就開始嘗試跟課。不得不說，國外的課程難度是真的大，一周的時間居然要學一門 Go 語言，然後還要讀論文，進而做MapReduce

MIT 6.031 Software Construction 學習筆記:(三) Mutability & Immutability

這節主要是講可變物件給程式設計帶來的危害，所謂不可變物件，就是整個生命週期中不可變的物件(廢話)， e.g. : String 具體來說參見 Basic Java when we discussed snapshot diagrams Risks of mutation r

《MIT 6.828 Homework 1: boot xv6》解題報告

本作業的網站連結：MIT 6.828 Homework 1: boot xv6 問題 Exercise: What is on the stack? While stopped at the above breakpoint, look at the registers and the stack

MIT 6.828課程引導部分的解讀

引導程式碼位於boot資料夾下，由一個16位與32位彙編混合的彙編檔案（boot.S）和一個C語言檔案（main.c）組成。程式的入口在boot.S中，採用的是AT&T語法，下面先對這個檔案進行分析： #include <inc/mmu.h> 在inc資料夾下有一個mm

《MIT 6.828 Lab 1 Exercise 12》實驗報告

本實驗的網站連結：MIT 6.828 Lab 1 Exercise 12。題目 Exercise 12. Modify your stack backtrace function to display, for each eip, the function name, source file na

MIT 6.001：SICP 2nd （word版和chm版）共享下載

MIT 6.001：SICP 2nd （Structure and Interpretation of Computer Programs -Second edition） Harold Abelson and Gerald Jay Sussman 這本

MIT-6.828-JOS-lab5:File system, Spawn and Shell

依然目前相同系統服務 ace 訪問 structure rst 根目錄 Lab 5: File system, Spawn and Shell tags: mit-6.828 os 概述本lab將實現JOS的文件系統，只要包括如下四部分：引入一個文件系統進程（

MIT-6.828-JOS-lab5:Network Driver

MIT-6.828 Lab 6: Network Driver (default final project) tags: mit-6.828 os 概述本lab是6.828預設的最後一個實驗，圍繞網路展開。主要就做了一件事情。從0實現網路驅動。還提到一些比較重要的概念：記憶體對映I

MIT-6.828 環境搭建

MIT 6.828是作業系統中最經典的一門課程。完成所有的lab就相當於完成了一個迷你的作業系統。我跟的是2018年的課程，課程首頁在6.828課程官網。當然所有資料都是英文的，所以難度也不低，這裡推薦幾本非常有用的參考書：《x86組合語言-從真實模式到保護模式》，《程式設計師的自我修養-連結、裝載與庫》，《

MIT-6.828 Lab2實驗報告

MIT-6.828 Lab 2: Memory Management實驗報告 tags:mit-6.828 os 概述本文主要介紹lab2，講的是作業系統記憶體管理，從內容上分為三部分：第一部分講的是實體記憶體管理，要進行記憶體管理首先需要知道哪些實體記憶體是空閒的，哪些是被使用的。還需要實現一些函式

MIT-6.828 Lab3實驗報

Lab 3: User Environments實驗報告 tags:mit-6.828 os 概述：本文是lab3的實驗報告，主要介紹JOS中的程序，異常處理，系統呼叫。內容上分為三部分：使用者環境建立，可以載入使用者ELF檔案並執行。建立異常處理機制。提供系統呼叫的能力。 Part A: Us

MIT 6.031 Software Construction 學習筆記:(三) Mutability & Immutability

這節主要是講可變物件給程式設計帶來的危害，所謂不可變物件，就是整個生命週期中不可變的物件(廢話)， e.g. : String Risks of mutation risk1：passing mutable values 看以下兩段程式碼: /** @ret

MIT 6.031 Software Construction 學習筆記:(四) Avoiding Debugging

這章Reading 9: Avoiding Debugging給了我很大的收穫，以前一些 ACMer 的不好的程式碼習慣(當然是為了快速coding)都在這裡暴露無遺 First Defense: Make Bugs Impossible 主要是前面講的一些內容

MIT-6.824 lab1

相關推薦