結合案例講解MapReduce重要知識點 ----------- 自定義MapReduce資料型別（1）重寫Writable介面

阿新 • • 發佈：2018-12-20

重寫Writable介面

如下程式碼就是自定義mr資料型別，在wordcount類使用它。

WordCountWritable

import java.io.DataInput;
import java.io.DataOutput;
import java.io.IOException;
import org.apache.hadoop.io.Writable;


/**
 * 自定義wc的資料型別：
 * @author lyd
 */
public class WordCountWritable implements Writable{
	public String word;
	public int counter;
	
	public WordCountWritable(){
	}
	
	public WordCountWritable(String word, int counter) {
		this.word = word;
		this.counter = counter;
	}

	/**
	 * 寫
	 */
	@Override
	public void write(DataOutput out) throws IOException {
		out.writeUTF(word);
		out.writeInt(counter);
	}

	/**
	 * 讀
	 */
	@Override
	public void readFields(DataInput in) throws IOException {
		this.word = in.readUTF();
		this.counter = in.readInt();
	}

	/**
	 * @return the word
	 */
	public String getWord() {
		return word;
	}

	/**
	 * @param word the word to set
	 */
	public void setWord(String word) {
		this.word = word;
	}

	/**
	 * @return the counter
	 */
	public int getCounter() {
		return counter;
	}

	/**
	 * @param counter the counter to set
	 */
	public void setCounter(int counter) {
		this.counter = counter;
	}

	/* (non-Javadoc)
	 * @see java.lang.Object#toString()
	 */
	@Override
	public String toString() {
		return word + ":" + counter;
	}
}

WordCount

import java.io.IOException;
import org.apache.hadoop.conf.Configuration;
import org.apache.hadoop.fs.Path;
import org.apache.hadoop.io.IntWritable;
import org.apache.hadoop.io.LongWritable;
import org.apache.hadoop.io.NullWritable;
import org.apache.hadoop.io.Text;
import org.apache.hadoop.mapreduce.Job;
import org.apache.hadoop.mapreduce.Mapper;
import org.apache.hadoop.mapreduce.Reducer;
import org.apache.hadoop.mapreduce.lib.input.FileInputFormat;
import org.apache.hadoop.mapreduce.lib.output.FileOutputFormat;

import edu.qianfeng.mr.day01.WordCountWritable;

public class WordCount {
public static class MyMapper extends Mapper<LongWritable, Text, Text, IntWritable>{
	
	Text word = new Text();
	IntWritable one = new IntWritable(1);
	
	@Override
	protected void map(LongWritable key, Text value,Context context)
			throws IOException, InterruptedException {
		//獲取行資料
		String line = value.toString();
		//對資料進行拆分   [hello,qianfeng,hi,qianfeng] [hello,1603] [hi,hadoop,hi,spark]
		String []  words = line.split(" ");
		//迴圈陣列
		for (String s : words) {
			word.set(s);
			context.write(word, one);
		}
		
	}
}

/**
 * 自定義reducer類
 * @author lyd
 *
 */
public static class MyReducer extends Reducer<Text, IntWritable, WordCountWritable, NullWritable>{
	
	@Override
	protected void reduce(Text key, Iterable<IntWritable> value,Context context)
			throws IOException, InterruptedException {
		//定義一個計數器
		int counter = 0;
		//迴圈奇數
		for (IntWritable i : value) {
			counter += i.get();
		}
		//建立資料型別物件
		WordCountWritable wc = new WordCountWritable(key.toString(), counter);
		//reduce階段的最終輸出
		context.write(wc, null);
	}
}

/**
 * job的主入口
 * @param args
 */
public static void main(String[] args) {
	
	try {
		//獲取配置物件
		Configuration conf = new Configuration();
		//建立job
		Job job = new Job(conf, "wordcount");
		//為job設定執行主類
		job.setJarByClass(WordCount.class);
		
		//設定map階段的屬性
		job.setMapperClass(MyMapper.class);
		job.setMapOutputKeyClass(Text.class);
		job.setMapOutputValueClass(IntWritable.class);
		FileInputFormat.addInputPath(job, new Path(args[0]));
		
		
		//設定reduce階段的屬性
		job.setReducerClass(MyReducer.class);
		job.setOutputKeyClass(WordCountWritable.class);
		job.setOutputValueClass(NullWritable.class);
		FileOutputFormat.setOutputPath(job, new Path(args[1]));
		
		//提交執行作業job 並列印資訊
		int isok = job.waitForCompletion(true)?0:1;
		//退出job
		System.exit(isok);
		
	} catch (IOException | ClassNotFoundException | InterruptedException e) {
		e.printStackTrace();
	}
	}
}

結合案例講解MapReduce重要知識點 ----------- 自定義MapReduce資料型別（1）重寫Writable介面

重寫Writable介面如下程式碼就是自定義mr資料型別，在wordcount類使用它。 WordCountWritable import java.io.DataInput; import java.io.DataOutput; import java.io.IOE

vue--自定義指令進行驗證（1）

指令 borde order hone 大於正則表達 display UNC pen 實例代碼： <template> <div id="app" class="app"> <h3>{{msg}}</h3> &

OC中UITableView之自定義cell的使用（1）：通過xib建立

在使用UITableView做開發時，常常會遇到系統提供的樣式無法滿足專案需求的情況，這時就需要根據需求來自定義cell。自定義cell有兩種方式： · 通過xib自定義cell（適用於cell中子控制元件個數固定、cell樣式統一的結構，例如：商品的列表頁面）

最易懂的自定義View原理系列（1）

轉自這裡前言自定義View原理是Android開發者必須瞭解的基礎；在瞭解自定義View之前，你需要有一定的知識儲備；本文將全面解析關於自定義View中的所有知識基礎。 1. View的分類檢視View主要分為兩類：類

iOS自定義轉場動畫（1）——自定義Push轉場動畫

版本：Xcode 7.0.1 語言：Objective-C 轉場動畫就是viewController之間切換的動畫。主要有以下三種自定義方法：列Push & Pop Modal Segue 第一種是UINavigationControl

【朝花夕拾】Android自定義View篇之（五）Android事件分發機制（上）三個重要方法的處理邏輯

前言在自定義View中，經常需要處理Android事件分發的問題，尤其在有多個輸入裝置（如遙控、滑鼠、遊戲手柄等）時，事件處理問題尤為突出。Android事件分發機制，一直以來都是一個讓眾多開發者困擾的難點，至少筆者在工作的前幾年中，沒有特意研究它之前

xgboost 自定義評價函數（metric）與目標函數

binary ret and 參數 cnblogs from valid ges zed 比賽得分公式如下：其中，P為Precision , R為 Recall。 GBDT訓練基於驗證集評價，此時會調用評價函數，XGBoost的best_iteration和

XAF 框架中，自定義參數動作（Action），輸入參數的控件可定義，用於選擇組織及項目

示例 app frame tro href express documents 定義 ron XAF 框架中，如何生成一個自定義參數動作（Action），輸入參數的控件可定義？參考文檔：https://documentation.devexpress.com/eXpres

AngularJs自定義指令詳解（5） - link

演示 hang cursor off drag font 雙向事件 date 在指令中操作DOM，我們需要link參數，這參數要求聲明一個函數，稱之為鏈接函數。寫法： link: function(scope, element, attrs) {　　// 在這裏操作DO

sench touch 自定義小圖標（轉）

found conf custom cmd svg logs 頁面一個會有自定義圖標的方法 Sencha touch自帶圖標有限，有時需要自己添加圖標。下面介紹自定義圖標的方法：首先需要生成圖標字體。有許多網站提供在線生成圖標字體的功能，比如IcoMoon，通過這個

vue2 自定義折疊列表（Accordion）組件

rep link 分享圖片 toggle sset pac baseline object 列表 1.自定義折疊列表 Accordion.vue  <template> <nav :class="$st

ASP.NET MVC 學習筆記-7.自定義配置信息（後續）

字符串 return abstract 新的 work 生成 value DC 連接字符串加密自定義配置信息的高級應用通過上篇博文對簡單的自定義配置信息的學習，使得更加靈活的控制系統配置信息。實際項目中，這種配置的靈活度往往無法滿足項目的靈活度和擴展性。比如，一個

自定義AXI-IP核（轉）

目的：自定義一個IP核，通過AXI匯流排與ARM系統連線環境： Win7 32bit Vivado2014.4.1 Xilinx sdk2014.4 開發板： Zc702 &nbs

用typedef自定義的資料型別

嚴格說，它不是一種新型別，使用typedef一般用來達到以下幾個目的： 1，用來定義一種型別的別名，比如說一個型別名稱特別長，為了書寫方便和便於程式碼的閱讀，實現別名功能（複雜名字簡單化）， typedef char* PCHAR; PCHAR pa, pb; struct Hello_

OC中UITableView之自定義cell的使用（2）：通過程式碼建立

8、jeecg 筆記之自定義word 模板匯出（一）

1、前言 jeecg 中已經自帶 word 的匯出匯出功能，其所使用的也是 easypoi，儘管所匯出的 word 能滿足大部分需求，但總是有需要用到自定義 word匯出模板，下文所用到的皆是 easypoi 提供的，為方便下次翻閱，故記之。 2、程式碼部分 2.1、controll

我的自定義View學習筆記（二）—— Paint 使用

這個一個系列，本系列講的都是本人自定義 View 的學習筆記。目的是加深影響，便於在以後工作中遇到相關問題的時候，能夠有個印象知道到哪裡去尋找答案。這是我學習扔物線大神的自定義 View 教程，自己記錄的筆記。連結在這裡HenCoder，強烈推薦大家去原地址

iOS NSNotificationCenter與自定義通知的封裝（PSSNotificationCenter）

前言作為iOS開發者，大家應該都使用過系統通知（NSNotificationCenter），無非就是三步，1. 註冊通知，2.傳送通知，3.銷燬觀察者，我在這裡就不多解釋了；。如果忘記銷燬觀察者，ios9之前是會崩潰的。因此我就有了自己實現全域性一對多分發通

freemarker自定義標籤報錯（四）

freemarker自定義標籤 1、錯誤描述六月 05, 2014 11:31:35 下午 freemarker.log.JDK14LoggerFactory$JDK14Logger error 嚴重: Template processing error: "Err

freemarker自定義標籤報錯（五）

freemarker自定義標籤 1、錯誤描述六月 05, 2014 11:40:49 下午 freemarker.log.JDK14LoggerFactory$JDK14Logger error 嚴重: Template processing error: "Expr

結合案例講解MapReduce重要知識點 ----------- 自定義MapReduce資料型別（1）重寫Writable介面

重寫Writable介面

WordCountWritable

WordCount

相關推薦