高效篩選兩個List中的不同的元素

阿新 • • 發佈：2019-01-28

問題記錄：

開發過程中，需要把兩個List中不同的元素篩選出來，這兩個List的資料量都很大，如果按照一般的方法，分別去遍歷兩個List，然後分別對每一個元素做比較，時間消耗將會達到m*n，處理效率顯然不盡人意。

解決思路：

使用一個Map來對2個List中的元素進行計數：

即把List的元素作為Map的Key，Entry的Value為Integer型別，用於記錄元素在兩個集合中出現的次數。

解決方案：

先遍歷一個List中的所有元素，put進Map，初始出現次數為1；

再遍歷第二個List中的所有元素，與map已有的元素進行比較：

如果Map中不存在這個元素，就把這個元素插入結果集，

如果Map中存在這個元素，則把這個元素的出現次數置為2。

程式碼示例：

示例實體類Product：

public class Product {

	private Integer id;
	
	private String name;

	public Product(Integer id, String name) {
		this.id = id;
		this.name = name;
	}

	public Integer getId() {
		return id;
	}

	public String getName() {
		return name;
	}
	
	@Override
	public String toString() {
		return "Product [id=" + id + ", name=" + name + "]";
	}
	
	public boolean equals(Object o){
		if (o == null) {
			return false;
		}
		if (this == o) {
			return true;
		}
		if (o instanceof Product) {
			Product p = (Product) o;
			if (p.getId() == this.getId() && p.getName().equals(this.getName())) {
				return true;
			}else {
				return false;
			}
		}
		return false;
	}
	
	public int hashCode(){
		int result = 17;
		result = result*37 + id;
		result = result*37 + name.hashCode();
		return result;
	}
}

示例解決方法：

public static Collection<Product> getDiffrent(Collection<Product> col1, Collection<Product> col2){
		//建立返回結果
		Collection<Product> diffrentResult = new ArrayList<>();
		//比較出兩個集合的大小，在新增進map的時候先遍歷較大集合，這樣子可以減少沒必要的判斷
		Collection<Product> bigCol = null;
		Collection<Product> smallCol = null;
		if (col1.size() > col2.size()) {
			bigCol = col1;
			smallCol = col2;
		}else {
			bigCol = col2;
			smallCol = col1;
		}
		//建立 Map<物件,出現次數> (直接指定大小減少空間浪費)
		Map<Object, Integer> map = new HashMap<>(bigCol.size());
		//遍歷大集合把元素put進map，初始出現次數為1
		for(Product p : bigCol) {
			map.put(p, 1);
		}
		//遍歷小集合，如果map中不存在小集合中的元素，就新增到返回結果，如果存在，把出現次數置為2
		for(Product p : smallCol) {
			if (map.get(p) == null) {
				diffrentResult.add(p);
			}else {
				map.put(p, 2);
			}
		}
		//把出現次數為1的 Key:Value 撈出，並把Key新增到返回結果
		for(Map.Entry<Object, Integer> entry : map.entrySet()) {
			if (entry.getValue() == 1) {
				diffrentResult.add((Product) entry.getKey());
			}
		}
		
		return diffrentResult;
	}

測試程式碼：

public static void main(String[] args) {
		List<Product> list1 = new ArrayList<>();
		List<Product> list2 = new ArrayList<>();
		for (int i = 0; i < 10; i++) {
			list1.add(new Product(i, "Product"+String.valueOf(i)));
		}
		for (int i = 0; i < 10; i = i + 2) {
			list2.add(new Product(i, "Product"+String.valueOf(i)));
		}
		Collection<Product> result = getDiffrent(list1, list2);
		for(Product p : result) {
			System.out.println(p.toString());
		}
	}

測試結果：

Product [id=7, name=Product7]
Product [id=1, name=Product1]
Product [id=9, name=Product9]
Product [id=3, name=Product3]
Product [id=5, name=Product5]

解決過程中遇到的問題：

由於是把自定義類作為Map的Key，勢必會存在一個問題：

Map在get的時候，是沒有辦法直接get到這個Key對應的鍵值對的。

解決辦法：

由HashMap的get方法的原始碼：

    public V get(Object key) {
        if (key == null)
            return getForNullKey();
        Entry<K,V> entry = getEntry(key);

        return null == entry ? null : entry.getValue();
    }

再來看看getEntry方法的原始碼：

  final Entry<K,V> getEntry(Object key) {
        if (size == 0) {
            return null;
        }

        int hash = (key == null) ? 0 : hash(key);
        for (Entry<K,V> e = table[indexFor(hash, table.length)];
             e != null;
             e = e.next) {
            Object k;
            if (e.hash == hash &&
                ((k = e.key) == key || (key != null && key.equals(k))))
                return e;
        }
        return null;
    }

由原始碼可以看出，HashMap在根據Key查詢的時候，是根據hashCode的值和equals方法來查詢這個Key所對應的鍵值對的，顯然我們需要重寫自定義類的equals()方法和hashCode()方法。

由於這裡只需要判斷物件的邏輯相等，重寫的equals()方法只需要判斷各個屬性值是否相等即可

	public boolean equals(Object o){
		if (o == null) {
			return false;
		}
		if (this == o) {
			return true;
		}
		if (o instanceof Product) {
			Product p = (Product) o;
			if (p.getId() == this.getId() && p.getName().equals(this.getName())) {
				return true;
			}else {
				return false;
			}
		}
		return false;
	}

重寫hashCode()方法

學習了《Effective Java》中提出的一種簡單通用的hashCode演算法

1. 初始化一個整形變數，為此變數賦予一個非零的常數值，比如int result = 17;

2. 選取equals方法中用於比較的所有域，然後針對每個域的屬性進行計算：

(1) 如果是boolean值，則計算f ? 1:0

(2) 如果是byte\char\short\int,則計算(int)f

(3) 如果是long值，則計算(int)(f ^ (f >>> 32))

(4) 如果是float值，則計算Float.floatToIntBits(f)

(5) 如果是double值，則計算Double.doubleToLongBits(f)，然後返回的結果是long,再用規則(3)去處理

long得到int

(6) 如果是物件應用，如果equals方法中採取遞迴呼叫的比較方式，那麼hashCode中同樣採取遞迴呼叫

hashCode的方式。否則需要為這個域計算一個正規化，比如當這個域的值為null的時候，那麼hashCode值為0。

(7) 如果是陣列，那麼需要為每個元素當做單獨的域來處理。如果你使用的是1.5及以上版本的JDK，那麼沒

必要自己去重新遍歷一遍陣列，java.util.Arrays.hashCode方法包含了8種基本型別陣列和引用陣列的

hashCode計算，演算法同上。

public int hashCode(){
int result = 17;
if (id != null) {
result = result*37 + id;
}
if (name != null) {
result = result*37 + name.hashCode();
}
return result;
}

至此問題完美解決，總的來說這是一個以空間換時間的解決方案。

高效篩選兩個List中的不同的元素

問題記錄：開發過程中，需要把兩個List中不同的元素篩選出來，這兩個List的資料量都很大，如果按照一般的方法，分別去遍歷兩個List，然後分別對每一個元素做比較，時間消耗將會達到m*n，處理效率顯

高效比較兩個list中不同的元素

為知具體出處，望作者見諒！！ package com.syl.test; import java.util.*; /** * 獲取兩個List的不同元素（假設List自身不存在重複元素） * Created by syl on 2017/12/26 0026. *

一，比較兩個陣列中不同元素

1,兩個陣列，找出其中一個比另一個多的元素，例如輸入{"1","2","3"} 和{"1","4","5"},結果為{"2","3"} private Set<String>findScope(String [] oldArray, String [] n

java兩個list中儲存bean物件，找出其中某一屬性不同的元素

在java中運用List集合儲存物件，如果想找到兩個list中不同的部分，可以用ArrayList的contains方法，遍歷每一個物件,判斷是否是相等的，如下： public stati

找出list中的不同元素、刪除兩個list中相同的物件

package com.test; import java.util.ArrayList; import java.util.Arrays; import java.util.Collections; import java.util.List; /** * *

時間殺手—for迴圈—如何找出兩個list中的相同元素

import numpy import datetime a = numpy.random.randint( 5,1000,100000 ) b = numpy.random startt1 = datetime.datetime.now() l11 = sorted(list(set(a)))

php獲取兩個陣列相同的元素（交集）以及比較兩個陣列中不同的元素（差集）

（一）php獲取兩個陣列相同元素　　array array_intersect(array $array1, array $array2, [, array $...]) 　　array array_int

js找出兩個陣列中不同的元素

function arr（array，array2）{ var arr3 = []; for（鍵入陣列）{ var stra = arra

setdiff：查詢兩個向量中不同的元素 + 外兩則去掉矩陣相同的東西

轉自：http://blog.csdn.net/tina_lulu_21/article/details/6273646設有向量A和B，要求出A和B中的不同元素，可使用matlab自帶的setdiff函式。語法為： c = setdiff(A, B) 其計算公式為c

兩個List去掉重複元素放在一個List中去【兩個Listsize值非常大】

/* * 思路： * 1.取得兩個list的相同元素：list.retainAll(E)方法 * 2.兩個list分別去掉相同的元素：list.removeAll(E); * 3.將剩下的兩個

提取兩個數組中不同元素

ring arr 結果 () array cep 一個 [] clas 假設數組： string[] listA ={"1","2","3","4","5"}; string[] listB = {"1","4","5"}; 那麽，提

Excel中篩選兩個表中相同的資料和快速填充一列的公式

將兩個工作表放在一個檔案中，使用if函式和countif函式判斷 =if(判斷條件countif(區域,條件),真值,[假值]) 例項 =if(countif(Sheet2!$A$1:$A$44,A2),"S","F") "$"的用法 A1相對引用 $A1絕對引用列 A$1絕對引用行 $A$1絕對引用行

Python程式碼比較兩個列表中的元素是否相等，並且返回相等元素的列表索引

list1 = [1,2,'a','b',5,67,78,99,"ji"] list2 = [1,"a","b",2,87,34,67,"ji"] for i in range(len(list1)): for j in range(len(list2)):

兩個陣列提取相同元素，兩個陣列提取不同元素

兩個陣列提取相同元素 const getArrEqual = (arr1, arr2) => { let newArr = []; for (let i = 0; i < arr2.length; i++) { for (let j = 0; j <

java對List去重並排序、如何快速地去掉兩個List中相同的部分

1：去重並排序 package twolist; import java.util.Collections; import java.util.Comparator; import java.util.HashMap; import java.util.Has

1點兒優化：比較兩個List中是否有相同的String

一般寫法（雙層for迴圈+if語句）複雜 for(int i = 0; i < list2.size(); i++){ for(int j = 0; j < list3.siz

mysql 查詢兩個表中不同字段的和，並通過兩個表的時間來分組

mysql data new 字段 class 兩張 time sele group ( SELECT sum( a.cost_sum ) AS sum_cost, sum( a.phone_sum ) AS sum_phone, s

工具類：關於如何找到兩個List數組中不同的數據的算法！

開發人員 uri print clas 數據結構 blank _id integer public 找到兩個List數組中不同的數據的算法！ import java.util.ArrayList;import java.util.HashMap;import java.ut

從兩個陣列中找不同元素

#include<stdio.h> int main() {int a[10],b[10],i,j,n,m,k; scanf("%d",&n); for(i=0;i<n;i++) scanf("%d",a[i]); scanf("%d",m); for

關於python中求出兩個列表的相同元素和不同元素

用列表推導式來寫 list1 = [1,3,65,2,7] list2 = [3,2,5,4] c = [x for x in list1 if x in list2] d = [y for y in (list1+list2) if y not in c] prin

高效篩選兩個List中的不同的元素

相關推薦