ElasticSearch最佳入門實踐（四十一）query string 的分詞以及 mapping 引入案例遺留問題的大揭祕

阿新 • • 發佈：2018-11-07

1、query string分詞

query string必須以和index建立時相同的analyzer進行分詞

query string對exact value和full text的區別對待
date：exact value
_all：full text
比如我們有一個document，其中有一個field，包含的value是：hello you and me，建立倒排索引
我們要搜尋這個document對應的index，搜尋文字是hell me，這個搜尋文字就是query string
query string，預設情況下，es會使用它對應的field建立倒排索引時相同的分詞器去進行分詞，分詞和normalization，只有這樣，才能實現正確的搜尋
我們建立倒排索引的時候，將dogs --> dog，結果你搜索的時候，還是一個dogs，那不就搜尋不到了嗎？所以搜尋的時候，那個dogs也必須變成dog才行。才能搜尋到。
知識點：不同型別的field，可能有的就是full text，有的就是exact value
post_date，date：exact value
_all：full text，分詞，normalization

2、mapping引入案例遺留問題大揭祕

GET /website/article/_search?q=2017
搜尋的是_all field，document所有的field都會拼接成一個大串，進行分詞
2017-01-02 my second article this is my second article in this website 11400

	doc1	doc2	doc3
2017	*	*	*
01	*
02		*
03			*

_all，2017，自然會搜尋到3個docuemnt
GET /website/article/_search?q=2017-01-01
_all，2017-01-01，query string會用跟建立倒排索引一樣的分詞器去進行分詞
2017
01
01

GET /website/article/_search?q=post_date:2017-01-02
date，會作為exact value去建立索引

	doc1	doc2	doc3
2017-01-01	*
2017-01-02		*
2017-01-03			*

所以post_date:2017-01-01，2017-01-01，doc1一條document

GET /website/article/_search?q=post_date:2017
因為是es 5.2以後做的一個優化

3、測試分詞器

$GET /_analyze{"analyzer": "standard","text": "Text to analyze"}$

ElasticSearch最佳入門實踐（四十一）query string 的分詞以及 mapping 引入案例遺留問題的大揭祕

1、query string分詞 query string必須以和index建立時相同的analyzer進行分詞 query string對exact value和full text的區別對待 date：exact value _all：full text

ElasticSearch最佳入門實踐（三十六）query string search 語法以及 _all metadata 原理揭祕

1、query string基礎語法 GET /test_index/test_type/_search?q=test_field:test GET /test_index/test_type/_search?q=+test_field:test

ElasticSearch最佳入門實踐（四十七）query DSL搜尋語法

1、一個例子讓你明白什麼是Query DSL GET /_search { "query": { "match_all": {} } } 2、Query DSL的基本語

ElasticSearch最佳入門實踐（四十二）什麼是mapping再次回爐透徹理解

（1）往es裡面直接插入資料，es會自動建立索引，同時建立type以及對應的mapping （2）mapping中就自動定義了每個field的資料型別（3）不同的資料型別（比如說text和date），可能有的是exact value，有的是full text （4）exac

ElasticSearch最佳入門實踐（三十一）document查詢內部原理揭祕

1、客戶端傳送請求到任意一個node，成為coordinate node 對於讀請求，不一定所有的請求都發送的primary shard 上去，也可以轉發到replied shard 上去，因為replied shard 也是可以服務所有讀請求的 2、coordin

ElasticSearch最佳入門實踐（四十九）各種query搜尋語法

1、match all 查詢所有 GET /_search { "query": { "match_all": {} } } 2、match 匹配某一個filed是否包含文字 GET /_search {

ElasticSearch最佳入門實踐（四十八）_filter與query深入對比解密：相關度，效能

1、filter 與 query 示例先構建兩條資料搜尋請求：年齡必須大於等於30，同時join_date必須是2018-01-01 2、filter與query對比大解密 filter，僅僅只是按照搜尋條件過濾出需要的資料而已

ElasticSearch最佳入門實踐（六十一）修改分詞器以及定製自己的分詞器

1、預設的分詞器 standard 其餘： standard tokenizer：以單詞邊界進行切分 standard token filter：什麼都不做 lowercase token filter：將所有字母轉換為小寫 stop token filer

ElasticSearch最佳入門實踐（三十七）用一個例子告訴你 mapping 到底是什麼

1、插入幾條資料 PUT /website/article/1 { "post_date": "2017-01-01", "title": "my first article", "content": "this is my first article in this w

ElasticSearch最佳入門實踐（五十八）搜尋相關引數梳理以及bouncing results問題解決方案

1、preference 決定了哪些shard會被用來執行搜尋操作 _primary, _primary_first, _local, _only_node:xyz, _prefer_node:xyz, _shards:2,3 bounci

ElasticSearch最佳入門實踐（四十）分詞器的內部組成到底是什麼，以及內建分詞器的介紹

1、什麼是分詞器一個分詞器，很重要，將一段文字進行各種處理，最後處理好的結果才會拿去建立倒排索引切分詞語，normalization（提升recall召回率）給你一段句子，然後將這段句子拆分成一個一個的單個的單詞，同時對每個單詞進行normalizat

ElasticSearch最佳入門實踐（四十四）手動建立和修改mapping以及定製string型別資料是否分詞

1、如何建立索引如果想設定 string 為分詞把它設定為 analyzed not_analyzed 則是設定為 exact value 全匹配 no 則是不能被索引和匹配 2、修改mapping 注意事項：只能建立index時手動建立mapp

ElasticSearch最佳入門實踐（三十九）倒排索引核心原理揭祕

1、例子，兩段文字 doc1：I really liked my small dogs, and I think my mom also liked them doc2：He never liked any dogs, so I hope that my m

ElasticSearch最佳入門實踐（三十八）精確匹配與全文搜尋的對比分析

1、ES中的兩種搜尋模式 1、exact value 2、full text 2、exact value 2017-01-01，exact value，搜尋的時候，必須輸入2017-01-01，才能搜尋出來。如果你輸入一個01，是搜尋不

ElasticSearch最佳入門實踐（三十五）分頁搜尋以及deep paging效能問題深度揭祕

1、如何使用es進行分頁搜尋的語法 size，from GET /_search?size=10 GET /_search?size=10&from=0 GET /_search?size=10&from=20 假設將這6條資料分成3頁，每一頁是2

ElasticSearch最佳入門實踐（三十二）bulk api的奇特json格式與底層效能優化關係揭祕

1、bulk api奇特的json格式 {"action": {"meta"}}\n {"data"}\n {"action": {"meta"}}\n {"data"}\n 2、bulk中的每個操作都可能要轉發到不同的node的shard去執行 3、如果採用比較良好的js

ElasticSearch最佳入門實踐（二十九）document增刪改內部原理揭祕

步驟（1）客戶端選擇一個node傳送請求過去，這個node就是coordinating node（協調節點）（2）coordinating node，對document進行路由，將請求轉發給對應的node（有primary shard）（3）實際的node上的prima

ElasticSearch最佳入門實踐（二十八）剖析document資料路由原理

1、document路由到shard上是什麼意思？我們這段，一個index的資料會被分為多片，每個片都在一個shard中，所以說，一個document存在於一個shard中當客戶端建立的時候，es此時就需要決定說，這個document存在於那個shard上。這個過程就稱

ElasticSearch最佳入門實踐（二十五）mget批量查詢api

1、批量查詢的好處就是一條一條的查詢，比如說要查詢100條資料，那麼就要傳送100次網路請求，這個開銷還是很大的如果進行批量查詢的話，查詢100條資料，就只要傳送1次網路請求，網路請求的效能開銷縮減100倍 2、mget的語法可以說mget是很重要

ElasticSearch最佳入門實踐（二十七）總結以及什麼是distributed document store

1、總結快速入門了一下，最基本的原理，最基本的操作在入門之後，對ES的分散式的基本原理，進行了相對深入一些的剖析圍繞著document這個東西，進行操作，進行講解和分析 2、什麼是distributed document s

ElasticSearch最佳入門實踐（四十一）query string 的分詞以及 mapping 引入案例遺留問題的大揭祕

1、query string分詞

2、mapping引入案例遺留問題大揭祕

3、測試分詞器

相關推薦