全文检索match_phrase

时间：2022-11-02 23:12:28浏览次数：42

标签：name xiaohong 全文检索 offset phrase type match desc

phrase的作用是短语匹配。

比如我插入一条创建一个索引并且插入数据，并且以name is xiaohong开始作为关键短语开始查询

PUT /test/_doc/1
{
  "desc":"hello, my name is xiaohong"
}

查看mapping关系可以得到的结果如下，可以知道插入的desc字段是被分词了的

请求：
GET /test/_mapping

返回：
{
  "test" : {
    "mappings" : {
      "properties" : {
        "desc" : {
          "type" : "text",
          "fields" : {
            "keyword" : {
              "type" : "keyword",
              "ignore_above" : 256
            }
          }
        }
      }
    }
  }
}

由于没有指定分词器，那么当前索引就是使用默认的分词器，我们来看一下是怎么分词的。

请求
GET _analyze
{
  "analyzer": "standard",
  "text": ["hello, my name is xiaohong"]
}

返回
{
  "tokens" : [
    {
      "token" : "hello",
      "start_offset" : 0,
      "end_offset" : 5,
      "type" : "<ALPHANUM>",
      "position" : 0
    },
    {
      "token" : "my",
      "start_offset" : 7,
      "end_offset" : 9,
      "type" : "<ALPHANUM>",
      "position" : 1
    },
    {
      "token" : "name",
      "start_offset" : 10,
      "end_offset" : 14,
      "type" : "<ALPHANUM>",
      "position" : 2
    },
    {
      "token" : "is",
      "start_offset" : 15,
      "end_offset" : 17,
      "type" : "<ALPHANUM>",
      "position" : 3
    },
    {
      "token" : "xiaohong",
      "start_offset" : 18,
      "end_offset" : 26,
      "type" : "<ALPHANUM>",
      "position" : 4
    }
  ]
}

可以看到的是按照空格来进行分词

下面开始查询测试

使用短语进行完整的查询，是可以查询出结果的

请求：
GET /test/_search
{
  "query": {
    "match_phrase": {
      "desc": "name is xiaohong"
    }
  }
}

返回：
{
  "took" : 1,
  "timed_out" : false,
  "_shards" : {
    "total" : 1,
    "successful" : 1,
    "skipped" : 0,
    "failed" : 0
  },
  "hits" : {
    "total" : {
      "value" : 1,
      "relation" : "eq"
    },
    "max_score" : 0.8630463,
    "hits" : [
      {
        "_index" : "test",
        "_type" : "_doc",
        "_id" : "1",
        "_score" : 0.8630463,
        "_source" : {
          "desc" : "hello, my name is xiaohong"
        }
      }
    ]
  }
}

颠倒词语顺序，变成xiaohong is name作为短语来查询。
得到的结果是无结果

请求:
GET /test/_search
{
  "query": {
    "match_phrase": {
      "desc": "xiaohong is name"
    }
  }
}

返回：
{
  "took" : 9,
  "timed_out" : false,
  "_shards" : {
    "total" : 1,
    "successful" : 1,
    "skipped" : 0,
    "failed" : 0
  },
  "hits" : {
    "total" : {
      "value" : 0,
      "relation" : "eq"
    },
    "max_score" : null,
    "hits" : [ ]
  }
}

短语进行跳单词查询，变成name xiaohong作为短语来查询。
得到的结果是无结果

请求：
GET /test/_search
{
  "query": {
    "match_phrase": {
      "desc": "name xiaohong"
    }
  }
}

返回：
{
  "took" : 2,
  "timed_out" : false,
  "_shards" : {
    "total" : 1,
    "successful" : 1,
    "skipped" : 0,
    "failed" : 0
  },
  "hits" : {
    "total" : {
      "value" : 0,
      "relation" : "eq"
    },
    "max_score" : null,
    "hits" : [ ]
  }
}

所以如果使用phrase作为查询方式，那么一定要严格的按照分词顺序进行查询，否则查询不到

标签：name,xiaohong,全文检索,offset,phrase,type,match,desc
From： https://www.cnblogs.com/yeasxy/p/16852891.html

【已验证】使用composer 出现Could not find a matching version of package xxx
今天使用composer安装一个包，开始我指定了版本，报错但是我后来，没有指定版本，还是报错？？去百度查了下，出现这个问题，有两个原因：你设置的composer的原有问题（我的源我都用了好......
64-ES11-String.prototype.matchAll方法
......
Leetcode第1773题：统计匹配规则的物品数量（Counting items match a rule）
解题思路根据题意进行模拟即可，利用哈希表把输入的ruleKey转换为items[i]的下标，然后再遍历一遍items，找出符合条件的物品数量。代码如下：classSolution{public:int......
python:ERROR: No matching distribution found for Pillow==9.1.0的处理(Python 3.6.
一，查看当前python和pip的版本:查看python的版本:[lhdop@blog~]$python3--versionPython3.6.8查看pip的版本:[lhdop@blog~]$pip3-Vpip21.3.1from/us......
【CF1396E】Distance Matching（构造）
题意：给一棵$n$个点的树，保证$n$为偶数，你需要将这$n$个点两两配对，使得每对点的距离和恰好为$k$。判断无解或输出方案。$n\leq10^5,k\leqn^2$。题解：首......
spring升级后,useSuffixPatternMatch默认为false,导致test.do匹配不到test
问题:spring升级后,发现useSuffixPatternMatch默认为false,导致test.do匹配不到test了原因：RequestMappingHandlerMapping.useSuffixPatternMatch(使用后缀模式匹配)在sp......
CodeChef Match the Streams
题目链接：传送门题目中的pdf翻译：题目描述：给定两个序列和。定义序列和的相似度为满足的下标的数量。你需要回答个询问。每个询问给定参数，你需要将更改为，然后计算序......
Sum of Matchings (图论,求所有区间贡献问题,)
题意: 思路:是图论,看给出的信息能不能构成一些特殊的图本题就是不相交的环,每次拆分可以变成不相交的环+链环的匹配就是点数/2,链也是点数/2(向下取整) ......
dremio 21 版本之后反射No File System scheme matches 问题解决
实际属于一个老问题了，整理下，方便使用，主要是我们在使用反射的时候碰到的问题问题如下UnknownFormatConversionException:Conversion='Unknownformat(pdfs)conversio......
grep命令提示"binary file matches **.log"解决方法
在查询日志的时候发现提示了这个错grep"binaryfilematches**.log"greptestXXX.logBinaryfileapp.logmatches此时使用-a参数接口。grep-atestXXX.log-a......

全文检索match_phrase

phrase的作用是短语匹配。

相关文章

赞助商

阅读排行