javascript - Elasticsearch 按自定义项目重量排序

标签 javascript elasticsearch

我已经存储了包含状态属性的文档。我想按状态优先级(而不是按状态字母顺序)对文档进行排序。我遵循了之前的答案并编写了以下函数,但它仍然无法按预期工作;文档按状态名称排序(按字母顺序):

function getESSortingByStatusQuery(query, order) {
        let statusOrder = ['BLUE', 'RED', 'BLACK', 'YELLOW', 'GREEN'];
        if(order == 'desc'){
            statusOrder.reverse();
        }
        const functions = statusOrder.map((item) => {
            const idx = statusOrder.indexOf(item);
            return {filter: {match: {statusColor: item}},
                weight: (idx + 1) * 50}
        });
        const queryModified = {
            "function_score": {
                "query": {"match_all": {}}, // this is for testing purposes and should be replaced with original query
                "boost": "5",
                "functions": functions,
                "score_mode": "multiply",
                "boost_mode": "replace"
            }
        }
        return queryModified;
    }

如果有人建议根据属性的预定义优先级(在本例中为状态)对项目进行排序的方法,我将不胜感激。

最佳答案

以下是示例 custom sort script我认为这就是你正在寻找的。我已经添加了示例映射、文档、查询和响应,如下所示。

映射:

PUT color_index
{
  "mappings": {
    "properties": {
      "color":{
        "type": "keyword"
      },
      "product":{
        "type": "text"
      }
    }
  }
}

示例文档:

POST color_index/_doc/1
{
  "color": "BLUE",
  "product": "adidas and nike"
}

POST color_index/_doc/2
{
  "color": "GREEN",
  "product": "adidas and nike and puma"
}

POST color_index/_doc/3
{
  "color": "GREEN",
  "product": "adidas and nike"
}

POST color_index/_doc/4
{
  "color": "RED",
  "product": "nike"
}

POST color_index/_doc/5
{
  "color": "RED",
  "product": "adidas and nike"
}

查询:

POST color_index/_search
{
  "query": {
    "bool": {
      "must": [
        {
          "query_string": {
            "default_field": "*",
            "query": "adidas OR nike"
          }
        }
      ]
    }
  },
  "sort": [
    { "_score": { "order": "desc"} },          <---- First sort by score
    { "_script": {                             <---- Second sort by Colors
            "type": "number",
            "script": {
                "lang": "painless",
                "source": "if(params.scores.containsKey(doc['color'].value)) { return params.scores[doc['color'].value];} return 100000;",
                "params": {
                    "scores": {
                        "BLUE": 0,
                        "RED": 1,
                        "BLACK": 2,
                        "YELLOW": 3,
                        "GREEN": 4
                    }
                }
            },
            "order": "asc"
        }

    }
  ]
}

首先,它将返回按分数排序的文档,然后将第二个排序逻辑应用于该结果。

对于第二次排序,即使用脚本排序,请注意我如何将数值添加到 scores 部分中的颜色。您需要相应地构造您的查询。

其工作原理位于 source 部分,我相信这是不言而喻的,我在其中使用了 doc['color'].value ,因为它是我正在应用自定义排序逻辑的领域。

响应:

{
  "took" : 5,
  "timed_out" : false,
  "_shards" : {
    "total" : 1,
    "successful" : 1,
    "skipped" : 0,
    "failed" : 0
  },
  "hits" : {
    "total" : {
      "value" : 5,
      "relation" : "eq"
    },
    "max_score" : null,
    "hits" : [
      {
        "_index" : "color_index",
        "_type" : "_doc",
        "_id" : "1",
        "_score" : 0.5159407,
        "_source" : {
          "color" : "BLUE",
          "product" : "adidas and nike"
        },
        "sort" : [
          0.5159407,                     <--- This value is score(desc by nature)
          0.0                            <--- This value comes from script sort as its BLUE and I've used value 0 in the script which is in 'asc' order
        ]
      },
      {
        "_index" : "color_index",
        "_type" : "_doc",
        "_id" : "5",
        "_score" : 0.5159407,
        "_source" : {
          "color" : "RED",
          "product" : "adidas and nike"
        },
        "sort" : [
          0.5159407,
          1.0
        ]
      },
      {
        "_index" : "color_index",
        "_type" : "_doc",
        "_id" : "3",
        "_score" : 0.5159407,
        "_source" : {
          "color" : "GREEN",
          "product" : "adidas and nike"
        },
        "sort" : [
          0.5159407,
          4.0
        ]
      },
      {
        "_index" : "color_index",
        "_type" : "_doc",
        "_id" : "2",
        "_score" : 0.40538198,
        "_source" : {
          "color" : "GREEN",
          "product" : "adidas and nike and puma"
        },
        "sort" : [
          0.40538198,
          4.0
        ]
      },
      {
        "_index" : "color_index",
        "_type" : "_doc",
        "_id" : "4",
        "_score" : 0.10189847,
        "_source" : {
          "color" : "RED",
          "product" : "nike"
        },
        "sort" : [
          0.10189847,
          1.0
        ]
      }
    ]
  }
}

注意前三个文档,它具有精确的 product 值,但不同的 color,您可以看到它们被分组在一起,因为我们首先按 _score 排序 然后我们按颜色排序

请告诉我这是否有帮助!

关于javascript - Elasticsearch 按自定义项目重量排序,我们在Stack Overflow上找到一个类似的问题: https://stackoverflow.com/questions/58522821/

相关文章:

javascript - 检查任务是否完成

elasticsearch - logstash + Elasticsearch : reloads the same data

elasticsearch - 将id_key与fluentd/elasticsearch结合使用

javascript - 使用jquery数据表我无法在不破坏FixedHead的情况下使单元格colspan = 3

javascript - PHP - 为什么我从编码的对象数组中得到奇怪的结果?

curl - PUT:使用elasticsearch找不到命令

elasticsearch - Elasticsearch 索引上的多个分析器

java - Spring-Boot 的高级 Rest 客户端 7.1.1 : java. lang.NoSuchMethodError : org. elasticsearch.action.bulk.BulkRequest.pipeline()Ljava/lang/String;

javascript - 如何使用 select :false 为字段创建虚拟属性

javascript - ionViewWillEnter错误: URI malformed