Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

es7.17.0 使用7.17.0版本依然报错startOffset #284

Open
zt5062 opened this issue Mar 14, 2023 · 3 comments
Open

es7.17.0 使用7.17.0版本依然报错startOffset #284

zt5062 opened this issue Mar 14, 2023 · 3 comments

Comments

@zt5062
Copy link

zt5062 commented Mar 14, 2023

es7.17.0 使用7.17.0版本依然报错startOffset must be non-negative, and endOffset must be >= startOffset, and offsets must not go backwards

@zt5062
Copy link
Author

zt5062 commented Mar 17, 2023

在插入的数据出现英文字母加符号的时候必然出现,比如CCTV-高清,而CCTV高清-可以正常插入,再就是文本中有中文符号的时候也会报同样的错,比如中文的书名号<>

@zt5062
Copy link
Author

zt5062 commented Mar 20, 2023

经过整理,字符分为3类,汉字,英文,符号(包括中英文符号)。
汉字开头,没有问题。
符号开头,必然报错。
英文开头,英文字符串后面接汉字没有问题,接符号,报错。

@xiaoshi2013
Copy link

我试了下elasticsearch 8.4.1版本可以
POST /medcl/_doc
{
"name": "CCTV-高清"
}

POST medcl/_search
{
"query": {
"match": {
"name.pinyin": {
"query": "cctvgq"
}
}
}
}

输出
{
"took": 3,
"timed_out": false,
"_shards": {
"total": 1,
"successful": 1,
"skipped": 0,
"failed": 0
},
"hits": {
"total": {
"value": 1,
"relation": "eq"
},
"max_score": 2.4876804,
"hits": [
{
"_index": "medcl",
"_id": "zl8E_Y0BKB1kh6YAO_6p",
"_score": 2.4876804,
"_source": {
"name": "CCTV-高清"
}
}
]
}
}

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants