Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[DOC] Change documentation for Rerank processor, add special case of Normalization processor #7392

Closed
1 of 4 tasks
martin-gaievski opened this issue Jun 14, 2024 · 0 comments · Fixed by #7513
Closed
1 of 4 tasks
Assignees
Labels
2 - In progress Issue/PR: The issue or PR is in progress.

Comments

@martin-gaievski
Copy link
Member

What do you want to do?

  • Request a change to existing documentation
  • Add new documentation
  • Report a technical problem with the documentation
  • Other

Tell us about your request. Provide a summary of the request and all versions that are affected.
Existing documentation for Rerank processor should be enhanced, we need to add a special case when Rerank and Normalization processors are working together. In such case scores of Normalization processor will be updated by Rerank, and that may be an unexpected behavior for end user.

This behavior is not related to any specific version and was there from the moment of release of both processors. Ideally this change should be added as soon as possible, this behavior is there for users of 2.12 and above.

Suggested change:

Special Scenario: Interaction with Normalization Processor and Hybrid Query
When using the rerank processor in conjunction with the normalization processor and a hybrid query, there is an important behavior to be aware of: the rerank processor will alter the result scores of the documents. This is due to the fact that the rerank processor operates after the normalization processor in the search pipeline.

Understanding the Order of Processing
Normalization Processor: This processor normalizes the scores of the documents based on the configured normalization method. For more details on how this processor works, refer to the Normalization Processor documentation.
Rerank Processor: Following the normalization, the rerank processor will further adjust the scores of the documents. This adjustment can significantly impact the final ordering of search results.
Implications

  • Score Modification: The rerank processor will modify the scores that were initially adjusted by the normalization processor, potentially leading to different ranking results than initially expected.
  • Hybrid Queries: In the context of hybrid queries, where multiple types of queries and scoring mechanisms are combined, this behavior is particularly noteworthy. The combined scores from the initial query will be normalized first and then reranked, resulting in a two-stage scoring modification.

What other resources are available? Provide links to related issues, POCs, steps for testing, etc.
Rerank processor page: https://opensearch.org/docs/latest/search-plugins/search-relevance/reranking-search-results/
Normalization processor page: https://opensearch.org/docs/latest/search-plugins/search-pipelines/normalization-processor/

@hdhalter hdhalter added 1 - Backlog - DEV Developer assigned to issue is responsible for creating PR. and removed untriaged labels Jun 14, 2024
@hdhalter hdhalter added 2 - In progress Issue/PR: The issue or PR is in progress. and removed 1 - Backlog - DEV Developer assigned to issue is responsible for creating PR. labels Jun 26, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
2 - In progress Issue/PR: The issue or PR is in progress.
Projects
None yet
3 participants