Adding additional distance #37826
Replies: 4 comments 14 replies
-
It is not a simple job to add a new index type or new metric type into milvus. |
Beta Was this translation helpful? Give feedback.
-
Milvus has already support SCANN and so many other indexes, you can use any one since TLSH satisfies the inequality triangle theorem. For more suggestion, HNSW can provide better performance&recall but need more memory, DiskANN can utilize disk to enlarge your capacity, SCANN can provide not bad performance and suitable for k>1K scenarios like all other IVF-series indexes. However, we only support IVFFlat for binary embedding. Not sure what is your data type in this case. To support more metrics, @foxspy can you help on this? |
Beta Was this translation helpful? Give feedback.
-
I think it's always better to start from https://github.com/zilliztech/knowhere.
|
Beta Was this translation helpful? Give feedback.
-
I'm building a service that finding similarity between malware and virus based TLSH.
TLSH is a high dimension vector represents the binary of the malware/virus, that the distance between two vectors satisfy the inequality triangle theorem.
I want to index and search these vector to find/group similar malware and virus, but the distance algorithm is quite unique that does not based on any distance Milvus supports, and can not be approximate to any of the supported distances (cosine, euclidean, dot, hamming....).
Can anybody please give me a good tutorial where should I start forking the code base and build the software.
I am completely new to Milvus, so please advise me on the index as well, I'm thinking of SCANN.
Beta Was this translation helpful? Give feedback.
All reactions