Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Streaming Data - calling a function when anomaly is detected #66

Open
vinura opened this issue Oct 25, 2019 · 2 comments
Open

Streaming Data - calling a function when anomaly is detected #66

vinura opened this issue Oct 25, 2019 · 2 comments

Comments

@vinura
Copy link

vinura commented Oct 25, 2019

I am using the streaming mode of rrcf to detect anomalies from data generated by a sensor. I was able to implement it. Yet is there a built-in function to classify data instead of looking at the graph to figure out the anomaly? (simply I want to call a python custom function when there is an anomaly).

I can use a threshold value for the codisp value and call the function that I want if there is an anomaly. However, I have around 40 sensor inputs which have different data patterns. So is codisp value is a relative value to the one unique case or can I use a general threshold for this?

@vinura vinura changed the title Streaming Data and classifying Streaming Data - calling a function when anomaly is detected Oct 25, 2019
@mdbartos
Copy link
Member

Ultimately you will need some kind of threshold test on CoDisp that will be application-dependent. Using a percentile score is a pretty reliable approach.

To answer the second part, I would need more information about the format of your data points, and what information each of them include.

@vinura
Copy link
Author

vinura commented Oct 28, 2019

Say it is coming from water pressure number from a pipeline. Each pipe has different water pressure. I get the out as a number when I sent a command to the sensor. So can I use a global threshold for coDisp ? or Should it be unique to the pipe ?

So I idea I have is to initiate new object for each pipe. But could the threshold value for codisp be global or unique to the instance? If it is unique to the instance, How can I figure out a way to use different codisp values for all 40 pipes.

Also the reason that i chose this algorithm was it can detect anomalies of streaming data. The pressure of the pipes changes according to the time of the day. So the rrcf detect those anomalies without a problem. However, this threshold thing is hard to figure out. I hope it is a global one :D .

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants