Feature-by-case request: Nonlinear time-series clustering based on their statistical similarity #427

sa18 · 2021-11-02T16:01:05Z

There is a numeric series of temporal data, for example, temperature.

It is required to colorize it by segments, where the same colors would mean the statistical similarity of the data under each segment. I would do it like this:

Split the series into equal segments of a given length.
For all pairs of segments, perform statistical similarity test. The result higher than 70% should mean the pair of segments are similar, and we'll assign the same color on them. Otherwise, we assign different colors.

Expectation from the math library:

Support for optimal storage of time-series (in this case 1D, but in a more general case - multidimensional).
Functional library to perform statical tests (Kolmogorov-Smirnov, Cucconi and others)
Ability to generate permutations, incl. random (required by Cucconi test implementation), with maximum performance and minimum memory consumption.

Here is (more complicated) description of classification by stat tests.

sa18 mentioned this issue Nov 2, 2021

Feature-by-case request: Extracting cycles & trend from time series data #428

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Feature-by-case request: Nonlinear time-series clustering based on their statistical similarity #427

Feature-by-case request: Nonlinear time-series clustering based on their statistical similarity #427

sa18 commented Nov 2, 2021

Feature-by-case request: Nonlinear time-series clustering based on their statistical similarity #427

Feature-by-case request: Nonlinear time-series clustering based on their statistical similarity #427

Comments

sa18 commented Nov 2, 2021