pysad.models.IForestASD

class pysad.models.IForestASD(initial_window_X=None, window_size=2048)[source]

An Anomaly Detection Approach Based on Isolation Forest Algorithm for Streaming Data using Sliding Window [BDF13]. Note that concept drift is not implemented since it is a part of the simulation. See Algorithm 2 in “An Anomaly Detection Approach Based on Isolation Forest Algorithm for Streaming Data using Sliding Window” paper. This method is unsupervised so it is not needed to give y as parameter.

Parameters:
  • initial_window_X (np.float64 array of shape (num_initial_instances,num_features)) – The initial window to fit for initial calibration period. We simply apply fit to these instances (Default=None).

  • window_size (int) – The size of the reference window and its sliding (Default=2048).

Methods

__init__([initial_window_X, window_size])

param model_cls:

fit(X[, y])

Fits the model to all instances in order.

fit_partial(X[, y])

Fits the model to next instance.

fit_score(X[, y])

This helper method applies fit_score_partial to all instances in order.

fit_score_partial(X[, y])

Applies fit_partial and score_partial to the next instance, respectively.

reset_model()

Removes the old model from the memory and instantiates a new one.

score(X)

Scores all instaces via score_partial iteratively.

score_partial(X)

Scores the anomalousness of the next instance.

fit(X, y=None)

Fits the model to all instances in order.

Parameters:
  • X (np.float64 array of shape (num_instances, num_features)) – The instances in order to fit.

  • y (int) – The labels of the instances in order to fit (Optional for unsupervised models, default=None).

Returns:

Fitted model.

Return type:

object

fit_partial(X, y=None)

Fits the model to next instance.

Parameters:
  • X (np.float64 array of shape (num_features,)) – The instance to fit.

  • y (int) – The label of the instance (Optional for unsupervised models, default=None).

Returns:

self.

Return type:

object

fit_score(X, y=None)

This helper method applies fit_score_partial to all instances in order.

Parameters:
  • X (np.float64 array of shape (num_instances, num_features)) – The instances in order to fit.

  • y (np.int32 array of shape (num_instances, )) – The labels of the instances in order to fit (Optional for unsupervised models, default=None).

Returns:

The anomalousness scores of the instances in order.

Return type:

np.float64 array of shape (num_instances,)

fit_score_partial(X, y=None)

Applies fit_partial and score_partial to the next instance, respectively.

Parameters:
  • X (np.float64 array of shape (num_features,)) – The instance to fit and score.

  • y (int) – The label of the instance (Optional for unsupervised models, default=None).

Returns:

The anomalousness score of the input instance.

Return type:

float

reset_model()

Removes the old model from the memory and instantiates a new one.

score(X)

Scores all instaces via score_partial iteratively.

Parameters:

X (np.float64 array of shape (num_instances, num_features)) – The instances in order to score.

Returns:

The anomalousness scores of the instances in order.

Return type:

np.float64 array of shape (num_instances,)

score_partial(X)

Scores the anomalousness of the next instance.

Parameters:

X (np.float64 array of shape (num_features,)) – The instance to score. Higher scores represent more anomalous instances whereas lower scores correspond to more normal instances.

Returns:

The anomalousness score of the input instance.

Return type:

float