Batch scenario:
whole set of data is available, focus - complete events
Batch scenario:
whole set of data is available, focus - complete events
Data stream scenario: continuous, unbounded, flow at high speed, high volume
Image credit: Wikimedia Commons
Figure: Extreme value distributions corresponding to m = 1; 10; 100; 1000, each describing where the maximum of m samples drawn from N(0; 1) will lie.
Let X=X1,X2,...,Xm
be a sequence of independent and identically distributed random variables and Xmax=max(X)
. If there exist centering constant dm(∈R)
and normalizing constant cm(>0)
, and some non-degenerate distribution function H+
such that
then H+
belongs to one of the following three distribution functions:
Figure: Distribution of 1000 extremes generated from bivariate kernel density function with m=500
Ψ
-transform space, using the Ψ
-transformation defined byΨ
-transform maps the density values back into space into which a Gumbel distribution can be fitted.Figure: Distribution of transformed values
Image credit: Wikimedia Commons
oddstream::find_odd_streams(train_data, test_stream)
Keyboard shortcuts
↑, ←, Pg Up, k | Go to previous slide |
↓, →, Pg Dn, Space, j | Go to next slide |
Home | Go to first slide |
End | Go to last slide |
Number + Return | Go to specific slide |
b / m / f | Toggle blackout / mirrored / fullscreen mode |
c | Clone slideshow |
p | Toggle presenter mode |
t | Restart the presentation timer |
?, h | Toggle this help |
Esc | Back to slideshow |