Modeling Data Transfers: Change Point and Anomaly Detection
2018
To help the operations and resource planning of a large experimental facility, we model the time needed for
transferringthe data files produced by the facility to a computer center, with the goals of predicting expected
file transfertime and identifying unusually slow
transfersthat might require attention from human operators. The
file transfertime can be thought of having two parts: a base time depending on the hardware and software involved, and a congestion part due to uncontrollable interferences from other operations on the
shared resourcesincluding network links,
disk storagesystems, and CPU involved in the
transfers. Since many parameters important to the
transfertime are not available to us, we employ a change point detection algorithm to separate the data records into time periods (called segments) with relatively stable behavior. Within each segment, we apply a non-parametric model to describe the congestion time. When the observed
file transfertime is significantly longer than typical expected time, we declare the particular
file transferto be unusually slow. When many of these unusually slow
file transfersare observed, it is worthwhile to notify the human operators to investigate the abnormal behavior of the system.
Keywords:
-
Correction
-
Source
-
Cite
-
Save
13
References
5
Citations
NaN
KQI