Modeling Data Transfers: Change Point and Anomaly Detection

2018
To help the operations and resource planning of a large experimental facility, we model the time needed for transferringthe data files produced by the facility to a computer center, with the goals of predicting expected file transfertime and identifying unusually slow transfersthat might require attention from human operators. The file transfertime can be thought of having two parts: a base time depending on the hardware and software involved, and a congestion part due to uncontrollable interferences from other operations on the shared resourcesincluding network links, disk storagesystems, and CPU involved in the transfers. Since many parameters important to the transfertime are not available to us, we employ a change point detection algorithm to separate the data records into time periods (called segments) with relatively stable behavior. Within each segment, we apply a non-parametric model to describe the congestion time. When the observed file transfertime is significantly longer than typical expected time, we declare the particular file transferto be unusually slow. When many of these unusually slow file transfersare observed, it is worthwhile to notify the human operators to investigate the abnormal behavior of the system.
    • Correction
    • Source
    • Cite
    • Save
    13
    References
    5
    Citations
    NaN
    KQI
    []
    Baidu
    map