Prospective anomaly detection methods such as the Modified EARS C2 are commonly adapted and used in public health syndromic surveillance systems. These methods however can produce an excessive false alert rate. We present a combined use of retrospective (e.g., Change Point Analysis (or CPA)) and prospective (e.g., C2) anomaly detection methods. This combined approach will help detect sudden aberrations in addition to subtle changes in local trends, help rule out alarm investigations, and assist with retrospective follow-ups. Examples on the utility of this combined approach in working collaboratively with the scientific community are applied to BioSense emergency departments' visits due to ILI. Methods, limitations, future work, and invitation to the scientific community to collaborate with us will be discussed at this talk.
1. Change Point Analysis Zhiheng (Roy) Xu, MS (PhD Candidate) Senior Research Scientist Taha A. Kass-Hout, MD, MS Deputy Director for Information Science (Acting) and BioSense Program Manager Division of Healthcare Information (DHI) Public Health Surveillance Program Office (PHSPO) Office of Surveillance, Epidemiology, and Laboratory Services (OSELS) Centers for Disease Control & Prevention (CDC) Any views or opinions expressed here do not necessarily represent the views of the CDC, HHS, or any other entity of the United States government. Furthermore, the use of any product names, trade names, images, or commercial sources is for identification purposes only, and does not imply endorsement or government sanction by the U.S. Department of Health and Human Services.
2.
3.
4.
5.
6. Control Chart Upper Control Limit (UCL)= µ + 3 σ Lower Control Limit (LCL) = µ - 3 σ where µ is the sample mean (central line) and σ is the sample standard deviation. 3 σ 3 σ
7. SIX SIGMA A six-sigma process is one in which 99.99966% of the products manufactured are free of defects.
8. CPA vs. Control Charts CPA Control Charts Data type Any Normal distributed data Type of changes Major and subtle changes Major changes only Mean Mean-shift Stable mean Computation Depends on the algorithms Simple and fast
9.
10.
11. CUSUM* Step 1: sample mean Step 2: residuals Step 3: cusum of residuals 0 ε 1 ε 1 + ε 2 ε 1 + ε 2 + ε 3 … ε 1 + ε 2 +…+ ε n * Kass-Hout, et al, The Joint Statistical Meeting, Vancouver, CA. August, 2010.
12. CUSUM* Level 1: Find a change point maximizing |S| Step 4: plot the cusum and find where is the maximum of absolute cusum. * Kass-Hout, et al, The Joint Statistical Meeting, Vancouver, CA. August, 2010.
13. CUSUM* Level 2: Find a change point on each sub-series * Kass-Hout, et al, The Joint Statistical Meeting, Vancouver, CA. August, 2010. Step 5: Break the time-series into two segments and repeat step 1-5.
14. CUSUM* Level n: Final result * Kass-Hout, et al, The Joint Statistical Meeting, Vancouver, CA. August, 2010.
18. CUSUM vs. SCM “ I have long given up on CUSUM type procedures (and any of the variants). The tests are plagued with problems of non-monotonic power and to get a date and confidence interval for the break date is not trivial and most methods don't work well.” “ The main difference is that I do not use asymptotic results, but instead employ the computer intensive bootstrapping approach to determine confidence levels and intervals so as to make the procedure nonparametric. ” Wayne Taylor, Ph.D. Pierre Perron, Ph.D.
32. Acknowledgement CDC Sam Groseclose, DVM, MPH Paul McMurray, MDS Soyoun Park, MS Others Rafal Raciborshi, Ph.D, Econometrician, STATA Corp, College Station, TX. Wayne Taylor, Ph.D, President of Taylor Enterprise, Inc. Pierre Perron, Ph.D, Professor of Economics, Boston University Yajun Mei, Ph.D, Asst. Professor of Statistics, Georgia Tech Elena Pesavento, Ph.D, Assoc. Professor of Economics, Emory University