Usage of a linear regression models to determine and rank different factors that threaten the data integrity is discussed. A model that predicts the number of data entry errors depending on five features: working experience, qualification and age of an operator as well as the data entry time and complexity of entered information is presented. Results of modeling can be used for designing and implementing an effective information security policy
information security, behavioral analytics, quality of information, information stability, linear regression, coefficient of determination, leverage
1. Shirlee-ann Knight and Janice Burn. Developing a Framework for Assessing Information Quality on the World Wide Web. Australia: EdithCowan University, Perth, 2005. Vol. 8. Pp. 161-172.
2. John Neter, Michael H. Kutner, Christopher J. Nachtsheim, William Wasserman. Applied Linear Statistical Models. 4E - Illinois: WCB McGraw-Hill, 1996.