Detection of Hospital Acquired Infections in sparse and noisy Swedish patient records: A machine learning approach using Naïve Bayes, Support Vector Machines and C4.5
2012 (English)In: Proceedings of the Sixth Workshop on Analytics for Noisy Unstructured Text Data, Association for Computing Machinery (ACM), 2012Conference paper (Refereed)
Hospital Acquired Infections (HAI) pose a significant risk on patients’ health while their surveillance is an additional work load for hospital medical staff and hospital management. Our overall aim is to build a system which reliably retrieves all patient records which potentially include HAI, to reduce the burden of manually checking patient records by the hospital staff. In other words, we emphasize recall when detecting HAI (aiming at 100%) with the highest precision possible. The present study is of experimental nature, focusing on the application of Naïve Bayes (NB), Support Vector Machines (SVM) and a C4.5 Decision Tree to the problem and the evaluation of the efficiency of this approach. The three classifiers showed an overall similar performance. SVM yielded the best recall value, 89.8%, for records that contain HAI. We present a machine learning approach as an alternative to rule-based systems which are more common in this task. The classifiers were applied on a small and noisy dataset, generating results which pinpoint the potentials of using learning algorithms for detecting HAI. Further research will have to focus on optimizing the performance of the classifiers and to test them on larger datasets.
Place, publisher, year, edition, pages
Association for Computing Machinery (ACM), 2012.
Hospital Acquired Infections, clinical text mining, Swedish clinical text, support vector machine, machine learning
Research subject Computer and Systems Sciences
IdentifiersURN: urn:nbn:se:su:diva-85409ISBN: 978-1-4503-1919-5OAI: oai:DiVA.org:su-85409DiVA: diva2:583671
AND 2012, December 9, 2012, IIT Bombay, Mumbai, India