Comparing Support Vector Regression and Random Forests for Predicting Malaria Incidence in Mozambique
2013 (English)In: 2013 International Conference on Advances in ICT for Emerging Regions (ICTer), IEEE Computer Society, 2013, 217-221 p.Conference paper (Refereed)
Accurate prediction of malaria incidence is essentialfor the management of several activities in the ministry of health in Mozambique. This study investigates the comparison ofsupport vector machines (SVMs) and random forests (RFs) forthis purpose. A dataset with records of malaria cases covering theperiod 1999-2008 was used to evaluate predictive models on thelast year when developed from one up to nine years of historicaldata. Mean squared error (MSE) was used as performancemetric. The scheme for estimating variable importance commonlyemployed for RFs was also adopted for SVMs. SVMs developedfrom two year of historical data obtained the best predictionaccuracy. Hence, if we are interested in predicting the actualnumber of malaria cases the support vector machines modelshould be chosen. In the analysis of variable importance, IndoorResidual Spray (IRS), the districts of Manhiça and Matola andmonth of January turned out to be the most important predictorsin both the SVM and RF models.
Place, publisher, year, edition, pages
IEEE Computer Society, 2013. 217-221 p.
Research subject Computer and Systems Sciences
IdentifiersURN: urn:nbn:se:su:diva-97714DOI: 10.1109/ICTer.2013.6761181ISBN: 978-1-4799-1274-2OAI: oai:DiVA.org:su-97714DiVA: diva2:679944
2013 International Conference on Advances in ICT for Emerging Regions (ICTer), 11-15 December 2013, Colombo (Sri Lanka)