Respondent-driven sampling bias induced by community structure and response rates in social networks
Number of Authors: 4
2017 (English)In: Journal of the Royal Statistical Society: Series A (Statistics in Society), ISSN 0964-1998, E-ISSN 1467-985X, Vol. 180, no 1, 99-118 p.Article in journal (Refereed) Published
Sampling hidden populations is particularly challenging by using standard sampling methods mainly because of the lack of a sampling frame. Respondent-driven sampling is an alternative methodology that exploits the social contacts between peers to reach and weight individuals in these hard-to-reach populations. It is a snowball sampling procedure where the weight of the respondents is adjusted for the likelihood of being sampled due to differences in the number of contacts. The structure of the social contacts thus regulates the process by constraining the sampling within subregions of the network. We study the bias induced by network communities, which are groups of individuals more connected between themselves than with individuals in other groups, in the respondent-driven sampling estimator. We simulate different structures and response rates to reproduce real settings. We find that the prevalence of the estimated variable is associated with the size of the network community to which the individual belongs and observe that low degree nodes may be undersampled if the sample and the network are of similar size. We also find that respondent-driven sampling estimators perform well if response rates are relatively large and the community structure is weak, whereas low response rates typically generate strong biases irrespectively of the community structure.
Place, publisher, year, edition, pages
2017. Vol. 180, no 1, 99-118 p.
Complex networks, Network sampling, Public health, Respondent-driven sampling bias
IdentifiersURN: urn:nbn:se:su:diva-141276DOI: 10.1111/rssa.12180ISI: 000397117600005OAI: oai:DiVA.org:su-141276DiVA: diva2:1086999