Digital Epidemiology: Use of Digital Data Collected for Non-epidemiological Purposes in Epidemiological Studies.

dc.contributor.authorPark, Hyeoun-Ae
dc.contributor.authorJung, Hyesil
dc.contributor.authorOn, Jeongah
dc.contributor.authorPark, Seul Ki
dc.contributor.authorKang, Hannah
dc.date.accessioned2020-06-08T14:28:42Z
dc.date.available2020-06-08T14:28:42Z
dc.date.issued0000-00-00
dc.description.abstractObjectives We reviewed digital epidemiological studies to characterize how researchers are using digital data by topic domain study purpose data source and analytic method Methods We reviewed research articles published within the last decade that used digital data to answer epidemiological research questions Data were abstracted from these articles using a data collection tool that we developed Finally we summarized the characteristics of the digital epidemiological studies Results We identified six main topic domains infectious diseases 58 7 non communicable diseases 29 4 mental health and substance use 8 3 general population behavior 4 6 environmental dietary and lifestyle 4 6 and vital status 0 9 We identified four categories for the study purpose description 22 9 exploration 34 9 explanation 27 5 and prediction and control 14 7 We identified eight categories for the data sources web search query 52 3 social media posts 31 2 web portal posts 11 9 webpage access logs 7 3 images 7 3 mobile phone network data 1 8 global positioning system data 1 8 and others 2 8 Of these 50 5 used correlation analyses 41 3 regression analyses 25 6 machine learning and 19 3 descriptive analyses Conclusions Digital data collected for non epidemiological purposes are being used to study health phenomena in a variety of topic domains Digital epidemiology requires access to large datasets and advanced analytics Ensuring open access is clearly at odds with the desire to have as little personal data as possible in these large datasets to protect privacy Establishment of data cooperatives with restricted access may be a solution to this dilemma
dc.identifier.urihttp://dx.doi.org/10.4258/hir.2018.24.4.253
dc.identifier.urihttps://lib.digitalsquare.io/handle/123456789/64302
dc.relation.uriHealthcare informatics research
dc.titleDigital Epidemiology: Use of Digital Data Collected for Non-epidemiological Purposes in Epidemiological Studies.en
dcterms.abstractObjectives We reviewed digital epidemiological studies to characterize how researchers are using digital data by topic domain study purpose data source and analytic method Methods We reviewed research articles published within the last decade that used digital data to answer epidemiological research questions Data were abstracted from these articles using a data collection tool that we developed Finally we summarized the characteristics of the digital epidemiological studies Results We identified six main topic domains infectious diseases 58 7 non communicable diseases 29 4 mental health and substance use 8 3 general population behavior 4 6 environmental dietary and lifestyle 4 6 and vital status 0 9 We identified four categories for the study purpose description 22 9 exploration 34 9 explanation 27 5 and prediction and control 14 7 We identified eight categories for the data sources web search query 52 3 social media posts 31 2 web portal posts 11 9 webpage access logs 7 3 images 7 3 mobile phone network data 1 8 global positioning system data 1 8 and others 2 8 Of these 50 5 used correlation analyses 41 3 regression analyses 25 6 machine learning and 19 3 descriptive analyses Conclusions Digital data collected for non epidemiological purposes are being used to study health phenomena in a variety of topic domains Digital epidemiology requires access to large datasets and advanced analytics Ensuring open access is clearly at odds with the desire to have as little personal data as possible in these large datasets to protect privacy Establishment of data cooperatives with restricted access may be a solution to this dilemma
dcterms.contributorPark, Hyeoun-Ae
dcterms.contributorJung, Hyesil
dcterms.contributorOn, Jeongah
dcterms.contributorPark, Seul Ki
dcterms.contributorKang, Hannah
dcterms.identifierhttp://dx.doi.org/10.4258/hir.2018.24.4.253
dcterms.relationHealthcare informatics research
dcterms.titleDigital Epidemiology: Use of Digital Data Collected for Non-epidemiological Purposes in Epidemiological Studies.en
Files
Collections