TY - JOUR
T1 - A hierarchical study for urban statistical indicators on the prevalence of COVID-19 in Chinese city clusters based on multiple linear regression (MLR) and polynomial best subset regression (PBSR) analysis
AU - Cheshmehzangi, Ali
AU - Li, Yujian
AU - Li, Haoran
AU - Zhang, Shuyue
AU - Huang, Xiangliang
AU - Chen, Xu
AU - Su, Zhaohui
AU - Sedrez, Maycon
AU - Dawodu, Ayotunde
N1 - The funding was provided by National Natural Science Foundation of China (NSFC) (71850410544, 72050410358,
71950410760) and also by the Ministry of Education, Culture, Sports, Science and Technology, Japan.
PY - 2022/2/4
Y1 - 2022/2/4
N2 - With evidence-based measures, COVID-19 can be effectively controlled by advanced data analysis and prediction. However, while valuable insights are available, there is a shortage of robust and rigorous research on what factors shape COVID-19 transmissions at the city cluster level. Therefore, to bridge the research gap, we adopted a data-driven hierarchical modeling approach to identify the most influential factors in shaping COVID-19 transmissions across different Chinese cities and clusters. The data used in this study are from Chinese officials, and hierarchical modeling conclusions drawn from the analysis are systematic, multifaceted, and comprehensive. To further improve research rigor, the study utilizes SPSS, Python and RStudio to conduct multiple linear regression and polynomial best subset regression (PBSR) analysis for the hierarchical modeling. The regression model utilizes the magnitude of various relative factors in nine Chinese city clusters, including 45 cities at a different level of clusters, to examine these aspects from the city cluster scale, exploring the correlation between various factors of the cities. These initial 12 factors are comprised of ‘Urban population ratio’, ‘Retail sales of consumer goods’, ‘Number of tourists’, ‘Tourism Income’, ‘Ratio of the elderly population (> 60 year old) in this city’, ‘population density’, ‘Mobility scale (move in/inbound) during the spring festival’, ‘Ratio of Population and Health facilities’, ‘Jobless rate (%)’, ‘The straight-line distance from original epicenter Wuhan to this city’, ‘urban per capita GDP’, and ‘the prevalence of the COVID-19’. The study’s results provide rigorously-tested and evidence-based insights on most instrumental factors that shape COVID-19 transmissions across cities and regions in China. Overall, the study findings found that per capita GDP and population mobility rates were the most affected factors in the prevalence of COVID-19 in a city, which could inform health experts and government officials to design and develop evidence-based and effective public health policies that could curb the spread of the COVID-19 pandemic.
AB - With evidence-based measures, COVID-19 can be effectively controlled by advanced data analysis and prediction. However, while valuable insights are available, there is a shortage of robust and rigorous research on what factors shape COVID-19 transmissions at the city cluster level. Therefore, to bridge the research gap, we adopted a data-driven hierarchical modeling approach to identify the most influential factors in shaping COVID-19 transmissions across different Chinese cities and clusters. The data used in this study are from Chinese officials, and hierarchical modeling conclusions drawn from the analysis are systematic, multifaceted, and comprehensive. To further improve research rigor, the study utilizes SPSS, Python and RStudio to conduct multiple linear regression and polynomial best subset regression (PBSR) analysis for the hierarchical modeling. The regression model utilizes the magnitude of various relative factors in nine Chinese city clusters, including 45 cities at a different level of clusters, to examine these aspects from the city cluster scale, exploring the correlation between various factors of the cities. These initial 12 factors are comprised of ‘Urban population ratio’, ‘Retail sales of consumer goods’, ‘Number of tourists’, ‘Tourism Income’, ‘Ratio of the elderly population (> 60 year old) in this city’, ‘population density’, ‘Mobility scale (move in/inbound) during the spring festival’, ‘Ratio of Population and Health facilities’, ‘Jobless rate (%)’, ‘The straight-line distance from original epicenter Wuhan to this city’, ‘urban per capita GDP’, and ‘the prevalence of the COVID-19’. The study’s results provide rigorously-tested and evidence-based insights on most instrumental factors that shape COVID-19 transmissions across cities and regions in China. Overall, the study findings found that per capita GDP and population mobility rates were the most affected factors in the prevalence of COVID-19 in a city, which could inform health experts and government officials to design and develop evidence-based and effective public health policies that could curb the spread of the COVID-19 pandemic.
KW - Diseases
KW - COVID-19
KW - multiple linear regression
KW - polynomial best subset regression
U2 - 10.1038/s41598-022-05859-8
DO - 10.1038/s41598-022-05859-8
M3 - Article
C2 - 35121784
SN - 2045-2322
VL - 12
JO - Scientific Reports
JF - Scientific Reports
IS - 1
M1 - 1964 (2022)
ER -