Logo-jrhs
J Res Health Sci. 21(2):e00518. doi: 10.34172/jrhs.2021.46

Short Communication

Predictive Role of Population Density and Use of Public Transport for Major Outcomes of SARS-CoV-2 Infection in the Italian Population: An Ecological Study

Alfonso Ilardi 1, *, Sergio Chieffi 2, Ciro Rosario Ilardi 3
1Department of Internal Medicine, A.O.R.N. "Antonio Cardarelli Hospital", Naples, Italy
2Department of Experimental Medicine, University of Campania "Luigi Vanvitelli", Naples, Italy
3Department of Psychology, University of Campania "Luigi Vanvitelli", Caserta, Italy
* Correspondence: Alfonso Ilardi (MD) Tel: +39 3494741157 E-mail: alfonso.ilardi@aocardarelli.it

Abstract

Background: This study aimed at assessing how population density (PD), aging index (AI), use of public transport (URPT), and PM10 concentration (PI) modulated the trajectory of the main COVID-19 pandemic outcomes in Italy, also in the recrudescence phase of the epidemic.

Study design: Ecological study.

Methods: For each region, we recovered data about cases, deaths, and case fatality rate (CFR) recorded since both the beginning of the epidemic and September 1, 2020. Data about total hospitalizations were included as well.

Results: PD correlated with, and was the best predictor of, total and partial cases, total and partial deaths, and total hospitalizations. Moreover, URPT correlated with, and was the best predictor of, total CFR. Besides, PI correlated significantly with total and partial cases, total and partial deaths, and total hospitalizations.

Conclusions: PD explains COVID-19 morbidity, mortality, and severity while URPT is the best predictor of disease lethality. These findings should be interpreted with caution due to the ecological fallacy.

Keywords: Air Pollution, COVID-19, Ecological Study, Population Density, Public Transport

Copyright

© 2021 The Author(s); Published by Hamadan University of Medical Sciences.
This is an open-access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.


Introduction

Respiratory infections, such as Coronavirus Disease 2019 (COVID-19) spread through droplets (5-10 μm) and aerosols (smaller than 5 μm) exhaled by infected individuals during quiet breathing, simple conversation, or through coughing and sneezing 1 . Hence, there is a need to comply with the recommendations provided by health authorities on social distancing (physical distancing of one meter or more) and the use of personal protective equipment. These measures, together with regular hand washing, disinfection of surfaces, and full personal hygiene, can be valuable contributions to counteract the spread of contagion 2,3 .

After an initial decrease in cases and admissions to the intensive care unit, the contagion curve increased again with the approach of autumn. Consequently, in many European countries, including Italy, the political authorities imposed new restrictions; however, some matters remained controversial.

The increase in tourist flows, which generally occurs between July and August, may have contributed to the second wave of the pandemic. However, this hypothesis is more applicable to the countries of the northern hemisphere, rather than the countries of the southern hemisphere where the summer season begins on December 21st and ends on March 20th.

Recent studies have suggested that population density (PD) is one of the key factors favoring transmissibility 4,5 . The use of public transport has also been reported as a possible factor increasing the spread of contagion 6 . The number of public transport users is higher in the most densely populated territories, and the restricted environments in which the transport occurs may not allow social distancing. Although the disease primarily affects older and multimorbid adults, the results of previous investigations have indicated that the aging index (AI) does not affect COVID-19 morbidity, mortality, and lethality 4 . There are also non-demographic variables that could moderate the impact of the COVID-19 outbreak. For instance, it has been hypothesized that PM10 concentrations exceeding 0.05 mg/m3 may result in accelerated dissemination of the virus 7 .

In light of issues such as those raised above, the present ecological study aimed at estimating the contribution of PD, AI, public transport use, and PM10 concentration in modulating the trajectory of the main COVID-19 pandemic outcomes in Italy, also in the recrudescence phase of the epidemic.


Methods

To detect any relationships between the available demographic/epidemiological data and the impact of COVID-19 pandemic on the Italian population 8-13 , we characterized each of the 20 Italian regions based on the PD (i.e., number of inhabitants per km 2 ), AI (i.e., the number of elders per 100 persons younger than 15 years old; a value higher than 100 indicates a higher number of older subjects than younger ones), utilization rate of public transport users (URPT), annual average of PM10 daily mean concentration (pollution index [PI]), total number of positive cases, total number of hospitalized patients (i.e., ordinary hospitalization and intensive care), total number of deaths, and case fatality rate (CFR) (i.e., the proportion of deceased patients among the total number of positive cases).

The data about the total number of cases, deaths, hospitalizations, and the total CFR, were collected from the beginning of the epidemic until November 4, 2020. Our archival research also included the partial estimate of cases, deaths, and CFR calculated from September 1, 2020, until November 4, 2020. These estimates will be referred to as partial cases, partial deaths, and partial CFR, respectively (Table 1). Data about PD, AI, and URPT were provided by the Italian National Institute of Statistics 10-12 while those about IP were collected from the Global Health Observatory (World Health Organization) 13 .

Table 1. Descriptive statistics for each region (Data updated on November 4, 2020)
Region PD
(Inhab /Km2)
AI (%) URPT (%) a PI b Total cases Total deaths Total hospitalizations c Partial cases d Partial deaths d CFR (%) Partial
CFR (%) d
Abruzzo120.6197.755.524.312,5435684638,787964.531.09
Basilicata55.3200.550.418.72,788541042,266261.941.15
Calabria126.416946.322.96,0921252234,602272.050.59
Campania423.2134.749.931.169,6137391,74462,5052941.060.47
Emilia Romagna199.0186.462.224.862,9144,6991,71531,0312367.470.76
Friuli Venezia Giulia152.7223.158.621.912,2644142558,495663.380.78
Lazio340.4167.757.725.355,2731,3092,53444,0824312.370.98
Liguria284.9260.753.220.732,1171,8341,27321,1662635.711.24
Lombardia423.4169.760.329.5224,19117,8485,525124,1169837.960.79
Marche161.5202.358.423.916,2611,0324529,021456.350.50
Molise67.8225.550.918.92,01641351,491182.031.21
Piemonte171.0211.35626.381,4094,4813,75848,5283355.500.69
Puglia205.1175.449.823.222,08576386116,6452073.451.24
Sardegna67.7221.649.922.410,6402413898,4471072.271.27
Sicilia192.31594721.726,0805691,25321,7632832.181.30
Toscana161.9209.859.622.752,8151,4611,51640,9573042.770.74
Trentino-Alto Adige79.014262.818.119,96977356411,938763.870.64
Umbria104.0210.555.222.212,05615435610,263741.280.72
Valle d’Aosta38.5188.261.621.43,7201811632,479354.871.41
Veneto267.5178.261.727.665,5312,4781,22542,6023583.780.84

PD: population density; AI: aging index; URPT: utilization rate of public transport; PI: pollution index

aaUtilization rate of public transport users in 2019

b Annual average of PM10 daily mean concentration (mg/m3) during 2013-2016

c Ordinary hospitalization and intensive care

d Partial estimate of cases, deaths, and CFR since September 1, 2020

In particular, data about URPT were acquired through a mixed approach (i.e., computer-assisted web interviews, computer-assisted personal interviews, and paper-and-pencil interviews) involving approximately 25,000 families distributed across about 800 Italian municipalities of different demographic sizes 12 . Conversely, the data about PI were regularly acquired and processed through monitors located within the main Italian metropolitan areas 13 . Variables concerning the COVID-19 outcomes were finally extracted from the historical data provided by the Italian Ministry of Health and National Health Institute 8,9 .

Statistical analyses were performed by means IBM SPSS Statistics for Windows (version 26). Bivariate correlation analysis was used for exploratory purposes. Cohen’s conventions (weak: <0.30; moderate: 0.30–0.50; strong: >0.50) were employed to interpret effect size. Subsequently, stepwise regression analyses were performed to study the relationship of sociodemographic (i.e., PD, AI, URPT) and environmental (i.e., PI) variables with the morbidity, mortality, severity, and lethality of COVID-19.

We ran seven stepwise regression models entering PD, AI, URPT, and PI as independent variables, and total cases, total hospitalized patients, total deaths, total CFR, partial cases, partial deaths, and partial CFR as separate dependent ones. In SPSS, the stepwise regression is a statistical data-driven regression technique combining forward and backward regression methods. The analysis started with no independent variables in the model.

At each step, the predictor explaining the largest amount of variance was added to the model. Variables entered in the model were systematically re-evaluated to establish whether their unique contribution to the variance was still significant after the addition of other predictors. Candidate predictors were dropped if they were no longer significant 14 . For all analyses, the significance threshold was set at α≤0.05 and adjusted according to Bonferroni’s correction method.


Results

About half of the variables under examination (i.e., total cases, total hospitalized patients, total deaths, partial cases, and partial deaths) did not satisfy the assumption of normality based on the examination of skewness and kurtosis values and results of the Shapiro-Wilk test. Therefore, non-parametric Spearman’s correlations (rrho) were performed. Table 2 summarizes the results of correlation analyses.

Table 2. Correlation matrix
Variables PD AI URPT PI Total
cases
Total
deaths
Total
hospitalizations
Partial
cases
Partial
deaths
CFR Partial
CFR
Population density-–0.360.05 0.70* 0.87* 0.75* 0.84* 0.87* 0.81* 0.27–0.21
Aging index–0.36-–0.01–0.34–0.35–0.17–0.35–0.40–0.310.090.21
Utilization rate of public transport0.05–0.01-0.110.320.530.230.260.23 0.62* –0.21
Pollution index 0.70* –0.340.11- 0.73* 0.58* 0.68* 0.72* 0.67* 0.24–0.39

Adjusted alpha (α/7)=0.007

*P ≤ 0.007

The PD showed a strong association with total cases (rrho=0.87, P<0.001), total hospitalized patients (rrho=0.84, P<0.001), and total deaths (rrho=0.75, P<0.001). Furthermore, PD had a strong correlation with partial cases (rrho=0.87, P<0.001) and deaths (rrho=0.81, P<0.001), while it did not correlate with total or partial CFR. Moreover, AI did not correlate with any of the variables under examination, while URPT showed a strong correlation with total CFR (rrho=0.62, P=0.003). Finally, PI had a significant and strong association with total (rrho=0.73, P<0.001) and partial cases (rrho=0.72, P<0.001), total (rrho=0.58, P=0.007) and partial deaths (rrho=0.67, P=0.001), and total hospitalized patients (rrho=0.68, P=0.001). Nevertheless, no correlation was found between PI and total or partial CFR.

As shown in Table 2, PD and PI were strongly interrelated (rrho=0.70, P<0.001). The PD and PI as well as AI and URPT entered seven regression models as candidate predictors of each outcome variable. Therefore, the finding of a bivariate relationship between these two variables suggested further examination of the presence of multicollinearity. In the case of correlation among predictors in a regression analysis, the variance of regression coefficients can be inflated. Therefore, we calculated the variance inflation factor (VIF) for each predictor to determine whether and how much the variance of regression coefficients was inflated.

The following general rules were applied to interpret the VIF values: VIF=1, no collinearity; VIF=1 to 5, moderate collinearity; VIF>5, high collinearity 15 . The variances of PD and PI coefficients were inflated by factors of 2.71 and 2.86, respectively. Furthermore, the tolerance values (i.e., the amount of variability not explained by the other independent variables) were higher than 0.10 (PD=0.37, PI=0.36) which indicates the absence of statistically significant multicollinearity 15 .

Overall, these results suggest the presence of moderate multicollinearity but not severe enough to affect the interpretation of regression models or warrant further corrective measures. Besides excluding multicollinearity, other assumptions of linear regression were satisfied: 1) both independent and dependent variables were quantitative, ii) there was a linear relationship between outcome and independent variables, and iii) the standardized residuals showed no drastic deviation from normality based on inspection of both P-P plots and histograms. Table 3 tabulates the results of regression analyses.

It was found that PD was the best predictor of the main pandemic outcomes. Particularly, the variances of total and partial cases, total and partial deaths, and total hospitalizations were explained only by increasing PD (total cases: R2=0.55, B=326.93, P<0.001; partial cases: R2=0.68, B=211.36, P<0.001; total deaths: R2=0.32, B=19.65, P=0.009; partial deaths: R2=0.64, B=1.57, P<0.001; total hospitalizations: R2=0.51, B=8.65, P<0.001). The variance of total CFR was explained by URPT; accordingly, a higher URPT predicted an increase in total CFR (R2=0.35, B=0.22, P=0.006). However, none of the candidate predictors sufficiently explained the variance of partial CFR.


Discussion

Based on the results of the present study, it is confirmed that PD increases the likelihood of interpersonal contacts and hence viral transmission. In line with previous studies, PD was found to be associated with the number of cases and deaths 4 . Moreover, PD showed a positive linear association with the total number of hospitalizations. This result suggests that the likelihood of developing the “severe form” of the COVID-19 that requires hospitalization is higher in the most populated regions of Italy. Furthermore, PD was strongly associated with the number of cases and deaths recorded during the second wave of the pandemic.

These findings, together with the confirmatory results of regression analysis, prove that PD is still the best demographic predictor of the spread of contagion and mortality of SARS-CoV-2 in the Italian population, also in the recrudescence phase of the epidemic. Unlike the findings of our previous study 4 , no positive relationship was found between PD and CFR. In this respect, some important considerations need to be discussed.

On the one hand, in our previous study, the statistical adjustment for multiple comparisons was not applied to avoid being too conservative. On the other hand, it should also be noted that, compared to the first months of the epidemic, a more widespread use of nasopharyngeal swabs for the assessment of cases, even asymptomatic ones, has involved increasingly broader population groups. Early detection of cases and their isolation might have contributed to the limitation of the viral circulation within the most vulnerable populations which, in turn, has led to lower CFR and the loss of correlation.

According to the results of regression analysis, URPT was the best predictor of total CFR. In other words, sharing the confined environment of a public transport vehicle or crowded public transport stations might increase the lethality of SARS-CoV-2. On average, people using public transports are younger (students and workers), and the infection tends to progress in an apparently asymptomatic manner or with few symptoms in younger subjects 16 . Therefore, this may have favored the circulation of asymptomatic/paucisymptomatic cases, and consequently the transmission of coronavirus to the most vulnerable populations including older adults and multimorbid patients.

Table 3. Predictors of the main COVID-19 pandemic outcomes
Variables B t P F R 2
Total cases F (1,18)=22.215, P<0.001 0.55
Population density326.934.7130.001
Aging index–0.01–0.060.950
Utilization rate of public transport0.281.9100.073
Pollution index0.311.2190.240
Total hospitalized patients F (1,18)=19.086, P<0.001 0.51
Population density8.654.3690.001
Aging index0.040.210.836
Utilization rate of public transport0.211.3240.203
Pollution index0.270.990.338
Total deaths F (1,18)=8.518, P=0.009 0.32
Population density19.652.9190.009
Aging index0.060.300.765
Utilization rate of public transport0.311.6670.114
Pollution index0.220.6750.509
Total case fatality rate F (1,18)=9.581, P=0.006 0.35
Population density0.190.9970.333
Aging index0.160.8150.427
Utilization rate of public transport0.223.0950.006
Pollution index0.170.8740.394
Partial cases F (1,18)=38.700, P<0.001 0.68
Population density211.366.2210.001
Aging index–0.06–0.3950.698
Utilization rate of public transport0.211.6690.113
Pollution index0.321.5140.148
Partial deaths F (1,18)=32.089, P<0.001 0.64
Population density1.575.6650.001
Aging index0.020.1480.884
Utilization rate of public transport0.191.4170.175
Pollution index0.090.3800.709
Partial case fatality rate F (1,18)=4.425, P=0.050 0.20
Population density0.130.3530.728
Aging index0.180.7900.442
Utilization rate of public transport–0.22–1.0280.318
Pollution index–0.04–2.1040.050

Adjusted alpha (α/4)=0.01

As previously mentioned, no association was found between AI and the main pandemic outcomes. Generally speaking, the females are the least affected, and the mean survival rate is higher in elderly multimorbid females with COVID-19 17 . Since the infection affects males more than females 18,19 , the higher prevalence of females in the older age groups of the Italian population may have blunted the association of AI with morbidity, mortality, disease severity, and lethality of SARS-CoV-2.

It is widely acknowledged that biological and chemical components of air pollution can have a negative impact on human health. Air pollution represents a well-known cause of prolonged inflammation as a result of an over-expression of inflammatory cytokines, even in healthy subjects 20 . Based on previous research, there is a positive association between the time of exposure to particulate matter, prior to the pandemic period, and the increase in vulnerability to SARS-CoV-2 21 . Furthermore, air pollution appears to increase the risk of mortality from COVID-19 22 . In line with this evidence, we found that PI has a significant correlation with morbidity, mortality, and total number of hospitalizations; however, no correlation was observed between PI and CFR. It should be noted that the greatest exposure to PM10 is likely prevalent in younger subjects who use public transport daily for study or work needs.

Ecological fallacy is the main limitation of the present study. Ecological fallacy is a bias affecting the interpretation of results since a relationship observed among variables on an aggregate level does not necessarily represent with confidence the association existing at an individual level 23 . For instance, from a statistical standpoint, a correlation tends to be stronger when an association is assessed at an aggregate level rather than an individual level. Still, details about individual profiles may be missed when analyses are conducted on aggregate data. Consequently, the findings of the present study should be interpreted with caution.


Conclusion

The current pandemic has brought to the attention of political and health authorities several issues that directly affect the daily lives of citizens. These issues extend from travel patterns within the cities of residence to the frequency of schools. Epidemiological studies involving demographic and environmental variables are needed to help policy-makers and administrative authorities to make informed and targeted decisions that can substantially change daily life.

In the present study, it was confirmed that PD plays a striking role in exacerbating the spread of contagion, and its relationship with disease severity and mortality. Furthermore, a strong association was found between URPT and COVID-19 lethality. It was hypothesized that the use of public transport by younger individuals, who are more likely to be asymptomatic, may promote the circulation of the virus and, consequently, its spread to more vulnerable individuals.

The limitation of ecological studies, such as this one, is that they provide trend information only on aggregated data. Nevertheless, the present study, which “photographs” the impact of some relevant aspects of the current pandemic in the Italian context, could serve as a guide for political and administrative management, and as a warning for future research.


Conflict of interest

The authors have no conflicts of interest to declare.


Funding

No funding to declare.


Highlights

  • Population density, frequent use of public transports, and larger PM10 concentrations may modulate the trajectory of the main COVID-19 pandemic outcomes.

  • Population density is the best predictor of morbidity, mortality, and disease severity.

  • Higher use of public transports explains the fatality of COVID-19.


References

  1. Prather KA, Wang CC, Schooley RT. Reducing transmission of SARS-CoV-2. Science 2020; 368(6498):1422-4.
  2. Chu DK, Akl EA, Duda S, Solo K, Yaacoub S, Schünemann HJ. Physical distancing, face masks, and eye protection to prevent person-to-person transmission of SARS-CoV-2 and COVID-19: a systematic review and meta-analysis. Lancet 2020; 395(10242):1973-87.
  3. Saghafipour A. Indirect and Potential Impacts of the COVID-19 Pandemic on the Public Health. J Res Health Sci 2020; 20(3):e00492.
  4. Ilardi A, Chieffi S, Iavarone A, Ilardi CR. SARS-CoV-2 in Italy: Population Density Correlates with Morbidity and Mortality. Jpn J Infect Dis 2021; 74(1):61-4.
  5. de Oliveira RP, Achcar JA, Nunes AA. Modeling the incidence and death rates of COVID-19 pandemic in different regions of the world. Epidemiol Methods 2020.
  6. Tirachini A, Cats O. COVID-19 and public transportation: Current assessment, prospects, and research needs. J Public Trans 2020; 22(1):1-21.
  7. Setti L, Passarini F, De Gennaro G, Barbieri P, Licen S, Perrone MG. Potential role of particulate matter in the spreading of COVID-19 in Northern Italy: first observational study based on initial epidemic diffusion. BMJ Open 2020; 10(9):e039338.
  8. Coronavirus Statistics. 2020 [Updated 4 November 2020]; Available from: https://statistichecoronavirus.it.
  9. Ministry of Health and National Health Institute. 2020 [Updated 4 November 2020]; Available from: https://opendatadpc.maps.arcgis.com/apps/opsdashboard/index.html#/b0c68bce2cce478eaac82fe38d4138b1.
  10. Italian National Institute of Statistics. Boundaries of administrative units and territorial bases 2020; Available from: http://dati.istat.it/Index.aspx?DataSetCode=DCCV_CARGEOMOR_ST_COM.
  11. Italian National Institute of Statistics. Nowcast system for demographic indicators 2020; Available from: http://dati.istat.it/Index.aspx?DataSetCode=DCIS_INDDEMOG1.
  12. Italian National Institute of Statistics. Land moves before COVID-19 2019; Available from: https://www.istat.it/it/archivio/242574.
  13. The Global Health Observatory. Ambient air pollution. WHO website. 2020 [Updated 2016]; Available from: www.who.int/airpollution/data/cities/en.
  14. Warner RM. Applied statistics: From bivariate through multivariate techniques. 2nd ed. Sage Publications; 2012.
  15. Daoud JI. Multicollinearity and regression analysis. J Phys Conf Ser 2017; 949(1):012009.
  16. Li R, Pei S, Chen B, Song Y, Zhang T, Yang W. Substantial undocumented infection facilitates the rapid dissemination of novel coronavirus (SARS-CoV-2). Science 2020; 368(6490):489-93.
  17. Ilardi A, Politi C, Ciarambino T. COVID-19: could sex and age be a risk factor? Minerva Med. 2020.
  18. Ambrosino I, Barbagelata E, Corbi G, Ciarambino T, Politi C, Moretti AM. Gender differences in treatment of Coronavirus Disease-2019. Monaldi Arch Chest Dis 2020; 90(4):646-56.
  19. Peckham H, de Gruijter NM, Raine C, Radziszewska A, Ciurtin C, Wedderburn LR. Male sex identified by global COVID-19 meta-analysis as a risk factor for death and ITU admission. Nat Commun 2020; 11(1):6317.
  20. Tsai DH, Riediker M, Berchet A, Paccaud F, Waeber G, Vollenweider P. Effects of short- and long-term exposures to particulate matter on inflammatory marker levels in the general population. Environ Sci Pollut Res Int 2019; 26(19):19697-704.
  21. Yao Y, Pan J, Wang W, Liu Z, Kan H, Qiu Y. Association of particulate matter pollution and case fatality rate of COVID-19 in 49 Chinese cities. Sci Total Environ 2020; 741:140396.
  22. Pozzer A, Dominici F, Haines A, Witt C, Münzel T, Lelieveld J. Regional and global contributions of air pollution to risk of death from COVID-19. Cardiovasc Res 2020; 116(14):2247-53.
  23. Shahbazi F, Khazaei S. Socio-economic inequality in global incidence and mortality rates from coronavirus disease 2019: an ecological study. New Microbes New Infect 2020; 38:100762.
Submitted: 11 Feb 2021
Revised: 30 Jun 2021
First published online: 12 Apr 2021
EndNote EndNote

(Enw Format - Win & Mac)

BibTeX BibTeX

(Bib Format - Win & Mac)

Bookends Bookends

(Ris Format - Mac only)

EasyBib EasyBib

(Ris Format - Win & Mac)

Medlars Medlars

(Txt Format - Win & Mac)

Mendeley Web Mendeley Web
Mendeley Mendeley

(Ris Format - Win & Mac)

Papers Papers

(Ris Format - Win & Mac)

ProCite ProCite

(Ris Format - Win & Mac)

Reference Manager Reference Manager

(Ris Format - Win only)

Refworks Refworks

(Refworks Format - Win & Mac)

Zotero Zotero

(Ris Format - FireFox Plugin)

Abstract View: 471
PDF Download: 179
Full Text View: 165