Skip to main content
Advertisement
  • Loading metrics

An investigation of spatial-temporal patterns and predictions of the coronavirus 2019 pandemic in Colombia, 2020–2021

  • Amna Tariq ,

    Roles Conceptualization, Data curation, Formal analysis, Funding acquisition, Investigation, Methodology, Resources, Software, Visualization, Writing – original draft, Writing – review & editing

    atariq1@student.gsu.edu

    Affiliation Department of Population Health Sciences, School of Public Health, Georgia State University, Atlanta, Georgia, United States of America

  • Tsira Chakhaia,

    Roles Writing – review & editing

    Affiliation Department of Population Health Sciences, School of Public Health, Georgia State University, Atlanta, Georgia, United States of America

  • Sushma Dahal,

    Roles Writing – review & editing

    Affiliation Department of Population Health Sciences, School of Public Health, Georgia State University, Atlanta, Georgia, United States of America

  • Alexander Ewing,

    Roles Writing – review & editing

    Affiliation Department of Population Health Sciences, School of Public Health, Georgia State University, Atlanta, Georgia, United States of America

  • Xinyi Hua,

    Roles Writing – review & editing

    Affiliation Department of Biostatistics, Epidemiology and Environmental Health Sciences, Jiann-Ping Hsu College of Public Health, Georgia Southern University, Statesboro, Georgia, United States of America

  • Sylvia K. Ofori,

    Roles Funding acquisition, Writing – review & editing

    Affiliation Department of Biostatistics, Epidemiology and Environmental Health Sciences, Jiann-Ping Hsu College of Public Health, Georgia Southern University, Statesboro, Georgia, United States of America

  • Olaseni Prince,

    Roles Writing – review & editing

    Affiliation Department of Population Health Sciences, School of Public Health, Georgia State University, Atlanta, Georgia, United States of America

  • Argita D. Salindri,

    Roles Writing – review & editing

    Affiliation Department of Population Health Sciences, School of Public Health, Georgia State University, Atlanta, Georgia, United States of America

  • Ayotomiwa Ezekiel Adeniyi,

    Roles Investigation, Writing – review & editing

    Affiliation Department of Computer Science, College of Arts and Sciences, Georgia State University, Atlanta, Georgia, United States of America

  • Juan M. Banda,

    Roles Funding acquisition, Investigation, Methodology, Resources, Writing – review & editing

    Affiliation Department of Computer Science, College of Arts and Sciences, Georgia State University, Atlanta, Georgia, United States of America

  • Pavel Skums,

    Roles Funding acquisition, Methodology, Resources, Writing – review & editing

    Affiliation Department of Computer Science, College of Arts and Sciences, Georgia State University, Atlanta, Georgia, United States of America

  • Ruiyan Luo,

    Roles Writing – review & editing

    Affiliation Department of Population Health Sciences, School of Public Health, Georgia State University, Atlanta, Georgia, United States of America

  • Leidy Y. Lara-Díaz,

    Roles Funding acquisition, Writing – review & editing

    Affiliation Centro de Investigación en Ingeniería Matemática (CI2MA) and Departamento de Ingeniería Matemática, Universidad de Concepción, Concepción, Chile

  • Raimund Bürger,

    Roles Funding acquisition, Writing – review & editing

    Affiliation Centro de Investigación en Ingeniería Matemática (CI2MA) and Departamento de Ingeniería Matemática, Universidad de Concepción, Concepción, Chile

  • Isaac Chun-Hai Fung,

    Roles Writing – review & editing

    Affiliation Department of Biostatistics, Epidemiology and Environmental Health Sciences, Jiann-Ping Hsu College of Public Health, Georgia Southern University, Statesboro, Georgia, United States of America

  • Eunha Shim,

    Roles Funding acquisition, Writing – review & editing

    Affiliation Department of Mathematics and Integrative Institute of Basic Sciences, Soongsil University, Seoul, Republic of Korea

  • Alexander Kirpich,

    Roles Writing – review & editing

    Affiliation Department of Population Health Sciences, School of Public Health, Georgia State University, Atlanta, Georgia, United States of America

  • Anuj Srivastava,

    Roles Methodology, Resources, Writing – review & editing

    Affiliation Department of Statistics, Florida State University, Tallahassee, Florida, United States of America

  •  [ ... ],
  • Gerardo Chowell

    Roles Conceptualization, Formal analysis, Funding acquisition, Investigation, Methodology, Project administration, Resources, Supervision, Validation, Visualization, Writing – review & editing

    Affiliation Department of Population Health Sciences, School of Public Health, Georgia State University, Atlanta, Georgia, United States of America

  • [ view all ]
  • [ view less ]

Abstract

Colombia announced the first case of severe acute respiratory syndrome coronavirus 2 on March 6, 2020. Since then, the country has reported a total of 5,002,387 cases and 127,258 deaths as of October 31, 2021. The aggressive transmission dynamics of SARS-CoV-2 motivate an investigation of COVID-19 at the national and regional levels in Colombia. We utilize the case incidence and mortality data to estimate the transmission potential and generate short-term forecasts of the COVID-19 pandemic to inform the public health policies using previously validated mathematical models. The analysis is augmented by the examination of geographic heterogeneity of COVID-19 at the departmental level along with the investigation of mobility and social media trends. Overall, the national and regional reproduction numbers show sustained disease transmission during the early phase of the pandemic, exhibiting sub-exponential growth dynamics. Whereas the most recent estimates of reproduction number indicate disease containment, with Rt<1.0 as of October 31, 2021. On the forecasting front, the sub-epidemic model performs best at capturing the 30-day ahead COVID-19 trajectory compared to the Richards and generalized logistic growth model. Nevertheless, the spatial variability in the incidence rate patterns across different departments can be grouped into four distinct clusters. As the case incidence surged in July 2020, an increase in mobility patterns was also observed. On the contrary, a spike in the number of tweets indicating the stay-at-home orders was observed in November 2020 when the case incidence had already plateaued, indicating the pandemic fatigue in the country.

Author summary

As the COVID-19 pandemic continues to spread across Colombia, studies highlighting the intensity of the pandemic have become imperative for appropriate resource allocation and informing public health policies. In this study we utilize mathematical models to infer the transmission dynamics of SARS-CoV-2 at the regional and national level as well as short-term forecasting of the COVID-19 epidemic trajectory. Moreover, we examine the geographic heterogeneity of the COVID-19 case incidence in Colombia along with the analysis of mobility and social media trends in relation to the observed COVID-19 case incidence in the country. The estimates of reproduction numbers at the national and regional level show a decline in disease transmission as of October 31, 2021. Moreover, the 30-day ahead short-term forecasts for the most recent time-period (October 2-October 31, 2021) generated using the mathematical models needs to be interpreted with caution as the single peak models point towards a sustained decline in case incidence contrary to the sub-epidemic wave model. Nevertheless, the spatial analysis in Colombia shows distinct variations in incidence rate patterns across different departments which can be grouped into four distinct clusters. Moreover, the social media and mobility trends also explain the occurrence of case resurgences over time.

Introduction

Ever since the detection of the first cluster of severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) in December 2019, the coronavirus disease 2019 (COVID-19) pandemic continues to threaten the world [1]. As this pandemic gears to continue in the third year, it has already resulted in devastating global socioeconomic and political consequences [2]. The virus’s novelty and complex transmission dynamics have resulted in non-pharmaceutical public health measures including social distancing mandates and intermittent lockdowns as the primary weapons to fight COVID-19 [3]. Moreover, as new variants of SARS-CoV-2 continue to emerge amidst vaccination campaigns globally, it remains unclear how the COVID-19 pandemic will unfold [4,5]. Despite the uncertainty in the development of the pandemic, SARS-CoV-2 continues to exert substantial morbidity and mortality burden surpassing 246 million confirmed cases and nearly 5 million associated deaths worldwide as of October 31, 2021 [6]. The mortality and morbidity of the COVID-19 pandemic continues to overwhelm the health care systems of many nations including the United States, Colombia, Mexico, Brazil and Argentina in the Americas [7].

As the third COVID-19 wave strikes globally, it remains imperative to understand the dynamics of disease transmission and the potential role of mitigation strategies at different spatial scales including the municipality, regional or departmental level in order to contain the virus [8]. Assessing the geographical patterns of the cases and identifying the COVID-19 impact at the regional level can provide valuable insights regarding the exposure to the virus and the distribution of public health resources required to mitigate its spread [9,10]. Pandemic emergency response which includes the implementation and lifting of dynamic lockdowns [11], social distancing, mask mandates, population testing, and provision of vaccinations has varied across countries in the same region and in different cities within the same country [12]. As with Latin America, one of the epicenters of the pandemic [13], the evolution of the pandemic and the quality of public health responses have varied across nations. While Chile aimed to mitigate the virus to slow down the disease and reduce peak in health care demand, Brazil, Ecuador, and Colombia’s objective was to contain the spread of the virus by minimizing the risk of transmission from infected to non-infected individuals to stop the outbreak [14,15]. Despite aiming to contain virus transmission in the country, Colombia became the second country in Latin America and eighth globally to reach one million cases by the end of October 2020 [16]. Colombia has also seen the fastest increase in total COVID-19 associated deaths compared to other Latin American countries as reported by the end of the year 2020 [14].

In Latin America, where the first confirmed COVID-19 case was reported in Brazil on February 26, 2020 [17], the intensity and diversity of social distancing efforts have varied widely. Since the detection of the first imported confirmed case on March 6, 2020, Colombia has observed three epidemic peaks [13,18]. As the virus disseminated throughout the country, initial prevention recommendations provided between March 9–11, 2020 from the local government authorities conflicted with those at the national level [19], creating confusion amongst the public about the preventive measures required to protect themselves [19]. Therefore, the case incidence continued to rise leading to the declaration of the national emergency on March 12, 2020 [19]. As a part of the prompt escalation of measures to prevent a full-blown outbreak most educational institutions were closed by March 16, 2020, followed by the closure of the country borders the next day [19].

As cases continued to increase rapidly, domestic and international flights were suspended on March 23, 2020 [19]. Subsequently, to limit virus transmission within the country, a nationwide mandatory quarantine was implemented on March 24, 2020. Meanwhile a phased reopening of the economy was initiated as early as April 27, 2020, under strict protocols, to support the country’s dwindling economy [19].

The evolution of the pandemic in Colombia has justified the five extensions of the mandatory quarantine that lasted until August 31, 2020 [20]. At this point, the country transitioned into a period of “selective isolation with responsible individual distancing” as the daily incidence in the country’s main cities including Bogotá, Medellín, Cali, Bucaramanga and Pasto levelled off and eventually leaned towards a downward trend [21,22]. Moreover, Barranquilla, Cartagena, Leticia and Quibdó had overcome the worst part of the first wave by August 25, 2020 [21]. The chosen selective isolation strategy prioritized tracing of suspected cases, those with infection, and their contacts, while reactivating the social and economic life of the country. As the cases continued to increase in the subsequent months, the government imposed and lifted dynamic lockdowns in multiple cities across the country. By the end of year 2020, COVID-19 cases were mainly concentrated in Bogotá (314,745), the capital city of Colombia followed by Antioquia (163,952), and Valle del Cauca (81,101) [23].

As the pandemic entered its second year, a second wave of COVID-19 was detected, primarily fueled by the P.1 (Gamma) variant of SARS-CoV-2 initially circulating in Brazil [24]. Soon after, the government announced a mass vaccination strategy that began on February 20, 2021 [25]. Around the same time, on February 25, 2021, the president of Colombia extended the duration of the national health emergency to last until May 31, 2021 [26]. Amidst delayed vaccine acquisition and slow vaccine distribution in the country, the cases continued to rise in some cities like Bogota and Barranquilla [27,28]. This resulted in the implementation of curfews in the cities and municipalities followed by the opening of businesses based on the occupancy rates of the intensive care units (ICU) in the hospitals beginning March 25, 2021 [27]. From March to June 2021, Colombia experienced a massive third wave of the COVID-19 pandemic, which evolved into a two stage peak-within-a-peak surge [28]. The month of May 2021 was reported to be the deadliest month resulting in an average of ~20,000 cases and 500 deaths per day [29]. While the impact of COVID-19 pandemic was not uniform across the entire country, Bogotá, Antioquia and Valle del Cauca have been the hardest hit areas in Colombia [30].

The Colombian government’s response to contain the COVID-19 pandemic has been declared the third worst out of 116 countries evaluated by the Lowy Institute in Australia, only ahead of Mexico and Peru [31]. Despite being one of the first countries in Latin America to offer diagnostic tests for COVID-19 [32], the testing and vaccination rate for Colombians remains low [33], with 0.79 tests per 1000 people per day and 0.39 vaccine doses administered per 100 people as of October 31, 2021. [34]. Hence, the country stands with a total of 5,002,387 cases and 127,258 deaths as of October 31, 2021 [30]. The factors contributing to the current COVID-19 outbreak are myriad. However, three distinguishable events have particularly interacted synergistically to add to the complexity of the pandemic in Colombia. These include the COVID-19 outbreaks in prisons and nursing homes that affected the vulnerable communities of the society [35,36] and the April 2021 Colombian protests provoked by the government policy proposals [28]. The Colombian prisons in Cali, Villavicencio, and Bogotá were severely impacted by the COVID-19 pandemic due to over-crowding, inadequate medical supplies and unhygienic conditions of facilities, which led to many infected inmates [36].

The COVID-19 pandemic in Colombia presents complex risk dynamics of SARS-CoV-2 transmission with a simultaneous interplay of epidemiological, behavioral, and political factors. Hence assessing the temporal variation of transmission in near real time becomes vital to evaluate the effectiveness of intervention strategies and possible reopening of the economy. Moreover, forecasting the COVID-19 trajectory can help understand the trends of the disease and estimate its potential burden. As the epidemic trajectory of the COVID-19 pandemic continues to unfold we forecast the COVID-19 trajectory in near-real time utilizing the mathematical models that have been validated for previous infectious disease outbreaks such as Ebola, Zika and the COVID-19 pandemic [3740]. Moreover, we specifically investigate the transmission dynamics of SARS-CoV-2 at the national and regional levels. We augment the analysis by investigating the spatial epidemic curves that allow us to analyze geographic heterogeneity that is lost while evaluating the aggregated epidemic curves and examine the social media and mobility trends concerning the implementation of lockdowns.

Methods

Data

Six sources of data are analyzed in this study. A brief description of the data sets is provided as follows:

  1. (i) Case incidence data

Case incidence data based on the dates of symptom onset is used to generate the epidemic curve, scaled incidence curve (see S1 Text), short-term forecasts of the COVID-19 pandemic, assess the spatial heterogeneity in Colombia and estimate the national and regional reproduction numbers. Specifically, we utilize publicly available time series of laboratory-confirmed cases based on the dates of symptom onset for the national and regional COVID-19 case incidence. These data are retrieved from the Colombian Ministry of Health as of October 31, 2021 [23]. Colombian departments and their four main districts; Barranquilla, Bogotá, Cartagena, and Santa Marta can be grouped into five regions as follows: the Amazon, the Andean, the Orinoquía, the Caribbean and the Pacific [41]. To evaluate the spatial heterogeneity and estimate the regional reproduction numbers, for simplicity we assume a regional level distribution as follows (S1 Fig):

  1. Amazon region: Amazonas, Caquetá, Guainía, Guaviare, Putumayo and Vaupés.
  2. Andean region: Antioquia, Boyacá, Bogotá, Caldas, Cauca, Cundinamarca, Huila, Norte De Santander, Quindío, Risaralda, Santander, and Tolima.
  3. Orinoquía region: Arauca, Casanare, Meta, and Vichada
  4. Caribbean region: Atlántico, Bolívar, Cesar, Córdoba, La Guajira, Magdalena, San Andrés, Sucre, Barranquilla, Cartagena, and Santa Marta
  5. Pacific region: Chocó, Nariño, and Valle Del Cauca

Based on the geographical distribution of the departments, Nariño, Valle Del Cauca, Guainía, and Cauca appear in two regions each. However, we include them in one region as stated above, to avoid the duplication of data for our analysis.

  1. (ii) Mortality data

COVID-19 mortality data based on the date of death is retrieved from the Colombian Ministry of Health as of October 31, 2021 [23]. Mortality data is used to generate the national short-term forecast of the COVID-19 pandemic and estimate the national reproduction number.

  1. (iii) Genomic data

To estimate the reproduction number from the genomic data, from the 1855 samples available as of June 30, 2021, 136 SARS-CoV-2 genome samples for Colombia are obtained from the “Global Initiative on Sharing Avian Influenza Data” (GISAID) repository [42] between February 27 and April 5, 2020 to estimate the reproduction number.

  1. (iv) Mobility trends data

Two sources of mobility data are analyzed at the national level from March 6, 2020, to October 31, 2021. These data were retrieved as of October 31, 2021.

Apple’s mobility data is a set of data that summarizes the number of requests made to Apple Maps for directions. The data are categorized by countries/regions, sub-regions, and cities. Apple has released the data for the three modes of human mobility: driving, walking and public transit. These data are published publicly by Apple’s mobility trends reports [43]. These aggregated and anonymized data are updated daily and includes the relative volume of directions requests per country compared to the baseline volume on January 13, 2020. To emphasize on the change rather than the gross number of requests, Apple has normalized all the data. The number of direction requests for each country or city and the type of direction request (driving, walking and public transit) has been set to 100 for January 13, 2020. All subsequent data are relative to the starting point.

Google’s mobility data [44] is retrieved from users who have turned on the location history in their Google account. Google’s mobility data shows the changes in mobility in each geographic and administrative region for visits to different places, such as grocery stores, parks, and recreation spots. The baseline day represents the normal value on the day of the week. The baseline day is the median value from the 5-week period from January 3, 2020, to February 6, 2020.

  1. (v) Twitter data

We retrieved data from the publicly available Twitter data set of the COVID-19 chatter from March 12, 2020 to October 31, 2021 [45]. A detailed description of Twitter data acquisition and processing is provided in a prior study [45].

Modeling framework for forecast generation

We utilize and compare three dynamic phenomenological growth models to generate short-term (i.e., 30-day ahead) forecasts for Colombia. These models have been applied to various infectious diseases including SARS, foot and mouth disease, Ebola [4648] and the current COVID-19 outbreak [4951]. The phenomenological growth models applied in this study include the two models defined by one scalar differential equation namely the generalized logistic growth model (GLM) [47] and the Richards growth model [52]. Additionally we apply the sub-epidemic wave model [46] which captures diverse epidemic trajectories such as the multimodal outbreaks. The forecasts obtained from these dynamic growth models can assess the potential scope of the pandemic in near real-time, provide insights on the contribution of disease transmission pathways, predict the impact of control interventions, and evaluate optimal resource allocation to inform public health policies. A detailed description of these models is provided in S2 Text.

Model calibration and forecasting approach

We utilize the national and regional level case incidence data retrieved from the Colombian Ministry of Health based on the dates of symptom onset as of October 31, 2021. We also utilize the national level mortality data based on the date of death as of October 31, 2021. Each forecast is fitted to the daily case counts based on the dates of symptom onset and daily death counts based on the date of death between July 4, 2021, and October 1, 2021 (90 days calibration period) to conduct a 30-day ahead short-term forecast for each model. The data from October 2, 2020, to October 31, 2021, is utilized to assess the performance of our 30-day ahead short-term forecasts. Details of the model calibration and forecasting approach are provided in S3 Text.

Performance metrics

The following five performance metrics are used to assess the quality of our model fit and the 30-day ahead short-term forecasts: the mean absolute error (MAE) [53]; the root mean squared error (RMSE) [54]; the coverage of the 95% prediction intervals (95% PI) [54]; the mean interval score (MIS) [54] and the weighted interval score (WIS) [55]; for each of the three models: GLM; Richards model; and the sub-epidemic wave model. A detailed description of the performance metrics is provided in S4 Text.

Reproduction number

In order to understand the transmission dynamics of COVID-19, we estimate the effective reproduction number, Rt, for the early ascending phase from February 27, 2020, to March 27, 2020, and the effective reproduction number Rt throughout the pandemic for the national and regional COVID-19 epidemic curves. The effective reproduction number, Rt, is the key parameter that characterizes the average number of secondary cases generated by a primary case at calendar time (t) during the outbreak. This quantity is crucial for identifying the magnitude of public health interventions required to contain an epidemic [5658]. The estimates of Rt indicate whether widespread disease transmission continues (Rt>1) or disease transmission declines (Rt<1). Therefore, to contain an outbreak, it is vital to maintain Rt<1. We utilize the case incidence, mortality, and the genomic data to estimate the reproduction numbers using three rigorous methods to compare the estimates of Rt obtained from each method and data source.

Effective reproduction number Rt, using the generalized growth model (GGM)

We estimate the national and regional reproduction numbers by calibrating GGM to the early growth phase of the pandemic [59], from February 27, 2020 to March 27, 2020. The description of the GGM is provided in S2 Text. The generation interval of SARS-CoV-2 is modeled with the assumption of a gamma distribution with a mean of 5.2 days and a standard deviation of 1.72 days [60]. We first characterize the daily incidence of local cases using the GGM. The progression of local incidence cases by dates of symptom onset, Ii, is simulated using the calibrated GGM model and account for the daily series of imported cases by dates of symptom onset, Ji, into the renewal equation to estimate the effective reproduction, as

The factor Ji represents the imported cases at time ti, Ii denotes the local case incidence at calendar time ti and ρj represents the discretized probability distribution of the generation interval. The factor 0≤α≤1 represents the relative contribution of imported cases to secondary disease transmission. We perform sensitivity analysis by setting α = 0.15, α = 1.00 and α = 0.00 to assess the relative contribution of the imported cases to the secondary disease transmission [40]. The numerator represents the total new cases Ii, and the denominator represents the total number of primary cases that contributes to the generation the new cases Ii (i.e., as secondary cases) at time ti. Hence, Rt, represents the average number of secondary cases generated by a single case at calendar time t. The uncertainty bounds around the curve of are derived directly from the 300 bootstrap samples generated from the GGM model with estimated parameters () where i = 1,2, …S. We assume a negative binomial error structure where the variance is assumed to be three times the mean based on the noise of the data [61]. This method is utilized to derive the early estimates of reproduction number and has been employed in several prior studies [50,61,62].

Since the national and regional epidemic curves have a distinct date of onset according to the seeding of the first local case, therefore, Rt for the national and regional level is estimated based on the date of onset of the first local case. For the national and regional epidemic curves, we estimate Rt for the first 30 days (Table 1).

thumbnail
Table 1. Dates for Rt estimation for the national and regional epidemic curves for the first 30 epidemic days.

https://doi.org/10.1371/journal.pntd.0010228.t001

Effective reproduction number Rt, using the Cori et al. method

We estimate the regional and national effective reproduction numbers (also known as the time dependent instantaneous reproduction numbers) using the local and imported case incidence data based on the date of symptoms onset as of October 31, 2021. The total number of incident cases arising at time step t, It, is the sum of the number of local incident cases () and imported incident cases (). Hence,

We assume that if the imported cases exist, they can be distinguished from the local cases through epidemiological investigations so that and can be observed at each time step [63,64].

The time dependent (instantaneous) Rt is defined as the ratio of the number of new locally infected cases generated at calendar time t (), and the total infectiousness across all infected individuals at time t given by [65,66]. Hence Rt can be written as

In this equation, the factor ws represents the infectivity function, which is the infectivity profile of the infected individual. The infectivity function is dependent on the time since infection (s), but is independent of the calendar time (t) [67,68].

The term describes the sum of infection incidence up to time step t − 1, weighted by the values of infectivity function ws. The distribution of the generation time can be applied to approximate ws, however, since the time of infection is a rarely observed event, measuring the distribution of generation time becomes difficult [65]. Therefore, the time of symptom onset is usually used to estimate the distribution of the serial interval (SI), which is defined as the time interval between the dates of symptom onset among two successive cases in a disease transmission chain [69].

The infectiousness of a case is a function of the time since infection, which is proportional to ws if the timing of infection in the primary case is set as time zero of ws and we assume that the generation interval equals the serial interval (SI). The SI is assumed to follow a gamma distribution with a mean of 5.2 days and a standard deviation of 1.72 days [60]. We also estimate Rt for the mortality curve by the date of death as of October 31, 2021. The analytical estimates of Rt are obtained within a Bayesian framework using the Cori et al. method, employing the EpiEstim package in R version 4.0.3. [69]. The values for Rt are estimated using 1-weekly sliding window. We report the median and 95% credible interval (CrI).

Reproduction number, R, from the genomic analysis

To estimate the reproduction number for SARS-CoV-2 between February 27, 2020 and April 5, 2020 from the genomic data, 136 SARS-CoV-2 genome samples from Colombia and their sampling times are obtained from the GISAID repository [42,70]. Short sequences and sequences with significant number of gaps and non-identified nucleotides are removed. For clustering, they are complemented by the sequences from other geographical regions across the globe, downsampled to n = 1470 representative sequences using a country specific sub-sampling algorithm of Nextstrain. Briefly, the algorithm first chooses the sequences from the focal country, and then complements them with the sequences obtained from other geographical regions, prioritizing sequences from the geographical neighbors and the sequences that are genetically close to the focal sequences. This decreases the size of the genomic data set to allow for feasible phylogenetic inference. We use the sequence subsample from Nextstrain (www.nextstrain.org) global analysis as of July 4, 2021. These sequences are aligned to the reference genome obtained from the literature [71] using Muscle [72] and trimmed using Molecular Evolutionary Genetics Analysis version: 7 (Mega 7) [73].

The largest Colombian cluster that possibly corresponds to within-country transmissions has been identified using hierarchical clustering of sequences. Phylodynamics analysis of that cluster is carried out using Bayesian Evolutionary Analysis by Sampling Trees (BEAST) [74] version 2. We use a strict molecular clock and a tree prior with exponential growth coalescent. Markov Chain Monte Carlo (MCMC) sampling is run for 10,000,000 iterations, and the parameters are sampled every 1000 iterations. The results are acceptable if the effective sample sizes are above 200 for all parameters. The exponential growth rate, f estimated by BEAST is used to calculate the basic reproduction number, R0. We utilize the standard assumption that SARS-CoV-2 generation intervals (times between infection and onward transmission) are gamma-distributed [75]. In this case, R0 can be estimated as where μ and σ are the mean and standard deviation of the gamma distribution, respectively. The values of μ = 5.2 days and σ = 1.72 days are taken from a prior study [60]. The formula for R0 is a strictly monotone transformation of , therefore it also transforms the 95% highest posterior density (HPD) intervals for f into those for R0.

Spatial analysis

We quantify and statistically analyze the shapes of the incidence rate curves. Shape analysis of the national and regional incidence rate curves is performed following the analytical methodology as described in previous literature [76] including steps to pre-process the daily cumulative COVID-19 case data based on the dates of symptom onset. Then, we analyze the shapes of these rate curves to compare, cluster and summarize the incidence rates. The pre-processing steps applied to the COVID-19 daily count data are as follows:

  1. Smoothing: Cumulative case curves are smoothed over 10 days using the smooth function in the MATLAB. The smooth function returns a moving average of the elements of a vector using a fixed window length which is determined heuristically, and in the case of spatial analysis, it is 10 days. The window slides down the length of the vector, computing an average over the elements within each window.
  2. Time differencing: If fi(t) denotes the given cumulative number of confirmed cases for department i on day t, then the growth rate per day at time t is given by gi(t) = fi(t)−fi(t−1).
  3. Rescaling: Each curve is rescaled by dividing each gi by the total number of confirmed cases.
  4. Smoothing: We then smooth the normalized curves again over a 5-day span using the smooth function in the MATLAB.

To identify the clusters by comparing the curves, we used a simple metric. For any two rate curves, hi and hj, we compute the norm ||hi −hj||, where the double bars denote the L2 norm of the difference function, that is,

To perform clustering of thirty six curves into smaller groups, we apply the dendrogram function in the MATLAB using the “ward” linkage as explained in reference [76]. The number of clusters is determined empirically based on the display of overall clustering results. After clustering the departments and administrative divisions into different groups, we derive the average curve for each cluster after using a time wrapping algorithm as performed in previous studies [76,77]. All analysis is conducted in MATLAB R2020b.

Twitter data analysis

To observe any relationship between the COVID-19 cases based on the dates of symptom onset and the frequency of tweets indicating stay-at-home orders we used a public dataset of 9,233,913,414 unique tweets of the COVID-19 chatter [45]. The frequency of tweets indicating stay-at-home order is used to infer people’s compliance with the stay-at-home-orders implemented to avoid spread of the virus by practicing social distancing. Tweets indicate the number of the people in favor of lockdowns and show how these numbers have varied throughout the pandemic. We removed all retweets and tweets not in the Spanish language to obtain the plotted data. We also filtered the tweets using the following hashtags: #quedateencasa (stay-at-home), and #trabajardesdecasa (work-from-home), which are two of the most used hashtags when users refer to the COVID-19 pandemic and their engagement with health measures. Lastly, our analysis was restricted to the tweets that originated from Colombia. A set of 493,276 unique tweets were gathered between March 12, 2020, and October 31, 2021. The usage of Twitter in Colombia is less compared to other Latin American countries like Mexico and Chile [78], hence we observe a lower proportion of filtered tweets. We overlay the curve of filtered tweets and the epidemic curve in Colombia to observe any relation between the shape of the epidemic trajectory and the shape of curve for the frequency of filtered tweets during the established time-period. The correlation coefficient between the daily cases based on the dates of symptom onset and frequency of filtered tweets is also estimated using the "corrcoef” function in MATLAB version R2020b. This coefficient explains the linear dependency of the daily cases based on the dates of symptom onset and frequency of filtered tweets.

Mobility data analysis

We utilize the R code developed by Healy [79] to analyze the time series data for Colombia from March 6, 2020 to October 31, 2021, for two modes of mobility: driving and walking derived from Apple’s mobility reports. We analyze the mobility trends to discover any commonality in pattern of the mobility curves with the epidemic curve of COVID-19. The time series for mobility requests is decomposed into trends, weekly and remainder components. The trend is obtained by applying locally weighted regression fitted to the data and the remainder is any residual leftover on any given day after accounting for the underlying trend and normal daily fluctuations. The analysis is performed using ‘covmobility’ package in R software, version 4.0.3.

We analyze the Google mobility trends from March 6, 2020, to October 31, 2021 for visits to the grocery stores and pharmacy, parks, transit stations, workplaces, and residential areas compared to the baseline which is the median value, for the corresponding day of the week, during the five weeks from January 3, 2020 to February 6, 2020. For each region category, the baseline is not a single value; it comprises of seven individual values. The same number of visitors on two different days of the week results in different percentage changes. The analysis is performed using ‘tidycovid19’ package and R software, version 4.0.3.

Results

A timeline showing the major events during the COVID-19 pandemic in Colombia is presented in Fig 1. As of October 31, 2021, Colombia has reported 4,537,370 cases based on the dates of symptom onset. The highest number of cases have been reported in the Andean region (2,935,282 cases) followed by the Caribbean region (888,914 cases), the Pacific region (518,203 cases), the Orinoquía region (140,549 cases) and the Amazon region (54,431 cases). The COVID-19 epidemic curve in Colombia shows a five-modal pattern with the first peak occurring in mid-July 2020 after the phased reopening of the country. It was followed by a small surge that occurred in mid-October 2020 after the implementation of selective isolation and responsible social distancing interventions. The second peak occurred at the beginning of January 2021 and the third protracted surge of the COVID-19 cases can be observed from March through June 2021 (Fig 2). The mortality curve (S2 Fig) for COVID-19 in Colombia also shows a three-modal pattern. The peaks for the mortality curve coincide with the three epidemic waves observed for the case incidence data. The first peak occurred in July 2020 followed by a second peak in January 2021 and a third larger peak from April through June 2021.

thumbnail
Fig 1. Timeline of the COVID-19 pandemic in Colombia as of October 31, 2021.

https://doi.org/10.1371/journal.pntd.0010228.g001

thumbnail
Fig 2. Epidemic curve of the COVID-19 pandemic in Colombia as of October 31, 2021.

https://doi.org/10.1371/journal.pntd.0010228.g002

Model calibration and forecasting performance

We compare the results from model calibration and 30-day ahead forecasting across three models: GLM, Richards and the sub-epidemic wave model. Model calibration, using 90 days of the epidemic data, between July 4, 2021 and October 1, 2021 shows that at the regional and national levels, the sub-epidemic model outperformed the GLM and Richards growth model in terms of the all five performance metrics (RMSE, MAE, MIS, WIS and 95% PI coverage) (Table 2) using the case incidence data. Therefore, the sub-epidemic wave model can be declared the most accurate model for the calibration period. The model fits exhibited sub-exponential growth dynamics [48,59] for three models (p~0.6–0.8) at the national and regional levels. The calibration performances for each region and the national data are listed in Table 2. Calibrating the three models to the mortality data also shows that the sub-epidemic model outperforms the other two models (S1 Table) and exhibits sub-exponential growth dynamics.

thumbnail
Table 2. Comparison of model performance metrics by calibrating the GLM, Richards and the sub-epidemic model for 90 epidemic days (July 4, 2021 to October 1, 2021) at the national and regional level.

Higher 95% PI coverage and lower RMSE, MAE, WIS and MIS represent better performance. Best performing model is given in bold with the superscript “a”.

https://doi.org/10.1371/journal.pntd.0010228.t002

In terms of the forecasting performance, again the sub-epidemic wave model performed better than the GLM and Richards model for the national and regional case incidence data (Table 3). The sub-epidemic model also outperformed the GLM and Richards model in the forecasting phase of the mortality curve (S2 Table). Hence, the sub-epidemic model can be considered the most accurate model to forecast the epidemic trajectory.

thumbnail
Table 3. Comparison of 30-day ahead forecasting performance (October 2, 2021 to October 31, 2021) by calibrating the GLM, Richards and the sub-epidemic model for 90 epidemic days (July 4, 2021 to October 1, 2021) at the national and regional level.

Higher 95% PI coverage and lower RMSE, MAE, WIS and MIS represent better performance. Best performing model is given in bold with the superscript "a”.

https://doi.org/10.1371/journal.pntd.0010228.t003

30-day ahead forecasts

Calibrating our models from July 04, 2021, to October 1, 2021 and generating the 30-day ahead forecasts from October 2, 2021 to October 31, 2021 utilizing the GLM and Richards growth model indicates a downwards trend for the national and regional case incidence data (Figs 3 and 4). On the other hand, the sub-epidemic wave model captures the multiple sub-epidemics comprising the coarse aggregated COVID-19 epidemic wave of Colombia. The sub-epidemic model predicts a downward trend for the Amazon and Orinoquía region, consistent with the findings of the GLM and Richards model. Whereas, for the national, Andean, Caribbean and the Pacific region, the sub-epidemic model predicts a stable case incidence pattern (Fig 5). According to the GLM and Richards model, the COVID-19 pandemic in Colombia would decline to zero during the month of October whereas the sub-epidemic wave model predicts an accumulation of 24,525 (95% PI: 13,677, 44,515) cases at the national level for the month of October 2021.

thumbnail
Fig 3. 30-days ahead forecasts of the national and regional COVID-19 epidemic curves in Colombia by calibrating the GLM model from July 4, 2021 to October 1, 2021.

The blue circles correspond to the data points; the solid red line indicates the best model fit, and the red dashed lines represent the 95% prediction interval. The vertical black dashed line represents the time of the start of the forecast period.

https://doi.org/10.1371/journal.pntd.0010228.g003

thumbnail
Fig 4. 30-days ahead forecast of the national and regional COVID-19 epidemic curves in Colombia by calibrating the Richards model from July 4, 2021 to October 1, 2021.

The blue circles correspond to the data points; the solid red line indicates the best model fit, and the red dashed lines represent the 95% prediction interval. The vertical black dashed line represents the time of the start of the forecast period.

https://doi.org/10.1371/journal.pntd.0010228.g004

thumbnail
Fig 5. 30-days ahead forecast of the national and regional COVID-19 epidemic curves in Colombia by calibrating the sub-epidemic wave model from July 4, 2021 to October 1, 2021.

The blue circles correspond to the data points; the solid red line indicates the best model fit, and the red dashed lines represent the 95% prediction interval. The vertical black dashed line represents the time of the start of the forecast period.

https://doi.org/10.1371/journal.pntd.0010228.g005

At the regional level, the Richards and the GLM model predict zero cases for the month of October (Figs 3 and 4). However, the sub-epidemic model predicts 59 (95% PI: 0, 507) cases for the Amazon region in the month of October. The sub-epidemic model also predicts 12,186 (95% PI: 7107, 19,235) cases for the Caribbean region, 18,165 (95% PI: 9205, 31,205) cases for the Andean region, 5039 (95% PI: 2517, 8725) cases for the Pacific region and 510 (95% PI: 87, 1456) cases for the Orinoquía region (Fig 5). The 30-day ahead forecast of the national mortality data generated by the GLM and Richards model indicate a decline in the number of deaths whereas the sub-epidemic wave model indicates an upward trend in the mortality curve with ~2893 (95% PI: 1860, 5325) deaths that can accumulate in the month of October 2021 (S3 Fig).

Reproduction number

Estimate of the effective reproduction number, Rt from case incidence data.

The reproduction number for the early ascending growth phase of the epidemic from the case incidence data (February 27, 2020 to March 27, 2020) using GGM was estimated at Rt~1.30 (95% CI:1.20, 1.50) at α = 0.15 for the national data. The growth rate parameter, r, was estimated at 1.40 (95% CI: 0.91, 2.0) and the deceleration of growth parameter, p, was estimated at 0.64 (95% CI: 0.56, 0.71), indicating early sub-exponential growth dynamics of the COVID-19 pandemic in Colombia (Fig 6). Simultaneously, the estimates of Rt for all the regions remained consistently above 1.20 (between ~1.20–2.22) for the early ascending phase of the pandemic with the Amazon region showing the highest estimate of reproduction number, at Rt ~2.2 followed by the Orinoquía region with Rt~1.8. The estimates of Rt for the Andean, Pacific and Caribbean regions remained between Rt~1.2–1.4. All regions except the Amazon region depict sub-exponential growth dynamics for the COVID-19 pandemic in Colombia with the deceleration of the growth parameter, p, estimated between 0.54–0.86. The Amazon region shows almost exponential growth dynamics with the deceleration of growth parameter, p ~0.95 (95% CI: 0.74, 1.00). In contrast, the Andean and the Caribbean regions show an almost linear pattern of the epidemic trajectory with the deceleration of growth parameter estimated at, p ~0.58 (95% CI: 0.48, 0.69) and p ~0.59 (95% CI: 0.34, 0.87) respectively. The sensitivity analysis shows that the estimates of reproduction numbers do not vary significantly at α = 0.15 and α = 1.00 (Tables 4 and 5). Moreover, setting α = 0.00, thereby assuming that there is no contribution of imported cases to the disease transmission process or generation of secondary cases also does not substantially change the estimates of reproduction number.

thumbnail
Fig 6.

Upper panel: Reproduction number for Colombia with 95% CI estimated using the GGM model. The estimated reproduction number of the COVID-19 epidemic in Colombia as of March 27, 2020, is 1.30 (95% CI: 1.20, 1.50). The growth rate parameter, r, is estimated at 1.40 (95%CI: 0.91, 2.0) and the deceleration of the growth parameter, p, is estimated at 0.64 (95%CI: 0.56, 0.71) at α = 0.15. Lower panel: The lower panel shows the GGM fit to the case incidence data for the first 30 days from February 27, 2020 to March 27, 2020. The blue circles correspond to the data points; the solid red line indicates the best model fit, and the red dashed lines represent the 95% confidence interval. The cyan lines are the model fits obtained via bootstrapping.

https://doi.org/10.1371/journal.pntd.0010228.g006

thumbnail
Table 4. Estimates of reproduction number (R), growth rate parameter (r) and deceleration of the growth parameter (p) obtained from the renewal equation method utilizing the GGM for the early ascending phase of the epidemic (30 days) at the national and regional level at α = 0.15.

https://doi.org/10.1371/journal.pntd.0010228.t004

thumbnail
Table 5. Estimates of reproduction number (R), growth rate parameter (r) and deceleration of the growth parameter (p) obtained from the renewal equation method utilizing the GGM for the early ascending phase of the epidemic (30 days) at the national and regional level at α = 1.00.

https://doi.org/10.1371/journal.pntd.0010228.t005

Estimate of effective (instantaneous) reproduction number, Rt, from the Cori et al. method.

The early estimate of effective (instantaneous) reproduction number at the national level for the first 30 epidemic days exhibits a steep decline. The reproduction number was estimated at, Rt~2.85 (95% CrI: 1.91, 4.09) on March 6, 2020, which declined to Rt~0.81 (95% CrI: 0.76, 0.85) by March 27, 2020 (Figs 7 and 8). At the national level the effective reproduction number has remained consistently above Rt~1.00 between April and July 3, 2020, after which the reproduction number declined to less than 1.00 (Rt~0.9) until September 22, 2020. Since the end of September 2020, Rt has fluctuated around ~1.00 (Rt~ 0.8–1.4) until the end of April 2021. From May till mid-June 2021, the estimates of Rt remained above ~1.0 indicating continued disease transmission. Since then, the estimates of Rt have mostly remained below ~1.0 with the most recent estimate of Rt~0.84 (95% CrI: 0.82, 0.86) as of October 31, 2021 (Fig 8).

thumbnail
Fig 7. Effective (instantaneous) reproduction number with 95% CrI estimated using the Cori et al. method and the estimate of reproduction number with 95% CI utilizing the GGM for simulating the parameters employed in the renewal equation method for the COVID-19 epidemic in Colombia as of March 27, 2020 (first 30 days).

The red solid line is the mean reproduction number estimated from the renewal equation method and the red dashed lines are the 95% CI coverage around the mean reproduction number. The green solid line is the median effective reproduction number estimated from the Cori et al. method and the green dashed lines are the 95% CrI coverage around the median instantaneous reproduction.

https://doi.org/10.1371/journal.pntd.0010228.g007

thumbnail
Fig 8.

Upper panel: Effective reproduction number estimated from the Cori et al. method with 95% credible intervals for the COVID-19 pandemic in Colombia as of October 31, 2021. The red solid line represents the mean reproduction number for Colombia and the red shaded area represents the 95% credible interval around it. The blue solid line represents the mean reproduction number in the Caribbean region and the blue shaded area represents the 95% credible interval around it. The yellow solid line represents the mean reproduction number in the Pacific region and the yellow shaded area represents the 95% credible interval around it. The brown solid line represents the mean reproduction number in the Orinoquía region and the brown shaded area represents the 95% credible interval around it. The pink solid line represents the mean reproduction number in the Andean region and the pink shaded area represents the 95% credible interval around it. The green solid line represents the mean reproduction number in the Amazon and the green shaded area represents the 95% credible interval around it. Lower panel: The percentage of the COVID-19 cases in Colombia, nationally and regionally as of October 31, 2021. The red solid line represents the percentage of cases in Colombia, the blue solid line represents the percentage of cases in the Caribbean region, the yellow solid line represents the percentage of cases in the Pacific region, the brown dotted line represents the percentage of cases in the Orinoquía region, the pink dashed line represents the percentage of cases in the Andean region and the green dotted line represents the percentage of cases in the Amazon region.

https://doi.org/10.1371/journal.pntd.0010228.g008

At the regional level, majority of the fluctuations in Rt were observed during the first 130 epidemic days (until July 5, 2020) after which estimates of Rt remain at around ~1.0. For the Caribbean region the early estimate of reproduction number was Rt~3.42 (95% CrI: 1.85, 5.69) on March 6, 2020. The reproduction number peaked on March 16, 2020, with an estimate of Rt~3.46 (95% CrI: 2.69, 4.36) and declined thereafter to remain consistently above 1.00 (Rt~1.00–3.21) until July 7, 2020. The reproduction number then fluctuated at ~1.00 until February 11, 2021. Since then, the estimates of Rt have remained between ~0.8–1.5 with the most recent estimate of Rt~0.78 (95% CrI: 0.75, 0.81) as of October 31, 2020. In the Pacific region, the early estimate of reproduction number as estimated on March 5, 2020, was Rt~3.04 (95% CrI: 1.28, 5.96). The reproduction number remained above 1.50 in the next 15 days. This was followed by the estimates of Rt consistently above 1.00 until the end of July 20, 2020, after which Rt fluctuated ~1.00 until February 2021. From March to October 31, 2021, the estimates of Rt remained between 0.9–1.5 with the most recent estimate of Rt~1.01 (95% CrI: 0.09, 1.08) as of October 31, 2021. The reproduction number in the Orinoquía region was estimated at Rt~3.46 (95% CrI: 0.25, 14.97) from March 5, 2020, to March 11, 2020, and declined thereafter to fluctuate ~1.00 until the end of May 2020. From June 1, 2020, till the end of August 2020 the estimates of Rt remained above 1.00 followed by fluctuations in Rt around 1.00. From April 9, 2021, the estimates of Rt have remained below 1.0 with the most recent estimate of Rt~0.73 (95% CrI: 0.59, 0.88) as of October 31, 2021.

The reproduction number in the Andean region peaked at Rt~3.28 (95% CrI: 2.27, 4.55) on March 6, 2020. This was followed by the estimates of reproduction number above 1.00 until July 27, 2020. Since then, Rt has fluctuated ~1.00 until mid-June 2021 after which the estimates of Rt remained consistently below 1.0 until October 10, 2021. The most recent estimate of Rt is at ~0.86 (95% CrI: 0.83, 0.88) as of October 31, 2021. In contrast, the reproduction number in the Amazon region was estimated at Rt~3.46 (95% CrI: 0.25, 14.9) on March 5, 2020. This was followed by the estimates of Rt consistently above 1.00 until May 11, 2020 which have since fluctuated ~1.00 until July 2021. The estimates of Rt remained consistently below 1.00 from July 10, 2021, until September 30, 2021, with the most recent estimate of Rt~0.64 (95% CrI: 0.07, 0.77) as of October 31, 2021 (Fig 8). The estimates of Rt obtained from the mortality curve remain higher than 1.00 for the first fifty days after which Rt fluctuates around 1.00 till October 31, 2021 (S4 Fig). As can be observed in both, the national and the regional level case incidence data and the national-level mortality data, the Cori et al. method tends to overestimate Rt early in the time series as infections occurring before the first date in the time series remain missing terms in the denominator [80].

Estimate of reproduction number, R from genomic data analysis.

Between February and April 2020, 136 analyzed Colombian sequences were sampled. These sequences were spread along the whole global SARS-CoV-2 phylogeny (Fig 9) and split into multiple clusters. This indicates multiple introductions of SARS-CoV-2 to the country during the initial pandemic stage (February 27, 2020 to April 5, 2020). For the largest cluster of size 35, growth rate for the early ascending phase of the COVID-19 pandemic in Colombia was estimated to be ~12.718. Subsequently, the reproduction number was estimated at, R~1.2 (95% HPD interval:[1.03,1.44]) indicating sustained disease transmission during the initial phase of the pandemic.

thumbnail
Fig 9. Global Maximum Likelihood (ML) tree for SARS-CoV-2 genomic data analyzed in the paper.

The leaves of the tree are Colombian sequences of SARS-CoV-2 sampled between February 27, 2020, and April 5, 2020 (highlighted in red) and background sequences sampled from around the globe for genomic context. Colombian sequences are distributed among different lineages, indicating multiple virus introductions. The largest estimated intra-Colombian cluster (i.e., a monophyletic clade consisting solely of Colombian sequences) was analyzed using an exponential growth coalescent model to estimate the intra-country basic reproduction number.

https://doi.org/10.1371/journal.pntd.0010228.g009

Spatial analysis.

Fig 10 shows the result from pre-processing COVID-19 data into growth rate functions. The results of clustering are shown in Fig 11. The four predominant clusters identified include the following departments and districts:

  1. Cluster 1: Antioquia, Caquetá, Cesar, Choco, Magdalena, Norte Santander, Putumayo, Quindío, Risaralda, Sta Marta D.E., Valle del Cauca
  2. Cluster 2: Arauca, Atlántico, Barranquilla, Bogotá, Bolívar, Boyacá, Caldas, Cartagena, Casanare, Cauca, Córdoba, Cundinamarca, La Guajira, Guaviare, Huila, Meta, Nariño, Santander, Sucre, and Tolima
  3. Cluster 3: Vichada
  4. Cluster 4: Amazonas, Guainía, San Andrés, and Vaupés
thumbnail
Fig 10. Pre-processing COVID-19 data into incidence rate functions.

From left to right: the original lab-confirmed COVID-19 cases, curve of daily new cases, smoothed and scaled rate curves, and average of rate curves before scaling and smothing.

https://doi.org/10.1371/journal.pntd.0010228.g010

thumbnail
Fig 11.

Panel A. Clustering of states according to the shapes of their rate curves. Cluster 1 is shown in blue, cluster 2 is shown in red, the smallest cluster, cluster 3 is shown in black whereas cluster 4 is shown in green. One can see that states with similar shapes of rates curves are geographically close to each other. The panel shows the geographic distribution of the clusters created in PaintMaps.com, https://paintmaps.com/map-charts/51/Colombia-map-chart [81]. Panel B. Shows the average shape of the incidence rate curves in each cluster and the overall average. Panel C. Shows the dendrogram plot- a hierarchal clustering of states, which shows that there are four predominant clusters of states.

https://doi.org/10.1371/journal.pntd.0010228.g011

Fig 12 shows the mean growth rate curves and one standard-deviation bands around it, in each cluster. The average incidence patterns in these four clusters are very distinct and clearly visible (Fig 12). For cluster 1, the rate increases rapidly from April 2020 to July 2020 and then declines to increase again at the end of November 2020 followed by another incline at the start of January 2021. The most recent upsurge in growth rate can be observed by the two-peak shape from May-July 2021. For cluster 2, we can observe a three modal growth pattern. There is rapid increase in growth rate in July 2020 followed by a decline and continuous fluctuations. This pattern is followed by two more peaks, one in January 2021 and the other peak can be observed in March and April 2021. Cluster 3 shows a slow growth rate until July 2020 followed by a rapid rise in growth curve until mid-September 2020. This is followed by a gradual decline of the growth curve with fluctuations and another large peak between June to July 2021. For cluster 4, the slow growth rate is followed by a spike in September 2020 and multiple fluctuations. This is followed by another peak in June 2021 (Fig 12).

thumbnail
Fig 12. Average shapes of the COVID-19 incidence rate curves, along with a one standard-deviation band around the average, in each of the clusters.

https://doi.org/10.1371/journal.pntd.0010228.g012

Mobility data

The curve of mobility trends from Apple tracked in the form of driving and walking declined in April 2020 to less than 60% corresponding to multiple interventions focused on the social distancing mandates as implemented by the government of Colombia. Mobility then started to increase in June, peaking in July 2020. Soon after, a decline in mobility was observed in August and September 2020 which was followed by fluctuations in mobility trends around the baseline until the end of June 2021. Thereafter, fluctuations in mobility can be seen trending away from the baseline until October 31, 2021 (Fig 13). Correspondingly it can be observed that as the mobility increased in July 2020, the number of cases also peaked.

thumbnail
Fig 13.

Panel A: Apple mobility trends data showing the variation in walking and driving among the population in Colombia. Orange curve shows the driving trend, and the blue curve shows the walking trend. Panel B: Google mobility trends data showing the variation in movement of people in Colombia across the following categories: grocery and pharmacy (green solid curve), parks (pink solid curve), transit stations (purple solid curve), workplaces (red solid curve) and residential areas (golden solid curve). Panel C: The COVID-19 incidence curve in Colombia by the dates of symptoms onset as of October 31, 2021.

https://doi.org/10.1371/journal.pntd.0010228.g013

The curves of mobility from Google data tracked in the form of visits to grocery and pharmacy, parks, transit stations and workplaces all follow the same pattern; declining in April 2020 to less than 60% incoherence with the Apple mobility data and corresponding to the government’s interventions to contain the spread of the virus. The mobility started to increase gradually thereafter but remained below the baseline until December 2020. The greatest fluctuations in mobility were observed in January 2021, the same time when the cases also peaked. Mobility has since fluctuated around the baseline with an inclining trend in the mobility for transit stations and grocery and pharmacy. The residential category is measured in duration rather than the number of visitors, thus we do not compare it with other five categories. Moreover, since people spend a larger amount of time at home, changes in the residential category remain inconspicuous (Fig 13B).

Twitter data analysis: Social media trends

The epidemic curve for Colombia is overlaid with the curve of tweets indicating the stay-at-home orders in Colombia. In Colombia the engagement of people with the #quedateencasa hashtag (stay-at-home order hashtag) fluctuated at a steady level during the course of the pandemic with a slight decline in the frequency of tweets observed after June 2020 while the number of cases continued to increase, which could point towards the apathy or frustration of the public towards the lockdowns and restrictions. This could also imply that the population does not follow the government’s stay-at-home orders. The decline in the number of tweets indicating stay-at-home orders was followed by a sharp increase in the frequency of tweets in November 2020 which gradually declined until April 2021. A short spike in the frequency of tweets was observed in April 2021 which have since remained around ~500 tweets per day. On the contrary, the cases peaked in July 2020 levelling off at ~7000 cases per day until December 2020 (Fig 14). Another peak in case incidence was observed in January 2021 followed by a sharp decline. This was followed by two upsurges in case incidence in November 2020 and April 2021. The correlation coefficient between the epidemic curve of cases based on the dates of symptom onset and the curve of tweets representing the stay-at-home orders was estimated at R = 0.1797 from March 12, 2020, to October 31, 2021.

thumbnail
Fig 14.

The daily number of tweets indicating stay at home orders (blue) and the daily number of new COVID-19 cases based on the dates of symptom onset (orange) as of October 31, 2021.

https://doi.org/10.1371/journal.pntd.0010228.g014

Discussion

In this study, we characterize the forecasting efforts and spatio-temporal transmission dynamics of the COVID-19 pandemic in Colombia by fitting mathematical models to the national and spatially distributed time series data. Our results point towards the sub-epidemic model as the most accurate model in terms of the calibration and forecasting performances. More importantly, the regional and national level GLM and Richards forecasts point towards a continuous declining trend in the epidemic trajectory in comparison to the sub-epidemic model that is able to reproduce the sustained growth pattern particularly distinguishable for the national, Caribbean, Andean and the Pacific region. Overall, the transmission dynamics show sustained disease transmission during the early phase of the COVID-19 pandemic exhibiting sub-exponential growth dynamics at the regional and national levels. These findings are consistent with the estimates of reproduction number obtained from the genomic data. As the epidemic progressed, fluctuations in Rt were observed with the most recent estimates of Rt<1.00 indicating disease containment. Regarding the geographical distribution in the national territory, Colombian departments can be segregated into four distinct clusters based on the growth rate patterns in each cluster. Moreover, while the increase in mobility patterns in June and July 2020 correspond to the observed peak in case incidence in July 2020, a spike in the number of tweets indicating the stay-at-home orders was observed in November 2020 when the case incidence had levelled off.

Appropriate short-term forecasts at the national and regional levels can help guide the intensity and magnitude of public health interventions required to contain the epidemic. In general, the national and regional short-term forecasts from the Richards model and the GLM point towards a sustained decline in the over-all case counts similar to the forecast produced by the sub-epidemic wave model for the Amazon and Orinoquía region. However, the sub-epidemic model predicts the stabilization in case incidence for the national, Andean, Caribbean, and the Pacific region. On the contrary, the mortality curve forecast predicted by the sub-epidemic wave model shows an increase in the death counts as is projected by the IHME model [82]. In general, the forecasts generated by the GLM, and Richards model predict that national and regional outbreaks have nearly reached extinction (Figs 3 and 4) in comparison to the forecasts generated by the sub-epidemic wave model. Therefore, the projections made using these models should be interpreted with caution given the instability in the reporting patterns and reporting delays. It can be appreciated that the sub-epidemic wave model is able to capture the underlying structure of the sub-epidemic building blocks comprising the coarse scale epidemic [46]. In our analysis the sub-epidemic wave model performs better than the GLM and Richards model in short term forecasts based on the performance metrics (Tables 3 and S2) that account for uncertainty in predictions similar to the findings in Mexico where the sub-epidemic wave model outperformed the GLM and Richards model in short term forecasting the COVID-19 pandemic [49]. The sub-epidemic wave model is also a better fit to the epidemic trajectories during the calibration period compared to the other two models (Tables 2 and S1).

The scaled COVID-19 case incidence based on the dates of symptom onset for all regions and the national data shows a coherent pattern with fluctuations in the percentage of cases around ~0.001 between April 2020 and October 2021 (Fig 8). The epidemic curve of the COVID-19 pandemic displays multimodal peaks as of October 31, 2021. As observed the COVID-19 epidemic peaked in mid-July 2020 followed by a decline in case incidence in mid-September 2020, only to rise again. The national COVID-19 curve declined at the beginning of year 2021. However, by the end of April 2021 another surge in the COVID-19 cases was observed that continued until the end of June 2021, with estimates of Rt ~1.00, indicating continued virus transmission in the country.

The early transmission dynamics of SARS-CoV-2 exhibit similarity at the national and regional levels. The COVID-19 pandemic in Colombia exhibited sub-exponential growth dynamics (0<p<1.00) during the early ascending phase of the outbreak at the national and regional levels. The sub-exponential growth pattern of the COVID-19 pandemic in Colombia can be attributed to a myriad of factors including non-homogenous mixing, spatial structure, population mobility, behavior changes and control interventions such as mask mandates and social distancing to contain the spread of the virus [83]. The results of this study are consistent with the sub-exponential growth patterns of the COVID-19 outbreaks observed in other Latin American countries including Mexico [84] and Chile [39] which also implemented mask mandates and social distancing interventions along with restricting mobility during the early growth phase of the pandemic. Simultaneously, the estimates of early transmission potential (Rt) indicate sustained disease transmission in Colombia at the national and regional levels with Rt>1.00. The results of our genomic analysis for Colombia agree with the sustained disease transmission in the country during the early ascending phase of the pandemic with R>1.00. These estimates indicate that although containment strategies were implemented during the first thirty days to mitigate the impact of the pandemic (Fig 1), additional interventions should be prioritized such as compulsory social distancing and intensified case surveillance. The results of our analysis are compatible with the estimates of early reproduction numbers retrieved from other countries, including Peru, Chile, Brazil and Mexico which followed similar COVID-19 outbreaks around the same time period [39,49,50,85].

We also employ the Cori et al. method to estimate the overall transmission potential (effective reproduction number) throughout the COVID-19 pandemic in Colombia to observe the spatiotemporal variation in Rt across different time points. Indeed multiple factors can be attributed to influence the reproduction number including the transmissibility of an infectious agent, individual contact rates, individual susceptibility and mitigation strategies [57,86]. Our results indicate that the national reproduction number remained substantially high (Rt>1) until the end of July 2020, after which it declined and fluctuated around ~1.0 with no major deviations. The most recent estimates of Rt indicate a decrease in the transmission potential of the virus with Rt estimated at ~0.8, indicating disease containment. At the regional level, Orinoquía, Pacific and Caribbean regions displayed the greatest fluctuation in the transmission potential. Moreover, the estimates of Rt remained between 3.02–3.46 until March 6, 2020, for all regions. As the pandemic progressed and control interventions were put in place, the estimates of Rt declined to fluctuate at around ~1.0. The estimates of Rt remained consistently above 1.00 in the Caribbean, Andean and the Pacific regions until July 2020, after which Rt fluctuated at ~1.00. The most recent estimates of transmission potential, as of October 31, 2021, at the regional level show that the control interventions have a positive impact on the control of the spread of the virus.

In general, Colombia has observed sustained disease transmission of SARS-CoV-2 despite the implementation of social distancing interventions. As can also be seen from Twitter analysis, the number of cases based on the dates of onset are positively correlated with the tweets indicating the stay-at-home orders. A plausible explanation of this phenomenon could be that the people might have stopped following the government’s preventive orders to stay at home as a result of pandemic fatigue or due to confusion regarding the national and regional policies. There was a decline in the mobility profile in Colombia after the implementation of social distancing mandates and declaration of national emergency. However, as the pandemic progressed, the mobility of the public reached a normal level with fluctuations around the baseline. A study reported that the average mobility reduction in Colombia fluctuated during the pandemic, starting at ~80% in April 2020, and reaching ~30% by the end of September 2020 [87]. In comparison Argentina and Chile have reported the most steady mobility reductions of ~80% and 70% respectively during the course of the pandemic [87]. The most recent data shows an increase in mobility, although the case counts seem to be stabilizing at ~1307 cases per day for the month of October 2021.

Analyzing the data at a finer scale provides an opportunity to characterize the growth patterns and appreciate the spatially clustered pattern of COVID-19 that emerges at the departmental level. Our classification of epidemic patterns in Colombia at the department level, including the 4 main districts, shows distinct variations in growth rates across different departments. For instance, cluster 1 including Antioquia, Caquetá, Cesar, Choco, Magdalena, Norte Santander, Putumayo, Quindío, Risaralda, Sta Marta D.E., and Valle del Cauca shows a fluctuating yet multipeak rising pattern in growth rate. In contrast, cluster 3 that includes Vichada and cluster 4 that includes Amazonas, Guainía, and Vaupés show stable growth at lower rates. This information can be utilized by the departments, districts, and regions to guide their decision regarding the implementation of public health measures. For example, departments in cluster 1 show fluctuating increasing growth pattern during the first three months and may need strict public health measures to contain the epidemic compared to cluster 3 which displays slow growth during the first three months. Moreover, as the mobility increased in June and July 2020, an increase in the growth rate patterns can also be observed in cluster 1, 2 and 4 (Figs 12 and 13).

This study has some limitations. First, we excluded the last 11-day case counts from our study in to overcome the influence of reporting delays on our analysis. Specifically, we utilize the case counts based on the dates of onset in this study, because delays in case reporting, testing rates and factors related to the surveillance systems can influence our epidemic projections. Secondly, we relied on the daily updates of cases in the official surveillance system of Colombia, which can sometimes underreport the cases. Third, the phenomenological models applied in this study do not explicitly account for behavioral changes, and thus the results such as the predicted decline or stability in the epidemic trajectory should be interpreted with caution. Fourth, some authors [80] shifted the dates of symptom onset backwards to approximate the date of infection when applying the Cori et al. method to estimate the instantaneous reproduction number. However, in this study we stick to the date of symptoms onset for the sake of comparison across methods to estimate the reproduction number. Lastly, the unpredictable social component of the epidemic on ground was also a limiting factor for the study as we did not know the ground truth epidemic pattern when the forecasts were generated.

In conclusion, Colombia observed a surge in case counts as the mobility increased in July 2020; however, since the end of 2020, the national and regional reproduction numbers have fluctuated around ~1.00 indicating sustained virus transmission in the region. The most recent estimates of Rt<1.00 indicate disease containment as of October 31, 2021. Moreover, the population of Colombia has had mixed reactions towards the stay-at-home orders as can be inferred from the Twitter analysis, contributing to the virus transmission in the country. Importantly, the spatial analysis indicates that departments like Antioquia, Caquetá, Cesar, and Choco need stronger public health strategies to contain the rising patterns in growth rates. Although Richards model and GLM point towards a sustained decline in the case and death counts in general, the sub-epidemic model indicates an upward trend in death counts and a stable case incidence pattern for the national, Pacific, Caribbean, and Andean regions. Hence, the forecasts need to be interpreted with caution given the spatial heterogeneity in transmission rates as well as dynamic implementation and lifting of the social distancing measures. The phenomenological growth models employed in this study to forecast and estimate reproduction numbers are particularly valuable for providing rapid predictions of the epidemics in complex scenarios that can be used for real-time preparedness because these models do not require specific disease transmission processes to account for the interventions.

Supporting information

S3 Text. Model calibration and forecasting approach.

https://doi.org/10.1371/journal.pntd.0010228.s003

(PDF)

S1 Fig. Map of the geographical distribution of departments and the five Colombian regions: the Amazon, the Andean, the Orinoquía, the Caribbean, and the Pacific, edited by the author, scale:1:10000000 with QGIS.org 2020.

QGIS Geographic Information System. QGIS Association. http://www.qgis.org. Using the Colombian department’s division shapefile obtained from https://datosabiertos.esri.co/datasets/colombia-covid19-coronavirus-departamento/explore?location=4.621900%2C-74.297150%2C5.62 (Open Source information provided by the National Institute of Health of Colombia, https://www.ins.gov.co. Publication date: April 1, 2020).

https://doi.org/10.1371/journal.pntd.0010228.s005

(TIFF)

S2 Fig. National mortality curve of COVID-19 for Colombia as of October 31, 2021.

https://doi.org/10.1371/journal.pntd.0010228.s006

(TIF)

S3 Fig. 30-days ahead forecast of the national COVID-19 mortality curve in Colombia by calibrating the Richards, GLM and sub-epidemic wave model from July 04, 2021 to October 01, 2021.

Blue circles correspond to the data points; the solid red line indicates the best model fit, and the red dashed lines represent the 95% prediction interval. The vertical black dashed line represents the time of the start of the forecast period.

https://doi.org/10.1371/journal.pntd.0010228.s007

(TIF)

S4 Fig. Instantaneous reproduction number with 95% credible intervals for the COVID-19 mortality curve in Colombia as of October 31, 2021.

The black solid line represents the mean reproduction number for Colombia and the black shaded area represents the 95% credible interval around it.

https://doi.org/10.1371/journal.pntd.0010228.s008

(TIF)

S1 Table. Comparison of model performance metrics by calibrating the GLM, Richards and the sub-epidemic model for 90 days of mortality data (July 4, 2021 to October 1, 2021).

Higher 95% PI coverage and lower RMSE, MAE, WIS and MIS represent better performance. Best performing model is given in bold with the superscript "a”.

https://doi.org/10.1371/journal.pntd.0010228.s009

(XLSX)

S2 Table. Comparison of 30-day ahead forecasting performance (October 2, 2021 to October 31, 2021) by calibrating the GLM, Richards and the sub-epidemic model for 90 days of mortality data (July 4, 2021 to October 1, 2021).

Higher 95% PI coverage and lower RMSE, MAE, WIS and MIS represent better performance. Best performing model is given in bold with the superscript "a”.

https://doi.org/10.1371/journal.pntd.0010228.s010

(XLSX)

References

  1. 1. Read JM, Bridgen JRE, Cummings DAT, Ho A, Jewell CP. Novel coronavirus 2019-nCoV (COVID-19): early estimation of epidemiological parameters and epidemic size estimates. Philos Trans R Soc Lond B Biol Sci. 2021;376(1829):20200265– pmid:34053269
  2. 2. Reiner RC, Barber RM, Collins JK, Zheng P, Adolph C, Albright J, et al. Modeling COVID-19 scenarios for the United States. Nat Med. 2021;27(1):94–105. pmid:33097835
  3. 3. Schüler L, Calabrese JM, Attinger S. Data driven high resolution modeling and spatial analyses of the COVID-19 pandemic in Germany. PLoS ONE. 2021;16(8):e0254660. pmid:34407071
  4. 4. Burki T. Understanding variants of SARS-CoV-2. The Lancet. 2021;397(10273):462. pmid:33549181
  5. 5. Mallapaty S. Are COVID vaccination programmes working? Scientists seek first clues. Nature. 2021;589:504–5 pmid:33483685
  6. 6. WHO. Coronavirus disease (COVID-19) Weekly Epidemiological Update and Weekly Operational Update. WHO; October 31, 2021.[Accessed on November 2, 2021].Available from: https://www.who.int/emergencies/diseases/novel-coronavirus-2019/situation-reports.
  7. 7. ECDC. COVID-19 situation update worldwide, as of week 44, updated 11 November 2021. European Centre for Disease Prevention and Control; November 11, 2021.[Accessed on November 11, 2021].Available from: https://www.ecdc.europa.eu/en/geographical-distribution-2019-ncov-cases.
  8. 8. Chowell G, Sattenspiel L, Bansal S, Viboud C. Mathematical models to characterize early epidemic growth: A review. Phys Life Rev. 2016;18:66–97. pmid:27451336
  9. 9. Kapitsinis N. The underlying factors of the COVID-19 spatially uneven spread. Initial evidence from regions in nine EU countries. Reg Sci Policy Pract. 2020;12(6):1027–45.https://doi.org/10.1111/rsp3.12340
  10. 10. Miller IF, Becker AD, Grenfell BT, Metcalf CJE. Disease and healthcare burden of COVID-19 in the United States. Nat Med. 2020;26(8):1212–7. pmid:32546823
  11. 11. Roy S, Dutta R, Ghosh P. Towards Dynamic lockdown strategies controlling pandemic spread under healthcare resource budget. Appl Net Sci. 2021;6(1):2. pmid:33437862
  12. 12. Burki T. COVID-19 in Latin America. Lancet Infect Dis. 2020;20(5):547–8. pmid:32311323
  13. 13. Varela AR, Florez LJH, Tamayo-Cabeza G, Contreras-Arrieta S, Restrepo SR, Laajaj R, et al. Factors Associated With SARS-CoV-2 Infection in Bogota;, Colombia: Results From a Large Epidemiological Surveillance Study. Lancet Reg Health Am. 2021;2. pmid:34458886
  14. 14. Benítez MA, Velasco C, Sequeira AR, Henríquez J, Menezes FM, Paolucci F. Responses to COVID-19 in five Latin American countries. Health Policy Technol. 2020;9(4):525–59. pmid:32874863
  15. 15. OECD. Flattening the COVID-19 peak: Containment and mitigation policies OECD 2020 [Accessed on March 3, 2021].Available from: https://www.oecd.org/coronavirus/policy-responses/flattening-the-covid-19-peak-containment-and-mitigation-policies-e96a4226/.
  16. 16. Press TA. Colombia reaches 1 million confirmed coronavirus cases. ABC News. 2020 October 24;[Accessed on July 23, 2021].Available from: https://abcnews.go.com/International/wireStory/colombia-reaches-million-confirmed-coronavirus-cases-73809991.
  17. 17. Gonzalez E, Harrison C, Hopkins K, Horwitz L, Nagovitch P, Sonneland HK, et al. The Coronavirus in Latin America. AS/COA. 2021
  18. 18. Daniels JP. COVID-19 cases surge in Colombia. The Lancet. 2020;396(10246):227. pmid:32711785
  19. 19. Arellana J, Márquez L, Cantillo V. COVID-19 outbreak in Colombia: An analysis of its impacts on transport systems. J Adv Transp. 2020;2020:8867316.
  20. 20. Trejos-Herrera AM, Vinaccia S, Bahamón MJ. Coronavirus in Colombia: Stigma and quarantine. J Glob Health. 2020;10(2):020372–. pmid:33110564
  21. 21. Mercado L. On September 1, quarantine ends and selective isolation begins. El Tiempo. 2020 August 25;[Accessed on August 26, 2020].Available from: https://www.eltiempo.com/politica/gobierno/ivan-duque-anuncia-aislamiento-selectivo-desde-el-primero-de-septiembre-532872.
  22. 22. MOH. “We are Calling for Selective Isolation with Responsible Individual Distancing”. Minsalud; August 20, 2020.[Accessed on August 29, 2020].Available from: https://www.minsalud.gov.co/English/Paginas/%E2%80%9CWe-are-Calling-for-Selective-Isolation-with-Responsible-Individual-Distancing%E2%80%9D.aspx.
  23. 23. MOH. Ministry of Health Colombia 2021 [Accessed on February 20, 2021].Available from: https://coronaviruscolombia.gov.co/Covid19/index.html.
  24. 24. Laajaj R, De Los Rios C, Sarmiento-Barbieri I, Aristizabal D, Behrentz E, Bernal R, et al. COVID-19 spread, detection, and dynamics in Bogota, Colombia. Nat Commun. 2021;12(1):4726. pmid:34354078
  25. 25. Government defined date of start of vaccination and purchase of more doses. El Tiempo. 2021 January 29;[Accessed on March 20, 2021].Available from: https://www.eltiempo.com/salud/vacunacion-en-colombia-inicia-el-20-de-febrero-asi-lo-aseguro-el-presidente-ivan-duque-563557.
  26. 26. The health emergency in Colombia will last three more months. Portafolio. 2021 February 25;[Accessed on March 3, 2021].Available from: https://www.portafolio.co/economia/gobierno/gobierno-anuncia-que-se-prolonga-la-emergencia-sanitaria-en-colombia-549512.
  27. 27. Reuters. Colombia to extend COVID restrictions as cases rise. Reuters. 2021;[Accessed on April 7, 2021].Available from: https://www.reuters.com/article/us-health-coronavirus-colombia-idUSKBN2BR0MN.
  28. 28. Shultz JM, Berg RC, Bernal Acevedo OA, Ocampo Cañas JA, Escobar VAP, Muñoz O, et al. Complex correlates of Colombia’s COVID-19 surge. Lancet Reg Health Am. 2021;3. pmid:34541569
  29. 29. Daniels JP. ‘Everything is collapsing’: Colombia battles third Covid wave amid unrest. The Guardian. 2021 June 22;[Accessed on June 4, 2021].Available from: https://www.theguardian.com/global-development/2021/jun/22/colombia-covid-coronavirus-third-wave-unrest.
  30. 30. INS. COVID-19 in Colombia Instituto Nacional De Salud 2021 [Accessed on June 2, 2021].Available from: https://www.ins.gov.co/Noticias/Paginas/Coronavirus.aspx.
  31. 31. Covid Performance Index, deconstructing pandemic responses: Lowy Institute; 2021 [Accessed on February 10, 2021].Available from: https://interactives.lowyinstitute.org/features/covid-performance/#methodology.
  32. 32. Colombia MoHaSP. Colombia, the first country in Latin America to have diagnostic tests for the new coronavirus Minsitry of Health and Social Protection2020 [Accessed on February 10, 2021].Available from: https://www.minsalud.gov.co/Paginas/Colombia-primer-pais-de-Latinoamerica-en-tener-pruebas-de-diagnostico-para-el-nuevo-coronavirus.aspx.
  33. 33. Malta M, Vettore MV, da Silva CMFP, Silva AB, Strathdee SA. The foreseen loss of the battle against COVID-19 in South America: A foretold tragedy. E Clin Med. 2021;39. pmid:34377968
  34. 34. Total COVID-19 tests per 1,000 people: Our World in Data; 2020 [Accessed on September 24, 2021].Available from: https://ourworldindata.org/grapher/full-list-cumulative-total-tests-per-thousand?time=2020-02-21..latest&country=BRA~CHL~SLV~MEX~PER.
  35. 35. COVID-19 outbreak: 74 older adults were infected with coronavirus in Manizales. 2020 August 20.[Accessed on March 4, 2021].Available from: https://www.bluradio.com/salud/brote-de-covid-19-74-adultos-mayores-se-contagiaron-con-coronavirus-en-manizales.
  36. 36. Coronavirus: what is the impact in Colombia? Justice for Colombia; 2020 November 5.[Accessed on March 5, 2021].Available from: https://justiceforcolombia.org/news/coronavirus-what-is-the-impact-in-colombia/.
  37. 37. Chowell G, Hincapie-Palacio D, Ospina J, Pell B, Tariq A, Dahal S, et al. Using Phenomenological Models to Characterize Transmissibility and Forecast Patterns and Final Burden of Zika Epidemics. PLoS Curr. 2016;8. pmid:27366586
  38. 38. Pell B, Kuang Y, Viboud C, Chowell G. Using phenomenological models for forecasting the 2015 Ebola challenge. Epidemics. 2018;22:62–70. pmid:27913131
  39. 39. Tariq A, Undurraga EA, Laborde CC, Vogt-Geisse K, Luo R, Rothenberg R, et al. Transmission dynamics and control of COVID-19 in Chile, March-October, 2020. PLoS Neg Trop Dis. 2021;15(1):e0009070. pmid:33481804
  40. 40. Tariq A, Lee Y, Roosa K, Blumberg S, Yan P, Ma S, et al. Real-time monitoring the transmission potential of COVID-19 in Singapore, March 2020. BMC Medicine. 2020;18(1):166. pmid:32493466
  41. 41. Information about departments of Colombia 2018 [Accessed on March 6, 2021].Available from: http://www.colombiainfo.org/en-us/colombia/departments.aspx.
  42. 42. Global initiative on sharing avian influenza data [Internet]. 2021 [Accessed on June 4, 2021].Available from: https://www.gisaid.org/.
  43. 43. Apple. Mobility Trends Reports Apple 2020 [Accessed on November 6, 2021].Available from: https://covid19.apple.com/mobility.
  44. 44. Google. COVID-19 Community Mobility Reports Google 2020 [Accessed on November 6, 2021].Available from: https://www.google.com/covid19/mobility/.
  45. 45. Banda J.M.; Tekumalla R.; Wang G.; Yu J.; Liu T.; Ding Y.; Artemova E.; Tutubalina E.; Chowell G. A Large-Scale COVID-19 Twitter Chatter Dataset for Open Scientific Research—An International Collaboration. Epidemiologia 2021, 2, 315 324. https://doi.org/10.3390/epidemiologia2030024
  46. 46. Chowell G, Tariq A, Hyman JM. A novel sub-epidemic modeling framework for short-term forecasting epidemic waves. BMC Med. 2019;17(1):164. pmid:31438953
  47. 47. Shanafelt DW, Jones G, Lima M, Perrings C, Chowell G. Forecasting the 2001 Foot-and-Mouth Disease Epidemic in the UK. EcoHealth. 2018;15(2):338–47. pmid:29238900
  48. 48. Bürger R, Chowell G, Lara-Díıaz LY. Comparative analysis of phenomenological growth models applied to epidemic outbreaks. Math Biosci Eng. 2019;16(5):4250–73. pmid:31499661
  49. 49. Tariq A, Banda JM, Skums P, Dahal S, Castillo-Garsow C, Espinoza B, et al. Transmission dynamics and forecasts of the COVID-19 pandemic in Mexico, March-December 2020. PLoS ONE. 2021;16(7):e0254826. pmid:34288969
  50. 50. Munayco CV, Tariq A, Rothenberg R, Soto-Cabezas GG, Reyes MF, Valle A, et al. Early transmission dynamics of COVID-19 in a southern hemisphere setting: Lima-Peru: February 29th–March 30th, 2020. Infect Dis Model. 2020;5:338–45. pmid:32399507
  51. 51. Bürger R, Chowell G, Lara-Díaz LY. Measuring differences between phenomenological growth models applied to epidemiology. Math Biosci. 2021;334:108558. pmid:33571534
  52. 52. Richards FJ. A Flexible Growth Function for Empirical Use. J Exp Bot. 1959;10(29):290–300
  53. 53. Kuhn M, Johnson K. Applied predictive modeling. 1st ed. New York: Springer; 2013.
  54. 54. Gneiting T, Raftery AE. Strictly Proper Scoring Rules, Prediction, and Estimation. J Am Stat Assoc. 2007;102(477):359–78.
  55. 55. Bracher J, Ray EL, Gneiting T, Reich NG. Evaluating epidemic forecasts in an interval format. PLoS Comp Biol. 2021;17(2):e1008618. pmid:33577550
  56. 56. Chowell G, Abdirizak F, Lee S, Lee J, Jung E, Nishiura H, et al. Transmission characteristics of MERS and SARS in the healthcare setting: a comparative study. BMC Med. 2015;13(1):210. pmid:26336062
  57. 57. Anderson RM, May RM. Infectious Diseases of Humans. Oxford, editor. Oxford Univeristy Press 1991.
  58. 58. Nishiura H, Chowell G, Heesterbeek H, Wallinga J. The ideal reporting interval for an epidemic to objectively interpret the epidemiological time course. J R Soc Interface. 2010;7(43):297–307. pmid:19570792
  59. 59. Viboud C, Simonsen L, Chowell G. A generalized-growth model to characterize the early ascending phase of infectious disease outbreaks. Epidemics. 2016;15:27–37. pmid:27266847
  60. 60. Ganyani T, Kremer C, Chen D, Torneri A, Faes C, Wallinga J, et al. Estimating the generation interval for coronavirus disease (COVID-19) based on symptom onset data, March 2020. Eurosurveillance. 2020;25(17):2000257.doi:https://doi.org/https://doi.org/10.2807/1560-7917.ES.2020.25.17.2000257
  61. 61. Chowell G. Fitting dynamic models to epidemic outbreaks with quantified uncertainty: A primer for parameter uncertainty, identifiability, and forecasts. Infect Dis Model. 2017;2(3):379–98. pmid:29250607
  62. 62. Tariq A, Roosa K, Mizumoto K, Chowell G. Assessing reporting delays and the effective reproduction number: The Ebola epidemic in DRC, May 2018-January 2019. Epidemics. 2019;26:128–33. pmid:30880169
  63. 63. Cori A, Donnelly CA, Dorigatti I, Ferguson NM, Fraser C, Garske T, et al. Key data for outbreak evaluation: building on the Ebola experience. Philos Trans R Soc Lond, B, Biolo Sci. 2017;372(1721):20160371. pmid:28396480
  64. 64. Paine S, Mercer G, Kelly P, Bandaranayake D, Baker M, Huang Q, et al. Transmissibility of 2009 pandemic influenza A(H1N1) in New Zealand: effective reproduction number and influence of age, ethnicity and importations. EuroSurveillance. 2010;15(24) pmid:20576236
  65. 65. Fraser C. Estimating Individual and Household Reproduction Numbers in an Emerging Epidemic. PLoS ONE. 2007;2(8):e758. pmid:17712406
  66. 66. Chong KC, Zee BCY, Wang MH. Approximate Bayesian algorithm to estimate the basic reproduction number in an influenza pandemic using arrival times of imported cases. Travel Med Infect Dis. 2018;23:80–6. pmid:29653203
  67. 67. He X, Lau EHY, Wu P, Deng X, Wang J, Hao X, et al. Temporal dynamics in viral shedding and transmissibility of COVID-19. Nat Med. 2020;26(5):672–5. pmid:32296168
  68. 68. Wallinga J, Teunis P. Different Epidemic Curves for Severe Acute Respiratory Syndrome Reveal Similar Impacts of Control Measures. Am J Epidemiol. 2004;160(6):509–16. pmid:15353409
  69. 69. Cori A, Ferguson NM, Fraser C, Cauchemez S. A New Framework and Software to Estimate Time-Varying Reproduction Numbers During Epidemics. Am J Epidemiol. 2013;178(9):1505–12. pmid:24043437
  70. 70. Shu Y, McCauley J. GISAID: Global initiative on sharing all influenza data—from vision to reality. Eurosurveillance 2017;22(13):30494. pmid:28382917
  71. 71. Wu F, Zhao S, Yu B, Chen Y-M, Wang W, Hu Y, et al. Complete genome characterisation of a novel coronavirus associated with severe human respiratory disease in Wuhan, China. bioRxiv.2020.01.24.919183 [preprint].
  72. 72. Edgar RC. MUSCLE: multiple sequence alignment with high accuracy and high throughput. Nucleic Acids Res. 2004;32(5):1792–7. pmid:15034147
  73. 73. Kumar S, Stecher G, Tamura K. MEGA7: Molecular Evolutionary Genetics Analysis Version 7.0 for Bigger Datasets. Mol Biol Evol. 2016;33(7):1870–4. pmid:27004904
  74. 74. Suchard MA, Lemey P, Baele G, Ayres DL, Drummond AJ, Rambaut A. Bayesian phylogenetic and phylodynamic data integration using BEAST 1.10. Virus Evol. 2018;4(1):vey016. pmid:29942656
  75. 75. Li Q, Guan X, Wu P, Wang X, Zhou L, Tong Y, et al. Early Transmission Dynamics in Wuhan, China, of Novel Coronavirus–Infected Pneumonia. New Eng J Med. 2020. pmid:31995857
  76. 76. Srivastava A, Chowell G. Understanding Spatial Heterogeneity of COVID-19 Pandemic Using Shape Analysis of Growth Rate Curves. Preprint medRxiv. 2020:2020.05.25.20112433. pmid:32511500
  77. 77. Srivastava A, Klassen EP. Functional and shape data analysis: Springer; 2016.
  78. 78. StatCounter. Stats Counter GlobalStats 2021 [Accessed on June 20, 2021].Available from: https://gs.statcounter.com/social-media-stats/all/chile/#monthly-202006-202106.
  79. 79. Healy K. Apple’s COVID Mobility Data Github: Github; 2020 [Accessed on July 1, 2021].Available from: https://github.com/kjhealy/apple_covid_post/blob/master/apple.Rmd.
  80. 80. Gostic KM, McGough L, Baskerville EB, Abbott S, Joshi K, Tedijanto C, et al. Practical considerations for measuring the effective reproductive number, Rt. PLoS Comp Biol. 2020;16(12):e1008409. pmid:33301457
  81. 81. PaintMaps, cartographer Color Colombia Map2021.[Accessed on November 6, 2021]. Available from: https://paintmaps.com/map-charts/51/Colombia-map-chart.
  82. 82. IHME. COVID-19 Projections 2020 [updated June 10][Accessed on June 10, 2021].Available from: https://covid19.healthdata.org/mexico.
  83. 83. Chowell G, Viboud C, Hyman JM, Simonsen L. The Western Africa ebola virus disease epidemic exhibits both global exponential and local polynomial growth rates. PLoS Curr. 2015;7. pmid:25685633
  84. 84. Mendoza CI. Inhomogeneous Transmission and Asynchronic Mixing in the Spread of COVID-19 Epidemics. Front Phys. 2021;9(324).
  85. 85. Felix FHC, Fontenele JB. Instantaneous R calculation for COVID-19 epidemic in Brazil. medRxiv.2020.04.23.20077172 [preprint].
  86. 86. Shim E, Tariq A, Choi W, Lee Y, Chowell G. Transmission potential and severity of COVID-19 in South Korea. Int Infect Dis. 2020;93:339–44. pmid:32198088
  87. 87. Martinez-Valle A. Public health matters: why is Latin America struggling in addressing the pandemic? J Public Health Policy. 2021;42(1):27–40. pmid:33510400