Recommendation

The importance of model assumptions in estimating the dynamics of the COVID-19 epidemic

Valery Forbes based on reviews by Bastien Boussau and 1 anonymous reviewer

A recommendation of:

Estimating dates of origin and end of COVID-19 epidemics

Thomas Bénéteau, Baptiste Elie, Mircea T. Sofonea, Samuel Alizon (2021), medRxiv, 2021.01.19.21250080, ver. 3 peer-reviewed and recommended by Peer Community in Mathematical and Computational Biology https://doi.org/10.1101/2021.01.19.21250080

Read preprint in preprint server Now published in Peer Community Journal

Abstract

EN

AR

ES

FR

HI

JA

PT

RU

ZH-CN

Estimating dates of origin and end of COVID-19 epidemics

Estimating the date at which an epidemic started in a country and the date at which it can end depending on interventions intensity are important to guide public health responses. Both are potentially shaped by similar factors including stochasticity (due to small population sizes), superspreading events, and ‘memory effects’ (the fact that the occurrence of some events, e.g. recovering from an infection, depend on the past, e.g. the number of days since the infection). Focusing on COVID-19 epidemics, we develop and analyse mathematical models to explore how these three factors may affect early and final epidemic dynamics. Regarding the date of origin, we find limited effects on the mean estimates, but strong effects on their variances. Regarding the date of extinction following lockdown onset, mean values decrease with stochasticity or with the presence of superspreading events. These results underline the importance of accounting for heterogeneity in infection history and transmission patterns to accurately capture early and late epidemic dynamics.

COVID-19, lockdown, SARS-Cov2, stochastic, non-markovian, epidemy modeling

This is an automatically generated version. The authors and PCI decline all responsibility concerning its content

تقدير تاريخ ظهور ونهاية أوبئة كوفيد-19

إن تقدير التاريخ الذي بدأ فيه الوباء في بلد ما والتاريخ الذي يمكن أن ينتهي فيه اعتمادًا على شدة التدخلات أمر مهم لتوجيه استجابات الصحة العامة. من المحتمل أن يتشكل كلاهما بعوامل مماثلة بما في ذلك العشوائية (بسبب صغر حجم السكان)، وأحداث الانتشار الفائق، و"تأثيرات الذاكرة" (حقيقة أن حدوث بعض الأحداث، مثل التعافي من العدوى، يعتمد على الماضي، على سبيل المثال عدد الحالات) أيام منذ الإصابة). من خلال التركيز على أوبئة كوفيد-19، نقوم بتطوير وتحليل النماذج الرياضية لاستكشاف كيفية تأثير هذه العوامل الثلاثة على ديناميكيات الوباء المبكرة والنهائية. أما بالنسبة لتاريخ المنشأ فنجد تأثيرات محدودة على متوسطات التقديرات، ولكن تأثيرات قوية على تبايناتها. فيما يتعلق بتاريخ الانقراض بعد بداية الإغلاق، تنخفض القيم المتوسطة مع العشوائية أو مع وجود أحداث الانتشار الفائق. تؤكد هذه النتائج على أهمية مراعاة عدم التجانس في تاريخ الإصابة وأنماط انتقال العدوى من أجل التقاط ديناميكيات الوباء المبكر والمتأخر بدقة.

كوفيد-19، الإغلاق، سارس-Cov2، العشوائية، غير الماركوفية، النمذجة الوبائية

This is an automatically generated version. The authors and PCI decline all responsibility concerning its content

Estimación de fechas de origen y fin de las epidemias de COVID-19

Estimar la fecha en la que comenzó una epidemia en un país y la fecha en la que puede terminar dependiendo de la intensidad de las intervenciones es importante para guiar las respuestas de salud pública. Ambos están potencialmente determinados por factores similares, incluida la estocasticidad (debido al pequeño tamaño de la población), eventos de superpropagación y "efectos de memoria" (el hecho de que la ocurrencia de algunos eventos, por ejemplo, la recuperación de una infección, dependa del pasado, por ejemplo, el número de días desde la infección). Centrándonos en las epidemias de COVID-19, desarrollamos y analizamos modelos matemáticos para explorar cómo estos tres factores pueden afectar la dinámica epidémica temprana y final. En cuanto a la fecha de origen, encontramos efectos limitados sobre las estimaciones medias, pero fuertes efectos sobre sus varianzas. En cuanto a la fecha de extinción tras el inicio del confinamiento, los valores medios disminuyen con la estocasticidad o con la presencia de eventos de superpropagación. Estos resultados subrayan la importancia de tener en cuenta la heterogeneidad en el historial de infección y los patrones de transmisión para capturar con precisión la dinámica epidémica temprana y tardía.

COVID-19, confinamiento, SARS-Cov2, estocástico, no markoviano, modelado de epidemias

This is an automatically generated version. The authors and PCI decline all responsibility concerning its content

Estimation des dates d’origine et de fin des épidémies de COVID-19

Estimer la date à laquelle une épidémie a commencé dans un pays et la date à laquelle elle peut se terminer en fonction de l'intensité des interventions est important pour orienter les réponses de santé publique. Les deux sont potentiellement façonnés par des facteurs similaires, notamment la stochasticité (due à la petite taille des populations), les événements de grande propagation et les « effets de mémoire » (le fait que la survenue de certains événements, par exemple la guérison d'une infection, dépend du passé, par exemple le nombre d'événements). jours depuis l’infection). En nous concentrant sur les épidémies de COVID-19, nous développons et analysons des modèles mathématiques pour explorer comment ces trois facteurs peuvent affecter la dynamique épidémique précoce et finale. Concernant la date d’origine, on retrouve des effets limités sur les estimations moyennes, mais des effets forts sur leurs variances. Concernant la date d’extinction après le début du confinement, les valeurs moyennes diminuent avec la stochasticité ou avec la présence d’événements de super-propagation. Ces résultats soulignent l'importance de tenir compte de l'hétérogénéité de l'histoire de l'infection et des modes de transmission pour capturer avec précision la dynamique épidémique précoce et tardive.

COVID-19, confinement, SARS-Cov2, stochastique, non markovien, modélisation épidémique

This is an automatically generated version. The authors and PCI decline all responsibility concerning its content

किसी देश में महामारी शुरू होने की तारीख और हस्तक्षेप की तीव्रता के आधार पर यह समाप्त होने की तारीख का अनुमान लगाना सार्वजनिक स्वास्थ्य प्रतिक्रियाओं का मार्गदर्शन करने के लिए महत्वपूर्ण है। दोनों संभावित रूप से समान कारकों से आकार लेते हैं जिनमें स्टोचैस्टिसिटी (छोटी आबादी के आकार के कारण), सुपरस्प्रेडिंग घटनाएं, और 'मेमोरी प्रभाव' (तथ्य यह है कि कुछ घटनाओं की घटना, उदाहरण के लिए संक्रमण से उबरना, अतीत पर निर्भर करती है, उदाहरण के लिए संख्या) संक्रमण के बाद से दिन)। COVID-19 महामारी पर ध्यान केंद्रित करते हुए, हम यह पता लगाने के लिए गणितीय मॉडल विकसित और विश्लेषण करते हैं कि ये तीन कारक प्रारंभिक और अंतिम महामारी की गतिशीलता को कैसे प्रभावित कर सकते हैं। उत्पत्ति की तारीख के संबंध में, हम औसत अनुमानों पर सीमित प्रभाव पाते हैं, लेकिन उनके भिन्नताओं पर मजबूत प्रभाव पाते हैं। लॉकडाउन की शुरुआत के बाद विलुप्त होने की तारीख के संबंध में, स्टोचैस्टिसिटी के साथ या सुपरस्प्रेडिंग घटनाओं की उपस्थिति के साथ औसत मूल्य कम हो जाते हैं। ये परिणाम प्रारंभिक और देर से महामारी की गतिशीलता को सटीक रूप से पकड़ने के लिए संक्रमण के इतिहास और संचरण पैटर्न में विविधता के लिए लेखांकन के महत्व को रेखांकित करते हैं। 0037961बीफ248229एफसीई6एफएएफ3ई72539ए COVID-19 महामारी की उत्पत्ति और समाप्ति की तारीखों का अनुमान लगाना ab73ebc2f03b44c5ad8ccb61ee2031ba COVID-19, लॉकडाउन, SARS-Cov2, स्टोकेस्टिक, गैर-मार्कोवियन, महामारी मॉडलिंग

COVID-19, लॉकडाउन, SARS-Cov2, स्टोकेस्टिक, गैर-मार्कोवियन, महामारी मॉडलिंग

This is an automatically generated version. The authors and PCI decline all responsibility concerning its content

新型コロナウイルス感染症の流行の発生日と終息日の推定

ある国で流行が始まった日と、介入の強度に応じて流行が終息する日を推定することは、公衆衛生上の対応を導く上で重要です。どちらも潜在的に、確率性（人口サイズが小さいため）、超拡散事象、および「記憶効果」（感染からの回復などのいくつかの事象の発生が過去に依存するという事実、例えば感染者の数など）を含む同様の要因によって形成される可能性があります。感染から数日）。私たちは、新型コロナウイルス感染症の流行に焦点を当て、これら 3 つの要因が初期および最終的な流行のダイナミクスにどのような影響を与えるかを調査するために、数学的モデルを開発および分析しています。起源の日付に関しては、平均推定値への影響は限定的ですが、その分散には強い影響があることがわかりました。ロックダウン開始後の絶滅の日付に関しては、平均値は確率論または超拡散現象の存在とともに減少します。これらの結果は、初期および後期の流行のダイナミクスを正確に把握するために、感染履歴と感染パターンの不均一性を考慮することの重要性を強調しています。

COVID-19、ロックダウン、SARS-Cov2、確率的、非マルコヴィアン、疫病モデリング

This is an automatically generated version. The authors and PCI decline all responsibility concerning its content

Estimativa de datas de origem e fim das epidemias de COVID-19

Estimar a data em que uma epidemia começou num país e a data em que pode terminar, dependendo da intensidade das intervenções, é importante para orientar as respostas de saúde pública. Ambos são potencialmente moldados por fatores semelhantes, incluindo estocasticidade (devido ao pequeno tamanho da população), eventos de superpropagação e “efeitos de memória” (o fato de que a ocorrência de alguns eventos, por exemplo, recuperação de uma infecção, depende do passado, por exemplo, o número de dias desde a infecção). Com foco nas epidemias de COVID-19, desenvolvemos e analisamos modelos matemáticos para explorar como esses três fatores podem afetar a dinâmica inicial e final da epidemia. Em relação à data de origem, encontramos efeitos limitados nas estimativas médias, mas fortes efeitos nas suas variâncias. Em relação à data de extinção após o início do confinamento, os valores médios diminuem com a estocasticidade ou com a presença de eventos de superpropagação. Estes resultados sublinham a importância de ter em conta a heterogeneidade no histórico de infeções e nos padrões de transmissão para capturar com precisão a dinâmica epidémica precoce e tardia.

COVID-19, bloqueio, SARS-Cov2, estocástico, não-markoviano, modelagem de epidemias

This is an automatically generated version. The authors and PCI decline all responsibility concerning its content

Оценка дат возникновения и окончания эпидемий COVID-19

Оценка даты начала эпидемии в стране и даты, когда она может закончиться, в зависимости от интенсивности мер вмешательства важна для определения ответных мер общественного здравоохранения. Оба потенциально формируются схожими факторами, включая стохастичность (из-за небольших размеров популяции), события сверхраспространения и «эффекты памяти» (тот факт, что возникновение некоторых событий, например, выздоровления от инфекции, зависит от прошлого, например, количества дней с момента заражения). Сосредоточив внимание на эпидемиях COVID-19, мы разрабатываем и анализируем математические модели, чтобы изучить, как эти три фактора могут повлиять на раннюю и конечную динамику эпидемии. Что касается даты происхождения, мы обнаруживаем ограниченное влияние на средние оценки, но сильное влияние на их дисперсию. Что касается даты исчезновения после начала карантина, средние значения уменьшаются по мере стохастичности или при наличии событий сверхраспространения. Эти результаты подчеркивают важность учета неоднородности истории инфекции и моделей передачи для точного отражения ранней и поздней динамики эпидемии.

COVID-19, локдаун, SARS-Cov2, стохастический, немарковский, моделирование эпидемии

This is an automatically generated version. The authors and PCI decline all responsibility concerning its content

估计 COVID-19 流行病的起源和结束日期

根据干预强度估计流行病在一个国家开始的日期和结束的日期对于指导公共卫生应对措施非常重要。两者都可能受到类似因素的影响，包括随机性（由于人口规模小）、超级传播事件和“记忆效应”（某些事件的发生，例如从感染中恢复，取决于过去，例如感染的数量）感染后的天数）。我们专注于 COVID-19 流行病，开发并分析数学模型，以探讨这三个因素如何影响早期和最终的流行病动态。关于起始日期，我们发现对均值估计的影响有限，但对其方差的影响很大。关于封锁开始后的灭绝日期，平均值会随着随机性或超级传播事件的存在而降低。这些结果强调了考虑感染史和传播模式的异质性以准确捕捉早期和晚期流行动态的重要性。

COVID-19、封锁、SARS-Cov2、随机、非马尔可夫、流行病模型

Submission: posted 23 February 2021
Recommendation: posted 04 July 2021, validated 27 July 2021

Cite this recommendation as:
Forbes, V. (2021) The importance of model assumptions in estimating the dynamics of the COVID-19 epidemic. Peer Community in Mathematical and Computational Biology, 100004. https://doi.org/10.24072/pci.mcb.100004

Recommendation

In “Estimating dates of origin and end of COVID-19 epidemics”, Bénéteau et al. develop and apply a mathematical modeling approach to estimate the date of the origin of the SARS-CoV-2 epidemic in France. They also assess how long strict control measures need to last to ensure that the prevalence of the virus remains below key public health thresholds. This problem is challenging because the numbers of infected individuals in both tails of the epidemic are low, which can lead to errors when deterministic models are used. To achieve their goals, the authors developed a discrete stochastic model. The model is non-Markovian, meaning that individual infection histories influence the dynamics. The model also accounts for heterogeneity in the timing between infection and transmission and includes stochasticity as well as consideration of superspreader events. By comparing the outputs of their model with several alternative models, Bénéteau et al. were able to assess the importance of stochasticity, individual heterogeneity, and non-Markovian effects on the estimates of the dates of origin and end of the epidemic, using France as a test case. Some limitations of the study, which the authors acknowledge, are that the time from infection to death remains largely unknown, a lack of data on the heterogeneity of transmission among individuals, and the assumption that only a single infected individual caused the epidemic. Despite the acknowledged limitations of the work, the results suggest that cases may be detected long before the detection of an epidemic wave. Also, the approach may be helpful for informing public health decisions such as the necessary duration of strict lockdowns and for assessing the risks of epidemic rebound as restrictions are lifted. In particular, the authors found that estimates of the end of the epidemic following lockdowns are more sensitive to the assumptions of the models used than estimates of its beginning. In summary, this model adds to a valuable suite of tools to support decision-making in response to disease epidemics.

References

Bénéteau T, Elie B, Sofonea MT, Alizon S (2021) Estimating dates of origin and end of COVID-19 epidemics. medRxiv, 2021.01.19.21250080, ver. 3 peer-reviewed and recommended by Peer Community in Mathematical and Computational Biology. https://doi.org/10.1101/2021.01.19.21250080

PDF recommendation

Conflict of interest:
The recommender in charge of the evaluation of the article and the reviewers declared that they have no conflict of interest (as defined in the code of conduct of PCI) with the authors or with the content of the article. The authors declared that they comply with the PCI rule of having no financial conflicts of interest in relation to the content of the article.

Reviews

Evaluation round #1

DOI or URL of the preprint: 10.1101/2021.01.19.21250080

Version of the preprint: 1

Author's Reply, 23 Jun 2021

Download author's reply Download tracked changes file

Dear Dr Forbes,

We read the reviewer’s comments and suggestions with great interest. We thank the reviewers for their suggestions, which helped to further improve our manuscript in the following ways.

First, we clarified the mathematical writing of the model, removing the unnecessary forms, and further defining the terms used. Second, we included a new analysis looking at the time until incidence reaches a threshold below which local measures would be sufficient to control the epidemic, following Dr Rousseau’s suggestions. Third, we followed the reviewer’s recommendations to correct for the writing mistakes, and in particular we made available the gitlab repository where all codes and raw simulation results are available.

In the response file, you will find our detailed comments to the issues raised by the reviewers. We also attach a .pdf file where the changes made are highlighted.

We herewith resubmit our manuscript and we hope that it is now acceptable for publication.

Awaiting your decision,

Thomas Bénéteau, Baptiste Elie, Mircea T. Sofonea, Samuel Alizon

https://doi.org/10.24072/pci.mcb.100070.ar1

Decision by Valery Forbes, posted 22 Apr 2021

Dear Authors,

We have received two very thoughtful and detailed reviews of your manuscript. I would ask that you revise your preprint and indicate in a separate document how you have addressed each of the reviewers' comments. We look forward to receiving your revised preprint.

Sincerely,

Valery Forbes

https://doi.org/10.24072/pci.mcb.100070.d1

Reviewed by anonymous reviewer 1, 26 Mar 2021

Download the review https://doi.org/10.24072/pci.mcb.100070.rev11

Reviewed by Bastien Boussau, 22 Apr 2021

Bénéteau et al. investigate the estimations by several models of the dates of the beginning and the end of the SARS-CoV-2 epidemic in France. This is a difficult problem as the number of infected people on both tails of the epidemic is low, meaning that assumptions at the heart of commonly-used SIR-based deterministic models become inappropriate. They propose a new stochastic model, a version of which includes superspreaders, and compare the estimates of this model to a deterministic SIR-like model and to another published deterministic model that includes age stratification. They find that estimates of the end of the epidemic following lockdowns are more sensitive to the assumptions of the models used than estimates of its beginning.

General comments
The manuscript was most of the time clearly written and easy to follow. However, some figures were difficult to interpret, and in some cases the description of the results seemed to include mistakes (see specific comments). In spite of these mistakes, the results appeared convincing. I could not find links to the data or the implementation of the models to reproduce the results. Finally, I believe the discussion could be extended a bit as I explain below.

The reliance on several models allows for testing the influence of different factors, including superspreaders, age structure, and memory in the time from hospitalization to death. However, these models all rely on different implementations, and differ in several respects, making their comparisons difficult. It might have been cleaner to use one framework to implement all models and compare them by changing one parameter at a time; for instance, some Bayesian models that have been proposed in the literature on SARS-CoV-2 might be amenable to such an investigation. Nonetheless, the fact that the different models agree in a lot of their predictions suggests that the results would probably have been the same, and the reliance on several implementations also protects against implementation-specific bugs.

Among the results that stand out is the fact that several months of lock-down are necessary to reach extinction of the epidemic. This is not unexpected, but the relevance of it to public health is little discussed in the manuscript. In two places the authors mention "an audience not familiar with stochasticity"; if this means e.g. public health officials or the general public, then more discussion should be included. In particular, I believe that the relationship between the authors' result and the feasibility of the "zero-Covid" strategy should be discussed, as a cursory reading of the manuscript may be interpreted as an argument against the strategy.

Along similar lines, it seems a bit much to ask of a lock-down that it brings an epidemic to its extinction, especially when the epidemic is tackled a bit late. Would a different objective, i.e. that of reaching daily incidence levels that are compatible with a zero-covid-like strategy (control points, local lock-downs) also require several months of lock-down? Would the modeling approach proposed by the authors suffice to answer such a question, if the data are available?

Specific comments
p3: "Finally, we analyse a classical deterministic Markovian model, which is commonly used to analyse COVID-19 epidemics [? ]." : missing reference
p4: "(see Figure S6)" : this is the first reference to a figure; it would probably make sense that this is Fig. S1, not S6.
p4: "a value much higher than the outbreak threshold above which a stochastic fade out is unlikely [10]": the number of daily deaths is not directly comparable to the outbreak threshold values provided in the reference cited. It would be convenient for the reader to detail the computations that ensure that the value chosen is much higher than the outbreak threshold.
Table S1: "Shape parameter (Gamma distribution)" : in this table, could the reader be reminded that the Gamma distribution is used to model heterogeneity in infectivity and/or infection duration?
Supp mat p3: "where η n measures the public health intervention impacts on the disease spread at day n,": for consistency with the stochastic model, perhaps it would be clearer to use t for the day?
Fig. S1: the legend to this figure should at least explain the meaning of the compartments, and possibly the parameters.
Supp mat p3-4: "We compared this model to the discrete time non-markovian model, and a SEAIRH4D model in which memory in the delay from hospitalization to death is implemented" : I find this description too short to really understand what was done, and the meaning of the acronym SEAIRH4D should be provided.
Supp mat p3: "The set of ODE shown in the previous paragraph is solved
using ’odeint’ function from Numpy on Python 3.8.3.": Is the code for the deterministic models available? If so it could be stated here.
Supp mat p3: "We estimated the following parameters for the
SEAIRHD model using a maximum likelihood procedure" : could the authors provide the likelihood formula and specify what algorithm was used to maximize the likelihood?
Figure S4: "Generation time standard deviation impact on the starting date inference.": there is an inconsistency between the y axis that states "Serial interval standard deviation" and the legend.
Figure S5: I assume a serial interval of 2.3 was used? It would be useful to point it out.
Supp mat p7: "We can see that only the importation of
new infected individuals during the first days has an impact on the epidemic.": I do not understand how this conclusion is reached: is it by comparison of Figs. S4 and S5? I would need more details on the reasoning and possibly another figure to understand this.
p5: "with an estimated efficacy of 1 -\eta_{FR}= 76% [21]." : it would be good to define \eta_{FR} here rather than a few lines later.
p6: "finite lock-down extensions on the the probability": too many "the"s
p6: So \tau is defined per simulation, and p_0(t) is averaged over all simulations?
p6: "SEAIRHD" : This model does not include the possibility that asymptomatic individuals become recovered without ever becoming symptomatic, which is a big feature of Covid. Could the authors comment on the expected importance of the lack of such a feature?
p6: "Scripts for the SEAIRHD model can be found in the supplementary materials.": I have not found them.
p7: "the same as in our model" : the same as in our DS model
p7: "The likelihood of the deterministic SEAIRHD model was computed assuming a Poisson distribution of the daily mortality incidence data." : I think it would be good to explain how parameter inference was achieved with the non-Markovian deterministic model.
p7: "the time mortality incidence reaches" : I think it would help to remind the reader that this date is March 23.
p7: "67 days (equivalent to a first case on January 16 in France), with a 95% confidence interval (95% CI) between 62 and 79 days" : the numbers given in this section do not seem to match Fig. 1 "DS without heterogeneity". Was there an inversion in the names of the violin/boxplots between with and without heterogeneity?
p8: "However, consistently with earlier studies [21? ]" : missing reference
p8: "the median delay for daily incidence to reach 100 deaths is decreased by 5 days when the serial interval standard deviation is decreased by one third (Fig. S4).": isn't it the opposite?
p8: "However, when assuming a more realistic scenario where all those cases are not imported on the same day, we find a much more limited impact on the delay" : I find it hard to be convinced, looking at the figures and trying to compare the two panels of Fig. S5. Could the authors provide trends or numbers, or maybe an additional supplementary figure, that would precisely convey this information?
p9 "Time to eradication": in this section a few comments about the results of the SEAIRHD model would be useful.
p10: "The results are shown in Figures 3 for the case without host heterogeneity and Fig. S8 with super-spreading events." : it is not clear to me why the authors chose to show the results of the superspreading model in supplementary material and the results of the model without superspreading in main? I would have expected the reverse.
p12: "as stressed by earlier studies [21? ]." : missing reference
p13: "higher k parameter value that the one used here (0.30 versus 0.16 here)" : than instead of that

https://doi.org/10.24072/pci.mcb.100070.rev12

User comments

No user comments yet

or Register
Submit a preprint