Recommendation

Importance of age structure on modeling COVID-19 epidemiological dynamics

Chen Liao based on reviews by Facundo Muñoz, Kevin Bonham and 1 anonymous reviewer

A recommendation of:

Non-Markovian modelling highlights the importance of age structure on Covid-19 epidemiological dynamics

Bastien Reyné, Quentin Richard, Camille Noûs, Christian Selinger, Mircea T. Sofonea, Ramsès Djidjou-Demasse, Samuel Alizon (2022), medRxiv, 2021.09.30.21264339, ver. 3 peer-reviewed and recommended by Peer Community in Mathematical and Computational Biology https://doi.org/10.1101/2021.09.30.21264339

Read preprint in preprint server Now published in a journal

Data used for results

Abstract

EN

AR

ES

FR

HI

JA

PT

RU

ZH-CN

Non-Markovian modelling highlights the importance of age structure on Covid-19 epidemiological dynamics

The Covid-19 pandemic outbreak was followed by a huge amount of modelling studies in order to rapidly gain insights to implement the best public health policies. Most of these compartmental models involved ordinary differential equations (ODEs) systems. Such a formalism implicitly assumes that the time spent in each compartment does not depend on the time already spent in it, which is at odds with the clinical data. To overcome this “memoryless” issue, a widely used solution is to increase and chain the number of compartments of a unique reality (e.g. have infected individual move between several compartments). This allows for greater heterogeneity and thus be closer to the observed situation, but also tends to make the whole model more difficult to apprehend and parameterize. We develop a non-Markovian alternative formalism based on partial differential equations (PDEs) instead of ODEs, which, by construction, provides a memory structure for each compartment thereby allowing us to limit the number of compartments. We apply our model to the French 2021 SARS-CoV-2 epidemic and, while accounting for vaccine-induced and natural immunity, we analyse and determine the major components that contributed to the Covid-19 hospital admissions. The results indicate that the observed vaccination rate alone is not enough to control the epidemic, and a global sensitivity analysis highlights a huge uncertainty attributable to the age-structured contact matrix. Our study shows the flexibility and robustness of PDE formalism to capture national COVID-19 dynamics and opens perspectives to study medium or long-term scenarios involving immune waning or virus evolution.

epidemiology, infectious diseases modelling, contact matrix, partial differential equations, Covid-19

This is an automatically generated version. The authors and PCI decline all responsibility concerning its content

تسلط النمذجة غير الماركوفية الضوء على أهمية البنية العمرية في الديناميكيات الوبائية لـ Covid-19

أعقب تفشي جائحة كوفيد-19 كمية هائلة من دراسات النمذجة من أجل اكتساب رؤى سريعة لتنفيذ أفضل سياسات الصحة العامة. تتضمن معظم هذه النماذج المجزأة أنظمة المعادلات التفاضلية العادية (ODEs). تفترض هذه الشكلية ضمنًا أن الوقت الذي يقضيه في كل حجرة لا يعتمد على الوقت الذي يقضيه فيها بالفعل، وهو ما يتعارض مع البيانات السريرية. للتغلب على هذه المشكلة "عديمة الذاكرة"، يتمثل الحل المستخدم على نطاق واسع في زيادة عدد أجزاء الواقع الفريد وتسلسلها (على سبيل المثال، انتقال الفرد المصاب بين عدة أجزاء). وهذا يسمح بقدر أكبر من عدم التجانس وبالتالي يكون أقرب إلى الوضع المرصود، ولكنه يميل أيضًا إلى جعل النموذج بأكمله أكثر صعوبة في الفهم وتحديد المعلمات. لقد قمنا بتطوير صيغة بديلة غير ماركوفية تعتمد على المعادلات التفاضلية الجزئية (PDEs) بدلاً من المعادلات التفاضلية الجزئية، والتي، من خلال بنائها، توفر بنية ذاكرة لكل حجرة مما يسمح لنا بالحد من عدد الأجزاء. نحن نطبق نموذجنا على وباء سارس-كوف-2 الفرنسي لعام 2021، ومع مراعاة المناعة الطبيعية والناجمة عن اللقاح، نقوم بتحليل وتحديد المكونات الرئيسية التي ساهمت في دخول مرضى كوفيد-19 إلى المستشفيات. وتشير النتائج إلى أن معدل التطعيم المرصود وحده لا يكفي للسيطرة على الوباء، ويسلط تحليل الحساسية العالمية الضوء على قدر كبير من عدم اليقين الذي يعزى إلى مصفوفة الاتصال المنظمة حسب العمر. تُظهر دراستنا مرونة وقوة شكلية PDE لالتقاط ديناميكيات COVID-19 الوطنية وتفتح وجهات نظر لدراسة السيناريوهات المتوسطة أو الطويلة الأجل التي تنطوي على تراجع المناعة أو تطور الفيروس.

علم الأوبئة، نمذجة الأمراض المعدية، مصفوفة الاتصال، المعادلات التفاضلية الجزئية، كوفيد-19

This is an automatically generated version. The authors and PCI decline all responsibility concerning its content

El modelo no markoviano destaca la importancia de la estructura de edades en la dinámica epidemiológica de Covid-19

El brote pandémico de Covid-19 fue seguido por una gran cantidad de estudios de modelización con el fin de obtener rápidamente información para implementar las mejores políticas de salud pública. La mayoría de estos modelos compartimentales involucraban sistemas de ecuaciones diferenciales ordinarias (EDO). Tal formalismo supone implícitamente que el tiempo pasado en cada compartimento no depende del tiempo ya pasado en él, lo que contradice los datos clínicos. Para superar este problema de “falta de memoria”, una solución ampliamente utilizada es aumentar y encadenar el número de compartimentos de una realidad única (por ejemplo, hacer que un individuo infectado se mueva entre varios compartimentos). Esto permite una mayor heterogeneidad y, por lo tanto, estar más cerca de la situación observada, pero también tiende a hacer que todo el modelo sea más difícil de comprender y parametrizar. Desarrollamos un formalismo alternativo no markoviano basado en ecuaciones diferenciales parciales (PDE) en lugar de EDO, que, por construcción, proporciona una estructura de memoria para cada compartimento, lo que nos permite limitar el número de compartimentos. Aplicamos nuestro modelo a la epidemia francesa de SARS-CoV-2 de 2021 y, teniendo en cuenta la inmunidad natural e inducida por la vacuna, analizamos y determinamos los componentes principales que contribuyeron a las admisiones hospitalarias por Covid-19. Los resultados indican que la tasa de vacunación observada por sí sola no es suficiente para controlar la epidemia, y un análisis de sensibilidad global destaca una enorme incertidumbre atribuible a la matriz de contactos estructurada por edad. Nuestro estudio muestra la flexibilidad y solidez del formalismo PDE para capturar la dinámica nacional de COVID-19 y abre perspectivas para estudiar escenarios a mediano o largo plazo que involucran una disminución inmune o la evolución del virus.

epidemiología, modelización de enfermedades infecciosas, matriz de contactos, ecuaciones diferenciales parciales, Covid-19

This is an automatically generated version. The authors and PCI decline all responsibility concerning its content

La modélisation non markovienne met en évidence l’importance de la structure par âge sur la dynamique épidémiologique du Covid-19

L'épidémie de pandémie de Covid-19 a été suivie d'un grand nombre d'études de modélisation afin d'obtenir rapidement des informations permettant de mettre en œuvre les meilleures politiques de santé publique. La plupart de ces modèles compartimentés impliquaient des systèmes d’équations différentielles ordinaires (ODE). Un tel formalisme suppose implicitement que le temps passé dans chaque compartiment ne dépend pas du temps déjà passé dans celui-ci, ce qui est en contradiction avec les données cliniques. Pour surmonter ce problème de « sans mémoire », une solution largement utilisée consiste à augmenter et enchaîner le nombre de compartiments d'une réalité unique (par exemple, un individu infecté se déplace entre plusieurs compartiments). Cela permet une plus grande hétérogénéité et donc d'être plus proche de la situation observée, mais tend également à rendre l'ensemble du modèle plus difficile à appréhender et à paramétrer. Nous développons un formalisme alternatif non markovien basé sur des équations aux dérivées partielles (EDP) au lieu d'EDO, qui, par construction, fournit une structure mémoire pour chaque compartiment nous permettant ainsi de limiter le nombre de compartiments. Nous appliquons notre modèle à l’épidémie française de SRAS-CoV-2 en 2021 et, tout en tenant compte de l’immunité induite par le vaccin et naturelle, nous analysons et déterminons les principales composantes qui ont contribué aux hospitalisations liées au Covid-19. Les résultats indiquent que le taux de vaccination observé ne suffit pas à lui seul à contrôler l’épidémie, et une analyse de sensibilité globale met en évidence une énorme incertitude attribuable à la matrice de contact structurée par âge. Notre étude montre la flexibilité et la robustesse du formalisme PDE pour capturer la dynamique nationale du COVID-19 et ouvre des perspectives pour étudier des scénarios à moyen ou long terme impliquant un déclin immunitaire ou une évolution du virus.

épidémiologie, modélisation des maladies infectieuses, matrice de contact, équations aux dérivées partielles, Covid-19

This is an automatically generated version. The authors and PCI decline all responsibility concerning its content

गैर-मार्कोवियन मॉडलिंग कोविड-19 महामारी विज्ञान की गतिशीलता पर आयु संरचना के महत्व पर प्रकाश डालता है

कोविड-19 महामारी के प्रकोप के बाद सर्वोत्तम सार्वजनिक स्वास्थ्य नीतियों को लागू करने के लिए तेजी से अंतर्दृष्टि प्राप्त करने के लिए बड़ी संख्या में मॉडलिंग अध्ययन किए गए। इनमें से अधिकांश कंपार्टमेंटल मॉडल में साधारण अंतर समीकरण (ओडीई) सिस्टम शामिल थे। इस तरह की औपचारिकता स्पष्ट रूप से मानती है कि प्रत्येक डिब्बे में बिताया गया समय उसमें पहले से बिताए गए समय पर निर्भर नहीं करता है, जो कि नैदानिक डेटा के विपरीत है। इस "स्मृतिहीन" मुद्दे को दूर करने के लिए, एक व्यापक रूप से उपयोग किया जाने वाला समाधान एक अद्वितीय वास्तविकता के डिब्बों की संख्या को बढ़ाना और श्रृंखलाबद्ध करना है (उदाहरण के लिए कई डिब्बों के बीच संक्रमित व्यक्तिगत चाल)। यह अधिक विविधता की अनुमति देता है और इस प्रकार देखी गई स्थिति के करीब होता है, लेकिन पूरे मॉडल को समझना और पैरामीटराइज़ करना अधिक कठिन बना देता है। हम ओडीई के बजाय आंशिक अंतर समीकरणों (पीडीई) के आधार पर एक गैर-मार्कोवियन वैकल्पिक औपचारिकता विकसित करते हैं, जो निर्माण द्वारा, प्रत्येक डिब्बे के लिए एक मेमोरी संरचना प्रदान करता है जिससे हमें डिब्बों की संख्या सीमित करने की अनुमति मिलती है। हम अपने मॉडल को फ्रेंच 2021 SARS-CoV-2 महामारी पर लागू करते हैं और, वैक्सीन-प्रेरित और प्राकृतिक प्रतिरक्षा के लिए लेखांकन करते हुए, हम उन प्रमुख घटकों का विश्लेषण और निर्धारण करते हैं जिन्होंने कोविड -19 अस्पताल में प्रवेश में योगदान दिया। परिणामों से संकेत मिलता है कि देखी गई टीकाकरण दर अकेले महामारी को नियंत्रित करने के लिए पर्याप्त नहीं है, और एक वैश्विक संवेदनशीलता विश्लेषण आयु-संरचित संपर्क मैट्रिक्स के कारण एक बड़ी अनिश्चितता को उजागर करता है। हमारा अध्ययन राष्ट्रीय कोविड-19 गतिशीलता को पकड़ने के लिए पीडीई औपचारिकता के लचीलेपन और मजबूती को दर्शाता है और प्रतिरक्षा कमजोर होने या वायरस के विकास से जुड़े मध्यम या दीर्घकालिक परिदृश्यों का अध्ययन करने के लिए दृष्टिकोण खोलता है।

महामारी विज्ञान, संक्रामक रोग मॉडलिंग, संपर्क मैट्रिक्स, आंशिक अंतर समीकरण, कोविड-19

This is an automatically generated version. The authors and PCI decline all responsibility concerning its content

非マルコフモデル化は、新型コロナウイルス感染症の疫学動態における年齢構成の重要性を強調する

新型コロナウイルス感染症（Covid-19）のパンデミック発生後、最善の公衆衛生政策を実施するための洞察を迅速に得るために、膨大な量のモデリング研究が行われました。これらのコンパートメントモデルのほとんどには、常微分方程式 (ODE) システムが含まれています。このような形式主義は、各コンパートメントで費やされた時間はそのコンパートメントですでに費やされた時間に依存しないことを暗黙に想定しており、これは臨床データと矛盾します。この「記憶のない」問題を克服するために、広く使用されている解決策は、固有の現実のコンパートメントの数を増やして連鎖させることです (例: 感染者が複数のコンパートメント間を移動する)。これにより、より大きな異質性が可能になり、観察された状況に近づくことができますが、モデル全体の把握とパラメータ化がより困難になる傾向もあります。私たちは、ODE の代わりに偏微分方程式 (PDE) に基づいた非マルコフ代替形式主義を開発します。これは、その構造により、各コンパートメントにメモリ構造を提供し、それによってコンパートメントの数を制限することができます。私たちはモデルをフランスの 2021 年 SARS-CoV-2 流行に適用し、ワクチン誘発免疫と自然免疫を考慮しながら、Covid-19 入院に寄与した主な要素を分析して特定します。この結果は、観察されたワクチン接種率だけでは流行を制御するのに十分ではないことを示しており、世界的な感度分析は、年齢構造の接触マトリクスに起因する大きな不確実性を浮き彫りにしている。私たちの研究は、国家的な新型コロナウイルス感染症の動態を捉えるための PDE 形式主義の柔軟性と堅牢性を示し、免疫力の低下やウイルスの進化を伴う中長期的なシナリオを研究する展望を開きます。

疫学、感染症モデリング、接触行列、偏微分方程式、Covid-19

This is an automatically generated version. The authors and PCI decline all responsibility concerning its content

A modelagem não-Markoviana destaca a importância da estrutura etária na dinâmica epidemiológica da Covid-19

O surto pandêmico de Covid-19 foi seguido por uma enorme quantidade de estudos de modelagem, a fim de obter rapidamente insights para implementar as melhores políticas de saúde pública. A maioria desses modelos compartimentais envolvia sistemas de equações diferenciais ordinárias (EDOs). Tal formalismo pressupõe implicitamente que o tempo passado em cada compartimento não depende do tempo já passado no mesmo, o que está em desacordo com os dados clínicos. Para superar este problema de “sem memória”, uma solução amplamente utilizada é aumentar e encadear o número de compartimentos de uma realidade única (por exemplo, ter um indivíduo infectado se movendo entre vários compartimentos). Isto permite maior heterogeneidade e assim estar mais próximo da situação observada, mas também tende a tornar todo o modelo mais difícil de apreender e parametrizar. Desenvolvemos um formalismo alternativo não-Markoviano baseado em equações diferenciais parciais (EDPs) em vez de EDOs, que, por construção, fornece uma estrutura de memória para cada compartimento, permitindo-nos assim limitar o número de compartimentos. Aplicamos o nosso modelo à epidemia francesa de SARS-CoV-2 em 2021 e, ao mesmo tempo que contabilizamos a imunidade natural e induzida pela vacina, analisamos e determinamos os principais componentes que contribuíram para as internações hospitalares por Covid-19. Os resultados indicam que a taxa de vacinação observada por si só não é suficiente para controlar a epidemia, e uma análise de sensibilidade global destaca uma enorme incerteza atribuível à matriz de contacto estruturada por idade. Nosso estudo mostra a flexibilidade e robustez do formalismo PDE para capturar a dinâmica nacional da COVID-19 e abre perspectivas para estudar cenários de médio ou longo prazo envolvendo declínio imunológico ou evolução do vírus.

epidemiologia, modelagem de doenças infecciosas, matriz de contato, equações diferenciais parciais, Covid-19

This is an automatically generated version. The authors and PCI decline all responsibility concerning its content

Немарковское моделирование подчеркивает важность возрастной структуры для эпидемиологической динамики Covid-19.

За вспышкой пандемии Covid-19 последовало огромное количество исследований по моделированию, чтобы быстро получить информацию для реализации наилучшей политики общественного здравоохранения. Большинство этих многораздельных моделей включали системы обыкновенных дифференциальных уравнений (ОДУ). Подобный формализм неявно предполагает, что время пребывания в каждом отсеке не зависит от времени, уже проведенного в нем, что противоречит клиническим данным. Чтобы преодолеть эту проблему «без памяти», широко используемое решение состоит в том, чтобы увеличить и связать количество отсеков уникальной реальности (например, инфицированное индивидуальное перемещение между несколькими отсеками). Это позволяет добиться большей неоднородности и, таким образом, быть ближе к наблюдаемой ситуации, но также имеет тенденцию усложнять понимание и параметризацию всей модели. Мы разрабатываем немарковский альтернативный формализм, основанный на уравнениях в частных производных (УЧП) вместо ОДУ, который по своей конструкции обеспечивает структуру памяти для каждого отсека, тем самым позволяя нам ограничить количество отсеков. Мы применяем нашу модель к эпидемии SARS-CoV-2 во Франции в 2021 году и, учитывая вакцино-индуцированный и естественный иммунитет, анализируем и определяем основные компоненты, которые способствовали госпитализации Covid-19. Результаты показывают, что одного лишь наблюдаемого уровня вакцинации недостаточно для контроля эпидемии, а анализ глобальной чувствительности подчеркивает огромную неопределенность, связанную с возрастной матрицей контактов. Наше исследование показывает гибкость и надежность формализма PDE для отражения национальной динамики COVID-19 и открывает перспективы для изучения среднесрочных и долгосрочных сценариев, включающих ослабление иммунитета или эволюцию вируса.

эпидемиология, моделирование инфекционных заболеваний, контактная матрица, уравнения в частных производных, Covid-19

This is an automatically generated version. The authors and PCI decline all responsibility concerning its content

非马尔可夫模型强调了年龄结构对 Covid-19 流行病学动态的重要性

Covid-19 大流行爆发后进行了大量的建模研究，以便快速获得见解以实施最佳的公共卫生政策。大多数这些分区模型涉及常微分方程 (ODE) 系统。这种形式主义隐含地假设在每个隔室中花费的时间并不取决于已经在其中花费的时间，这与临床数据不一致。为了克服这种“无记忆”问题，一种广泛使用的解决方案是增加和链接独特现实的隔间数量（例如，已感染的个体在多个隔间之间移动）。这允许更大的异质性，从而更接近观察到的情况，但也往往使整个模型更难以理解和参数化。我们开发了一种基于偏微分方程（PDE）而不是 ODE 的非马尔可夫替代形式主义，它通过构造为每个隔间提供了存储结构，从而允许我们限制隔间的数量。我们将我们的模型应用于法国 2021 年 SARS-CoV-2 疫情，在考虑疫苗诱导免疫和自然免疫的同时，我们分析并确定了导致 Covid-19 入院的主要因素。结果表明，仅观察到的疫苗接种率不足以控制疫情，全局敏感性分析凸显了年龄结构接触矩阵带来的巨大不确定性。我们的研究展示了 PDE 形式主义在捕捉国家 COVID-19 动态方面的灵活性和稳健性，并为研究涉及免疫减弱或病毒进化的中长期情景开辟了视角。

流行病学、传染病建模、接触矩阵、偏微分方程、Covid-19

Submission: posted 04 October 2021
Recommendation: posted 30 January 2022, validated 04 February 2022

Cite this recommendation as:
Liao, C. (2022) Importance of age structure on modeling COVID-19 epidemiological dynamics. Peer Community in Mathematical and Computational Biology, 100008. https://doi.org/10.24072/pci.mcb.100008

Recommendation

COVID-19 spread around the globe in early 2020 and has deeply changed our everyday life [1]. Mathematical models allow us to estimate R0 (basic reproduction number), understand the progression of viral infection, explore the impacts of quarantine on the epidemic, and most importantly, predict the future outbreak [2]. The most classical model is SIR, which describes time evolution of three variables, i.e., number of susceptible people (S), number of people infected (I), and number of people who have recovered (R), based on their transition rates [3]. Despite the simplicity, SIR model produces several general predictions that have important implications for public health [3].

SIR model includes three populations with distinct labels and is thus compartmentalized. Extra compartments can be added to describe additional states of populations, for example, people exposed to the virus but not yet infectious. However, a model with more compartments, though more realistic, is also more difficult to parameterize and analyze. The study by Reyné et al. [4] proposed an alternative formalism based on PDE (partial differential equation), which allows modeling different biological scenarios without the need of adding additional compartments. As illustrated, the authors modeled hospital admission dynamics in a vaccinated population only with 8 general compartments.

The main conclusion of this study is that the vaccination level till 2021 summer was insufficient to prevent a new epidemic in France. Additionally, the authors used alternative data sources to estimate the age-structured contact patterns. By sensitivity analysis on a daily basis, they found that the 9 parameters in the age-structured contact matrix are most variable and thus shape Covid19 pandemic dynamics. This result highlights the importance of incorporating age structure of the host population in modeling infectious diseases. However, a relevant potential limitation is that the contact matrix was assumed to be constant throughout the simulations. To account for time dependence of the contact matrix, social and behavioral factors need to be integrated [5].

References

[1] Hu B, Guo H, Zhou P, Shi Z-L (2021) Characteristics of SARS-CoV-2 and COVID-19. Nature Reviews Microbiology, 19, 141–154. https://doi.org/10.1038/s41579-020-00459-7

[2] Jinxing G, Yongyue W, Yang Z, Feng C (2020) Modeling the transmission dynamics of COVID-19 epidemic: a systematic review. The Journal of Biomedical Research, 34, 422–430. https://doi.org/10.7555/JBR.34.20200119

[3] Tolles J, Luong T (2020) Modeling Epidemics With Compartmental Models. JAMA, 323, 2515–2516. https://doi.org/10.1001/jama.2020.8420

[4] Reyné B, Richard Q, Noûs C, Selinger C, Sofonea MT, Djidjou-Demasse R, Alizon S (2022) Non-Markovian modelling highlights the importance of age structure on Covid-19 epidemiological dynamics. medRxiv, 2021.09.30.21264339, ver. 3 peer-reviewed and recommended by Peer Community in Mathematical and Computational Biology. https://doi.org/10.1101/2021.09.30.21264339

[5] Bedson J, Skrip LA, Pedi D, Abramowitz S, Carter S, Jalloh MF, Funk S, Gobat N, Giles-Vernick T, Chowell G, de Almeida JR, Elessawi R, Scarpino SV, Hammond RA, Briand S, Epstein JM, Hébert-Dufresne L, Althouse BM (2021) A review and agenda for integrated disease models including social and behavioural factors. Nature Human Behaviour, 5, 834–846 https://doi.org/10.1038/s41562-021-01136-2

PDF recommendation

Conflict of interest:
The recommender in charge of the evaluation of the article and the reviewers declared that they have no conflict of interest (as defined in the code of conduct of PCI) with the authors or with the content of the article. The authors declared that they comply with the PCI rule of having no financial conflicts of interest in relation to the content of the article.

Reviews

Evaluation round #1

DOI or URL of the preprint: https://doi.org/10.1101/2021.09.30.21264339

Version of the preprint: 1

Author's Reply, 20 Jan 2022

Download author's reply Download tracked changes file https://doi.org/10.24072/pci.mcb.100111.ar1

Decision by Chen Liao, posted 30 Dec 2021

Dear Authors,

We have received three reviews of your manuscript, two of which are very thoughful and detailed. All three reviwers are positive and appreciate the mathematical approaches proposed in your study. Given that this is a solid work with rigorous methodology and well-structured texts, we are happy to recommend your article after some minor revisions according to the review recommendations. In particular, I would encourage the authors to improve the following two aspects: (1) literature review of other PDE approaches and (2) codes/documentation of the software package.

Please submit your revised manuscript within one month and let us know if you anticipate any delay.

When you are ready to resubmit, please provide a detailed list of your responses to all review comments and a desription of the changes you have made in the manuscript. I would have appreciated if two versions of the revised manuscript are provided: one clean version and the other denoting where the text has been changed (highlighted or in track-change).

We hope that our recommendation process has been constructive so far. Please don't hesitate to contact us if you have any questions or comments.

Sincerely,

Chen Liao

https://doi.org/10.24072/pci.mcb.100111.d1

Reviewed by anonymous reviewer 1, 02 Dec 2021

Download the review https://doi.org/10.24072/pci.mcb.100111.rev11

Reviewed by Facundo Muñoz, 16 Nov 2021

# Review of "The importance of the population age-structure: insights from Covid-19 dynamics model structured by age, time since infection and acquired immunity"

https://doi.org/10.1101/2021.09.30.21264339

Facundo Muñoz, November 2021.

## General considerations

Main goal: Demonstrate the adaptation and extension of a recently published (by some of the same authors) approach which overcomes the Markovian hypothesis (lack of memory) of classical compartmental epidemiological models, with the objective of understanding the interplay between vaccination rates and age-structure in the Covid-19 pandemic in France.

Method: The methodology generalises the classical methodology based on Ordinary Differential Equations (ODEs) with respect to __time__, with Partial Differential Equations (PDEs) with respect to the __age__ and to the __time since infection__, in addition to __time__.
The authors deploy a specific compartment for vaccinated population and explore the use of alternative data sources to inform the age-structured contact patterns.

I think the title and the introduction could be improved to better identify the main topic of the manuscript.
The title focus on the scientific questions about some mechanisms at play in the French Covid-19 pandemic ("the importance of the population age-structure"), without any mention to the methodology.
By contrast, the abstract and introduction focus on the methodology, presenting the lack-of-memory issue as the knowledge gap to be addressed: "... we introduce an alternate formalism relying on partial differential equations..." (l. 35).

In my first read of the manuscript, I thought that the main topic was the methodology and the title was highlighting an application. It took me a second, more careful read to understand that the methodology had been introduced previously in Richard et al. (2021) and the present paper demonstrated how to tailor it to address different questions. But this is not clearly conveyed by neither the title nor the introduction (or the abstract).

## Introduction

The section is very well structured, first stating the context and quickly identifying the knowledge gap. Namely, the need to model memory effects.
It then explains the limits of two prior alternative approaches and makes a good case by arguing that multiplying compartments do not scale well and models quickly become very difficult to parameterise and interpret.

However, the first method is dismissed as a "workaround" which "artificially" increases the number of compartments.
I think these are inappropriate dismissive qualifiers. Every model could be ultimately considered as _artificial_ and used to _work around_ reality. The question is how _useful_ they are.

More specific statements about the relative merits would be much more informative. For instance, the authors could rather argue that modelling heterogeneities by age __continuously__ is more _parsimonious_ than introducing __artificial__ boundaries between age groups. This formulation explicitly specifies what exactly is being considered _artificial_, by contrast to the current proposal.

I would have appreciated further introductory references to epidemiological modelling with PDEs. The only reference is Richard et al. (2021), which in turn says that it is a "less common and much more challenging" approach, without further references.

## Materials and methods

The presentation of the model is condensed, but well structured, rigorous and sufficiently detailed. Especially given that the main ideas were presented previously in some more detail.

I have only missed one or two sentences to discuss the recovery rate $\gamma^{mv}(a, i)$ from compartment $I^{mv}_{aik}$, about line 70, where the need for this compartment is introduced. In particular, justifying the choice for recovered individuals returning back to the compartment $V_{ak}$ rather than $R_{aj}$. Stating explicitly that, in so doing, the time since vaccination $k$ is preserved, and possibly other consequences of the choice.

l. 75: « ... the number of [+newly] severely infected individuals of age $a$ at time $t$ [-is][+are] given by the boundary condition[+s] »

In point 6 of Assumption S1, I think it is missing the case $l \in d$, or is there a reason for leaving it out?

I must confess that I could not quite follow the demonstration of the well-posedness of the system in appendix A.2, nor the derivation of the basic reproduction number in appendix A.3. It's been a long time since I last revisited Banach spaces, and I am not familiar with the utilised methods and results. Nevertheless, both sections provide enough references and pointers for interested readers.

## Results

All the data and code were appropriately available for reproducing the results.
Providing cached intermediate results which are lengthy to compute is very much appreciated.
However, the documentation and comments are not sufficiently detailed.

For instance, the first script (`1_fit_vaccionation.R`) performs a calculation in parallel, which seems computationally demanding (I stopped it after a few minutes). It stores the results into an object called `results`.
Coincidentally, there is a cached data file named `results.RData` which, judging by the name, seems to correspond with said computation.
Yet there is no comment or indication confirming this, and loading such data file brings in a number of objects, none of them called _results_.
It takes some more investigation to figure out that `results.RData` is created by the 5th script, and used in the 7th. So, it seems related to something else.

Next, the second script warns from the beginning that it takes a few hours to run. Yet, it does not provide any pointer to the generated object (called `best` in the script), for which there is no cached results.

I don't pretend to be overly critic. It is apparent that the authors put some effort in cleaning up and commenting their code, and I truly appreciate it.
Still, making code available and __accessible__ to other people is difficult and takes a lot of time. Sometimes as much as producing the code itself.

The R package `modelvacc` is a wrapper around a set of C++ functions that implement the model equations and procedures.
However, its complete lack of documentation (code comments, help pages) and tests somewhat hampers its reliability and re-usability by other researchers. I believe that this package is of considerable scientific value as a companion to the paper and can be instrumental in the adoption and improvement of the approach proposed by the authors. As such, it should be subject to the same high standards as the manuscript itself.

In summary, I would encourage the editors and the authors to improve the code a bit before, or after, publication.

## Discussion

The discussion is well structured, placing the results in context, and stating the relevant scientific conclusions given the strengths and limitations of the approach.

https://doi.org/10.24072/pci.mcb.100111.rev12

Reviewed by Kevin Bonham, 20 Dec 2021

Reyne et. al. - The importance of the population age-structure: insights from Covid-19dynamics model structured by age, time since infection and acquired immunity

Review

In The importance of the population age-structure: insights from Covid-19 dynamics model structured by age, time since infection and acquired immunity, Reyné and colleagues present a SIR model based on partial differential equations (PDE) as opposed to the typical ODE-based models. The authors state that this provides the ability to more faithfully capture the time that individuals spend within model compartments (memory) without the need to artifically inflate the number of compartments modeled. This has the advantage of increasing interpretability and flexibility of the model at the cost of more up-front effort at parameterization.

Unfortunately, I fear I lack the mathematical expertise to comment directly on the construction of the model and on its outputs. I will instead focus on the clarity of the writing and on the software, in the hopes that this will be useful.

Writing

The authors do an admirable job explaining the construction of their PDE model, including how individual terms relate to real-world scenarios and the source of values for initial parameterization. Though I am not able to readily follow the math, the descriptions in the text are clear and sensible. Figure 1 provides a useful reference for the modeled compartments, and the pathways between them.

Many of the limitations that I perceived are mentioned in the main text or in the discussion, and are adequately explained. One exception here is regarding the waning of immunity after vaccination (lines 104-105).

Regarding the modelling of vaccine efficacy, for simplicity, we neglect immune waning, i.e. the decrease of immunity with time

The time-dependent changes in vaccine effectiveness strike me as a major source of uncertainty in this pandemic, and something for which models of this sort are well-suited to address (as claimed by the authors on line 34 as one motivation for this approach). In other portions of the manuscript, the authors imply that they are modeling this waning (eg ln 67 and ln 286). Perhaps it is clear from the equations, but I find myself unclear on whether this is actually accounted for or not.

Software

The authors make their software (written in C++ and R) available via an institutional gitlab repository. I was able to download a tarball of this code and follow the instructions to install dependencies on my laptop (Ubuntu xenial, R v4.0.1) Though (as mentioned in the README) many of the scripts take a long time to run, intermediate results are helpfully provided, and all of the code that I tried ran without errors until I interrupted it. The R portions of the code contain many helpful comments.

There are a few places where values that should perhaps be determined programmatically are hard-coded (eg here), and it might be nice if the parameters described in the paper could be found in a single configuration file (or something) rather than sprinkled throughout the scripts, as this would make it easier to tweak the assumptions of the model to see their effects, but these are very minor gripes.

I find it quite admirable that code is provided in a runnable state for review. A few additional steps could make this code availability even stronger (though I hesitate to demand any of these steps as necessary).

Register / archive the code via an independent institutional repository such as zenodo.org or osf.io. Especially one that provides a digital object identifier (DOI). As it stands, there is no guarantee that this code won't disappear tomorrow.
Provide additional instructions for installing specific versions of packages. The provided session_info.txt file is a great start - using the renv package allowed me to reproduce the environment, at least as regards R dependencies. Additional information about C++ versions and compilation would also be welcome.
Provide some kind of indication within the scripts (just comments would be fine) which portions of the code take approximately what amount of time. I would have like to try to run the code that only takes minutes or hours so that I could inspect the output, but without knowing which parts might take days (or be infeasible on my laptop), this isn't practically possible.
Descriptions in the code that reference specific parts of the paper. Especially given my difficulty with understanding the math, being able to link the code directly to descriptions in the paper would be immensely helpful.

The commit history on the publicly available project looks like it starts when the project was basically complete. Many people are uncomfortable sharing in-progress code (it's possible that version tracking was not even done earlier), and I don't think anything different is expected, which is why I'm not including it in my list of suggestions. But it's a shame.

Results

The results, so far as I understand them, are impressive and on the whole, clearly presented. I am a bit unclear about figure 2 - in particular, I wonder if it would make more sense to split the ages into more plausible units, rather than just 10 year increments. For example, infants and toddlers are likely to be more dissimilar from school age kids than eg 9 vs 11 year olds. One might also consider breaking up based on availability of vaccine (eg the youngest kids still can't get vaccinated).

For final publication, it might be nice to extend figures 5 and 6 with the most recent available data, as it currently ends in August. I don't know how feasible this is given the run-time of the code. Any further deviations from reality would not necessarily change the utility of this paper, but could be interesting fodder for discussion.

https://doi.org/10.24072/pci.mcb.100111.rev13

User comments

No user comments yet

or Register
Submit a preprint