Close printable page

Recommendation

Allowing gene transfers doesn't make life easier for inferring orthology and paralogy

Barbara Holland based on reviews by 2 anonymous reviewers

A recommendation of:

Consistency of orthology and paralogy constraints in the presence of gene transfers

Mark Jones, Manuel Lafond, Celine Scornavacca (2022), arXiv:1705.01240 [cs], ver.6 peer-reviewed and recommended by Peer Community in Mathematical and Computational Biology https://doi.org/10.48550/arXiv.1705.01240

Read preprint in preprint server

Abstract

EN

AR

ES

FR

HI

JA

PT

RU

ZH-CN

Consistency of orthology and paralogy constraints in the presence of gene transfers

Orthology and paralogy relations are often inferred by methods based on gene sequence similarity that yield a graph depicting the relationships between gene pairs. Such relation graphs frequently contain errors, as they cannot be explained via a gene tree that contains the depicted orthologs/paralogs while being consistent with the species evolution. Previous research has mostly focused on correcting such errors in some minimal way, for instance by changing a minimum number of relations to attain consistency. In this work, we ask: could the errors in the orthology predictions be explained by lateral gene transfer? We formalize this question by allowing gene transfers to behave either as a speciation or as a duplication, expanding the space of valid orthology graphs. We then provide a variety of algorithmic results regarding the underlying problems. Namely, we show that deciding if a relation graph R is consistent with a given species network N with known transfer highways is NP-hard, and that it is W[1]-hard under the parameter “minimum number of transfers”. During the process, we define a novel algorithmic problem called Antichain on trees, which may be useful for other reductions. We then present an FPT algorithm for the decision problem based on the degree of the gene tree associated with R. We also study analogous problems in the case that the transfer highways on a species tree are unknown.

orthology, phylogenetic network, algorithms, fixed-parameter tractability

This is an automatically generated version. The authors and PCI decline all responsibility concerning its content

اتساق قيود تقويم العظام والشلل في وجود عمليات نقل الجينات

غالبًا ما يتم استنتاج العلاقات المتعلقة بعلم العظام والشلل من خلال طرق تعتمد على تشابه تسلسل الجينات والتي تنتج رسمًا بيانيًا يصور العلاقات بين أزواج الجينات. تحتوي هذه الرسوم البيانية للعلاقات في كثير من الأحيان على أخطاء، حيث لا يمكن تفسيرها عبر شجرة الجينات التي تحتوي على المتعامدين/المشابهين المصورين بينما تكون متسقة مع تطور الأنواع. ركزت الأبحاث السابقة في الغالب على تصحيح مثل هذه الأخطاء بطريقة بسيطة، على سبيل المثال عن طريق تغيير الحد الأدنى من العلاقات لتحقيق الاتساق. في هذا العمل نتساءل: هل يمكن تفسير الأخطاء في تنبؤات تقويم العظام عن طريق نقل الجينات الجانبي؟ نحن نقوم بإضفاء الطابع الرسمي على هذا السؤال من خلال السماح لعمليات نقل الجينات بالتصرف إما كنوع جديد أو كتكرار، مما يؤدي إلى توسيع مساحة الرسوم البيانية الصالحة لتقويم العظام. ثم نقدم بعد ذلك مجموعة متنوعة من النتائج الخوارزمية فيما يتعلق بالمشاكل الأساسية. على وجه التحديد، نظهر أن تحديد ما إذا كان الرسم البياني للعلاقة R يتوافق مع شبكة أنواع معينة N مع طرق نقل سريعة معروفة هو أمر صعب NP، وأنه صعب W[1] تحت معلمة "الحد الأدنى لعدد عمليات النقل". أثناء العملية، قمنا بتعريف مشكلة خوارزمية جديدة تسمى Antichain على الأشجار، والتي قد تكون مفيدة لتخفيضات أخرى. نقدم بعد ذلك خوارزمية FPT لمشكلة القرار استنادًا إلى درجة شجرة الجينات المرتبطة بـ R. وندرس أيضًا مشكلات مماثلة في حالة عدم معرفة طرق النقل السريعة على شجرة الأنواع.

علم تقويم العظام، شبكة النشوء والتطور، الخوارزميات، قابلية تتبع المعلمة الثابتة

This is an automatically generated version. The authors and PCI decline all responsibility concerning its content

Consistencia de las restricciones de ortología y paralogía en presencia de transferencias de genes.

Las relaciones de ortología y paralogía a menudo se infieren mediante métodos basados en la similitud de secuencias genéticas que producen un gráfico que representa las relaciones entre pares de genes. Estos gráficos de relaciones contienen frecuentemente errores, ya que no pueden explicarse mediante un árbol genético que contenga los ortólogos/parálogos representados y al mismo tiempo sea coherente con la evolución de las especies. Las investigaciones anteriores se han centrado principalmente en corregir dichos errores de alguna manera mínima, por ejemplo cambiando un número mínimo de relaciones para lograr coherencia. En este trabajo nos preguntamos: ¿podrían los errores en las predicciones ortológicas explicarse por la transferencia lateral de genes? Formalizamos esta cuestión permitiendo que las transferencias de genes se comporten como una especiación o como una duplicación, ampliando el espacio de los gráficos de ortología válidos. Luego proporcionamos una variedad de resultados algorítmicos con respecto a los problemas subyacentes. Es decir, mostramos que decidir si un gráfico de relaciones R es consistente con una red de especies determinada N con carreteras de transferencia conocidas es NP-difícil, y que es W[1]-difícil bajo el parámetro "número mínimo de transferencias". Durante el proceso, definimos un nuevo problema algorítmico llamado Antichain en árboles, que puede ser útil para otras reducciones. Luego presentamos un algoritmo FPT para el problema de decisión basado en el grado del árbol genético asociado con R. También estudiamos problemas análogos en el caso de que se desconozcan las autopistas de transferencia en un árbol de especies.

ortología, red filogenética, algoritmos, trazabilidad de parámetros fijos

This is an automatically generated version. The authors and PCI decline all responsibility concerning its content

Cohérence des contraintes d'orthologie et de paralogie en présence de transferts de gènes

Les relations orthologiques et paralogiques sont souvent déduites par des méthodes basées sur la similarité des séquences génétiques qui donnent un graphique illustrant les relations entre les paires de gènes. De tels graphiques de relations contiennent souvent des erreurs, car ils ne peuvent pas être expliqués via un arbre génétique contenant les orthologues/paralogues représentés tout en étant cohérents avec l'évolution de l'espèce. Les recherches antérieures se sont principalement concentrées sur la correction de ces erreurs de manière minimale, par exemple en modifiant un nombre minimum de relations pour atteindre la cohérence. Dans ce travail, nous nous demandons : les erreurs dans les prédictions orthologiques pourraient-elles s’expliquer par un transfert latéral de gènes ? Nous formalisons cette question en permettant aux transferts de gènes de se comporter soit comme une spéciation, soit comme une duplication, élargissant ainsi l'espace des graphes orthologiques valides. Nous fournissons ensuite une variété de résultats algorithmiques concernant les problèmes sous-jacents. À savoir, nous montrons que décider si un graphe de relations R est cohérent avec un réseau d’espèces N donné avec des autoroutes de transfert connues est NP-difficile, et qu’il est W[1]-difficile sous le paramètre « nombre minimum de transferts ». Au cours du processus, nous définissons un nouveau problème algorithmique appelé Antichain sur les arbres, qui peut être utile pour d'autres réductions. Nous présentons ensuite un algorithme FPT pour le problème de décision basé sur le degré de l'arbre génétique associé à R. Nous étudions également des problèmes analogues dans le cas où les autoroutes de transfert sur un arbre d'espèces sont inconnues.

orthologie, réseau phylogénétique, algorithmes, traitabilité à paramètres fixes

This is an automatically generated version. The authors and PCI decline all responsibility concerning its content

जीन स्थानांतरण की उपस्थिति में ऑर्थोलॉजी और पैरालॉजी बाधाओं की संगति

ऑर्थोलॉजी और पैरालॉजी संबंधों का अनुमान अक्सर जीन अनुक्रम समानता पर आधारित तरीकों से लगाया जाता है, जो जीन जोड़े के बीच संबंधों को दर्शाने वाला एक ग्राफ उत्पन्न करता है। ऐसे संबंध ग्राफ़ में अक्सर त्रुटियां होती हैं, क्योंकि उन्हें जीन ट्री के माध्यम से समझाया नहीं जा सकता है जिसमें प्रजातियों के विकास के अनुरूप होते हुए चित्रित ऑर्थोलॉग/पैरालॉग शामिल हैं। पिछला शोध ज्यादातर ऐसी त्रुटियों को कुछ न्यूनतम तरीके से ठीक करने पर केंद्रित रहा है, उदाहरण के लिए स्थिरता प्राप्त करने के लिए न्यूनतम संख्या में संबंधों को बदलना। इस कार्य में, हम पूछते हैं: क्या ऑर्थोलॉजी भविष्यवाणियों में त्रुटियों को पार्श्व जीन स्थानांतरण द्वारा समझाया जा सकता है? हम जीन स्थानांतरण को या तो प्रजाति के रूप में या दोहराव के रूप में व्यवहार करने की अनुमति देकर, वैध ऑर्थोलॉजी ग्राफ़ के स्थान का विस्तार करके इस प्रश्न को औपचारिक बनाते हैं। फिर हम अंतर्निहित समस्याओं के संबंध में विभिन्न प्रकार के एल्गोरिथम परिणाम प्रदान करते हैं। अर्थात्, हम दिखाते हैं कि यह तय करना कि क्या संबंध ग्राफ आर ज्ञात स्थानांतरण राजमार्गों के साथ किसी दिए गए प्रजाति नेटवर्क एन के अनुरूप है, एनपी-हार्ड है, और यह "स्थानांतरण की न्यूनतम संख्या" पैरामीटर के तहत डब्ल्यू [1] -हार्ड है। इस प्रक्रिया के दौरान, हम पेड़ों पर एंटीचेन नामक एक नवीन एल्गोरिथम समस्या को परिभाषित करते हैं, जो अन्य कटौती के लिए उपयोगी हो सकती है। फिर हम आर से जुड़े जीन वृक्ष की डिग्री के आधार पर निर्णय समस्या के लिए एक एफपीटी एल्गोरिदम प्रस्तुत करते हैं। हम उस मामले में अनुरूप समस्याओं का भी अध्ययन करते हैं जब किसी प्रजाति के पेड़ पर स्थानांतरण राजमार्ग अज्ञात होते हैं।

ऑर्थोलॉजी, फाइलोजेनेटिक नेटवर्क, एल्गोरिदम, फिक्स्ड-पैरामीटर ट्रैक्टेबिलिटी

This is an automatically generated version. The authors and PCI decline all responsibility concerning its content

遺伝子導入の存在下でのオルソロジーとパラロジーの制約の一貫性

オルソロジーとパラロジーの関係は、多くの場合、遺伝子ペア間の関係を示すグラフを生成する遺伝子配列の類似性に基づく方法によって推論されます。このような関係グラフは、種の進化と一致しながら、描かれたオルソログ/パラログを含む遺伝子ツリーでは説明できないため、エラーを含むことがよくあります。これまでの研究では、一貫性を確保するために最小限の関係を変更するなど、最小限の方法でこのようなエラーを修正することに主に焦点が当てられてきました。この研究では、オーソロジー予測の誤差は遺伝子の水平伝達によって説明できるだろうか、と考えます。私たちは、遺伝子導入が種分化または重複として動作できるようにすることでこの疑問を形式化し、有効なオーソロジーグラフの空間を拡張します。次に、根本的な問題に関するさまざまなアルゴリズムの結果を提供します。すなわち、関係グラフ R が既知の移動ハイウェイを備えた特定の種ネットワーク N と一致するかどうかを判断することは NP 困難であり、パラメーター「最小移動数」の下では W[1] 困難であることを示します。そのプロセス中に、ツリー上のアンチチェーンと呼ばれる新しいアルゴリズムの問題を定義します。これは他の削減に役立つ可能性があります。次に、R に関連する遺伝子木の次数に基づいた決定問題の FPT アルゴリズムを提示します。また、種木の伝達ハイウェイが不明な場合の類似の問題も研究します。

オルソロジー、系統発生ネットワーク、アルゴリズム、固定パラメータの扱いやすさ

This is an automatically generated version. The authors and PCI decline all responsibility concerning its content

Consistência das restrições de ortologia e paralogia na presença de transferências de genes

As relações de ortologia e paralogia são frequentemente inferidas por métodos baseados na similaridade de sequências genéticas que produzem um gráfico que descreve as relações entre pares de genes. Tais gráficos de relação freqüentemente contêm erros, pois não podem ser explicados por meio de uma árvore genética que contém os ortólogos/parálogos representados, embora sejam consistentes com a evolução das espécies. A investigação anterior centrou-se principalmente na correção de tais erros de uma forma mínima, por exemplo, alterando um número mínimo de relações para obter consistência. Neste trabalho, perguntamos: os erros nas previsões ortológicas poderiam ser explicados pela transferência lateral de genes? Formalizamos esta questão permitindo que as transferências de genes se comportem como uma especiação ou como uma duplicação, expandindo o espaço de gráficos ortológicos válidos. Em seguida, fornecemos uma variedade de resultados algorítmicos relacionados aos problemas subjacentes. Nomeadamente, mostramos que decidir se um grafo de relação R é consistente com uma determinada rede de espécies N com rodovias de transferência conhecidas é NP-difícil, e que é W[1]-difícil sob o parâmetro “número mínimo de transferências”. Durante o processo, definimos um novo problema algorítmico denominado Antichain em árvores, que pode ser útil para outras reduções. Apresentamos então um algoritmo FPT para o problema de decisão baseado no grau da árvore genética associada a R. Também estudamos problemas análogos no caso em que as rodovias de transferência em uma árvore de espécies são desconhecidas.

ortologia, rede filogenética, algoritmos, tratabilidade de parâmetros fixos

This is an automatically generated version. The authors and PCI decline all responsibility concerning its content

Согласованность ограничений ортологии и паралогии при переносе генов

Отношения ортологии и паралогии часто выводятся с помощью методов, основанных на сходстве последовательностей генов, которые дают график, изображающий отношения между парами генов. Такие графы отношений часто содержат ошибки, поскольку их нельзя объяснить с помощью генного дерева, которое содержит изображенные ортологи / паралоги, но при этом согласуется с эволюцией вида. Предыдущие исследования в основном были сосредоточены на исправлении таких ошибок каким-либо минимальным способом, например, путем изменения минимального количества отношений для достижения согласованности. В этой работе мы задаемся вопросом: можно ли объяснить ошибки в ортологических предсказаниях латеральным переносом генов? Мы формализуем этот вопрос, позволяя переносу генов вести себя либо как видообразование, либо как дупликация, расширяя пространство действительных графов ортологии. Затем мы предоставляем различные алгоритмические результаты, касающиеся основных проблем. А именно, мы показываем, что решение о том, согласуется ли граф отношений R с заданной видовой сетью N с известными транспортными магистралями, является NP-трудным и W[1]-трудным при параметре «минимальное количество передач». В ходе этого процесса мы определяем новую алгоритмическую задачу под названием «Антицепь на деревьях», которая может быть полезна для других сокращений. Затем мы представляем FPT-алгоритм для решения проблемы, основанный на степени генного дерева, связанного с R. Мы также изучаем аналогичные проблемы в случае, когда пути передачи на дереве видов неизвестны.

ортология, филогенетическая сеть, алгоритмы, управляемость с фиксированными параметрами

This is an automatically generated version. The authors and PCI decline all responsibility concerning its content

存在基因转移时直系同源和旁系同源约束的一致性

直系同源和旁系同源关系通常通过基于基因序列相似性的方法来推断，这些方法产生描述基因对之间关系的图表。此类关系图经常包含错误，因为它们无法通过包含所描述的直向同源物/旁系同源物的基因树来解释，同时与物种进化保持一致。以前的研究主要集中在以某种最小的方式纠正此类错误，例如通过更改最少数量的关系来实现一致性。在这项工作中，我们问：同源预测中的错误可以用横向基因转移来解释吗？我们通过允许基因转移表现为物种形成或重复，扩展有效直系图的空间来形式化这个问题。然后，我们提供有关潜在问题的各种算法结果。也就是说，我们证明，决定关系图 R 是否与具有已知转移高速公路的给定物种网络 N 一致是 NP 困难的，并且在参数“最小转移次数”下是 W[1] 困难的。在此过程中，我们定义了一个称为树上反链的新颖算法问题，这可能对其他减少有用。然后，我们提出了一种基于与 R 相关的基因树的程度的决策问题的 FPT 算法。我们还研究了物种树上的转移高速公路未知的情况下的类似问题。

直系学、系统发育网络、算法、固定参数易处理性

Submission: posted 30 June 2021
Recommendation: posted 16 February 2022, validated 21 February 2022

Cite this recommendation as:
Holland, B. (2022) Allowing gene transfers doesn't make life easier for inferring orthology and paralogy. Peer Community in Mathematical and Computational Biology, 100009. https://doi.org/10.24072/pci.mcb.100009

Recommendation

Determining if genes are orthologous (i.e. homologous genes whose most common ancestor represents a speciation) or paralogous (homologous genes whose most common ancestor represents a duplication) is a foundational problem in bioinformatics. For instance, the input to almost all phylogenetic methods is a sequence alignment of genes assumed to be orthologous. Understanding if genes are paralogs or orthologs can also be important for assigning function, for example genes that have diverged following duplication may be more likely to have neofunctionalised or subfunctionalised compared to genes that have diverged following speciation, which may be more likely to have continued in a similar role.

This paper by Jones et al (2022) contributes to a wide range of literature addressing the inference of orthology/paralogy relations but takes a different approach to explaining inconsistency between an assumed species phylogeny and a relation graph (a graph where nodes represent genes and edges represent that the two genes are orthologs). Rather than assuming that inconsistencies are the result of incorrect assessment of orthology (i.e. incorrect edges in the relation graph) they ask if the relation graph could be consistent with a species tree combined with some amount of lateral (horizontal) gene transfer.

The two main questions addressed in this paper are (1) if a network N and a relation graph R are consistent, and (2) if – given a species tree S and a relation graph R – transfer arcs can be added to S in such a way that it becomes consistent with R?

The first question hinges on the concept of a reconciliation between a gene tree and a network (section 2.1) and amounts to asking if a gene tree can be found that can both be reconciled with the network and consistent with the relation graph. The authors show that the problem is NP hard. Furthermore, the related problem of attempting to find a solution using k or fewer transfers is NP-hard, and also W[1] hard implying that it is in a class of problems for which fixed parameter tractable solutions have not been found. The proof of NP hardness is by reduction to the k-multi-coloured clique problem via an intermediate problem dubbed “antichain on trees” (Section 3). The “antichain on trees” construction may be of interest to others working on algorithmic complexity with phylogenetic networks.

In the second question the possible locations of transfers are not specified (or to put it differently any time consistent transfer arc is considered possible) and it is shown that it generally will be possible to add transfer edges to S in such a way that it can be consistent with R. However, the natural extension to this question of asking if it can be done with k or fewer added arcs is also NP hard.

Many of the proofs in the paper are quite technical, but the authors have relegated a lot of this detail to the appendix thus ensuring that the main ideas and results are clear to follow in the main text. I am grateful to both reviewers for their detailed reviews and through checking of the proofs.

References

Jones M, Lafond M, Scornavacca C (2022) Consistency of orthology and paralogy constraints in the presence of gene transfers. arXiv:1705.01240 [cs], ver. 6 peer-reviewed and recommended by Peer Community in Mathematical and Computational Biology. https://arxiv.org/abs/1705.01240

PDF recommendation

Conflict of interest:
The recommender in charge of the evaluation of the article and the reviewers declared that they have no conflict of interest (as defined in the code of conduct of PCI) with the authors or with the content of the article. The authors declared that they comply with the PCI rule of having no financial conflicts of interest in relation to the content of the article.

Funding:
no declaration

Reviews

Evaluation round #1

DOI or URL of the preprint: https://arxiv.org/abs/1705.01240

Version of the preprint: 4

Author's Reply, 02 Feb 2022

Download author's reply Download tracked changes file https://doi.org/10.24072/pci.mcb.100108.ar1

Decision by Barbara Holland, posted 10 Jan 2022

I am pleased to have (finally) managed to get two expert reviews for this paper. Apologies for how long this took!

You'll see that they both have constructive suggestions that you may like to consider in a revised version.

I haven't delved tinto the technical detail but noticed a few small things.

Page 1

Orthology and paralogy relations are often inferred by methods based on gene sequence similarity, which yield a graph depicting the relationships between gene pairs.

->

Orthology and paralogy relations are often inferred by methods based on gene sequence similarity that yield a graph depicting the relationships between gene pairs.

I always get confused about when to use which and that but here I think that is better (i.e. an essential clause)

Vertical descent with modification (speciation) constitutes only part of the events shaping a gene history;

I'd say that vertical descent with modification is different from speciation, i.e. it's evolution along an edge whereas speciation is a splitting event.

page 3

The authors ask, given a reconciled gene tree G that displays a given of relations, whether there is a species network N that be reconciled with G.

->

The authors ask, given a reconciled gene tree G that displays a given of relations, whether there is a species network N that can be reconciled with G.

page 7 (last sentence)

missing a reference

It is worth mentioning the question studied in ??

https://doi.org/10.24072/pci.mcb.100108.d1

Reviewed by anonymous reviewer 1, 09 Sep 2021

Download the review https://doi.org/10.24072/pci.mcb.100108.rev11

Reviewed by anonymous reviewer 2, 03 Jan 2022

The paper presents a new approach to several problems on the homology relations in the gene tree-network reconciliation approach. From the mathematical and algorithmic point of view, the results are correct and sound. In general, the submitted article presents interesting algorithmic and computational complexity results. However, it is formally quite technical and requires some effort to follow. My general recommendation is positive, but I think the article requires revision. Below, I present more detailed remarks on the submitted contribution.

Perhaps, the most tricky elements of the paper are definitions with plenty of symbols and sometimes confusing usage of notions (see comment on Sect. 5). That's both on the level of the definitions and examples. Therefore, I recommend providing some better illustrations with explanations.

Recommendations:

1. Instead of writing a paragraph with exemplary alpha mapping (in pg. 2, which seems to contain mistakes), I recommend providing a picture of G embedded into N with explanations. It would be beneficial in understanding the concept of gene tree-network reconciliations. The current approach might be too difficult for a reader without experience in such approaches.

2. Also, the labeling e^* should be explained directly in Definition 2.

3. In Fig. 2, a comment should be on the presence of edge (c2,b2), since the edge seems not to represent an orthology relation from the exemplary reconciliation of G and N (which is confusing given the definition of orthology relation; however, it is formally correct, since the authors do not claim that R represents the relations from N).

4. Related to the above comment. Pg. 6, Sect. 2.2. Clarify that R is not the orthology graph for N (from Figure) or correct Fig. 1.

5. To be checked on page 6 (in the example):

- in e(alpha_1(b_1)) repeated,

- e(alpha_1(b_2)) missing,

- e(alpha_1(g_5))=S (not T)

- e(alpha_2(g_5))=T missing

6. In Section 5.

Definition 5 is conflicted with Definition 3. If S is a species tree, it is also a network with k=0 transfers. Also, "using k transfers", allows using 0 transfers. Thus, the notion of S-consistency is conflicted with Definition 3, when N is a species tree. The tricky part is that both notions are connected (also in the proof of Lemma 5). A careful reader can understand which definition must be applied, but it took me a while to untangle this issue.

Suggestion: try to avoid using S-consistency and N-consistency, where N and S are defined as a network and a species tree, resp.; Maybe use "species tree-consistency"?

Other comments:

- pg. 11. MWACT instead of ACT (2nd problem definition)

- The conditions on the weight functions are repeated several times (Lemma 3, Lemma 4, proofs, and other parts); I suggest introducing a new notion for the properties and removing the repeated lists.

- pg 6. the last line missing reference ??

- pg 23. 7 line from the top, remove "edge"

- pg 28. 2nd line from the top, LAST subscript;

- pg 28. 3rd line "alpha ... are incomparable" - explain what does it mean in a network

- inconsistent notation of edges: xy or (x,y) in several places

Algorithms.

The presented algorithms are clear and easy to understand Algorithm 2 can be improved by adopting more refined techniques from algorithmic papers on HGT reconciliation (see suggested papers at the end of the review), where the factor O(|V|^2) can be replaced by O(1). Such an update requires the introduction of an additional formula, which for (g,s) returns the minimum cost under the assumption the g is mapped to s', where there is a path from s to s' in N, plus the cost of transfers on the best path from s to s' (note that t(s,s') will be not needed). I leave the decision to the authors on how to incorporate this observation into the results. Such an improvement is not crucial in the contribution (even if the improvement in the polynomial part of FPT algorithm is significant), so a comment would be sufficient.

Please provide space complexity analysis.

Proofs in the appendix.

I also analyzed the proofs in the appendix, focusing on the more demanding and non-trivial proofs on reductions. This part is nicely written and presented but requires the most effort. I did not analyze the proofs of correctness of Algorithms 1 and 2 (the algorithms were easy to follow) and the proof of Theorem 4 (due to reviewing deadlines).

Related work suggestions.

1. The suggested improvement in Alg. 2 is presented in several algorithmic papers on variants of reconciling a gene tree with a species tree with horizontal gene transfer e.g. by Bansal or Mykowiecka (DOI: 10.1109/TCBB.2017.2707083).

2. Modelling reconciliation with transfers: papers on H-trees by Gorecki et. al., which seems the most related to the reconciliation of (G,alpha) with the network N.

3. The question of consistency and existence of reconciliations relates to "reconciliation feasibility problems" which seem to be a simpler version of consistency problems: given sigma mapping and a species tree S, the question is whether there is a gene tree for sigma that reconciles with the S. Also, there are more related questions e.g., on minimizing costs etc. See algorithmic papers by Eulenstein and others.

4. Another feasibility-related paper: see M. Helmuth, 2017.

https://doi.org/10.24072/pci.mcb.100108.rev12