Skip to main content
Advertisement
Browse Subject Areas
?

Click through the PLOS taxonomy to find articles in your field.

For more information about PLOS Subject Areas, click here.

  • Loading metrics

Transfer of knowledge from model organisms to evolutionarily distant non-model organisms: The coral Pocillopora damicornis membrane signaling receptome

  • Lokender Kumar,

    Roles Investigation, Writing – original draft

    Affiliation Department of Chemistry, Colorado School of Mines, Golden, CO, United States of America

  • Nathanael Brenner,

    Roles Investigation, Visualization, Writing – original draft

    Affiliation Department of Chemistry, Colorado School of Mines, Golden, CO, United States of America

  • Samuel Sledzieski,

    Roles Investigation, Methodology, Software, Visualization, Writing – original draft, Writing – review & editing

    Affiliation MIT Computer Science & Artificial Intelligence Lab, Massachusetts Institute of Technology, Cambridge, MA, United States of America

  • Monsurat Olaosebikan,

    Roles Software

    Affiliation Department of Computer Science, Tufts University, Medford, MA, United States of America

  • Liza M. Roger,

    Roles Investigation, Validation, Writing – review & editing

    Affiliation Department of Chemical and Life Science Engineering, Virginia Commonwealth University, Richmond, VA, United States of America

  • Matthew Lynn-Goin,

    Roles Software

    Affiliation Department of Chemistry, Colorado School of Mines, Golden, CO, United States of America

  • Roshan Klein-Seetharaman,

    Roles Data curation, Methodology, Software, Writing – review & editing

    Affiliation Yale University, New Haven, CT, United States of America

  • Bonnie Berger,

    Roles Investigation, Methodology, Supervision, Writing – review & editing

    Affiliation MIT Computer Science & Artificial Intelligence Lab, Massachusetts Institute of Technology, Cambridge, MA, United States of America

  • Hollie Putnam,

    Roles Funding acquisition, Investigation, Resources, Validation, Writing – review & editing

    Affiliation Department of Biological Sciences, University of Rhode Island, South Kingstown, RI, United States of America

  • Jinkyu Yang,

    Roles Funding acquisition, Investigation

    Affiliation Department of Department of Aeronautics & Astronautics, University of Washington, Seattle, WA, United States of America

  • Nastassja A. Lewinski,

    Roles Funding acquisition, Writing – review & editing

    Affiliation Department of Chemical and Life Science Engineering, Virginia Commonwealth University, Richmond, VA, United States of America

  • Rohit Singh,

    Roles Conceptualization, Investigation, Methodology, Software, Supervision, Visualization, Writing – original draft, Writing – review & editing

    Affiliation MIT Computer Science & Artificial Intelligence Lab, Massachusetts Institute of Technology, Cambridge, MA, United States of America

  • Noah M. Daniels,

    Roles Data curation, Investigation, Methodology, Software, Writing – original draft, Writing – review & editing

    Affiliation Department of Computer Science and Statistics, University of Rhode Island, South Kingstown, RI, United States of America

  • Lenore Cowen,

    Roles Conceptualization, Funding acquisition, Investigation, Methodology, Project administration, Resources, Software, Supervision, Writing – original draft, Writing – review & editing

    Affiliation Department of Computer Science, Tufts University, Medford, MA, United States of America

  • Judith Klein-Seetharaman

    Roles Conceptualization, Funding acquisition, Investigation, Project administration, Resources, Supervision, Validation, Visualization, Writing – original draft, Writing – review & editing

    jkleinse@asu.edu

    Affiliation Department of Chemistry, Colorado School of Mines, Golden, CO, United States of America

Abstract

With the ease of gene sequencing and the technology available to study and manipulate non-model organisms, the extension of the methodological toolbox required to translate our understanding of model organisms to non-model organisms has become an urgent problem. For example, mining of large coral and their symbiont sequence data is a challenge, but also provides an opportunity for understanding functionality and evolution of these and other non-model organisms. Much more information than for any other eukaryotic species is available for humans, especially related to signal transduction and diseases. However, the coral cnidarian host and human have diverged over 700 million years ago and homologies between proteins in the two species are therefore often in the gray zone, or at least often undetectable with traditional BLAST searches. We introduce a two-stage approach to identifying putative coral homologues of human proteins. First, through remote homology detection using Hidden Markov Models, we identify candidate human homologues in the cnidarian genome. However, for many proteins, the human genome alone contains multiple family members with similar or even more divergence in sequence. In the second stage, therefore, we filter the remote homology results based on the functional and structural plausibility of each coral candidate, shortlisting the coral proteins likely to have conserved some of the functions of the human proteins. We demonstrate our approach with a pipeline for mapping membrane receptors in humans to membrane receptors in corals, with specific focus on the stony coral, P. damicornis. More than 1000 human membrane receptors mapped to 335 coral receptors, including 151 G protein coupled receptors (GPCRs). To validate specific sub-families, we chose opsin proteins, representative GPCRs that confer light sensitivity, and Toll-like receptors, representative non-GPCRs, which function in the immune response, and their ability to communicate with microorganisms. Through detailed structure-function analysis of their ligand-binding pockets and downstream signaling cascades, we selected those candidate remote homologues likely to carry out related functions in the corals. This pipeline may prove generally useful for other non-model organisms, such as to support the growing field of synthetic biology.

Introduction

A bioinformatics functional genomics/proteomics pipeline for a newly sequenced non-model eukaryotic organism can (and should) seek to leverage the wealth of known information and annotation available for model species that are evolutionarily related. Due to the urgency of tackling species loss through environmental damage, many new non-model organisms are currently being sequenced, especially corals. Once the likely constituent genes have been identified [1,2], predicting if the relevant evolutionary functions of the non-model species genes are conserved or have diverged from their orthologous counterparts in the model species becomes directly relevant. Of course, there may be genes in the non-model organism that have no homologues, also referred to as the dark genome [3]. However, the goal of this paper is to maximally exploit existing knowledge from model organisms for the mining of non-model organisms for function.This is because this step is often quite difficult for organisms such as corals, where the host is 700 million years distant from the closest well-annotated model organism, even for many of the genes in closely related species to the model organism [4]. It is particularly interesting that coral animals contain a surprising number of human homologs missing from fly and C. elegans [5].

Corals are complex organisms consisting of an animal host (cnidarian), endosymbiotic dinoflagellate algae and a microbiome with more than 20,000 species of bacteria, bacteriophages, fungi and viruses, collectively referred to as holobiont [6]. We consider here, as a case study, the membrane receptor proteins in corals (host cnidarian). These are important families of proteins to study in coral cnidarian hosts because they have large potential to mediate the host’s interactions with its symbiont, microbiome and environment. Understanding these mechanisms in corals has become urgent. As a result of anthropogenic activities, both local and global, coral reefs (i.e. coral holobiont assemblages) are declining rapidly. Mass coral bleaching, or the expulsion of the symbiotic algae due primarily to thermal and photophysiological stress driven by marine heatwaves and high irradiance, is resulting in substantial coral mortality [7]. The IPCC Sixth Assessment Report states that under the current climate agreements (1.5°C warming, Paris Agreement signed in 2015) 70–90% of corals worldwide will be lost by 2050, and as much as 99% if warming reaches 2°C [8]. Thus understanding the relevant parts of the coral animal genome that are relevant to interactions with their environment has become an urgent bioinformatics task, made possible by the recent availability of coral genomes for multiple species of both animal and symbiont. Fortunately, large classes of membrane receptor proteins have been preserved over time [9], so there is a large body of prior knowledge that we can extrapolate from.

Given a protein of interest that has not been studied before, how much can be learnt from better studied proteins is a common question in biology. The answer lies in the sequence-structure-function paradigm, namely that a similar sequence usually maps to a similar structure and conserved function. This is the premise for the use of sequence alignments, for which many methods of analysis exist, the choice of which typically depends on how similar the sequences to be compared are.

For many years, BLAST [10] was the de facto standard homology search tool, as it provided comparable sensitivity to exact alignment methods such as Smith-Waterman [11] and early heuristics such as FASTA [12] with much faster runtime performance. However, these methods, and even iterative methods such as PSI-BLAST, are unable to detect homologs in the “twilight zone” of homology, between 10 and 30% sequence identity [13,14]. The development of profile Hidden Markov Models (HMMs) such as HMMER [15] and SAM [16] improved remote homology detection performance, but as they still attempt to score a single query sequence against a profile-based HMM, they may miss homologs deep into the “twilight zone.” A more recent advancement came in the form of HMM-HMM alignment [17] with HHpred in 2005, with further improvements resulting in HHblits [18]. These methods build a library of HMMs (or use an existing library) from families of nucleotide or protein sequences, and next build a query HMM from a multiple sequence alignment based on a BLAST-like search for similar sequences to the query, and use a variant of the Viterbi algorithm [19] to align the two HMMs according to maximum likelihood. HHblits was our choice for remote homology detection in this pipeline, as it combines state-of-the-art sensitivity to remote homologs with fast runtime performance, and is able to detect homologs in the range of 10–30% sequence identity. Given the roughly 700 million years of evolutionary divergence between cnidarians and humans, we could not assume that homologs would be of higher sequence identity. In order to avoid confusion with other organisms, we built a custom HHblits database for Pocillopora damicornis, and one of its symbiotic algae, Cladocopium goreaui (C1), formerly known as Symbiodinium (S.) goreaui (Clade C, type C1) [20] which we were able to query with human sequences of known function.

We know that corals adjust their behavior in response to external and internal cues, as illustrated by the following examples. Corals contract or extend their tentacles in response to light intensity [21]. Coral larvae are known to prefer red over white surfaces for settling [22]. Corals can fight, so they must be able to sense and attack the enemy or competitor [23]. Corals can distinguish organisms to tolerate (symbiosis) versus subjecting them to an immune attack to prevent disease [24]. Corals prefer to eat plastic over copepods which may relate to a sense of taste [25]. Since corals manage 90% of their energy from symbiotic algae [25,26], they must also be able to measure and regulate nutrient balance. The key question that arises from these observations is: what are the molecular mechanisms underlying these behaviors?

Generally, signal sensing and response reactions in biological systems depend on membrane receptor signaling systems. Receptor activation involves the detection of the signaling molecule (ligand) outside the cell when the ligand binds to the receptor protein present on the surface of the cell (Fig 1). The signal transduction stage involves the activation of the receptor (conformational change) leading to the chain reaction of the activation of intracellular proteins. These signal transduction cascades trigger specific cellular responses. Thus, membrane receptors most generally are proteins that are coupled to intracellular signal transduction cascades. There are two types of membrane receptors, sometimes referred to as type I and type II receptors (Fig 1). Type I membrane receptors usually contain one or two transmembrane (TM) helices, and often carry enzymatic activity in their cytoplasmic domains, such as tyrosine kinase receptors like the epidermal growth factor receptor. Type II membrane receptors are G-protein coupled receptors (GPCR) which exclusively contain a seven-TM helical bundle. There are also ion channels which change their permeability in response to external signals, but are usually not classified as receptors [27]. Membrane receptors have three major structural domains: extracellular (EC), transmembrane (TM), and cytoplasmic (CP). We will discuss one example in depth for each of the two types of membrane receptor proteins. For the type II membrane receptor family, we have chosen the GPCR sub-family of opsins. As an example, for type I membrane receptors, we will discuss Toll-like receptors (TLRs).

thumbnail
Fig 1. Schematic representation of type I and type II membrane receptors in signal transduction.

Type I membrane receptors (left) typically possess one transmembrane helix per chain and usually form dimers. They may contain enzymatic activity in the cytoplasmic domain, such as a kinase, or they may recruit a soluble enzyme (blue circle) through protein-protein interaction. Type II membrane receptors are G protein coupled receptors (GPCR) which are characterized by seven transmembrane helices. All receptors (type I and type II) bind to ligands in the extracellular domain, transmit the signal via the transmembrane domain to the cytoplasmic side where they initiate appropriate signal transduction cascades.

https://doi.org/10.1371/journal.pone.0270965.g001

GPCRs respond to a large diversity of signals, from light in the opsin family to binding of hormones, neurotransmitters and even other proteins [28]. Correspondingly, in humans, these receptors take a premier role as pharmaceutical targets and have thus been extensively studied [29]. Signal transduction by GPCRs is via binding and activation of heterotrimeric G proteins composed of alpha, beta and gamma subunits [30]. GPCRs are responsible for the majority of cellular responses to external stimuli. Upon activation by a ligand, the receptor promotes the exchange of GTP for GDP in its partner heterotrimeric G protein complex, leading to the dissociation of the alpha and beta gamma complexes from the receptor and each other. The G proteins then interact with other downstream effector proteins to mediate a cell response. In this study, we have used our remote homology detection pipeline to find human GPCR as well as G protein homologs in P. damicornis and C. goreaui.

TLRs play a crucial role in innate immunity [31]. These membrane glycoprotein receptors recognize and respond to a variety of microbial (viral, fungal, and bacterial) components such as lipopeptides [32], peptidoglycans and lipopolysaccharides [33], flagellin [33,34], DNA [35], and RNA [36]. TLRs consist of leucine-rich repeats (LRR), and the Toll/interleukin-1 receptor (TIR) domain [37]. TLRs initiate signal transduction through interactions with TIR-domain containing adapters such as MyD88 [38], that in turn recruits interleukin-1 receptor-associated kinase (IRAK) via interactions through death domains [39]. After phosphorylation, IRAK family proteins interact with the TRAF6 adapter. TRAF6 activates TAK1, a member of the mitogen-activated protein (MAP) kinase family [40], leading to the activation of NF-kB via kinase dependent signaling cascade involving IkB kinase complex (IKK-⍺,-β, and -γ) and MAP kinases (ERK, p38, JNK), resulting in the expression of target genes [41]. Viruses and bacteria are abundant in seawater and live in close association with corals [42]. The chemical crosstalk between corals and microbes plays an important role in coral growth and development. Corals need TLRs to communicate with microbes and it has been proposed that TLR signaling is conserved in corals [43]. In the present study, we used our remote homology detection pipeline to find human TLR protein homologs in P. damicornis and C. goreaui to advance our understanding of coral innate immunity.

Using these two representative membrane receptor families, we will demonstrate that the stated problem of identifying remote homologues in non-model organisms is not just a remote sequence detection problem, but highlights the need for functional investigation of the putative proteins, which involves both systems and structural aspects of these proteins and their interaction partners. To this end, we focus on those positions in the alignments between model and non-model organism homologues that determine specificity of the respective protein’s functions. We will discuss below first the common elements of the pipeline, how we determine the pool of all membrane receptors in P. damicornis, how we subdivide this broad group into type I and type II membrane receptors, and how we refined these two multi-subfamily groups into functional classes by analysis of two of them, TLRs and opsins, by way of identifying the specificity determination positions, followed by analysis of the downstream signaling proteins.

Materials and methods

A github repository is available for this project accessible at https://github.com/judithks/corals

(a) Protein sequence retrieval

All non-P. damicornis and C. goreaui sequences were retrieved from the UniProt—Swiss-Prot Protein Knowledgebase, SIB Swiss Institute of Bioinformatics; Geneva, Switzerland (https://www.uniprot.org/). P. damicornis sequences [44] were downloaded from http://pdam.reefgenomics.org/download/. C. goreaui sequences [45] were obtained from http://symbs.reefgenomics.org/download/. P. damicornis sequences are referenced by their numbers only, so for example, pdam_00017423-RA will be referred to as 17423.

Specific subgroups of sequences were identified and retrieved as follows and subjected to HHblits analysis as described in section (b).

(i) Human membrane receptor list.

We have utilized two human membrane protein lists. The first one was published as the human membrane receptome in 2003 [46]. We subjected the original list to an updated search in UniProt and retrieved 978 current UniProt entries, including GPCRs, provided in S1 File. The second was generated by using three alpha helix prediction tools followed by filtering of splice variants and clustering of the remaining genes [47]. This list contains a total of 3,399 genes, including GPCRs, and is available from their supplement. The mapped P. damicornis entries (see below) are available in S2 File.

(ii) Human GPCR list.

A comprehensive list of human GPCRs was extracted from an updated UniProt list of multi-species GPCRs (release: 2020_03 of 17-Jun-2020: 825 proteins (https://www.uniprot.org/docs/7tmrlist.txt). This list contained a total of 3093 GPCR sequences including 825 human GPCRs. See S3 File for the extracted human GPCR sequences. The mapped P. damicornis entries (see below) are provided in S4 File. Note that this file only contains the top ranked HHblits hit in each case for clarity. In each search, there are usually multiple highly ranked hits and they have been included in the discussion of this paper where applicable.

(iii) G protein list.

Sequences of human G proteins were obtained from UniProt using keyword search, resulting in 16 alpha chains, 5 beta chains, and 12 gamma chains [30]. The complete list of these human proteins and their candidate P. damicornis homologues is provided in S5 File.

(iv) Toll-like receptor list.

Protein sequences of TLRs of human [31] and other organisms, namely Drosophila [48], chicken [49], frog [50], zebrafish [51] and C. elegans, were retrieved from UniProt. TLR downstream signaling molecules (MyD88, TIRAM, TIRAP and TRIF) were also acquired from UniProt. The complete list with all the protein sequences used is provided as S6 File. The extracted P. damicornis TLR list is provided as S7 File (note there is a separate sheet for each organism).

(b) Remote homology detection using Hhblits

To enable organism specific searches, we created an online HHblits coral protein remote homology search tool (https://hhblits.cs.tufts.edu/), an installation of HHblits [18] with genomic databases built for P. damicornis and separately for C. goreaui. Individual FASTA sequences were imported to this search tool and queries were run with an E-value cutoff of 10−3, single iteration, minimum probability of 20 (default), and with the minimum number of lines to show in the hit list expanded from 10 to 250. The jobID, email information, database information (e.g. P. damicornis) were submitted and the result output file was received by email or as batch predictions. Individual predictions contained the HHblits results of the submitted proteins in text format including protein sequence alignment, E-value, P-value, probability, column matched and score. In the case of GPCR, the result output was analyzed in a bidirectional fashion as follows. First, the top ranked P. damicornis hit was retrieved for each GPCR. Then a list of unique P. damicornis proteins were created from that, and the corresponding human GPCRs for which they appeared as top hits, provided as S4 File. For membrane receptors, the non-redundant lists of top ranked P. damicornis hits retrieved in the same way as for GPCRs are provided for both lists (S2 File). In the case of TLRs and G proteins, HHblits results were analyzed manually (see Results). For TLRs, top hits with 100% probability score were selected and analyzed separately for each model organism, i.e. human [31], Drosophila [48], chicken [49], frog [50], and zebrafish [51], before comparing them across organisms (Table 1). The list of P. damicornis hits for TLRs are provided as S7 File. A summary based on the frequency of occurrence of P. damicornis hits is provided in Table 1.

thumbnail
Table 1. Number of times a model organism shows homology to a given P. damicornis protein with 100% probability.

Columns highlighted in green show at least one representation in each model organism studied.

https://doi.org/10.1371/journal.pone.0270965.t001

(c) PROSITE analysis

The most highly ranked P. damicornis sequences retrieved through HHblits were subjected to PROSITE analysis [52] to identify the presence of conserved domains (https://prosite.expasy.org/). Protein sequences were submitted to the PROSITE user interface and the results were analyzed and grouped by combining the domain schematic provided by PROSITE.

(d) Transmembrane helix detection

The presence of transmembrane helices was predicted using TMHMM Server v. 2.0 (DTU bioinformatics, Department of Bio and Health Informatics) (https://services.healthtech.dtu.dk/service.php?TMHMM-2.0).

(e) Homology-modeling

Homology models were generated using Swiss Model (https://swissmodel.expasy.org/), an integrated web-based service dedicated to homology modeling of proteins [53]. We used the target-template alignment function of Swiss Model and provided the reconstruction of the full TLR5 (3J0A) combining crystallographic and cryo-electron microscopy data created by [54] as a template to model P. damicornis 9200. We have also modeled our potential matches for opsins (629, 2270, 12246, and 19775) using squid rhodopsin as a template (2ZIY) [55]. The models were evaluated for their global quality estimate and local quality score as per Swiss Model guidelines. The models were downloaded and analyzed using PyMOL (version-2.3.4, Schrodinger, LLC). These proteins were further structurally analyzed for ligand binding pocket and Ballesteros-Weinstein numbering system. Ligand binding pocket residues in GPCRs were extracted from previous chemogenomic analysis [56].

(f) Molecular docking studies of retinal with coral opsin homologs

Molecular docking was performed using retinal as the ligand with AutodockTools 1.5.6 [57]. We standardized our docking experiments using squid rhodopsin with the following parameters: center_x = 43.171, center_t = 6.216, center_z = 17.019, size_x = 16, size_y = 22, and size_x = 26. Docking was performed by extraction after aligning the homology model with the squid rhodopsin space coordinates. The ‘exhaustiveness’ option was set as 32.0. The binding pocket was analyzed using the Biovia discovery suite 2019 v19.1.0.18287 (Dassault Systemes Biovia Corp). Residues involved in the interactions for each model are listed in S1 Table.

(g) Multiple Sequence Alignment of P. damicornis

Potential P. damicornis members of respective protein families were aligned to each other using MUSCLE [58] (https://www.ebi.ac.uk/Tools/msa/muscle/) to examine if structurally relevant amino acids were conserved across family members. As an example, the alignment of opsin homologues in P. damicornis is shown in S1 Fig.

(h) D-SCRIPT analysis

We initially identified candidate homologs in P. damicornis for the human alpha, beta, and gamma G proteins using HHBlits [18] (hhblits.cs.tufts.edu). We took the union of top hits to identify 124 candidate alpha proteins, 207 candidate beta proteins, and 5 candidate gamma proteins. We used the human pre-trained D-SCRIPT [59] model to predict interaction between all pairs of alpha-beta, beta-gamma, and alpha-gamma subunits. We performed the same analysis in Montipora capitata [60], where we identified 184 candidate alpha proteins, 253 candidate beta proteins, and 4 candidate gamma proteins. We created a mapping between P. damicornis and M. capitata proteins using BLAST [10] and identified best-bidirectional-hits, i.e. a pair (P,M) map to each other if M is the best BLAST hit in M. capitata for P and vice versa. We overlay the P. damicornis and M. capitata networks with each other using the mapping and identify as network-evidence candidate proteins where pairwise interaction is predicted between an alpha, beta, and gamma subunit forming a triangle in the network.

Results

(a) Development of a pipeline: Addressing the challenges with existing methods

An overview of the pipeline we developed to address the goal of identifying the repertoire of membrane receptors in the non-model organism, the coral P. damicornis, using known information from the model organism, human, is shown in Fig 2. The first step is to create a list of human proteins representing the function of interest, here membrane receptors. A list of membrane receptors in human had been published previously [46], but when retrieving the sequences from the UniProt database, several entries were no longer valid. We manually retrieved the updated UniProt ID’s by searching the database through protein names. The list of human membrane receptors obtained is available as S1 File. This list contains 978 human proteins. We also used another published list of membrane receptors [47], which contained 1352 human proteins reported using an outdated identifier format. Finally, a list of human GPCRs was extracted from a GPCR list in any organism from the UniProt database available at (https://www.uniprot.org/docs/7tmrlist.txt).

thumbnail
Fig 2. In silico protein prediction methodology.

We employ a multi-step process which combines computational and manual filtering to identify putative functional candidates in coral. For a protein family of interest, a small number of human proteins are manually selected, and a larger number are computationally selected. We perform two remote homology database searches using HHblits—the manually selected human proteins against the database of coral proteins, and the coral hits against the human protein database. We consider a coral protein a candidate only if it is a best- bidirectional match—if the human protein it matched with was the highest likelihood hit in both directions of the search. The process to this point can be fully automated. However, further verification of the coral candidates requires domain-specific knowledge and involves an in-depth analysis of the structural viability of the candidate through both traditional (see Figs 3 and 4) and machine learning (Figs 5 and 6) methods.

https://doi.org/10.1371/journal.pone.0270965.g002

Initially, we used BLAST, as well as several multiple sequence alignment (MSA) based tools to retrieve P. damicornis homologues, but after inspection of the results we concluded that the alignments between human and P. damicornis sequences were poor as judged by the number of gaps and the fractions of sequences aligned (data not shown). We concluded that due to the low sequence similarity between human and P. damicornis, we required a remote homology detection tool. Hhblits is the accepted gold standard for remote homology detection [18]. Thus, all three human membrane receptor lists were searched against P. damicornis using our implementation of HHblits, available at https://hhblits.cs.tufts.edu/, where our implementation of HHblits contains genome-wide template libraries of HMM models of all the genes in several coral hosts and symbiont genomes (principally P. damicornis, M. capitata, plus the clade C1 symbioint).

When mapping these three sets of lists to the P. damicornis sequences using HHblits, it became clear that the same genes mapped to multiple human query sequences. This is because there are many members of membrane receptor superfamilies, e.g. receptor tyrosine kinases or GPCRs in each organism itself already. We therefore created non-redundant lists of the top ranked P. damicornis sequences. We obtained 374 unique P. damicornis sequences from the 978 human plasma membrane receptome (older list, based on [46]), 329 from the 1352 human membrane proteome (newer list, based on [47]) and 151 from the 825 human GPCRs (based on (https://www.uniprot.org/docs/7tmrlist.txt)). These lists and their respective overlap is provided in S2 File, labeled “old membrane”, “new membrane” and “GPCR”, respectively. Combining the three lists and eliminating duplicates appearing in more than one of the lists yielded a total of 446 unique P. damicornis sequences. While all 151 GPCRs were found in both membrane receptor lists, there were 111 and 50 non-GPCR receptors missed if we considered only the older human receptor list as compared to the newer one. Thus, we conclude that there are 295 non-GPCR and 151 GPCR candidates in the P. damicornis membrane receptome.

It is important to note that, in the majority of cases, none of the quantitative parameters of the sequence alignment was able to differentiate between the hits. In other words, it was not possible to automatically assign a best human homologue for any given P. damicornis sequence. This is because different members of protein families have diverged in humans, and are more similar to each other, than they are to any P. damicornis sequence, which is evolutionarily most distant to all of them.

Given the fact that many more human sequences map to a smaller set of sequences in P. damicornis raises the question which functional sub-categories of these superfamilies are present in P. damicornis. Clearly, sequence alone is not sufficient to answer this question, and therefore identifying which of the functionalities of a given sub-family of membrane receptors is present in corals requires an analysis of known functional properties. This requires domain expertise and can no longer be fully automated in contrast to all previous steps (Fig 2). This final step of the pipeline is demonstrated for two examples below and necessarily branches according to the functions of the proteins of interest. The opsin subfamily was chosen as a representative example for the type II membrane receptor, the GPCR pipeline (section (b) below). The TLRs were chosen to represent the type I membrane receptors (section (c), below).

(b) GPCR (type II receptor) pipeline branch

(i) Global analysis of GPCR families.

The large GPCR family has been divided into subclasses based on a combination of pharmacological and sequence considerations, a classification which has been revised a number of times over the years [56]. Here, we use the clusters obtained with a chemogenomics approach based on the alignment of 30 critical GPCR positions supposed to face the ligand binding cavity [56]. Major classes besides olfactory receptors are Frizzled, Glutamate, Secretin and Adhesion families with the Class A Rhodopsin family being split into 18 different clusters. When analyzing the human receptors for which P. damicornis sequences were found, these included chemokine, taste, glutamate, adrenergic, lipids, peptides, adenosine, amines, melanocortins, acids, chemoattractants, purines, frizzled, adhesion, prostanoids, and MAS-related receptors. Likely missing are melatonin receptors (with 2 low ranked options), vasopeptides, brain-gut peptides, SREB, secretin, opioid and glycoproteins. In all cases there are multiple human sequences that map to multiple (but less than humans) P. damicornis sequences. Detailed below is the evidence that P. damicornis corals have sensory receptors analogous with smell, taste and light perception in sections (ii), (iii) and (iv), respectively.

(ii) Odorant receptors in P. damicornis.

We found 208 human olfactory receptors in the list of 825 human GPCRs that map to unique pdam genes 17423, 20860, 17300, 5244, 5376 and 16463. This suggests that there are at least 6 olfactory receptors in P. damicornis.

(iii) Taste receptors in P. damicornis.

HHblits suggests that there are 15 remote homologs for the human taste receptors in the P. damicornis proteome: 04028, 09436, 00629, 02659, 13619, 21435, 22798, 17219, 01145, 13621, 10275, 04281, 02512, 20500, and 16973. 02512, and 09436 have a large N-terminal domain of approximately 500 amino acids preceding the 7 transmembrane helical GPCR motif. This architecture is similar to what is found in human metabotropic glutamate receptors, where the soluble glutamate binding domain precedes the 7 transmembrane helical motif, but the added domains in the above two P. damicornis putative GPCR sequences show no similarity with the human glutamate binding domain.

(iv) Vision receptors (opsins) in P. damicornis.

The HHblits result showed that the 11 members of the human opsin family mapped to 7 P. damicornis sequences. However, there were other members of the Class A GPCR family that also mapped to the same P. damicornis sequences. To find out which of the HHblits remote homology predictions were possiblerepresentations of visual functions, we used the structure-function paradigm and analyzed the ligand binding pockets of these proteins, and the results are displayed in Figs 3 and 4. Light detection is the main function of rhodopsin. Therefore, the homologous proteins must have an ability to bind with a light sensitive retinal ligand and should possess a conserved ligand binding pocket. This pocket should show sequence similarity with human opsin protein. Considering this hypothesis, we analyzed the human opsin ligand binding pocket and compared our analysis with P. damicornis opsin homologues. We obtained the profiles of 30 cavity-facing amino acids used for clustering of the human GPCRs including opsin [56], shown in Fig 3A. Thus, the approach outlined here for opsins can be extended to other GPCRs. The first step was to verify the presence or absence of these amino acids important for ligand binding in the P. damicornis sequences for the opsin family. The 30 residues are spread out throughout the sequence, and the Ballesteros-Weinstein numbering scheme used to identify aligned positions in GPCRs clearly showed that these residues are located across the 7 transmembrane helices (Fig 3A). We compared these 30 residues against the top hit for OPSD and OPN5, and the result showed that only 11 out of 30 residues are the same between OPSD and their P. damicornis homologue.

thumbnail
Fig 3. Sequence alignment and active site analysis of rhodopsin.

(A) Alignment of human rhodopsin with remotely homologous coral proteins, only showing the positions of those amino acids that are in proximity to the ligand in the opsin family (ref). TheBallesteros-Weinstein numbering scheme is used to enable comparison to other members of the GPCR family. Positions are colored according to the degree of similarity: White foreground/black background, 100%; white foreground/grey background, > 80%, black foreground/grey background, > 60%. (B) Active site of bovine rhodopsin crystal structure 1L9H (>99% homologous to human rhodopsin) showing the residues involved in the active site shown in (A). (C) Active site showing the interaction of the ligand and vitamin A derivative, retinal, which makes rhodopsin light-sensitive, with Lys296 and Glu113 (also labeled in A,B).

https://doi.org/10.1371/journal.pone.0270965.g003

thumbnail
Fig 4. Do the putative coral opsin proteins have a propensity to bind retinal which would make them light sensitive? To address this question, we created homology models and conducted molecular docking of the “rhodopsin” receptors with retinal.

(A) Crystal structure of squid rhodopsin (2ZIY). Squid is the evolutionarily closest organism to corals of those for which rhodopsin structures exist. (B) Expanded view of the retinal (shown in green in all structures A-H) in the squid crystal structure shown in (A). As a control, the same retinal was docked to the structure andthe docked conformation is shown in pink. (C, D) Molecular interactions of retinal in the crystal structure vs. docked conformation. (E, F, G, H) Homology modeling and docking studies with coral homology models 629, 2270, 12246 and 19775, respectively. In all cases, panels are labeled as follows: (i) homology model, (ii) retinal confirmations: Squid(green) vs. docking pose in coral opsin (magenta), (iii) molecular interaction of docking pose. Details of interactions and the docking scores are provided in S1 Table.

https://doi.org/10.1371/journal.pone.0270965.g004

Further, we had previously determined the minimal ligand binding pocket that is capable of exerting the function of a GPCR: transmission of the ligand binding signal to the downstream signaling proteins [61]. The approach used GREMLIN, Generative Regularized Models of Proteins, to identify longnge interactions from co-varying amino acid positions in multiple sequence alignments [62]. GREMLIN learns an undirected probabilistic graphical model known as a Markov Random Field (MRF). Unlike HMMs, which are also graphical models, MRFs can model longnge couplings (non-sequential residues). We performed GREMLIN analysis on GPCRs, statistically evaluated different sizes of ligand binding pockets and found that a pocket as small as 4 residues still shows significant enrichment of edges over null. This means that four residues connect maximally to the rest of the protein orchestrating the global conformational change inherent to GPCR function and are required for signal transduction. Our analysis showed that 3 of these 4 residues are conserved across human opsin and the P. damicornis homologue. This supports the conclusion that this protein is a functional GPCR.

Finally, GPCRs involved in vision require a lysine in a specific location of the ligand binding pocket to covalently attach via a Schiff base. The Schiff base is stabilized by another residue in the sequence, that is glutamic acid (E113), forming a counter ion to lysine K296, (highlighted by red arrows in Fig 3B). We can see that the P. damicornis homologue contains the lysine required for retinal binding, but has a tyrosine instead of glutamic acid in the counter ion position. This substitution is expected to cause a shift in the absorbance maximum but not a loss of function as a light-sensitive protein. This finding supports the hypothesis that this P. damicornis sequence represents a light-sensitive opsin protein, likely with an absorbance maximum different from that of rhodopsin and more like the OPN5 protein, which also has a tyrosine at this crucial position. Repeating this process for all of the P. damicornis homologues allowed us to eliminate proteins that did not fit the requirements of GPCR action or retinal binding ability and narrow down the most closely related opsin protein. We predict that there are 4 P. damicornis opsin proteins (in contrast to 11 human opsin proteins) and these are most similar to OPN5, OPSG, OPSX and OPSR.

(v) Homology modeling and molecular docking studies with opsin homologs.

To confirm the structure and functional relationship of rhodopsin with putative coral rhodoposin homologs, we have performed structural modeling of the top coral rhodopsin hits using the squid rhodopsin structure as a template (Fig 4). We carried out molecular docking to these homology models that suggested that the retinal was indeed able to bind with the putative ligand binding pocket in the coral rhodopsin homologous structures. This suggests that these are strong candidates to be considered as coral rhodopsin homologs and may play an important role in light sensing mechanisms. We have summarized the details of the retinal interactions with squid rhodopsin and coral rhodopsin homologs in S1 Table.

(c) GPCR binding partner: G Proteins

To gather further evidence that the proteins identified were opsins, we investigated their signal transduction function, namely binding and activation of the G proteins upon ligand binding (or light activation of the retinal ligand in the case of opsins). G proteins are heterotrimeric proteins consisting of alpha-, beta- and gamma-domains. There are 16 different alpha chains, 5 beta chains, and 12 gamma chains in humans [30], and their UniProt ID’s are provided in S5 File, tab 1. HHblits search of the alpha and gamma chains yielded six potential P. damicornis alpha chains and five potential gamma chains, see S5 File, tab 2. We were unable to retrieve potential beta chains reliably, due to the beta propeller structure of the beta chains being a very common structural motif found across multiple protein families.

  1. (i) Multiple sequence alignment of P. damicornis alpha chains. The potential P. damicornis alpha chains were aligned with the human G-alpha subunit that binds to rhodopsin, called transducin, with gene symbol GNAT1 using MUSCLE (see Methods) to examine if structurally relevant amino acids were conserved. Amino acids of GNAT1 which interact with rhodopsin, GNB1, and GTP were identified from co-crystal structures (PDB ID: 6OY9, 1TAD) by selecting residues within 5 Å of the relevant structure in PyMOL. These residues are highlighted in Fig 5A using boxes colored orange, blue, and green, respectively, corresponding to the structures shown in Fig 5B and 5C. The alignment shows that the GTP binding pocket residues are the most conserved, the receptor binding residues are the least conserved, and the beta chain binding residues are more conserved than the receptor binding residues but less conserved than the GTP binding residues. This is to be expected, as the GTP pocket will need to bind to the same structure in corals as it does in humans, whereas the receptor and beta chain binding residues will need to bind to coral receptor and beta chain homologues, which will be structurally different. Because the conservation patterns are similar across all 5 P. damicornis G-alpha candidates, we are not able to select a top candidate from this pool that is most likely representative of the rhodopsin-binding G-alpha subunit, transducin (GNAT1).
thumbnail
Fig 5. Identification of putative G protein subunits as candidates of immediate downstream signaling partners for GPCRs.

(A) MSA of GNAT1 with coral homologues and crystal structures of GNAT1. Amino acids of GNAT1 which interact with rhodopsin, GNB1 (beta chain), and GTP are boxed in blue, red, and green, respectively. The same amino acids are also listed in the table. In the crystal structures, GNAT1 (cyan) binding interfaces with rhodopsin (orange) and the beta chain (blue) are shown in (B) and the GTP binding pocket (green) is shown in (C). (D) Predicted interactions in P. damicornis between alpha (cyan), beta (blue), gamma (orange) chains and rhodopsin (orange). (E) Predicted candidate G protein interactions in P. damicornis, and the predicted interactions of the corresponding candidates in M. capitata (solid line: Predicted interaction; dashed line: Best-bidirectional BLAST hit).

https://doi.org/10.1371/journal.pone.0270965.g005

  1. (ii) Network analysis of putative P. damicornis G protein subunits. To overcome the challenge that sequence alignment alone was not sufficient to narrow down the choices of P. damicornis sequences related to G proteins, we leveraged computational prediction of protein-protein interactions (PPI) to characterize the relative likelihood of the top P. damicornis hits being truly involved in GPCR binding activity (Fig 5D and 5E). Applying D-SCRIPT [59], a recently introduced sequence-based deep learning model for PPI prediction, we performed two complementary PPI analyses. In the first, we performed an all-vs-all computational screen of interaction between all the candidate G-alpha, beta and gamma proteins in P. damicornis, reasoning that the true positive hits will display the expected interaction patterns of alpha-beta and beta-gamma binding. Our predicted PPIs broadly conformed to this expectation, with 17900, 11071, 23984, 07710, 14456, 11840 being the strongest candidates for alpha subunits, 00168 the strongest candidate for a beta subunit, and 00526 the strongest candidate for a gamma protein (see Fig 5D and S5 File, tab 2). To gain further confidence in our estimates, we also performed a similar screen of G protein subunits in M. capitata (Fig 5E) and assessed if homologous pairs across the two coral species (estimated by a bidirectional best hit analysis) had similar PPI patterns (see Methods). These results further supported our estimate, as the same connectivity patterns observed in P. damicornis are also observed between homologues in M. capitata (Fig 5E).
  2. (ii) Conservation of structural interfaces in G protein complexes. To further confirm the predicted G protein complex compositions, D-SCRIPT was then used to assess the structural plausibility of individual G-alpha candidates in P. damicornis. We performed an in silico mutagenesis study of each G-alpha candidate, evaluating these mutations by how they changed the D-SCRIPT score for interaction with candidate G-beta proteins 00168 and 14586 (Fig 6). Reasoning that a conservative test would be to require the same binding mechanism as seen in human G-alpha-beta interaction, we first aligned candidate G-alpha protein sequences in P. damicornis against human G-alpha proteins GNAT1, and used the multiple sequence alignment to identify the residues in coral G-alpha where the corresponding location in GNAT1 is known to play a role in binding to G-beta proteins. We tested if random in silico mutations at these locations had a particularly deleterious effect on the likelihood of PPIs with coral G-beta candidates (average of 50 trials). We compared these likelihoods to the original predicted probability of interaction, and the predicted probability of interaction after in silico mutations at random sites (average of 50 trials). Two candidates, 14456 and 11071, showed sharply decreased likelihood of a PPI occurring specifically when the sequences were mutated at putative binding locations—suggesting a binding mechanism similar to human G-alpha-beta interaction is conserved in determining the interaction likelihood of coral G-alpha-betas.
thumbnail
Fig 6.

Predicted interaction of candidate alpha proteins with beta candidate proteins 00168 (A), 14586 (B). Original: Original predicted probability. Permuted: Average predicted probability of 50 samples with 25 beta-binding residues randomly perturbed. Random: Average predicted probability of 50 samples with 25 randomly chosen residues randomly perturbed.

https://doi.org/10.1371/journal.pone.0270965.g006

(d) Toll-like receptor (TLR, type I receptor) pipeline branch

  1. (i) TLRs in P. damicornis. After our investigation of putative GPCR (aka type II receptor) proteins and pathways in P. damicornis, we next investigated a representative of the type I receptors, namely TLRs. Because there are large differences in the numbers of TLRs in different organisms, we retrieved TLR sequences not only from humans [31], but also zebrafish [31,51], frog [50], chicken [49] and Drosophila (Toll proteins) [63]. After subjecting these sequences to HHblits prediction, we extracted all coral proteins that showed a minimum of 100% probability, and grouped them by their homology to a given organism. The results are summarized in Table 1. P. damicornis proteins 22934, 22930, 11599, 9200, 14109, 17966, and 13021 were observed as homologues to at least one TLR from each of the five model organisms studied. Because many type I receptors are often composed of multiple domains, we then subjected these proteins to PROSITE analysis in addition to homology modeling.
  2. (iii) PROSITE analysis showed similar domain signatures in P. damicornis proteins. PROSITE analysis was used to identify domain structures within putative TLRs. Fig 7A illustrates that in human TLRs, there are multiple copies of leucine rich repeat (LRR) domains present on the extracellular side of the receptor and a Toll/interleukin-1 (IL-1) receptor (TIR) domain in the intracellular side in human TLRs. In contrast, P. damicornis 22934, 22930, 15883, 15877, 11734, and 11599 were devoid of LRRs and only displayed the TIR domain (Fig 7B). This suggests that these proteins may not belong to the typical TLR family. On the other hand, PROSITE analysis of P. damicornis 14109 and 17966 homologues showed a large number of LRRs, plus a cadherin domain, EGF_CA (Calcium-binding EGF domain), Thrombospondin, type 3 repeat (TSP3), Thrombospondin C-terminal domain profile (TSP_CTER) domains on the extracellular side and the TIR domain on the intracellular side. Because of these additional domains, it is possible that these are TLRs but with different or additional functions as compared to their endosomal human counterparts. A third domain composition was revealed by PROSITE analysis of P. damicornis 737 which only had LRRs and no TIR domain. Therefore, this homologue was rejected as a TLR candidate. The only domain composition similar to that of human TLRs was observed for P. damicornis homologue 9200, which included multiple LRRs in the extracellular domain and a TIR domain in the cytoplasmic domain. For further analysis, we selected 9200 for homology modeling due to its matching profile with human TLRs.
thumbnail
Fig 7. PROSITE analysis of human and coral TLRs.

(A) PROSITE domain mapping of human TLRs. (B) PROSITE domain mapping of possible P. damicornis TLR homologues.

https://doi.org/10.1371/journal.pone.0270965.g007

  1. (iii) Structural similarities between human TLR5 and P. damicornis TLR. Swiss Model matching for query P. damicornis protein 9200 retrieved human TLR5 (3j0a) as the top hit. We refined the model by removing several missing regions from the coral TLR model (Fig 8). Using TMHMM with 9200, we identified a transmembrane helix region (649–671) between the extracellular and the intracellular region, supporting the organization as a typical type I receptor.
thumbnail
Fig 8. Coral TLR homology model based on the full-length TLR5 structure model with pdb id 3j0a LRR: Leucine rich repeats.

(A) Structure reconstruction of the full-length human TLR5 protein (3j0a). (B) Homology model of P. damicornis 9200 protein using the human TLR5 structure reconstruction model as a template.

https://doi.org/10.1371/journal.pone.0270965.g008

  1. (iv) MyD88 homologue as the possible downstream partner. TLRs utilize multiple adapter proteins to transmit the signal to the inside of the cell, namely MyD88, TIRAM, TIRAP and TRIF adaptor proteins. Our HHblits results of these proteins showed presence of a coral homologue only for MyD88.

Discussion

We present a general pipeline applicable to any non-model organism to explore structures and functions based on detectable homology to critical proteins in humans or other model organisms (Fig 2). Using three case studies, we demonstrate customized, structure-based approaches to select functional candidates from the pool of homologs. In the case of GPCRs, we looked at specificity determining positions [64,65], which can then be confirmed using 3D structural modeling, to check that the active sites and binding pockets that are expected, should the functional role of the protein be conserved, are indeed present. In the case of G proteins, we derived function based on protein-protein interactions, where we have used our recent deep learning method [59] to perform in silico mutagenesis studies to further help us distinguish sequences which are likely to allow us to correctly transfer functional annotation from their human homologs from those that may have other functions. In the case of TLRs, we have used PROSITE to identify the presence or absence of entire domains within the sequences, and used expert knowledge to evaluate if the location of these domains matches the functional expectation (e.g. if the domain faces the inside or outside of the cell).

Generally, the availability of three-dimensional structures for one or more members of the protein family of interest opens the possibility for in-depth structure modeling using homology. Although the quality of homology models depends on the degree of sequence conservation, structurally aligning remote sequences goes a long way in interpretation of their structure conservation as long as we can have confidence in the alignment. For example, despite near negligible sequence conservation between rhodopsin and metabotropic glutamate receptors, the most distant members of the GPCR family, we were able to predict the pharmacological outcome of ligand binding based on structural alignment [66]. With the continuous improvements in de novo structure prediction [67], this task will become more and more feasible even for proteins that do not have any homologous structures available, i.e. for which no structural template can be found.

One challenge not currently addressed by our pipeline is the identification of de novo proteins with de novo functions. While homology modeling allows us to detect proteins that are present (with mutations) in both species, we acknowledge that just as some proteins novelly evolved in vertebrates and are not present in cnidarians, there is the possibility of complexes that novelly evolved in corals, which this approach cannot detect.

The results of our study have several biological implications. Cnidarians as evolutionarily early animals offer great opportunities to study the evolution of important biological functions, such as sensing and signaling, and understanding host-microbe communication.

TLRs are used for molecular communication with microbes. These receptors interact with specific microbial antigens and play a major role in innate immunity. We obtained strong evidence for TLR presence in P. damicornis, in line with several previous reports suggesting the presence of TLR mediated signaling in corals. Analysis of the Nematostella vectensis genome (sea anemone, cnidarian model system) indicated the presence of immune related genes, membrane attack proteins, complement pathway associated signaling molecules [68,69] and one TLR associated with stinging cells (cnidocytes [44,70,71]. RNA-Seq data also supports the ability for immune responses in corals [44,70]. One study showed that the muramyl dipeptide (MDP), a bacterial cell wall component, was found to trigger the up-regulation of GTPases of immunity-associated proteins in Acropora millepora [72]. Finally, the transcriptomic expression during pathogen challenge with Vibrio sp. in Acropora cervicornis showed two TLR2 homologs and the adaptor molecule TNF receptor-associated factor 3 (TRAF3) were up-regulated [73,74]. We now add detailed protein structural analysis to these reports, and also provide evidence that P. damicornis carries the gene for the TLR adapter protein, MyD88 (P. damicornis id 15711).

Of particular note is that our results suggest a much smaller number of proteins involved in TLR signaling than in the model organisms human, chicken, zebrafish, frog, and Drosophila. We predict that there may only be a single, unique TLR homolog in the coral P. damicornis that exhibits all the features expected for TLRs, and is most similar to TLR5 in human which could be trait common to all Anthozoa, not just P. damicornis and N. vectensis. Furthermore, we have found only one (MyD88) of the four known adapter proteins MyD88, TIRAM, TIRAP and TRIF. This suggests the presence of a less diversified and simpler TLR signaling pathway in coral as compared to higher eukaryotes.

A similar conclusion was reached in the analysis of the large and diverse GPCR family. The human genome alone encodes 825 GPCRs (https://www.uniprot.org/docs/7tmrlist.txt), while we only found 151 GPCRs in P. damicornis. Looking at reports of GPCR family size in evolutionarily early organisms suggest that there is a dramatic expansion in the number of GPCRs when transitioning from unicellular to multicellular organisms and the development of sensing-specific organs [75]. For example, choanoflaggelates have only 10 GPCR [76], while holothurians, one of the five Echinoderm classes, have 246 GPCRs [77], sea urchins, also echinoderms, have 979 [78] and the Placozoan Trichoplax adhaerens has 420 [79], sponges have 330 [80], N. vectensis have 890 [81] and hydra has 1200 (!) [82]. Evolutionary analysis of the sub-families suggests that the doubling that takes place between T. adhaerens and N. vectensis is roughly maintained and most sub-families remain relatively constant in their size distributions, with the Class A Rhodopsin-like family being the largest and most diverse in all species [83,84]. Invertebrates possess remarkable chemosensory capabilities to explore and detect minute biochemical compounds. Despite the low number of odorant receptors found in P. damicornis, corals have demonstrated high sensitivity to chemical cues (e.g. [85] and we postulate that this low number can be explained by the absence of organs dedicated to smell compared to higher organisms. Cnidarian polyp tentacles serve for prey capture and transport (to oral cavity), aggression and defense, and the cells they are made of (cnidocytes) also have various specialized functions [86] however, they constitute, with the oral cavity and the epidermis, the most specialized cnidarian sensing organ.

Generally, odors are used by organisms for survival purposes, such as homing, finding food, distinguishing between the same species and predators, predator avoidance and defense, and reproduction. In humans, the nose is a specialized organ to detect volatile compounds, and so we usually associate the ability to smell only with land animals. However, marine organisms can also smell. For example, fish have nostrils and pump water through them, and the compounds detected are decoded by their olfactory bulb. Fish can detect very low concentrations of compounds and use it for example to detect the direction of their home reef [87] Anemones have long been known for their production of the pyridinium compound, amphikuemin, that attracts its symbiotic fish which swims towards it [88]. In humans the detection and response to olfactory cues is so important that a remarkable gene expansion akin to that of antibody diversity has taken place giving rise to hundreds of olfactory receptors, which are highly diverse in different individuals underlying the differences between preferences and detection abilities in human populations [89]. Although corals emit large numbers of volatile compounds [90] for marine organisms, it is less important if compounds are volatile in air, as they would be dissolved in water, and so olfactory reception is better referred to as chemoreception in general. This can be well demonstrated with the sea urchin [78]. For example, genomic clustering and single-exon gene structures suggest rapid gene duplication creating a new class of GPCRs, the “group B surreal-GPCR’s in the sea urchin that are differentially expressed in pedicellariae and tube feet, suggesting that these organs take up an analogous role to the human nose in sensing chemical stimuli [78]. A similar rapid gene duplication gave rise to the diversity of olfactory receptors in humans, suggesting a similarity in the benefit of expansion of the organism’s capabilities to detect a larger diversity of ligands. These gene duplication events are highly organism specific, for example the holothurians are predicted to contain 57 olfactory receptors [77]. The next step after identification of the receptor repertoire is of course the identification and matching of corresponding ligands. Most progress to date with respect to ligands has been made with neuropeptides [91]. Interestingly, there are larger numbers of putative taste receptors as compared to odorant receptors in P. damicornis (15 coral for 28 human taste receptors), which might indicate that sensing chemicals as part of the food intake machinery is an important chemosensory part of P. damicornis.

Much simpler than differentiating the diversity of chemical compounds is sensing the color and intensity of light, a task that can be accomplished by humans with 11 members of the opsin subfamily of class A GPCR. Again, P. damicornis shows a reduced repertoire: based on our detailed sequence and structural analysis of opsin candidates we predict 4 GPCR that may be able to bind retinal and gain light sensitivity in P. damicornis. This is consistent with the hypothesis that P. damicornis can likely sense and respond to light and even distinguish colors. Previous research showed that coral larvae preferentially settle on red as opposed to white surfaces [92]. Consistent with this behavioral finding, is the inclusion of the red opsin protein (OPSR) in our list. On the other hand, there is also evidence for the ability to sense blue light—the action spectra of polyp retraction in the corals E. fastigiata and M. cavernosa have absorbance maxima of 482 and 478 nm, with half width at half height of 115 and 105 nm, respectively [93]. These narrow band widths support the conclusion of a single opsin involved in light sensing for polyp retraction in each coral, consistent with the small number of opsin candidates obtained for P. damicornis. The counter-ion of vertebrate opsin E is replaced by Y in invertebrate opsins in general [94], also observed for P. damicornis, with the exception of S in 19775. The 19775 protein maps to both medium wave and long wave sensitive human opsins, further suggesting a change in absorbance maxima. Thus, despite the absence of eyes, which are only needed when spatial vision is needed, detection of light is a fundamental ability shared with many other organisms, such as sea urchins [78]. Even when such organs are missing, the expression of specific opsin genes is not ubiquitous, providing a pathway towards encoding spatial information.

Taken together, both TLR and GPCR analysis suggest that P. damicornis represents a transition species that carries the minimal number of essential components for signaling and innate immunity, while additional functionality and fine tuning is achieved through diversification of this small pool of proteins in higher evolved organisms. Thus, being able to differentiate between early basic versions of a given function and finding the reasons for the need to diverge emphasizes the utility of this method assuming the best hit is the most functionally relevant hit. This may help to identify possible functions for currently not conclusively annotated proteins in P. damicornis and other non-model organisms. Although we restricted our detailed structure-function analysis to the GPCR subfamily of opsins in vision, G proteins, and TLRs, we also presented some initial analysis of possible odorant, taste and other receptors in corals. It is important to note, however, that the discovery of homologous receptors is done mainly through bioinformatics approaches, and thus can only be viewed as hypothesis generating work. Experimental approaches, such as pharmacological, cell biological and/or biochemical, are necessary to confirm these hypotheses. Thus, while further functional and especially experimental tests are needed to verify these predictions, the approach presented identifies potential new targets for studying coral biology and possibly draw connections between the general functions of the human proteins that the coral proteins mapped to and their potential role in coral sensing.

Supporting information

S1 Fig. Clustal omega multiple sequence alignment of opsin homologs.

Residues involved in active site formation are highlighted in blue.

https://doi.org/10.1371/journal.pone.0270965.s001

(DOCX)

S1 Table. The details of residues and their molecular interactions with the retinal active site in squid rhodopsin and coral putative rhodopsin proteins.

https://doi.org/10.1371/journal.pone.0270965.s002

(DOCX)

References

  1. 1. Ekblom R, Wolf JBW. A field guide to whole-genome sequencing, assembly and annotation. Evol Appl. 2014;7: 1026–1042. pmid:25553065
  2. 2. Campbell MS, Holt C, Moore B, Yandell M. Genome Annotation and Curation Using MAKER and MAKER-P. Curr Protoc Bioinformatics. 2014;48: 4.11.1–39. pmid:25501943
  3. 3. Oprea TI. Exploring the dark genome: implications for precision medicine. Mamm Genome. 2019;30: 192–200. pmid:31270560
  4. 4. Copper P. Evolution, Radiations, and Extinctions in Proterozoic to Mid-Paleozoic Reefs. Topics in Geobiology. 2001. pp. 89–119.
  5. 5. Kortschak RD, Samuel G, Saint R, Miller DJ. EST analysis of the cnidarian Acropora millepora reveals extensive gene loss and rapid sequence divergence in the model invertebrates. Curr Biol. 2003;13: 2190–2195. pmid:14680636
  6. 6. Thompson JR, Rivera HE, Closek CJ, Medina M. Microbes in the coral holobiont: partners through evolution, development, and ecological interactions. Front Cell Infect Microbiol. 2014;4: 176. pmid:25621279
  7. 7. Hughes TP, Kerry JT, Baird AH, Connolly SR, Dietzel A, Eakin CM, et al. Global warming transforms coral reef assemblages. Nature. 2018;556: 492–496. pmid:29670282
  8. 8. Pörtner HO, Roberts DC, Adams H, Adler C, Aldunce P, Ali E, et al. Climate change 2022: impacts, adaptation and vulnerability. 2022. Available: https://research.wur.nl/en/publications/climate-change-2022-impacts-adaptation-and-vulnerability?utm_source=twitter&utm_medium=social&utm_campaign=sharelink.
  9. 9. de Mendoza A, Sebé-Pedrós A, Ruiz-Trillo I. The evolution of the GPCR signaling system in eukaryotes: modularity, conservation, and the transition to metazoan multicellularity. Genome Biol Evol. 2014;6: 606–619. pmid:24567306
  10. 10. Altschul SF, Madden TL, Schäffer AA, Zhang J, Zhang Z, Miller W, et al. Gapped BLAST and PSI-BLAST: a new generation of protein database search programs. Nucleic Acids Res. 1997;25: 3389–3402. pmid:9254694
  11. 11. Smith TF, Waterman MS. Identification of common molecular subsequences. J Mol Biol. 1981;147: 195–197. pmid:7265238
  12. 12. Pearson WR. Searching protein sequence libraries: comparison of the sensitivity and selectivity of the Smith-Waterman and FASTA algorithms. Genomics. 1991;11: 635–650. pmid:1774068
  13. 13. Yona G, Levitt M. Within the twilight zone: a sensitive profile-profile comparison tool based on information theory. J Mol Biol. 2002;315: 1257–1275. pmid:11827492
  14. 14. Rost B. Twilight zone of protein sequence alignments. Protein Eng. 1999;12: 85–94. pmid:10195279
  15. 15. Eddy SR. Profile hidden Markov models. Bioinformatics. 1998;14: 755–763. pmid:9918945
  16. 16. Karplus K, Barrett C, Hughey R. Hidden Markov models for detecting remote protein homologies. Bioinformatics. 1998;14: 846–856. pmid:9927713
  17. 17. Söding J, Biegert A, Lupas AN. The HHpred interactive server for protein homology detection and structure prediction. Nucleic Acids Res. 2005;33: W244–8. pmid:15980461
  18. 18. Remmert M, Biegert A, Hauser A, Söding J. HHblits: lightning-fast iterative protein sequence searching by HMM-HMM alignment. Nat Methods. 2011;9: 173–175. pmid:22198341
  19. 19. Viterbi AJ. Error Bounds for Convolutional Codes and an Asymptotically Optimum Decoding Algorithm. The Foundations of the Digital Wireless World. 2009. pp. 41–50.
  20. 20. LaJeunesse TC, Parkinson JE, Gabrielson PW, Jeong HJ, Reimer JD, Voolstra CR, et al. Systematic Revision of Symbiodiniaceae Highlights the Antiquity and Diversity of Coral Endosymbionts. Curr Biol. 2018;28: 2570–2580.e6. pmid:30100341
  21. 21. Levy O, Dubinsky Z, Achituv Y. Photobehavior of stony corals: responses to light spectra and intensity. J Exp Biol. 2003;206: 4041–4049. pmid:14555744
  22. 22. Mason B, Beard M, Miller MW. Coral larvae settle at a higher frequency on red surfaces. Coral Reefs. 2011. pp. 667–676.
  23. 23. Yosef O, Popovits Y, Malik A, Ofek-Lalzer M, Mass T, Sher D. A tentacle for every occasion: comparing the hunting tentacles and sweeper tentacles, used for territorial competition, in the coral Galaxea fascicularis. BMC Genomics. 2020;21: 548. pmid:32770938
  24. 24. Mansfield KM, Gilmore TD. Innate immunity and cnidarian-Symbiodiniaceae mutualism. Dev Comp Immunol. 2019;90: 199–209. pmid:30268783
  25. 25. Allen AS, Seymour AC, Rittschof D. Chemoreception drives plastic consumption in a hard coral. Marine Pollution Bulletin. 2017. pp. 198–205. pmid:28743368
  26. 26. Wooldridge SA. Is the coral-algae symbiosis really “mutually beneficial” for the partners? BioEssays. 2010. pp. 615–625. pmid:20517874
  27. 27. Nemecz Á, Prevost MS, Menny A, Corringer P-J. Emerging Molecular Mechanisms of Signal Transduction in Pentameric Ligand-Gated Ion Channels. Neuron. 2016;90: 452–470. pmid:27151638
  28. 28. Sakmar TP. Structure and Function of G-Protein-Coupled Receptors. Handbook of Cell Signaling. 2010. pp. 151–156.
  29. 29. Sriram K, Insel PA. G Protein-Coupled Receptors as Targets for Approved Drugs: How Many Targets and How Many Drugs? Molecular Pharmacology. 2018. pp. 251–258. pmid:29298813
  30. 30. Wettschureck N, Offermanns S. Mammalian G proteins and their cell type specific functions. Physiol Rev. 2005;85: 1159–1204. pmid:16183910
  31. 31. Akira S, Takeda K. Toll-like receptor signalling. Nat Rev Immunol. 2004;4: 499–511. pmid:15229469
  32. 32. Kang JY, Nan X, Jin MS, Youn S-J, Ryu YH, Mah S, et al. Recognition of lipopeptide patterns by Toll-like receptor 2-Toll-like receptor 6 heterodimer. Immunity. 2009;31: 873–884. pmid:19931471
  33. 33. Park BS, Song DH, Kim HM, Choi B-S, Lee H, Lee J-O. The structural basis of lipopolysaccharide recognition by the TLR4-MD-2 complex. Nature. 2009;458: 1191–1195. pmid:19252480
  34. 34. Song WS, Jeon YJ, Namgung B, Hong M, Yoon S-I. A conserved TLR5 binding and activation hot spot on flagellin. Sci Rep. 2017;7: 40878. pmid:28106112
  35. 35. Ohto U, Ishida H, Shibata T, Sato R, Miyake K, Shimizu T. Toll-like Receptor 9 Contains Two DNA Binding Sites that Function Cooperatively to Promote Receptor Dimerization and Activation. Immunity. 2018;48: 649–658.e4. pmid:29625894
  36. 36. Song W, Wang J, Han Z, Zhang Y, Zhang H, Wang W, et al. Structural basis for specific recognition of single-stranded RNA by Toll-like receptor 13. Nat Struct Mol Biol. 2015;22: 782–787. pmid:26323037
  37. 37. Narayanan KB, Park HH. Toll/interleukin-1 receptor (TIR) domain-mediated cellular signaling pathways. Apoptosis. 2015;20: 196–209. pmid:25563856
  38. 38. Dunne A O’Neill LAJ. Adaptor usage and Toll-like receptor signaling specificity. FEBS Lett. 2005;579: 3330–3335.
  39. 39. Chen H, Jiang Z. The essential adaptors of innate immune signaling. Protein Cell. 2013;4: 27–39. pmid:22996173
  40. 40. Dong W, Liu Y, Peng J, Chen L, Zou T, Xiao H, et al. The IRAK-1-BCL10-MALT1-TRAF6-TAK1 Cascade Mediates Signaling to NF-κB from Toll-like Receptor 4. Journal of Biological Chemistry. 2006. pp. 26029–26040. pmid:16831874
  41. 41. Tang J, Xu L, Zeng Y, Gong F. Effect of gut microbiota on LPS-induced acute lung injury by regulating the TLR4/NF-kB signaling pathway. Int Immunopharmacol. 2021;91: 107272. pmid:33360370
  42. 42. van Oppen MJH, Blackall LL. Coral microbiome dynamics, functions and design in a changing world. Nature Reviews Microbiology. 2019. pp. 557–567. pmid:31263246
  43. 43. Williams LM, Fuess LE, Brennan JJ, Mansfield KM, Salas-Rodriguez E, Welsh J, et al. A conserved Toll-like receptor-to-NF-κB signaling pathway in the endangered coral Orbicella faveolata. Dev Comp Immunol. 2018;79: 128–136.
  44. 44. Cunning R, Bay RA, Gillette P, Baker AC, Traylor-Knowles N. Comparative analysis of the Pocillopora damicornis genome highlights role of immune system in coral evolution. Sci Rep. 2018;8: 16134. pmid:30382153
  45. 45. Liu H, Stephens TG, González-Pech RA, Beltran VH, Lapeyre B, Bongaerts P, et al. Symbiodinium Genomes Reveal Adaptive Evolution of Functions Related to Coral-dinoflagellate Symbiosis. 2018.
  46. 46. Ben-Shlomo I, Yu Hsu S, Rauch R, Kowalski HW, Hsueh AJW. Signaling receptome: a genomic and evolutionary perspective of plasma membrane receptors involved in signal transduction. Sci STKE. 2003;2003: RE9. pmid:12815191
  47. 47. Almén MS, Nordström KJV, Fredriksson R, Schiöth HB. Mapping the human membrane proteome: a majority of the human membrane proteins can be classified according to function and evolutionary origin. BMC Biol. 2009;7: 50. pmid:19678920
  48. 48. Imler JL, Hoffmann JA. Toll receptors in Drosophila: a family of molecules regulating development and immunity. Curr Top Microbiol Immunol. 2002;270: 63–79. pmid:12467244
  49. 49. Nawab A, An L, Wu J, Li G, Liu W, Zhao Y, et al. Chicken toll-like receptors and their significance in immune response and disease resistance. Int Rev Immunol. 2019;38: 284–306. pmid:31662000
  50. 50. Ishii A, Kawasaki M, Matsumoto M, Tochinai S, Seya T. Phylogenetic and expression analysis of amphibian Xenopus Toll-like receptors. Immunogenetics. 2007;59: 281–293. pmid:17265063
  51. 51. Kanwal Z, Wiegertjes GF, Veneman WJ, Meijer AH, Spaink HP. Comparative studies of Toll-like receptor signalling using zebrafish. Dev Comp Immunol. 2014;46: 35–52. pmid:24560981
  52. 52. de Castro E, Sigrist CJA, Gattiker A, Bulliard V, Langendijk-Genevaux PS, Gasteiger E, et al. ScanProsite: detection of PROSITE signature matches and ProRule-associated functional and structural residues in proteins. Nucleic Acids Res. 2006;34: W362–5. pmid:16845026
  53. 53. Waterhouse A, Bertoni M, Bienert S, Studer G, Tauriello G, Gumienny R, et al. SWISS-MODEL: homology modelling of protein structures and complexes. Nucleic Acids Res. 2018;46: W296–W303. pmid:29788355
  54. 54. Zhou K, Kanai R, Lee P, Wang H-W, Modis Y. Toll-like receptor 5 forms asymmetric dimers in the absence of flagellin. J Struct Biol. 2012;177: 402–409. pmid:22173220
  55. 55. Shimamura T, Hiraki K, Takahashi N, Hori T, Ago H, Masuda K, et al. Crystal structure of squid rhodopsin with intracellularly extended cytoplasmic region. J Biol Chem. 2008;283: 17753–17756. pmid:18463093
  56. 56. Surgand J-S, Rodrigo J, Kellenberger E, Rognan D. A chemogenomic analysis of the transmembrane binding cavity of human G-protein-coupled receptors. Proteins. 2006;62: 509–538. pmid:16294340
  57. 57. Harris R, Olson AJ, Goodsell DS. Automated prediction of ligand-binding sites in proteins. Proteins. 2008;70: 1506–1517. pmid:17910060
  58. 58. Edgar RC. MUSCLE: multiple sequence alignment with high accuracy and high throughput. Nucleic Acids Res. 2004;32: 1792–1797. pmid:15034147
  59. 59. Sledzieski S, Singh R, Cowen L, Berger B. D-SCRIPT translates genome to phenome with sequence-based, structure-aware, genome-scale predictions of protein-protein interactions. Cell Syst. 2021. pmid:34536380
  60. 60. Shumaker A, Putnam HM, Qiu H, Price DC, Zelzion E, Harel A, et al. Genome analysis of the rice coral Montipora capitata. Sci Rep. 2019;9: 2571. pmid:30796282
  61. 61. Moitra S, Tirupula KC, Klein-Seetharaman J, Langmead CJ. A minimal ligand binding pocket within a network of correlated mutations identified by multiple sequence and structural analysis of G protein coupled receptors. BMC Biophys. 2012;5: 13. pmid:22748306
  62. 62. Balakrishnan S, Kamisetty H, Carbonell JG, Lee S-I, Langmead CJ. Learning generative models for protein fold families. Proteins. 2011;79: 1061–1078. pmid:21268112
  63. 63. Valanne S, Wang J-H, Rämet M. The Drosophila Toll Signaling Pathway. The Journal of Immunology. 2011. pp. 649–656. pmid:21209287
  64. 64. Kalinina OV, Mironov AA, Gelfand MS, Rakhmaninova AB. Automated selection of positions determining functional specificity of proteins by comparative analysis of orthologous groups in protein families. Protein Sci. 2004;13: 443–456. pmid:14739328
  65. 65. Capra JA, Singh M. Predicting functionally important residues from sequence conservation. Bioinformatics. 2007;23: 1875–1882. pmid:17519246
  66. 66. Yanamala N, Tirupula KC, Klein-Seetharaman J. Preferential binding of allosteric modulators to active and inactive conformational states of metabotropic glutamate receptors. BMC Bioinformatics. 2008;9 Suppl 1: S16. pmid:18315847
  67. 67. Jumper J, Evans R, Pritzel A, Green T, Figurnov M, Ronneberger O, et al. Highly accurate protein structure prediction with AlphaFold. Nature. 2021;596: 583–589. pmid:34265844
  68. 68. Miller DJ, Hemmrich G, Ball EE, Hayward DC, Khalturin K, Funayama N, et al. The innate immune repertoire in cnidaria—ancestral complexity and stochastic gene loss. Genome Biol. 2007;8: R59. pmid:17437634
  69. 69. Nyholm SV, Graf J. Knowing your friends: invertebrate innate immunity fosters beneficial bacterial symbioses. Nat Rev Microbiol. 2012;10: 815–827. pmid:23147708
  70. 70. Anderson DA, Walz ME, Weil E, Tonellato P, Smith MC. RNA-Seq of the Caribbean reef-building coral Orbicella faveolata (Scleractinia-Merulinidae) under bleaching and disease stress expands models of coral innate immunity. PeerJ. 2016;4: e1616. pmid:26925311
  71. 71. Margolis SR, Dietzen PA, Hayes BM, Wilson SC, Remick BC, Chou S, et al. The cyclic dinucleotide 2′3′-cGAMP induces a broad antibacterial and antiviral response in the sea anemone Nematostella vectensis. Proceedings of the National Academy of Sciences. 2021. pmid:34903650
  72. 72. Weiss Y, Forêt S, Hayward DC, Ainsworth T, King R, Ball EE, et al. The acute transcriptional response of the coral Acropora millepora to immune challenge: expression of GiMAP/IAN genes links the innate immune responses of corals with those of mammals and plants. BMC Genomics. 2013;14: 400. pmid:23768317
  73. 73. Vidal-Dupiol J, Ladrière O, Meistertzheim A-L, Fouré L, Adjeroud M, Mitta G. Physiological responses of the scleractinian coral Pocillopora damicornis to bacterial stress from Vibrio coralliilyticus. Journal of Experimental Biology. 2011. pp. 1533–1545. pmid:21490261
  74. 74. Libro S, Kaluziak ST, Vollmer SV. RNA-seq profiles of immune related genes in the staghorn coral Acropora cervicornis infected with white band disease. PLoS One. 2013;8: e81821. pmid:24278460
  75. 75. Krieger J, Breer H. Olfactory reception in invertebrates. Science. 1999;286: 720–723. pmid:10531050
  76. 76. Hake KH. The microbiome of a colonial choanoflagellate from Mono Lake, CA. University of California, Berkeley; 2019.
  77. 77. Marquet N, Cardoso JCR, Louro B, Fernandes SA, Silva SC, Canário AVM. Holothurians have a reduced GPCR and odorant receptor-like repertoire compared to other echinoderms. Sci Rep. 2020;10: 3348. pmid:32098989
  78. 78. Raible F, Tessmar-Raible K, Arboleda E, Kaller T, Bork P, Arendt D, et al. Opsins and clusters of sensory G-protein-coupled receptors in the sea urchin genome. Dev Biol. 2006;300: 461–475. pmid:17067569
  79. 79. Srivastava M, Begovic E, Chapman J, Putnam NH, Hellsten U, Kawashima T, et al. The Trichoplax genome and the nature of placozoans. Nature. 2008;454: 955–960. pmid:18719581
  80. 80. Krishnan A, Dnyansagar R, Almén MS, Williams MJ, Fredriksson R, Manoj N, et al. The GPCR repertoire in the demosponge Amphimedon queenslandica: insights into the GPCR system at the early divergence of animals. BMC Evol Biol. 2014;14: 270. pmid:25528161
  81. 81. Schiöth HB, Nordström KJV, Fredriksson R. The adhesion GPCRs; gene repertoire, phylogeny and evolution. Adv Exp Med Biol. 2010;706: 1–13. pmid:21618822
  82. 82. Chapman JA, Kirkness EF, Simakov O, Hampson SE, Mitros T, Weinmaier T, et al. The dynamic genome of Hydra. Nature. 2010;464: 592–596. pmid:20228792
  83. 83. Jékely G. Global view of the evolution and diversity of metazoan neuropeptide signaling. Proc Natl Acad Sci U S A. 2013;110: 8702–8707. pmid:23637342
  84. 84. Nordström KJV, Sällman Almén M, Edstam MM, Fredriksson R, Schiöth HB. Independent HHsearch, Needleman–Wunsch-Based, and Motif Analyses Reveal the Overall Hierarchy for Most of the G Protein-Coupled Receptor Families. Mol Biol Evol. 2011;28: 2471–2480. pmid:21402729
  85. 85. Tebben J, Motti CA, Siboni N, Tapiolas DM, Negri AP, Schupp PJ, et al. Chemical mediation of coral larval settlement by crustose coralline algae. Sci Rep. 2015;5: 10803. pmid:26042834
  86. 86. Fautin DG. Structural diversity, systematics, and evolution of cnidae. Toxicon. 2009;54: 1054–1064. pmid:19268491
  87. 87. Dixson DL, Jones GP, Munday PL, Planes S, Pratchett MS, Srinivasan M, et al. Coral reef fish smell leaves to find island homes. Proceedings of the Royal Society B: Biological Sciences. 2008. pp. 2831–2839. pmid:18755672
  88. 88. Murata M, Miyagawa-Kohshima K, Nakanishi K, Naya Y. Characterization of compounds that induce symbiosis between sea anemone and anemone fish. Science. 1986;234: 585–587. pmid:17835564
  89. 89. Trimmer C, Keller A, Murphy NR, Snyder LL, Willer JR, Nagai MH, et al. Genetic variation across the human olfactory receptor repertoire alters odor perception. Proceedings of the National Academy of Sciences. 2019. pp. 9475–9480. pmid:31040214
  90. 90. Lawson CA, Raina J-B, Deschaseaux E, Hrebien V, Possell M, Seymour JR, et al. Heat stress decreases the diversity, abundance and functional potential of coral gas emissions. Glob Chang Biol. 2021;27: 879–891. pmid:33253484
  91. 91. Takahashi T. Comparative Aspects of Structure and Function of Cnidarian Neuropeptides. Front Endocrinol. 2020;11: 339. pmid:32528418
  92. 92. Foster T, Gilmour JP. Seeing red: Coral larvae are attracted to healthy‑looking reefs. Marine Ecology Progress Series. 2016. pp. 65–71.
  93. 93. Gorbunov MY, Falkowski PG. Photoreceptors in the cnidarian hosts allow symbiotic corals to sense blue moonlight. Limnol Oceanogr. 2002;47: 309–315.
  94. 94. Kojima K, Yamashita T, Imamoto Y, Kusakabe TG, Tsuda M, Shichida Y. Evolutionary steps involving counterion displacement in a tunicate opsin. Proc Natl Acad Sci U S A. 2017;114: 6028–6033. pmid:28533401