Skip to main content

Differential genetic and functional background in inflammatory bowel disease phenotypes of a Greek population: a systems bioinformatics approach



Crohn’s disease (CD) and Ulcerative colitis (UC) are the two main entities of inflammatory bowel disease (IBD). Previous works have identified more than 200 risk factors (including loci and signaling pathways) in populations of predominantly European ancestry. Our study was conducted on an extended population-specific cohort of 573 Greek IBD patients (364 CD and 209 UC) and 445 controls.


To highlight the different genetic and functional background of IBD and its phenotypes, utilizing contemporary systems bioinformatics methodologies.


Disease-associated SNPs, obtained via our own 89 loci IBD risk GWAS panel, were detected with the whole genome association analysis toolset PLINK. These SNPs were used as input for 2 novel and different pathway analysis methods to detect functional interactions. Specifically, PathwayConnector was used to create complementary networks of interacting pathways whereas; the online database of protein interactions STRING provided protein–protein association networks and their derived pathways. Network analyses metrics were employed to identify proteins with high significance and subsequently to rank the signaling pathways those participate in.


The reported complementary pathway and enriched protein–protein association networks reveal several novel and well-known key players, in the functional background of IBD like Toll-like receptor, TNF, Jak-STAT, PI3K-Akt, T cell receptor, Apoptosis, MAPK and B cell receptor signaling pathways. IBD subphenotypes are found to have distinct genetic and functional profiles which can contribute to their accurate identification and classification. As a secondary result we identify an extended network of diseases with common molecular background to IBD.


IBD’s burden on the quality of life of patients and intricate functional background presents us constantly with new challenges. Our data and methodology provide researchers with new insights to a specific population, but also, to possible differentiation markers of disease classification and progression. This work, not only provides new insights into the interplay among IBD risk variants and their related signaling pathways, elucidates the mechanisms underlying IBD and its clinical sequelae, but also, introduces a generalized bioinformatics-based methodology which can be applied to studies of different disorders.


Crohn’s disease (CD) and ulcerative colitis (UC), are the two major manifestations of what is known as inflammatory bowel disease (IBD). They are chronic conditions characterized by prolonged inflammation of the digestive tract and their exact cause is unknown. However, genetics and problems with the immune system have been associated with IBD. Even if recent specific epidemiological data does not exist for Greece, which is the sample source of this work, it was estimated that 2.5–3 million people in Europe are affected by IBD, with a direct healthcare cost of 4.6–5.6 bn Euros/year [1]. Over the last years, a significant number of trait associated gene variants were identified through genome-wide association studies (GWAS) in diverse populations, which strengthened our understanding of complex diseases such as IBD [2]. Regarding European ancestry populations, approximately 200 genome-wide significant (GWS) IBD susceptibility loci [3] have been identified, however, IBD has been associated with significant geographic and ethnic differences in incidence and prevalence [4].

Generally, since GWAS focus on testing association of disease with individual SNPs over the genome and only top-ranked SNPs with the strongest statistical evidence for association are described, GWAS are underpowered to detect loci which have small marginal effect but rather act jointly or interact with trait variability [4, 5]. Thus, more sophisticated analyses such as network-assisted studies that integrate GWAS results are very promising approaches towards the discovery of functionally related genes including those that have a small marginal effect but rather act jointly in disease susceptibility.

Computational approaches have become standard practice in the last decades for managing and analyzing biological data. Due to the accumulative amount of information biological experiments produced, also known as –omics data, the need arose for powerful computational inquiries and storage. Biological databases had to be developed and specialized tools, each targeting specific data types, had to be developed. Contemporary practices and literature [3, 6,7,8] are focused on these approaches producing more and more knowledge to be consumed. Systems bioinformatics [9] implementations try to combine all this newfound and/or newly appreciated knowledge into comprehensible interactions and provide insights into the patient-disease complex.

In the present study, we employed a bioinformatics pipeline to integrate IBD GWAS results with experimental and bibliographic data via two different approaches; one that informs on pathway-pathway networks and one that provides protein–protein association (via their respective genes) networks. These allowed us to perform network analysis and clustering, to identify sets of interconnected genes and functional pathways associated with each of the two IBD forms and their phenotypes.

More specifically we use the results of our GWAS study of an extended cohort of 573 Greek IBD patients (364 CD and 209 UC) and 441 controls using 89 single nucleotide polymorphisms (SNPs) that showed moderate or strong association in previous studies [6, 10, 11] to perform various network analyses. The data and analysis of CD samples is novel whereas regarding UC we have employed re-analysis of our previously published data using new contemporary bioinformatics approaches. Our results were combined with pathway interaction, and gene co-expression, co-localization, co-occurrence and fusion data to reveal biologically meaningful processes that underlie the risk of IBD. This work aims to have a two-fold impact: to provide scientists who are in with new information on the pathogenesis of IBD and to propose and highlight new methodologies which can be applied on genetic data of different pathological origins.

Materials and methods

Study design

The overall experimental design is illustrated as a flowchart in Fig. 1 and will be explained in detail here.

Fig. 1
figure 1

Flow chart showcasing the experimental methodology and study design

Samples and DNA isolation

We had conducted GWAS using case–control datasets, totaling 573 Greek IBD cases 364 CD and 209 UC) and 445 healthy controls from unrelated, self-identified Greek individuals as previously described (Table 1) [12]. Our samples were stratified to disease sub-phenotypes according to the Montreal Classification [13] and more specifically CD samples were categorized based on their behavioral subphenotypes (B1: Non-stricturing, Non-penetrating, B2: Stricturing, B3: Penetrating), whereas, UC samples were categorized based on their extent subphenotypes (E1: Ulcerative proctitis, E2: distal UC, E3: pancolitis). None of the patients or controls had a family history of autoimmune disease. The diagnosis of IBD was based on standard clinical, endoscopic, radiological, and histological criteria. Before commencement of the study, the Ethics Committee at the participating centers approved the recruitment protocols. All participants were informed of the study. DNA was isolated from blood with the NucleoSpin blood kit (Macherey–Nagel, Germany).

Table 1 Characteristics of case/control sets used


A genome-wide SNP typing of a discovery panel, using the Affymetrix Genome-Wide Human SNP Array 5.0 was carried out previously at Institute for Clinical Molecular Biology, Christian-Albrechts-University, Kiel, Germany [6, 10]. Part of this panel has been used in previous studies [12].

SNP quality control and association analysis

The inclusion criteria for the samples in our statistical analysis accounted for SNP missing rate, minor allele frequency and a Hardy–Weinberg Equilibrium exact test p value to rule out genotyping errors. Association analysis was performed on the included samples based on a pairwise comparison of the disease phenotype and sub-phenotypes using a 1 df χ2 (Chi square) test. Estimated odds ratios (OR) with a 95% confidence interval (CI) were also calculated for allele 1 (minor) versus allele 2 (major) in our preselected SNPs. Only the SNPs with an asymptomatic p value ≤ 0.05 were considered in our results for further analyses. Quality control and association tests were performed using PLINK [14] v1.90b4.9. The R package metaphor [15] v2.0 was used for the creation of OR plots based on our test results and VENNY [16] was used to identify SNPs common between IBD phenotypes and subphenotypes.

Signaling pathways enrichment and functional associations

Using the genes carrying the SNPs highlighted by our association analyses, gene-set lists were created as input to the PathwayConnector [17] (Method 1 of the flowchart) and the Search Tool for the Retrieval of Interacting Genes/Proteins (STRING), a database of known and predicted protein–protein associations [18] (Method 2 of the flowchart) platforms.

In Method 1, KEGG [19] was selected as the default signaling pathway database, the top ten Enrichr pathways per set were considered as the initial seed pathways used in the complementary network analysis and edge betweenness was selected as the community detection algorithm for clustering on the complementary pathway network.

For Method 2 each gene of our gene-set was converted to a best matched protein set. The networks were then created using an interaction score of 0.400 (medium confidence) with an enrichment of 30 interactors in total (no more than 20 1st shell and 10 2nd shell interactors), after testing various combinations for the most accurate results based on current knowledge. 1st shell interactors are proteins directly associated with our initial set while 2nd shell ones are those associated with the 1st shell interactors. As active interaction sources all categories had been selected (Textmining: data extracted from the abstracts of scientific literature, Experiments: data extracted from other PPA databases, Databases: data extracted from curated databases, Co-expression: genes that are co-expressed in the same or in other species (transferred by homology), Neighborhood: genes that occur repeatedly in close neighborhood in (prokaryotic) genomes, Gene Fusion: gene fusion events per species, Co-occurrence: proteins linked across species). The Markov Cluster Algorithm (MCL) [20] with an inflation parameter of 3 was applied to the final network for cluster detection based on domain architecture. Edges were created by confidence levels, and disconnected nodes were hidden. Using cytoscape [21], as well as, the igraph [22] and centiserve [23] packages for R, we calculated various network analysis metrics, in order to detect hubs (Degree Centrality), bottlenecks (Betweenness Centrality), shortest path topology (Latora harmonic closeness centrality) and in general nodes (proteins) that play an important role in the protein (PPA) networks. We devised a gene ranking score by using a weighted function, giving Degree centrality a 0.2 factor, Latora Closeness Centrality a 0.3 and Betweenness Centrality a 0.5. This score tries to signify the knowledge represented in literature about the actual significance of those metrics in a protein network [24, 25]. Finally, pathway analysis was performed, on the enriched networks of the disease phenotypes and sub-phenotypes, keeping the KEGG database as reference and the resulting signaling pathway lists were compared using the VENNY online tool to detect and visualize commonalities between them using Venn diagrams. The average combined score of centralities for each protein contributing to a pathway was used to calculate a pathway ranking score.


As described previously, to elucidate the functional links between single nucleotide polymorphisms (SNPs) and IBD, we used the results from our GWAS analysis to investigate signaling pathways involved in IBD using 2 different computational methods.

The PLINK analysis results pointed to 17 statistically significant SNPs specific for CD, 8 for UC and 13 generally for IBD compared to healthy individuals (HC), which were used as input in our pathway and enrichment analyses (Table 2). Figure 2a–c showcases the OR diagrams (Forest plots) of these SNPs versus their association to each disease phenotype and sub-phenotype as endoscopically and clinically categorized. The statistical hypothesis here is versus Allele1 and whether the SNP must be a homozygote or heterozygote to be associated with the disease. Results with an OR score < 1 point to a disease association when the SNP is a homozygote and an OR score > 1 points to a heterozygote SNP related to the disease phenotype.

Table 2 Overview of the SNPs included in the pathway and enrichment analyses
Fig. 2
figure 2

Forest plots of OR ratios for the SNPs highlighted by the SNP analysis performed via plink. These refer to a IBD vs HC, b CD vs HC, and c UC vs HC. All the depicted SNPs statistically significantly relative to the corresponding disease phenotype (p value < 0.05 and the ones with the star have a p-value < 0.01). Furthermore, results with an OR score < 1 point to a disease association where the SNP is a homozygote with the minor allele and an OR score > 1 points to a heterozygote

Our results revealed regarding CD, 15 SNPs for B1, 9 for B2 and 1 for B3. Concerning UC, 7 SNPs were related to E1, 2 were associated to E2 phenotype and 13 to E3 phenotype (Table 2). It is worth mentioning that the low count of SNPs associated with the B3 and E2 sub-phenotypes is heavily perturbed by the rarity of these cases in our Greek samples and in the worldwide population in general. Figure 3a, in a Venn Diagram, showcases all the SNPs that are common between CD and UC from this initial analysis whereas Fig. 3b the common SNPs between B1 and B2 CD and finally Fig. 3c shows that there are no common SNPs in our results between E1 and E3.

Fig. 3
figure 3

Common SNPs found from the analysis on our datasets, between phenotypes and sub-phenotypes of IBD. a 4 common SNPs were found between CD and UC, b 3 common SNPs were found between B1 and B2, c no common SNPs were found between E1 and E3

Our results although clearly pointing to a specific and distinct genetic background of the disease phenotypes and sub-phenotypes highlighted the fact that our datasets only contained a handful of genes that don’t allow us to see the bigger picture. It is well known that gene products exert their functions through interactions with other cellular components, and the impact of a genetic perturbation can spread along the links of any functional network the gene product is involved in [26].

To study the role of specific signaling pathways in IBD pathogenesis, we employed Methods 1 and 2 on the gene sets inferred from these SNPs. Genes associated with the B3 and E2 sub-phenotypes gave extremely small datasets to be analyzed so they were disregarded.

Using Method 1 we identified the top 10 pathways after enrichment for all IBD phenotypes and subphenotypes. Moreover, 23 complementary pathways for CD, 11 for UC, 31 for B1, 15 for B2, 24 for E1 and 11 for E3 were detected as interacting with our original 10. The individual results along with visualizations of the complementary networks are included in Additional file 1.

Using Method 2, we constructed PPA networks and detected signaling pathways. The CD and UC risk genes interaction networks are presented in Fig. 4a, b respectively, whereas Fig. 5a, b showcases the networks created by the B1–B2 and E1–E3 sub-phenotype risk genes as those arose from our previous analyses. Different color groups signify clusters.

Fig. 4
figure 4

Enriched protein–protein association networks created from the risk genes highlighted from previous analyses for a CD and b UC. STX7, STX8, VTI1B proteins were found to be common between the 2 networks. 4 distinct clusters detected for CD and 2 for UC

Fig. 5
figure 5

Enriched PPA networks created from the risk genes highlighted from previous analyses for a B1 and B2 CD sub-phenotypes and b E1 and E3 UC sub-phenotypes. Only the protein NKX2-3 was found to be common between the CD sub-phenotypes, whereas, none were found for UC. 4 clusters were detected for B1, 2 for B2, 2 for E1 and 3 for E3

The PPA network constructed for CD has 38 nodes, 220 edges and the MCL clustering algorithm has signified 4 clusters, whereas, the UC one has 33 nodes, 164 edges and 2 clusters. In total using the enriched PPA networks only 3 proteins were common between UC and CD: STX7, STX8, VTI1B. The same process for the B1 and B2 CD sub-phenotypes and the E1 and E3 UC sub-phenotypes highlighted: For B1 the enriched PPA network consists of 37 nodes, 187 edges and 4 clusters. For B2 the enriched PPA network consists of 34 nodes, edges and 2 clusters. Only the protein NKX2-3 was found to be common between the 2 enriched networks. The E1 PPA network consists of 32 nodes, 261 edges and 2 clusters, while, the E3 of 34 nodes, 146 edge and 3 clusters. No proteins were found in common between the 2 networks of the UC sub-phenotypes.

Network analysis uses the three different centralities and their subsequent transformation into a combined score has provided, for each phenotype and its sub-phenotypes, a ranked list (Additional file 2) highlighting the proteins most topologically important regarding their protein–protein association networks.

The enrichment process via STRING combined with centrality analysis has also enabled us to study the functional pathways involving the proteins highlighted by the network using KEGG. In total, for the main IBD phenotypes, 26 signaling pathways were found exclusively for CD, 22 for UC and 27 were shared between them. Regarding CD sub-phenotypes B1 and B2, 13 pathways were found exclusively for B1, 21 exclusively for B3 and 15 in common between them. For the UC sub-phenotypes 15 pathways were found exclusively for E1, 30 for E3 and 33 in common between them. Additional file 3 showcases the aforementioned group intersections. Finally, Additional file 4 provides a ranked listing of all the pathways for each phenotype and sub-phenotypes, based on the previous combined scores for each protein, helping identify pathways that might play a significant role to IBD pathogenesis/functional background.

To understand better our findings and arrive at a consensus between our methodologies, we have created Fig. 6 which provides common and individually highlighted pathways between Methods 1 and 2 for the IBD phenotypes and subphenotypes. The common ones are four for CD, seven for B1, four for B2, two for UC, two for E1 and two for E3. Finally, using the data from these merged results we constructed a Disease–Disease association network as depicted in Fig. 7. This network allows us to visualize disorders that share molecular mechanisms with our IBD sub-phenotypes.

Fig. 6
figure 6figure 6

a Final merged pathway results from the 2 methods for all CD sub-phenotypes, b final merged pathway results from the 2 methods for all UC sub-phenotypes

Fig. 7
figure 7

Disease–Disease association network based on molecular background commonalities


Recent successes of large GWAS studies have had a large impact on identifying the variants of complex diseases, such as IBD [11, 27,28,29]. Here, using an integrated pipeline of methodologies we integrate GWAS data of a Greek IBD population with curated databases of fundamental human pathways as well as gene and reaction-based functional networks, in order to obtain novel insights into the potential causal process of IBD and their sub-phenotypes, hopefully leading to specific diagnostic and therapeutic targets.

A novel stride in our present work was the further examination of the main phenotypes of IBD and their sub-phenotypes using a combination of –omics data and network-based approaches. The specificity of the results regarding SNPs, proteins and signaling pathways involved in IBD allows us to shift through general literature findings and pinpoint those that apply exactly to the population under study. We acknowledge that the two approaches showcased in this paper provide us only with a few common results (as depicted in Fig. 6). This is to be expected due to the differences in the methodologies of the two approaches and their intermediate steps. This signifies that when employing various omics methods to extrude conclusions, especially about the functional role of genes, researchers should consider combinational approaches which complement each other, rather than relying on a single method. We also must recognize the limitations of the databases, as highlighted by the KEGG pathway results from both methods, to identify specific disorder pathways when provided with a limited set of genes. Many disorders share common pathophysiological mechanisms like inflammation making it difficult for the database to distinguish the specific disorder under study. This highlights the importance of more specific mechanism-oriented databases.

The use of pathway network connectivity and centrality analysis of the protein–protein association networks, as well as their rankings, not only allows for more unbiased/unmanaged results of important proteins and their role in IBD but also draws attention to specific pathways to be considered out of all those “discovered” by plain pathway analysis methods. By using a weighted approach to combine centralities as shown here, and by modifying the initial scheme presented according to the weight that is desired to be given each time to each centrality, researchers might find the answers to the questions about which nodes are important to a protein association network according to their biological significance/role.

The current analysis implicates a significant number of core pathways indicating an important role among others for IBD, such as Toll-like receptor signaling, TNF signaling, Jak-STAT signaling, PI3K-Akt signaling, T cell receptor signaling, MAPK signaling and B cell receptor signaling pathways components. The NF-kappa B signaling, NOD-like receptor signaling, regulation of autophagy, chemokine signaling, adherents junction pathways were found to be CD specific, whereas the intestinal immune network for IgA production, natural killer cell mediated cytotoxicity, Wnt signaling, cytokine-cytokine receptor interaction, colorectal cancer, VEGF signaling, cGMP-PKG signaling, cell adhesion molecules (CAMs), and Fc epsilon RI signaling pathways seem to be UC specific. When we stratified the cases according to disease sub-phenotypes we identified distinct pathways for the B1 and B2 sub-phenotypes regarding CD, and the E1 and E3 sub-phenotypes regarding UC. Interestingly, the role of most of the identified pathways in IBD pathogenesis and its clinical significance in IBD therapy and diagnostics are well studied [30, 31]. Toll-like receptors are basic mediators of innate host defense in the intestine, involved in maintaining mucosal and commensal homeostasis [32]. Additionally, novel therapies have been developed targeting alternative TNF and ILs signaling (i.e. IL-12/23 axis, IL-6) pathways as well as Jak inhibitors in IBD [33]. It is also well known that combination of disease-associated variants of ATG16L1 and NOD2/CARD15 leads to synergistically increased susceptibility for CD, indicating a possible crosstalk between NOD2- and ATG16L1-mediated processes in the pathogenesis of CD [34]. Notably Kini et al. [35] indicated that changes in signaling through Wnt primarily affected colonic stem cells, whereas Notch affected progenitor function, providing new insights into the development of inflammation and relapse in UC. As depicted in our results, the central role of all these pathways is highlighted.

In the present study the protein–protein association network analysis revealed that 3 proteins were common between UC and CD: STX7, STX8, VTI1B. This is expected since there role of autophagy in the pathogenesis and progression of IBD is well documented [36]. Furthermore, SNARE complexes and their regulators have a key role during inflammation and may present potential therapeutic targets in a wide range of inflammatory diseases such as IBD [37]. SNAREs have recently been implicated in controlling autophagosome development in mammalian cells [38] and the SNAREs vesicle-associated membrane protein (VAMP)7, syntaxin-7 (STX7), syntaxin-8 (STX8), and VTI1B regulate the homotypic fusion of phagophore precursors [39]. These fusion events allow the growth of these structures into a tubular network leading to the formation of phagophores and autophagosomes [40].

Our results further indicated that B1 and B2, CD sub-phenotypes exhibit distinct protein and pathway profiles, and that the NKX2-3 gene was found common in these two entities. These findings are in accordance with previous studies which indicated that NKX2-3 is a susceptibility locus for IBD in Eastern European patients but hasn’t been related to a specific sub-phenotype [41]. However, the B2 network presents two disjointed clusters which might be attributed to the fact that a limited number of SNPs was used in GWAS and the possible links remain outside our initial targets. Regarding UC sub-phenotypes E1 and E3 revealed that they have distinct pathways.

Our observations were also confirmed by the combined centralities network analysis. More specific for CD the proteins identified to have the strongest significant involvement with the disease are TLR4, SRC, NOD2, MYD88 and IL6. These results are not surprising since it is well known that NOD2 is a major genetic risk factor for CD, and NOD2 signal cascade is enhanced by toll-like receptor (TLR) agonists through NF-κB. NOD2 and TLR signaling collaborate to enhance immune responses [42]. TLR4 engages the adaptor MyD88 in combination with the adaptor TIRAP/Mal. Additionally via the signal transduction pathways involving MyD88, IRAK a number of mediators induced that could implicated in the CD pathogenesis such as TNFa, and IL6 [43]. The rest of the proteins identified, are involved in the pathways related to inappropriate immune response to floral components as well as autophagy signaling pathways [44]. Examining the main implicated proteins in CD sub-phenotypes, our results revealed some significant observations. The main proteins related to B1 sub-phenotype are the proteins implicated mainly in TLR and NOD2 signaling pathways (i.e. TLR4, MyD88, NOD2). Regarding NOD2, a previous study suggested that L1007fs mutation, in central Europeans is associated with fibrostenotic disease, [45] but this cannot confirmed in our results and might be be explained by the different ethnic population in our own study. Other proteins correlated mainly with the B1 sub-phenotype are PRPF8, SNRPF as well as TRAF6. Reduced TRAF6 gene expression was found in IBD patients due to hypermethylation [46]. Regarding SNRPF recently Wang et al. [47] identified an antibody against SNRPB, as an autoantibody marker in CD but there are not information related to disease sub-phenotypes. For PRPF8 there are not data available regarding its implication to CD pathogenesis. About the B2 sub-phenotype the autophagy related proteins seem to be more important (ATG12, ATG4B, ATG3 etc.). Even if there are no data supporting the association of autophagy genes with specific CD sub-phenotype, undoubtedly autophagy plays an important role in CD pathogenesis [48]. Conclusively there are distinct protein patterns implicated in these two sub-phenotypes than probably can be used for CD progression prediction.

Interestingly the proteins strongly implicated in UC pathogenesis are distinct from those of CD. IL2, STX3, NFATC2 and JUN seem to have major role in UC. Regarding IL2 it has been shown that Il2−/−mice develop IBD most reminiscent of UC [49]. Regarding STX3, a novel mechanism was recently reported, regulating intestinal serotonin transporter (SERT) via PI3K and STX3 [50]. Sikander et al. [51] demonstrated that there may be a potential association between polymorphisms in the (SERT) gene promoter and UC, thus STX3 seems to be important for UC pathogenesis. Considering NFATC2, we know that it is a transcription factor with pleotropic roles [52]. Remarkably, the existing data suggest an important cell-intrinsic role for NFAT family transcription factors in intrinsic negative T cell regulation and Weigmann et al. [53] supported that oxazolone-induced ulcerative colitis and progression to colon cancer are attenuated in NFATC2 KO mice due to ineffective production of IL-6. This suggests that NFATC2 can act as a more generalized modulator of inflammation. Regarding the sub-phenotypes of UC, we observed that E1 is mostly related to proteins such as TLR4, TNF, NFKB1, TNFRSF1A, and others involved in the NF-kappa B signaling pathway. Interestingly E1 sub-phenotype seems to also be strongly associated with Ras-related C3 botulinum toxin substrate 1 (RAC1) protein. It is known that disruption of Rac1 in macrophage and neutrophils of mice protected them against dextran sulphate sodium (DSS)-induced colitis [54]. On the other hand E3 sub-phenotype is mostly related to IL2 protein and also with autophagosomes and inflammation-related proteins i.e. syntaxins and NFATC2 [55, 56]. A strong association for the IL2/IL21 locus with UC is well known [49]. STX3 has a crucial role in trafficking pathways of cytokines in neutrophil granulocytes [57]. Additionally, FASLG seems also to play a basic role in this sub-phenotype and has been documented in the attenuation of apoptosis response to Fas-ligand in active ulcerative colitis [58]. NFATC2 is involved in colitis by controlling mucosal T cell activation in an IL-6-dependent manner and seems to be a potential therapeutic target for UC [56]. Our data indicate that distinct pathways also characterize the UC sub-phenotypes.

Genetic variants and their role in functional changes, though, are not only important in understanding IBD pathophysiology but also understanding treatment-related enigmas like patient response. As previous works [59,60,61,62,63] have shown, traditional IBD treatments like glucosteroids and azathioprine, but also newer approaches like anti-TNF, are all susceptible to inefficiency due to specific genetic polymorphisms. The IBD landscape is vast and includes many factors and pitfalls that should be considered when trying to identify “who” is responsible for disease onset, progression and treatment, by making use of various technical approaches, each targeting a different subsystem [64]. Highlighted among these factors, the microbiome, has become a scientific trend in recent years due to its apparent implication in various diseases, especially IBD. Microbiota dysbiosis appears to either drive or uniquely classify, aspects of IBD like progression [65] and response to treatment [66].

Collectively, our approaches provide important insights into the interplay among IBD risk variants and their related signaling pathways in IBD. All this information is implicated directly to our understanding of the mechanisms underlying IBD and its clinical sequelae. Moreover, by applying these approaches to several disorders and then comparing the results we might be able to understand how key pathophysiological mechanisms can lead to comorbidities previously unknown.

Availability of data and materials

All data and materials are available upon request.


  1. Burisch J, Jess T, Martinato M, Lakatos PL, EpiCom E. The burden of inflammatory bowel disease in Europe. J Crohn’s Colitis. 2013;7(4):322–37.

    Article  Google Scholar 

  2. Mesbah-Uddin M, Elango R, Banaganapalli B, Shaik NA, Al-Abbasi FA. In-silico analysis of inflammatory bowel disease (IBD) GWAS loci to novel connections. PLoS ONE. 2015;10(3):e0119420.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  3. Liu JZ, van Sommeren S, Huang H, Ng SC, Alberts R, Takahashi A, et al. Association analyses identify 38 susceptibility loci for inflammatory bowel disease and highlight shared genetic risk across populations. Nat Genet. 2015;47(9):979–86.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  4. Ek WE, D’Amato M, Halfvarson J. The history of genetics in inflammatory bowel disease. Ann Gastroenterol. 2014;27(4):294–303.

    PubMed  PubMed Central  Google Scholar 

  5. Liu Y, Brossard M, Sarnowski C, Vaysse A, Moffatt M, Margaritte-Jeannin P, et al. Network-assisted analysis of GWAS data identifies a functionally-relevant gene module for childhood-onset asthma. Sci Rep. 2017;7(1):938.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  6. Franke A, Balschun T, Sina C, Ellinghaus D, Hasler R, Mayr G, et al. Genome-wide association study for ulcerative colitis identifies risk loci at 7q22 and 22q13 (IL17REL). Nat Genet. 2010;42(4):292–4.

    Article  CAS  PubMed  Google Scholar 

  7. Johnson SC, Gonzalez B, Zhang Q, Milholland B, Zhang Z, Suh Y. Network analysis of mitonuclear GWAS reveals functional networks and tissue expression profiles of disease-associated genes. Hum Genet. 2017;136(1):55–65.

    Article  CAS  PubMed  Google Scholar 

  8. Ji S-G, Juran BD, Mucha S, Folseraas T, Jostins L, Melum E, et al. Genome-wide association study of primary sclerosing cholangitis identifies new risk loci and quantifies the genetic relationship with inflammatory bowel disease. Nat Genet. 2017;49(2):269.

    Article  CAS  PubMed  Google Scholar 

  9. Oulas A, Minadakis G, Zachariou M, Sokratous K, Bourdakou MM, Spyrou GM. Systems bioinformatics: increasing precision of computational diagnostics and therapeutics through network-based approaches. Brief Bioinform. 2017.

    Article  PubMed Central  Google Scholar 

  10. Franke A, Balschun T, Karlsen TH, Sventoraityte J, Nikolaus S, Mayr G, et al. Sequence variants in IL10, ARPC2 and multiple other loci contribute to ulcerative colitis susceptibility. Nat Genet. 2008;40(11):1319–23.

    Article  CAS  PubMed  Google Scholar 

  11. Anderson CA, Boucher G, Lees CW, Franke A, D’Amato M, Taylor KD, et al. Meta-analysis identifies 29 additional ulcerative colitis risk loci, increasing the number of confirmed associations to 47. Nat Genet. 2011;43(3):246–52.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  12. Gazouli M, Mantzaris G, Kotsinas A, Zacharatos P, Papalambros E, Archimandritis A, et al. Association between polymorphisms in the Toll-like receptor 4, CD14, and CARD15/NOD2 and inflammatory bowel disease in the Greek population. World J Gastroenterol. 2005;11(5):681–5.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  13. Satsangi J, Silverberg M, Vermeire S, Colombel J. The Montreal classification of inflammatory bowel disease: controversies, consensus, and implications. Gut. 2006;55(6):749–53.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  14. Purcell S, Neale B, Todd-Brown K, Thomas L, Ferreira MA, Bender D, et al. PLINK: a tool set for whole-genome association and population-based linkage analyses. Am J Hum Genet. 2007;81(3):559–75.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  15. Viechtbauer W. Conducting meta-analyses in R with the metafor package. J Stat Softw. 2010;36(3):1–48.

    Article  Google Scholar 

  16. Oliveros J. VENNY. An interactive tool for comparing lists with Venn Diagrams. http.bioinfogp cnb csic es/tools/venny/index html. 2007.

  17. Smyth GK. Limma: linear models for microarray data. Bioinformatics and computational biology solutions using R and Bioconductor. Berlin: Springer; 2005. p. 397–420.

    Chapter  Google Scholar 

  18. Szklarczyk D, Morris JH, Cook H, Kuhn M, Wyder S, Simonovic M, et al. The STRING database in 2017: quality-controlled protein–protein association networks, made broadly accessible. Nucleic Acids Res. 2017;45(D1):D362–8.

    Article  CAS  PubMed  Google Scholar 

  19. Kanehisa M, Goto S. KEGG: kyoto encyclopedia of genes and genomes. Nucleic Acids Res. 2000;28(1):27–30.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  20. Enright AJ, Van Dongen S, Ouzounis CA. An efficient algorithm for large-scale detection of protein families. Nucleic Acids Res. 2002;30(7):1575–84.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  21. Shannon P, Markiel A, Ozier O, Baliga NS, Wang JT, Ramage D, et al. Cytoscape: a software environment for integrated models of biomolecular interaction networks. Genome Res. 2003;13(11):2498–504.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  22. Csardi G, Nepusz T. The igraph software package for complex network research. InterJ Complex Syst. 2006;1695(5):1–9.

    Google Scholar 

  23. Jalili M, Salehzadeh-Yazdi A, Asgari Y, Arab SS, Yaghmaie M, Ghavamzadeh A, et al. CentiServer: a comprehensive resource, web-based application and R package for centrality analysis. PLoS ONE. 2015;10(11):e0143111.

    Article  PubMed  PubMed Central  Google Scholar 

  24. Sharma P, Bhattacharyya DK, Kalita JK, editors. Centrality analysis in PPI networks. In: IEEE 2016 international conference on accessibility to digital world (ICADW); 2016.

  25. Estrada E, Ross GJ. Centralities in simplicial complexes. Applications to protein interaction networks. J Theor Biol. 2018;438:46–60.

    Article  CAS  PubMed  Google Scholar 

  26. Barabasi AL, Gulbahce N, Loscalzo J. Network medicine: a network-based approach to human disease. Nat Rev Genet. 2011;12(1):56–68.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  27. de Lange KM, Moutsianas L, Lee JC, Lamb CA, Luo Y, Kennedy NA, et al. Genome-wide association study implicates immune activation of multiple integrin genes in inflammatory bowel disease. Nat Genet. 2017;49(2):256–61.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  28. Li P, Yang XK, Wang X, Zhao MQ, Zhang C, Tao SS, et al. A meta-analysis of the relationship between MYO9B gene polymorphisms and susceptibility to Crohn’s disease and ulcerative colitis. Hum Immunol. 2016;77(10):990–6.

    Article  CAS  PubMed  Google Scholar 

  29. Li J, Wei Z, Chang X, Cardinale CJ, Kim CE, Baldassano RN, et al. Pathway-based genome-wide association studies reveal the association between growth factor activity and inflammatory bowel disease. Inflamm Bowel Dis. 2016;22(7):1540–51.

    Article  PubMed  Google Scholar 

  30. Coskun M, Salem M, Pedersen J, Nielsen OH. Involvement of JAK/STAT signaling in the pathogenesis of inflammatory bowel disease. Pharmacol Res. 2013;76:1–8.

    Article  CAS  PubMed  Google Scholar 

  31. Muraro D, Simmons A. An integrative analysis of gene expression and molecular interaction data to identify dys-regulated sub-networks in inflammatory bowel disease. BMC Bioinform. 2016;17:42.

    Article  CAS  Google Scholar 

  32. Cario E. Toll-like receptors in inflammatory bowel diseases: a decade later. Inflamm Bowel Dis. 2010;16(9):1583–97.

    Article  PubMed  Google Scholar 

  33. Catalan-Serra I, Brenna O. Immunotherapy in inflammatory bowel disease: novel and emerging treatments. Hum Vacc Immunother. 2018.

    Article  Google Scholar 

  34. Billmann-Born S, Lipinski S, Bock J, Till A, Rosenstiel P, Schreiber S. The complex interplay of NOD-like receptors and the autophagy machinery in the pathophysiology of Crohn disease. Eur J Cell Biol. 2011;90(6–7):593–602.

    Article  CAS  PubMed  Google Scholar 

  35. Kini AT, Thangaraj KR, Simon E, Shivappagowdar A, Thiagarajan D, Abbas S, et al. Aberrant niche signaling in the etiopathogenesis of ulcerative colitis. Inflamm Bowel Dis. 2015;21(11):2549–61.

    Article  PubMed  Google Scholar 

  36. Ke P, Shao BZ, Xu ZQ, Chen XW, Liu C. Intestinal autophagy and its pharmacological control in inflammatory bowel disease. Front Immunol. 2016;7:695.

    Article  CAS  PubMed  Google Scholar 

  37. Collins LE, DeCourcey J, Soledad di Luca M, Rochfort KD, Loscher CE. An emerging role for SNARE proteins in dendritic cell function. Front Immunol. 2015;6:133.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  38. Moreau K, Ravikumar B, Renna M, Puri C, Rubinsztein DC. Autophagosome precursor maturation requires homotypic fusion. Cell. 2011;146(2):303–17.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  39. Moreau K, Rubinsztein DC. The plasma membrane as a control center for autophagy. Autophagy. 2012;8(5):861–3.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  40. Moreau K, Renna M, Rubinsztein DC. Connections between SNAREs and autophagy. Trends Biochem Sci. 2013;38(2):57–63.

    Article  CAS  PubMed  Google Scholar 

  41. Meggyesi N, Kiss LS, Koszarska M, Bortlik M, Duricova D, Lakatos L, et al. NKX2-3 and IRGM variants are associated with disease susceptibility to IBD in Eastern European patients. World J Gastroenterol. 2010;16(41):5233–40.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  42. Sidiq T, Yoshihama S, Downs I, Kobayashi KS. Nod2: a critical regulator of ileal microbiota and Crohn’s disease. Front Immunol. 2016;7:367.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  43. Newton K, Dixit VM. Signaling in innate immunity and inflammation. Cold Spring Harbor Perspect Biol. 2012.

    Article  Google Scholar 

  44. Hooper KM, Barlow PG, Stevens C, Henderson P. Inflammatory bowel disease drugs: a focus on autophagy. J Crohn’s Colitis. 2017;11(1):118–27.

    Article  Google Scholar 

  45. Protic MB, Pavlovic ST, Bojic DZ, Krstic MN, Radojicic ZA, Tarabar DK, et al. CARD15 gene polymorphisms in Serbian patients with Crohn’s disease: genotype–phenotype analysis. Eur J Gastroenterol Hepatol. 2008;20(10):978–84.

    Article  PubMed  Google Scholar 

  46. McDermott E, Ryan EJ, Tosetto M, Gibson D, Burrage J, Keegan D, et al. DNA methylation profiling in inflammatory bowel disease provides new insights into disease pathogenesis. J Crohn’s Colitis. 2016;10(1):77–86.

    Article  Google Scholar 

  47. Wang H, Demirkan G, Bian X, Wallstrom G, Barker K, Karthikeyan K, et al. Identification of antibody against SNRPB, small nuclear ribonucleoprotein-associated proteins B and B’, as an autoantibody marker in Crohn’s disease using an immunoproteomics approach. J Crohn’s Colitis. 2017;11(7):848–56.

    Article  Google Scholar 

  48. Stappenbeck TS, Rioux JD, Mizoguchi A, Saitoh T, Huett A, Darfeuille-Michaud A, et al. Crohn disease: a current perspective on genetics, autophagy and immunity. Autophagy. 2011;7(4):355–74.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  49. Festen EA, Goyette P, Scott R, Annese V, Zhernakova A, Lian J, et al. Genetic variants in the region harbouring IL2/IL21 associated with ulcerative colitis. Gut. 2009;58(6):799–804.

    Article  CAS  PubMed  Google Scholar 

  50. Nazir S, Kumar A, Chatterjee I, Anbazhagan AN, Gujral T, Priyamvada S, et al. Mechanisms of intestinal serotonin transporter (SERT) upregulation by TGF-beta1 induced non-Smad pathways. PLoS ONE. 2015;10(5):e0120447.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  51. Goldner D, Margolis KG. Association of serotonin transporter promoter polymorphism (5HTTLPR) with microscopic colitis and ulcerative colitis: time to be AsSERTive? Dig Dis Sci. 2015;60(4):819–21.

    Article  PubMed  PubMed Central  Google Scholar 

  52. May SL, Zhou Q, Lewellen M, Carter CM, Coffey D, Highfill SL, et al. Nfatc2 and Tob1 have non-overlapping function in T cell negative regulation and tumorigenesis. PLoS ONE. 2014;9(6):e100629.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  53. Ha SJ, Mueller SN, Wherry EJ, Barber DL, Aubert RD, Sharpe AH, et al. Enhancing therapeutic vaccination by blocking PD-1-mediated inhibitory signals during chronic infection. J Exp Med. 2008;205(3):543–55.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  54. Muise AM, Walters T, Xu W, Shen-Tu G, Guo CH, Fattouh R, et al. Single nucleotide polymorphisms that increase expression of the guanosine triphosphatase RAC1 are associated with ulcerative colitis. Gastroenterology. 2011;141(2):633–41.

    Article  CAS  PubMed  Google Scholar 

  55. Kumar S, Jain A, Farzam F, Jia J, Gu Y, Choi SW, et al. Mechanism of Stx17 recruitment to autophagosomes via IRGM and mammalian Atg8 proteins. J Cell Biol. 2018.

    Article  PubMed  PubMed Central  Google Scholar 

  56. Weigmann B, Lehr HA, Yancopoulos G, Valenzuela D, Murphy A, Stevens S, et al. The transcription factor NFATc2 controls IL-6—dependent T cell activation in experimental colitis. J Exp Med. 2008;205(9):2099–110.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  57. Naegelen I, Plancon S, Nicot N, Kaoma T, Muller A, Vallar L, et al. An essential role of syntaxin 3 protein for granule exocytosis and secretion of IL-1α, IL-1β, IL-12b, and CCL4 from differentiated HL-60 cells. J Leukoc Biol. 2015;97(3):557–71.

    Article  CAS  PubMed  Google Scholar 

  58. Seidelin JB, Nielsen OH. Attenuated apoptosis response to Fas-ligand in active ulcerative colitis. Inflamm Bowel Dis. 2008;14(12):1623–9.

    Article  PubMed  Google Scholar 

  59. Matsukura H, Ikeda S, Yoshimura N, Takazoe M, Muramatsu M. Genetic polymorphisms of tumour necrosis factor receptor superfamily 1A and 1B affect responses to infliximab in Japanese patients with Crohn’s disease. Aliment Pharmacol Ther. 2008;27(9):765–70.

    Article  CAS  PubMed  Google Scholar 

  60. Qasem A, Ramesh S, Naser SA. Genetic polymorphisms in tumour necrosis factor receptors (TNFRSF1A/1B) illustrate differential treatment response to TNFα inhibitors in patients with Crohn’s disease. BMJ Open Gastroenterol. 2019;6(1):e000246.

    Article  PubMed  PubMed Central  Google Scholar 

  61. Medrano L, Taxonera C, Márquez A, Barreiro-de Acosta M, Gómez-García M, González-Artacho C, et al. Role of TNFRSF1B polymorphisms in the response of Crohn’s disease patients to infliximab. Hum Immunol. 2014;75(1):71–5.

    Article  CAS  PubMed  Google Scholar 

  62. Lee M-N, Kang B, Choi SY, Kim MJ, Woo SY, Kim J-W, et al. Impact of genetic polymorphisms on 6-thioguanine nucleotide levels and toxicity in pediatric patients with IBD treated with azathioprine. Inflamm Bowel Dis. 2015;21(12):2897–908.

    Article  PubMed  Google Scholar 

  63. Yang QF, Chen BL, Zhang QS, Zhu ZH, Hu B, He Y, et al. Contribution of MDR1 gene polymorphisms on IBD predisposition and response to glucocorticoids in IBD in a Chinese population. J Digest Dis. 2015;16(1):22–30.

    Article  CAS  Google Scholar 

  64. Dovrolis N, Filidou E, Kolios G. Systems biology in inflammatory bowel diseases: on the way to precision medicine. Ann Gastroenterol. 2019;32(3):233.

    PubMed  PubMed Central  Google Scholar 

  65. Dovrolis N, Drygiannakis I, Filidou E, Kandilogiannakis L, Arvanitidis K, Tentes I, et al. Gut microbial signatures underline complicated Crohn’s disease but vary between cohorts. Inflammatory bowel diseases: An In Silico Approach; 2018.

    Google Scholar 

  66. Magnusson MK, Strid H, Sapnara M, Lasson A, Bajor A, Ung K-A, et al. Anti-TNF therapy response in patients with ulcerative colitis is associated with colonic antimicrobial peptide expression and microbiota composition. J Crohn’s Colitis. 2016;10(8):943–52.

    Article  Google Scholar 

Download references


Not applicable.


This article hasn’t received any funds from any research or non-research organizations.

Author information

Authors and Affiliations



All authors follow the ICJME requirements, and had made considerable contributions in the present study. All authors read and approved the final manuscript.

Corresponding author

Correspondence to Maria Gazouli.

Ethics declarations

Ethics approval and consent to participate

Not applicable.

Consent for publication

Not applicable.

Competing interests

The authors declare that they have no competing interests.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Additional files

Additional file 1.

Analysis results via PathwayConnector for all our studied phenotypes except B3 and E2 due to the limited amount of statistically significant genes after the initial GWAS analysis. For each phenotype we report the top 10 statistically significant pathways after enrichment, the newly associated pathways via the construction of a complementary network and finally the network’s visual representation. All the network visualization figures are high resolution and can be saved and viewed individually. (Index: Page 2: Crohn’s Diseaseq Page 3: B1 CD; Page 4: B2 CD; Page 5: Ulcerative Colitis; Page 6: E1 UC; Page 7: E3 UC).

Additional file 2.

The ranked proteins associated with each IBD phenotype and sub-phenotype after centrality analysis, in their respective sheets.

Additional file 3.

Unique and shared KEGG pathways between different phenotype groupings after enrichment via STRING: CD vs UC, B1 vs B2 and E1 vs E3. The results are shown in the respective sheets. a) CD vs UC, b) B1 vs B2, c) E1 vs E2.

Additional file 4.

The table represents all the KEGG pathways per IBD phenotype and sub-phenotype by utilizing the results in Additional files 2 and 3. These have all been ranked using the protein centrality scores for the proteins contributing to each one of them as explained in the manuscript.

Rights and permissions

Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (, which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver ( applies to the data made available in this article, unless otherwise stated.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Gazouli, M., Dovrolis, N., Franke, A. et al. Differential genetic and functional background in inflammatory bowel disease phenotypes of a Greek population: a systems bioinformatics approach. Gut Pathog 11, 31 (2019).

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI: