The KUPNetViz: a biological network viewer for multiple -omics datasets in kidney diseases

1471-2105-14-235 1471-2105 Software The KUPNetViz: a biological network viewer for multiple -omics datasets in kidney diseases MoulosPanagiotispmoulos@eie.gr KleinJuliejulie.klein64@gmail.com JuppSimonjupp@ebi.ac.uk StevensRobertrobert.stevens@cs.manchester.ac.uk BascandsJean-Loupjean-loup.bascands@inserm.fr SchanstraPJoostjoost-peter.schanstra@inserm.fr

Institut National de la Santé et de la Recherche Médicale (INSERM), U1048, Institute of Cardiovascular and Metabolic Disease, 1 avenue Jean Poulhès, 31432 Toulouse, France

Université Toulouse III Paul-Sabatier, 118 route de Narbonne, 31062 Toulouse, France

School of Computer Science, University of Manchester, Kilburn Building, Oxford Road, Manchester M13 9PL United Kingdom

BMC Bioinformatics

Networks analysis

1471-2105 2013 14 1 235 http://www.biomedcentral.com/1471-2105/14/235 10.1186/1471-2105-14-235 127201221720132472013 2013Moulos et al.; licensee BioMed Central Ltd.This is an Open Access article distributed under the terms of the Creative Commons Attribution License ( http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Abstract

Background

Constant technological advances have allowed scientists in biology to migrate from conventional single-omics to multi-omics experimental approaches, challenging bioinformatics to bridge this multi-tiered information. Ongoing research in renal biology is no exception. The results of large-scale and/or high throughput experiments, presenting a wealth of information on kidney disease are scattered across the web. To tackle this problem, we recently presented the KUPKB, a multi-omics data repository for renal diseases.

Results

In this article, we describe KUPNetViz, a biological graph exploration tool allowing the exploration of KUPKB data through the visualization of biomolecule interactions. KUPNetViz enables the integration of multi-layered experimental data over different species, renal locations and renal diseases to protein-protein interaction networks and allows association with biological functions, biochemical pathways and other functional elements such as miRNAs. KUPNetViz focuses on the simplicity of its usage and the clarity of resulting networks by reducing and/or automating advanced functionalities present in other biological network visualization packages. In addition, it allows the extrapolation of biomolecule interactions across different species, leading to the formulations of new plausible hypotheses, adequate experiment design and to the suggestion of novel biological mechanisms. We demonstrate the value of KUPNetViz by two usage examples: the integration of calreticulin as a key player in a larger interaction network in renal graft rejection and the novel observation of the strong association of interleukin-6 with polycystic kidney disease.

Conclusions

The KUPNetViz is an interactive and flexible biological network visualization and exploration tool. It provides renal biologists with biological network snapshots of the complex integrated data of the KUPKB allowing the formulation of new hypotheses in a user friendly manner.

Background

During the past decade major advances in biological research, mainly in the field of high throughput analysis (e.g. –omics), has led to an exponential increase in available experimental data, produced through a variety of techniques including DNA, miRNA 1 and antibody arrays 2 , next generation sequencing technologies 3 and mass spectrometry 4 . This switch in life sciences towards multi-omics approaches has created a gap in the provision of bioinformatics tools capable of combining this data. More importantly, there appears to be a shortage in efficient tools that would aid the bench biologist to i) simplify and categorize the results of a multi-omics approach which usually come into the form of long lists ii) visualize different information layers which can be mined from multi-omics approaches and aggregate useful pieces into biochemical pathways and/or biological function groups iii) combine steps (i) and (ii) in a repeatable and reusable fashion so as to extract meaningful outcomes regarding the biological system under investigation and iv) combine all the aforementioned steps to formulate plausible hypotheses, possibly applied to similar systems (e.g. systems functioning in the same tissue/organ/similar disease situation) and eventually design new experiments for hypothesis validation.

The renal biology field is facing similar problems. Although kidney diseases have been extensively studied in different species (i.e. human, mouse, rat), this wealth of information remains hidden across several layers of public data and/or literature repositories. Although considerable effort has been devoted in aggregating data from several resources 5 6 7 , the resulting databases fail to meet the multi-omics attribute and remain dispersed across the web. To address this problem, we developed the Kidney and Urinary Pathway Knowledge Base (KUPKB) 8 , a publicly available repository which organizes an important amount of existing knowledge regarding renal tissue, cell and disease categorization, using Semantic Web technologies 9 . The KUPKB can be queried through the user-friendly iKUP browser, 8 , accessible at http://www.kupkb.org.

The iKUP comprises a powerful tool in terms of speed, selectivity and descriptive power. Nevertheless, its nature restricts the user to viewing the query results in tabular format, only skimming the surface of the rich interconnected data otherwise available in the KUPKB. In addition, although it succeeds in displaying findings hidden in scattered repositories, such as the expression of a set of genes under a very specific combination of kidney tissue, cellular type and disease, it remains unable to map this information to interaction networks available as background information and in a multiple species manner.

The value of biological network representations has been extensively analyzed in bioinformatics and systems biology literature 10 11 . Some important aspects include the ability to capture fixed snapshots of cellular states 12 otherwise hidden in tables, to infer functional associations 13 , to reduce complexity by combining protein interaction, gene expression and metabolic profiles in a single image 14 , and perform pattern recognition in a network snapshot 15 .

In this article, we describe KUPNetViz, an interactive biological network querying and visualization application. Its main purpose is to assist renal scientists to extend their research by providing an alternative KUPKB data image depicting interactions among the queried molecules and their neighbors, coupled with functional and biochemical pathway annotation in both a species dependent and independent manner. The main tasks supported and promoted by KUPNetViz are: i) the display, exploration and manipulation of general protein-protein interaction as well as functional and biochemical pathway association networks for a number of widely studied mammalian species, ii) the transformation of these general networks to kidney specific networks through their association with related gene/protein/miRNA expression datasets, iii) the mining of possibly hidden relationships among co-regulated and/or directly interacting (“neighboring”) entities under several combinations of kidney anatomies and/or disease models and iv) the extrapolation of both expression data and network interactions across different diseases, anatomies but also different species facilitating the quick formulation and screening of biological hypotheses and the design of new experiments.

The molecule and interaction visual representations in KUPNetViz have been inspired by previous work in the field 16 with several additions and significant modifications to fit for use. To demonstrate its value and necessity in kidney research studies and its ability to complete the aforementioned tasks, we present two usage examples that exemplify that the KUPNetViz clearly extends the functionalities of the iKUP. Specifically, we found substantial additional evidence for a role of calreticulin in renal disease in humans 8 by showing that calreticulin is involved in a larger interaction network in renal graft rejection. In addition, we propose a novel association of the inflammatory axis interleukin-6 (IL6)/IL6 receptor with the progression of polycystic kidney disease.

Implementation and functionality

General architecture

The KUPNetViz is a web application developed using PHP for server side programming and jQuery ( http://www.jquery.org) coupled with static HTML for the client side. The backend of the application is a MySQL database hosting the background knowledge used to build the mappings among entities, the network interactions, annotation data and properly parsed experimental data that make up the most important part of the KUPKB. The backend database can be very easily updated and maintained as it can be rebuilt in an automated fashion with a single one-line command, through a series of Perl scripts that download, parse and import KUPKB data and the background knowledge resources into the schema, using a wrapper script and a simple YAML ( http://www.yaml.org) configuration file. The resulting interaction networks from user queries along with experimental data mappings (gene, protein, miRNA expression and statistical significance) are rendered, visualized and controlled using the Cytoscape Web graph visualization library 17 .

Background knowledge

The background knowledge integrated in KUPNetViz from KUPKB has been extensively described elsewhere 8 18 . Briefly, gene, protein and miRNA annotations are derived from NCBI gene, UniProt 19 and Ensembl 20 , and Microcosm 21 respectively. For the mappings among the various biological entities and accession numbers, we used the mapping files provided by NCBI (ftp.ncbi.nlm.nih.gov) as well as the Biomart web services 22 . Files provided by NCBI were also used for the mapping of genes to their respective GO terms. Molecule interactions are extracted from the STRING protein-protein interaction database 23 by parsing publicly available flat files and miRNA to gene interactions are extracted from the Microcosm miRNA target files ( http://www.ebi.ac.uk/enright-srv/microcosm/htdocs/targets/v5/). Biochemical pathway information was derived from KEGG 24 using its freely available web service API to download and construct mappings between genes and their respective pathways.

Experimental data

The experimental data integrated in KUPNetViz and used to map gene/protein/miRNA abundance to the biochemical networks constructed by querying the application has been extensively described elsewhere 8 . Briefly, the knowledge base currently contains over 220 experiments spanning several biological layers (gene, protein, miRNA and metabolite abundances) derived from published articles and public repositories (e.g. GEO). In some cases data are reported as extracted from the respective publication (coupled with statistical significance where available) while in other cases data had to be reanalyzed (mostly microarray data) before integration. All experimental data are derived from studies related to several kidney diseases and anatomies and are manually curated.

Usage and navigation

The KUPNetViz application consists of a single web page divided in three main parts: i) the interaction network canvas, ii) the annotation section with legends regarding the network components and a dynamic subsection which displays information and external links for the selected network elements, and iii) the application controls which are organized in four tabs containing data search and mapping options as well as more advanced application controls. This configuration allows the user to easily navigate through the application and explore and map KUPKB data without leaving the main network view. A snapshot of the application is presented in Figure 1. The usage of the application is self-guiding and can be summarized in three basic steps: i) search using a molecule list of interest, querying one or more of the supported species, ii) mapping of experimental data to the network, by navigating and selecting through multiple renal locations and diseases associated with the queried molecules and iii) association with biological functions, processes and cellular components, with biochemical pathways and with miRNAs targeting the entities of interest.

Figure 1

A snaphsot of KUPNetViz in action

A snaphsot of KUPNetViz in action. Under the network view, the KUPKB data mapping tab is displayed, where the researcher can map onto the network several gene/protein/miRNA expression datasets and apply different criteria and combinations of kidney location(s), disease(s) and data from published studies. This snapshot focuses on the down-regulation of TGFβ1 in kidney in two renal disease states, namely in acute renal allograft rejection and interstitial fibrosis and tubular atrophy, and the up-regulation of the miRNA that targets TGFβ1 in interstitial fibrosis and tubular atrophy, as derived from two published studies.

Data input

The user can query KUPNetViz using a variety of molecule identification types, including HUGO gene symbols (e.g. “CALR”, but the search is case-insensitive), Entrez accession numbers (e.g. 811), UniProt accessions (e.g. P27797), Ensembl gene and protein IDs (e.g. ENSG00000179218 or ENSP00000320866), miRBase accessions (e.g. hsa-miR-362-3p) or by simple free text (e.g. “calreticulin”). All the above accessions and search terms are equivalent to calreticulin and can be used in mixed identification types. Four different mammalian species are currently supported, namely human, mouse, rat and dog. A strong feature of KUPNetViz, which is rare among other application of the same kind, is the ability to query multiple species for protein-protein network reconstruction at the same time.

Extrapolation of protein–protein interactions across species

The extrapolation procedure followed by KUPNetViz is quite simplistic and is performed by assuming that a protein-protein interaction which has been recorded experimentally or predicted in e.g. humans but not in mice has a probability of being also present in mice (or any other of the supported by the application mammalian species). Thus, instead of aligning networks from different organisms (in similar taxonomies) the tool creates a meta-network with “super-nodes” and “super-edges” where protein-protein interactions are shared among species. Although this assumption involves certain risk regarding its biological validity, it allows the quick creation of hypotheses regarding those protein-protein interactions and the quick screening from the experimentalist. The extrapolation assumption to super-nodes and super-edges is also based on a certain “bias” in the number of biological studies regarding each organism, as there is a normal and understandable trend towards the study of human biomaterial (cell lines, healthy or diseased tissues) as compared to other organisms. For example, a simple search of the term “homo sapiens” in Pubmed yields a number of studies which is one order of magnitude greater (12547599) than the respective number of studies for “mus musculus” (1159658) at the time of search. This is reflected also in the number of protein-protein interactions: using a calreticulin-centered first level network (the first neighbors of calreticulin) for human, 365 protein-protein interactions are found (interaction score threshold: 0.2) while the same number for mouse is 8 (same threshold).

Running time

The running time of the KUPNetViz depends on two factors: i) the performance of the network layout algorithm and the visualization itself (e.g. supported network size) which depends on the Cytoscape Web library and ii) the querying performance of the application regarding the mapping of renal –omics datasets to the network. The latter depends on the number of queried molecules. Typical queries with a few molecule names complete in a few seconds. The only operations that are quite time consuming both in terms of querying and visualization time are the search for second level neighbors of selected nodes and the case of many queried molecules when working in multi-species mode. Finally, the speed of the network layout depends on the user’s machine as the network rendering takes place in the user’s browser.

Data export

Network export is available through the respective functions of the Cytoscape Web library. The user has the ability to export the resulting network after querying and processing, in a variety of text formats (SIF, GraphML, XGMML and Arena3D format 25 ) as well as high-quality image formats (PDF, PNG and SVG). The text formats allow the import of the resulting networks into other graph analysis applications (such as Cytoscape) for further explorations and network property analysis. The supported text formats are sufficient for compliance with most current biological network visualization tools.

Results

To demonstrate the added value of the KUPNetViz we present two case studies, revealing insights that would require significant effort to mine using the iKUP only. Specifically, we explored the role of calreticulin, a protein involved in renal disease in animals, in a larger interaction network in renal graft rejection (interstitial fibrosis and tubular atrophy) and its association with other functional and pathway elements and the association of the inflammatory axis interleukin-6 (IL6)/IL6 receptor with the progression of polycystic kidney disease.

Case study 1: the role of calreticulin in renal graft rejection

Calreticulin is a protein involved in renal disease in animals 26 . Using the iKUP browser to explore calreticulin entries in the KUPKB, we demonstrated for the first time that calreticulin expression was induced in human renal graft rejection, an in silico hypothesis that was then experimentally confirmed 8 . To better understand the role of calreticulin in renal graft rejection, we sought to investigate the following questions: 1) was calreticulin acting as an independent player or as a part of a bigger protein network and 2) what were the processes associated with calreticulin dysregulation. These questions could not be answered using the iKUP browser alone, as they required knowledge about protein-protein interactions and protein annotation with biological processes.

We used the KUPNetViz to search for “calreticulin” in human and visualized neighboring genes/proteins (first level neighbors, interaction threshold: 0.2). We next annotated the network with renal graft rejection data contained in the KUPKB, GO processes and KEGG pahways. To perform this procedure, we selected “interstitial fibrosis and tubular atrophy” from the kidney disease panel. This disease model was associated with, namely “Nakorchevsky, J Am Soc Nephrol, 2010”, “Rodder, Am J Transplant, 2009” and “Scherer, Nephrol Dial Transplant, 2009”. In addition to these datasets, we also selected “ECM-receptor interaction” pathway and “basement membrane” cellular component from the GO and KEGG panels respectively. The resulting network is presented in Figure 2.

Figure 2

Calreticulin network

Calreticulin network. Analysis of the calreticulin network shows that not only calreticulin expression is modified in renal graft rejection but also many of the genes/proteins (34%) with known interactions with calreticulin. In addition specific processes/pathways are enriched including extracellular matrix interaction and basement membrane proteins. These are processes and pathways known to be involved in renal graft rejection. Overall projection of the KUPKB data on the calreticulin interaction network clearly exemplifies a role of calreticulin in renal graft rejection.

Overall, we observed that calreticulin was at the centre of a network where many genes and proteins were also modified in this pathology, and that most of these biomolecules were part of the basal membrane and involved with cell to extracellular matrix interactions (ECM). Specifically, the ECM-receptor interaction pathway is mostly upregulated, as observed from the orange to red colour scale of the nodes connected to calreticulin (Figure 2).

ECM and basement membrane are known to be significantly altered in renal graft rejection 27 and the analysis performed using our tool links calreticulin to these pathological processes. Furthermore, basement membrane proteins identified as interacting with calreticulin such as laminins and fibronectin (LAMA2, LAMA4, LAMA5, LAMB1, LAMB2, LAMC1 and FN1 in Figure 2) have been identified to be upregulated in renal graft rejection 28 29 . Altogether, the use of KUPNetViz provides additional, pathway-based, evidence for calreticulin as a valuable target in renal graft rejection. Although it would not have been impossible to find these links otherwise, the use of the tool presented in this article significantly accelerated this discovery. For example, a search using Pubmed did not return any results when querying for “laminin fibronectin calreticulin”. Furthermore, querying classic network visualization tools such as the tool provided in the STRING database 23 website lack background knowledge on kidney diseases.

Case study 2: potential involvement of IL6 and the IL6 receptor in the progression of polycystic kidney disease

IL6 is a pro-inflammatory cytokine that has been previously described as a pro-fibrotic mediator in liver 30 , in lung and in skin 31 models of fibrosis. However, current evidence for a possible role of IL6 in the development of renal fibrosis is limited. The only available evidence of this link in the kidney was published recently in a study demonstrating that mice with genetic blockade of IL6 were protected against the development of renal fibrosis 32 . In addition, the authors also observed increased IL6 expression levels in kidney biopsies of chronic kidney disease patients compared to age-matched control biopsies. IL6 is a peculiar cytokine as, although it is expressed by a large variety of cells, it can only target a low number of those, due to the very limited expression of its receptor (IL6R) 33 .

To further investigate the possible role of IL6 and IL6R, we performed a search for these molecules in the iKUP which demonstrated that their renal expression in normal conditions seemed to be limited to the collecting duct, a tubular structure of the kidney 34 . The results also showed that IL6 and IL6R were up-regulated in both human polycystic kidney disease (PKD) and in a rat PKD model 35 36 . To our knowledge, there are few data linking IL6 to PKD but a recent study 37 demonstrated that patients with PKD had higher level of serum IL6 as compared to controls. In order to obtain further support for this link, we next searched for IL6 and IL6R in the KUPNetViz. In humans, the network associated with those two entities obtained by querying for and displaying their first level interacting neighbors was mostly up-regulated in patients with PKD (Figure 3A). This outcome was derived after mapping three KUPKB datasets related to PKD, namely “Lai, Proteomics Clin Appl, 2008”, “Mason, Proteomics Clin Appl, 2009” and “Song, Hum Mol Genet, 2009”, 35 38 39 (disease filter: “autosomic dominant polycystic kidney disease” and “PKD1 autosomic dominant kidney disease”). More importantly, in mouse and rat models of PKD, up-regulation of the network increased with the severity of the disease as levels of up-regulation increased (i.e. orange to red) and the p-values decreased with disease progression (thickening of node border). This outcome was derived after mapping the dataset “Chen, BMC Res Notes, 2008” for mouse 40 , and the dataset “Koupedidou, BMC Nephrol, 2010” for rat 36 . The latter is depicted in Figure 3B-D. To conclude, by using the KUPNetViz we were able to generate a novel hypothesis, suggesting for the first time that the IL6/IL6R axis might be an important player in PKD and a valuable target in this disease. In the future, experiments will be needed to confirm or refute this hypothesis but this could represent a major step forward in understanding and handling this common and life-threatening genetic disease.

Figure 3

The IL6/IL6R axis network for human and rat

The IL6/IL6R axis network for human and rat. The IL6/IL6R axis expression network created by querying the KUPNetViz for IL6 and IL6R and using the first level interaction neighbors of IL6. (A) The network associated with IL6/IL6R is mostly up-regulated in patients with PKD as observed after mapping the results of three relevant studies in human (namely “Lai, Proteomics Clin Appl, 2008”, “Mason, Proteomics Clin Appl, 2009” and “Song, Hum Mol Genet, 2009”), (B-D) Progressive upregulation of the IL6/IL6R network in PKD in rat model as derived from the study of Koupepidou et al.36. The three figures depict three distinct stages of the disease, specifically (A) day 0, (B) day 6 and (D) day 24, after mapping the three experimental conditions in the dataset “Koupepidou, BMC Nephrol, 2010”.

Discussion

The wealth of biological information regarding kidney disease that can be extracted from the current version of the KUPKB is currently presented in tabular format. Given the possibly long returning lists of molecules, the nature of the knowledge base and the fact that it contains hierarchical and protein interaction data, we sought to visualize these interactions through a gene network visualization module. Prior to its development we tried to map experimental data to some of the numerous existing software packages that have been developed for biological network visualization and exploration and at the same time suited at least partially our needs (mapping of possible multi-omics data, integrated background databases, customizable data sources). These tools included EGAN, 16 and Biological Networks, 41 . In the case of EGAN, the feedback included the excessive amount of displayed information, the view complexity derived from the multiple data sources, the inability to simultaneously display expression data from multiple experiments and the moderate performance of the application regarding big networks. Another major issue with EGAN is the lack of directionality in gene-to-gene edges. On the other hand, in the case of Biological Networks, the users were overwhelmed mostly by the amount of functionalities, the complex interface and the multiple background knowledge data sources, features that might fascinate a bioinformatician but often frustrates a bench biologist seeking clear-cut specific answers instead of heavy and complex multi-functionalities. In addition, Biological Networks requires a local installation which is often a disadvantage, mostly in terms of maintenance.

The above tools comprise only a small example out of a nowadays large pool of tools allowing the combined view of protein-protein interaction networks and abundance data 14 . Leaving aside the domain-specific nature of KUPNetViz, most of the packages presented in the first table of 14 meet the criteria of combined views of interaction and abundance data (e.g. VANTED, 42 , GENeVis, 43 ) and are equipped with high-level functionalities (e.g. network clustering algorithms, 44 dimensionality reduction methods, 45 , visualization of extremely large networks, 46 , and network module detection, 47 ). However, although the above tools excel in network-related functionalities and visualization quality, none of them comes with a complete and yet simple set of bundled background databases (protein-protein interaction, element annotation, functional and pathway associations) in one place, a fact which is essential for the researcher. Other solutions including Pajek, 48 , and yEd ( http://www.yworks.com/en/products_yed_about.html) comprise also excellent network analysis tools but are very generic and not focused to biological problems which may also confuse the biologist.

Another drawback with many current applications is that they are desktop-based while there is a current trend in software development to switch from desktop to web-based applications for a variety of reasons including flexibility, maintenance and platform independence. Thus, although there are certain web-based tools, based on Java^TM webstart technology (VANTED, 42 ) or related technologies (VisANT, 13 ), they still have limitations similar to the ones described for certain desktop applications, and mainly the lack of a small but adequate set of bundled background databases and a relatively simplistic interface which does not require prior training. The KUPNetViz application, although simplistic in its basic concepts, aspires to initiate a set of domain-specific integrative visualization tools that focus in comprehensive interfaces. These interfaces could be used to answer biological questions from the first usages, without demanding from the user to go through long manuals, time-consuming database and literature searches to annotate the various entities in the network and understand basic mathematical concepts of graph theory.

Even though the developers of several biological network visualization and data integration packages have put substantial effort in the simplicity and clarity of the outcome, some of them can only be fully exploited by trained bioinformaticians while others require prior training in the form of workshops or seminars in order to be used at their full potential. The development of the KUPNetViz application was a continuous interaction between computer scientists and renal biologists, where several features were added, removed or re-implemented in the basis of the arguments that the application should remain simple to use, self explanatory and it should provide simple biological network snapshots. As a result, the tool manages to keep the network display and the functionalities as simple as possible by hiding and/or automating several technical aspects and mathematical properties of the biological graph. At the same time, it does not restrain more advanced users from exploring other possible mathematical properties of the resulting networks as it allows their export in text formats that can be imported to numerous existing graph analysis packages for further analysis. Additionally, taking into account a recent trend in the development of bioinformatics tools 49 , which focuses to the application of user-centered design methods, the KUPNetViz comprises a good example of such a strategy, given the time that was required for the case studies to be completed and the feedback from biologists that used the application.

To our knowledge, one of the novelties of the KUPNetViz, rare among other biological network visualization tools, is the ability to visualize gene-to-gene relationships in a multispecies manner, while at the same time maintaining the simple minimalistic network display. The majority of current packages focuses either on the alignment of known biochemical networks among different species (e.g. Osprey, 50 or VANLO, 51 ), allowing the user to visualize but not automatically extrapolate possible interactions that may apply across species, or on the manual building of such inter-species interaction networks based on multiple available data sources (e.g. Biological Networks, 41 ). In addition, the multiple network alignment or the manual network building is usually performed with a certain cost in application and visualization simplicity, often discouraging the simple or the hurried user.

The KUPNetViz goes one step further by allowing the possibility of extrapolation of biomolecular relationships from one species to another while at the same time maintaining its straightforward visual interface and graph views (Additional file 1). This creates the unique possibility to combine –omics data from different animal models and/or human disease and quickly screen for reasonable hypotheses. As the KUPNetViz supported organisms are not equally extensively annotated in the literature (e.g. mouse is better annotated for developmental functions than human 52 ), the multi-species functionality allows the extrapolation of gene-to-gene relationships and gene/protein expression data from one organism to another. While some relationships between molecules may differ between species, most relationships and biological pathways can be considered evolutionary conserved and thus allow the easy formulation of extrapolated hypotheses, regarding both biomolecular relationships and gene/protein expression under certain pathological conditions across species. The researcher can then judge the fundamental validity of these hypotheses and proceed either by rejecting a hypothesis and formulate another, or by creating a list of plausible hypothesis which can ultimately be verified in the lab.

Additional file 1

An example of KUPNetVis multispecies view. Three snapshots of a simple KUPNetViz network presenting the multispecies view, either with ‘supergenes’ encompassing orthologs among the supported species, or with collapsed ‘supergenes’ for simplified view.

Click here for file

Furthermore, as the whole application can be easily built and deployed in several machines, this modularity renders it easily adaptable to other data sources (e.g. another organ). Thus, with minor modifications, the KUPNetViz may evolve to a more general framework, suitable for organ or tissue specific knowledge bases, which at the same time is not as generic as Cytoscape Web itself but includes the basic bundled knowledge to function as an independent biological network visualization application. Although such an application is not alone in its kind, it manages to keep simplicity, visual complexity and speed at acceptable levels, as compared with the two aforementioned applications ( 16 41 ). A schematic representation of the KUPNetViz modularization is depicted in Figure 4.

Figure 4

The modularization of KUPNetViz

The modularization of KUPNetViz. A simple description of the architecture behind the KUPNetViz which consists of three simple layers, present in many similar applications: i) the background database, ii) the server PHP code layer handling database querying and parsing iii) the client side of the application where the interaction with the user is taking place. Client–server data exchange is performed using the widely used JavaScript Object Notation (JSON) format. The vertical dashed lines in the data stored in the backend database layer and the PHP layer mark the distinction of the different modules handling the background knowledge and network reconstruction and the KUPKB datasets and their mapping to the network. This distinction allow the evolution of the KUPNetViz to a more general biological network web framework which combines bundled background knowledge on biomolecular interactions with a very simple interface designed for bench researchers wishing simple biological state snapshots and not to go deeper in other network properties.

The reasoning behind the construction of a MySQL backend database to support the application instead of using the KUPKB semantic web repository includes three main arguments: firstly, during the development we sought to maintain software modularity and re-usability even though in its current version, the visualization application is adjusted to kidney. Thus, we considered that an RDB model (especially regarding the background knowledge part) can be adjusted easier to other similar experimental data sources (e.g. a different organ) or data sources of different structure (e.g. multi-omics data from different types of cancer), requiring minor modifications only in the user interface part. Secondly, not all the background knowledge data sources that are used by the KUPNetViz were initially incorporated in the semantic web model (e.g. the protein-protein interactions or the reference pathways). This does not suggest a KUPKB model design problem but rather highlights the different initial purpose of the model which was not network visualization, but the detailed knowledge representation of kidney anatomies and disease models and the experimental data mapping and annotation to this knowledge representation model. Finally, during the design process, we realized that not all of the knowledge stored in the semantic web repository (e.g. the highly detailed localization and hierarchization of cell types and kidney tissues) was required for the minimalistic approach promoted by KUPNetViz. Thus, we used an RDB model to reduce query complexities and running speeds. In addition to the latter, the backend database building time, including the download and parsing of the background knowledge files and the re-parsing of the experimental data requires less than three hours while the rebuild of the semantic web repository requires over a day (the times were measured on the same computer). This allows the easier maintenance of the KUPNetViz and the more frequent synchronization of the backend database with the latest releases of the incorporated background knowledge sources.

Conclusions and future work

KUPNetViz is a user-friendly biological network visualization tool dedicated to renal research. This tool uses data gathered from multiple resources on several renal pathological states and background knowledge elements regarding biomolecular interactions as well as functional and biochemical pathway associations and creates biological network snapshots that can shed light on mechanisms involved in renal disease. It provides renal biologists with an alternative network data representation, complementary to the functionality of the iKUP exploration tool with a small cost regarding the display complexity. Moreover, maybe its most important feature is the biomolecular interaction extrapolation across species which allows the researcher to quickly formulate and screen several hypotheses in a simple manner. We have demonstrated the value of KUPNetViz in two case studies, further investigating the role of calreticulin as a key player in a gene network mostly up-regulated in renal graft rejection and newly investigating the potential involvement of IL6 and the IL6 receptor in the progression of polycystic kidney disease.

The value of the KUPNetViz in kidney research will be increased in parallel with the number of related public multi-omics datasets which will be also available in the KUPKB. The expertise of the KUPNetViz hosting group in renal biology guarantees its constant curation and addition of related datasets, as these are getting published. This will potentially lead to increased confidence in the observed networks as additional biological evidence will eventually pile up from those constantly added datasets. The addition of datasets is easy and straightforward, since data are entered in ready-to-fill Excel sheets by biologists and incorporated by a single command line. We are aiming to the inclusion of the latest omics data sets in the KUPKB at least once a year from which the KUPNetViz will directly benefit from. Future extensions include the incorporation of additional biomolecule annotation and interaction resources (KEGG compounds) and the addition of a module that will identify highly enriched and over-represented biological functions and biochemical pathways using the StRAnGER algorithm 53 .

Availability and requirements

Home page: Accessible through the website of KUPKB and the iKUP browser (tab “KUPKB Network Visualizer”) at http://www.kupkb.org or directly at http://www.kupkb.org/vis/index.php.

Operating system: KUPNetViz is a web-based application thus it is platform independent.

Requirements: KUPNetViz is best used and viewed under Internet Explorer 8 or higher, Mozilla Firefox, Google Chrome, Apple Safari and Opera. The use of Internet Explorer 7 and lower is not recommended.

License: KUPNetViz is free for academic use but requires a license from the authors for any commercial purposes. The software is available without user registration.

Further information: An analytical user’s guide coupled with usage examples is available at the application’s homepage and as supplementary material online (Additional file 2).

Additional file 2

The KUPNetVis user guide. An extensive user guide presenting all the features of the application with additional examples.

Click here for file

Abbreviations

KUP: Kidney and urinary pathways; KUPKB: Kidney and urinary knowledge base; KUPNetViz: Kidney and urinary pathway network visualizer; GO: Gene ontology; KEGG: Kyoto encyclopedia of genes and genomes; PKD: Polycystic kidney disease; CKD: Chronic kidney disease; IL6: Interleukin 6; IL6R: Interleukin 6 receptor; RDB: Relational database.

Competing interest

The authors do not declare any conflicts of interest.

Authors' contributions

PM conceived, designed and implemented the software as a whole, tested the software and drafted the manuscript. JK and JPS tested the software, suggested interface modifications, performed the case studies and contributed in drafting the manuscript. SJ and RS contributed to the database design and the integration of KUPNetViz to the iKUP browser. JB contributed in drafting the manuscript and data management. All authors have read, revised and approved the manuscript.

Acknowledgements

The authors want to express their acknowledgements to Dr Aristotle Chatziioannou for useful input at the early stages of the application design. This work is funded by the EU/FP7/ICT-2007.4.4 e-LICO project ( http://www.e-lico.eu) and by the FP7-IAPP program “Protoclin” (GA 251368, http://www.protoclin.org).

Cross-platform analysis of global microRNA expression technologiesYaukCLRowan-CarrollASteadJDWilliamsABMC Genomics20101133010.1186/1471-2164-11-330289056220504329Multiplexed protein measurement: technologies and applications of protein and antibody arraysKingsmoreSFNat Rev Drug Discov20065431032010.1038/nrd2006178025116582876Sequencing technologies - the next generationMetzkerMLNat Rev Genet2010111314610.1038/nrg262619997069Mass spectrometry-based metabolomicsDettmerKAronovPAHammockBDMass Spectrom Rev2007261517810.1002/mas.20108190433716921475The GUDMAP database–an online resource for genitourinary researchHardingSDArmitCArmstrongJBrennanJChengYHaggartyBHoughtonDLloyd-MacGilpSPiXRoochunYDevelopment2011138132845285310.1242/dev.063594318859321652655In-depth proteomic profiling of the normal human kidney glomerulus using two-dimensional protein prefractionation in combination with liquid chromatography-tandem mass spectrometryMiyamotoMYoshidaYTaguchiINagasakaYTasakiMZhangYXuBNametaMSezakiHCuellarLMJ Proteome Res2007693680369010.1021/pr070203n17711322High-throughput identification of IMCD proteins using LC-MS/MSPisitkunTBieniekJTchapyjnikovDWangGWuWWShenRFKnepperMAPhysiol Genomics200625226327610.1152/physiolgenomics.00214.2005143603616449382The KUPKB: a novel Web application to access multiomics data on kidney diseaseKleinJJuppSMoulosPFernandezMBuffin-MeyerBCasemayouAChaayaRCharonisABascandsJLStevensRFASEB J20122652145215310.1096/fj.11-19438122345404Biological knowledge management: the emerging role of the Semantic Web technologiesAntezanaEKuiperMMironovVBrief Bioinform200910439240710.1093/bib/bbp02419457869Learning biological networks: from modules to dynamicsBonneauRNat Chem Biol200841165866410.1038/nchembio.12218936750Getting connected: analysis and principles of biological networksZhuXGersteinMSnyderMGenes Dev20072191010102410.1101/gad.152870717473168Tools for visually exploring biological networksSudermanMHallettMBioinformatics200723202651265910.1093/bioinformatics/btm40117720984VisANT 3.5: multi-scale network visualization, analysis and inference based on the gene ontologyHuZHungJHWangYChangYCHuangCLHuyckMDeLisiCNucleic Acids Res200937Web Server issue115121Visualization of omics data for systems biologyGehlenborgNO'DonoghueSIBaligaNSGoesmannAHibbsMAKitanoHKohlbacherONeuwegerHSchneiderRTenenbaumDNat Methods201073 SupplS56S6820195258A survey of visualization tools for biological network analysisPavlopoulosGAWegenerALSchneiderRBioData Min200811210.1186/1756-0381-1-12263668419040716EGAN: exploratory gene association networksPaquetteJTokuyasuTBioinformatics201026228528610.1093/bioinformatics/btp656280430519933825Cytoscape Web: an interactive web-based network browserLopesCTFranzMKaziFDonaldsonSLMorrisQBaderGDBioinformatics201026182347234810.1093/bioinformatics/btq430293544720656902Developing a kidney and urinary pathway knowledge baseJuppSKleinJSchanstraJStevensRJ Biomed Semantics20112Suppl 2S710.1186/2041-1480-2-S2-S7323930722166306Reorganizing the protein space at the Universal Protein Resource (UniProt)ConsortiumTUNucleic Acids Res201240Database issue7175Ensembl 2012FlicekPAmodeMRBarrellDBealKBrentSCarvalho-SilvaDClaphamPCoatesGFairleySFitzgeraldSNucleic Acids Res201240Database issue8490miRBase: integrating microRNA annotation and deep-sequencing dataKozomaraAGriffiths-JonesSNucleic Acids Res201139Database issue152157BioMart Central Portal: an open database network for the biological communityGubermanJMAiJArnaizOBaranJBlakeABaldockRChelalaCCroftDCrosACuttsRJDatabase (Oxford)20112011bar04110.1093/database/bar041The STRING database in 2011: functional interaction networks of proteins, globally integrated and scoredSzklarczykDFranceschiniAKuhnMSimonovicMRothAMinguezPDoerksTStarkMMullerJBorkPNucleic Acids Res201139Database issue561568KEGG for integration and interpretation of large-scale molecular data setsKanehisaMGotoSSatoYFurumichiMTanabeMNucleic Acids Res201240Database issue109114Arena3D: visualizing time-driven phenotypic differences in biological systemsSecrierMPavlopoulosGAAertsJSchneiderRBMC Bioinformatics2012134510.1186/1471-2105-13-45336871622439608Altered expression of calreticulin during the development of fibrosisKypreouKPKavvadasPKaramessinisPPeroulisMAlbertiASiderasPPsarrasSCapetanakiYPolitisPKCharonisASProteomics20088122407241910.1002/pmic.20070083118563736Unique changes in interstitial extracellular matrix composition are associated with rejection and cyclosporine toxicity in human renal allograft biopsiesAbrassCKBerfieldAKStehman-BreenCAlpersCEDavisCLAm J Kidney Dis1999331112010.1016/S0272-6386(99)70252-09915262Establishing the molecular pathways involved in chronic allograft nephropathy for testing new noninvasive diagnostic markersMasVMalufDArcherKYanekKMasLKingAGibneyEMasseyDCotterellAFisherRTransplantation200783444845710.1097/01.tp.0000251373.17997.9a17318078Endothelin-mediated oncofetal fibronectin expression in chronic allograft nephropathySiddiquiIKhanZALianDJiangJZhongRGarciaBChakrabartiSTransplantation200682340641410.1097/01.tp.0000228905.44649.0616906041Attenuated liver fibrosis and depressed serum albumin levels in carbon tetrachloride-treated IL-6-deficient miceNatsumeMTsujiHHaradaAAkiyamaMYanoTIshikuraHNakanishiIMatsushimaKKanekoSMukaidaNJ Leukoc Biol199966460160810534116Immunization with DNA topoisomerase I and Freund's complete adjuvant induces skin and lung fibrosis and autoimmunity via interleukin-6 signalingYoshizakiAYanabaKOgawaAAsanoYKadonoTSatoSArthritis Rheum201163113575358510.1002/art.3053921792823Interleukin 6 underlies angiotensin II-induced hypertension and chronic renal damageZhangWWangWYuHZhangYDaiYNingCTaoLSunHKellemsREBlackburnMRHypertension201259113614410.1161/HYPERTENSIONAHA.111.17332822068875Interleukin-6, its role in fibrosing conditionsO'ReillySCiechomskaMCantRHugleTvan LaarJMCytokine Growth Factor Rev20122339910710.1016/j.cytogfr.2012.04.00322561547Transcriptional profiling of native inner medullary collecting duct cells from rat kidneyUawithyaPPisitkunTRuttenbergBEKnepperMAPhysiol Genomics2008322229253227665217956998Systems biology of autosomal dominant polycystic kidney disease (ADPKD): computational identification of gene expression pathways and integrated regulatory networksSongXDi GiovanniVHeNWangKIngramARosenblumNDPeiYHum Mol Genet200918132328234310.1093/hmg/ddp16519346236Cyst formation in the PKD2 (1–703) transgenic rat precedes deregulation of proliferation-related pathwaysKoupepidouPFelekkisKNKranzlinBStichtCGretzNDeltasCBMC Nephrol2010112310.1186/1471-2369-11-23293687320813037Inflammation, oxidative stress, and insulin resistance in polycystic kidney diseaseMenonVRudymDChandraPMiskulinDPerroneRSarnakMClin J Am Soc Nephrol20116171310.2215/CJN.04140510302225020829421Characterization of the renal cyst fluid proteome in autosomal dominant polycystic kidney disease (ADPKD) patientsLaiXBacallaoRLBlazer-YostBLHongDMasonSBWitzmannFAProteomics Clin Appl200827–811401152285734220411046The biomarker enriched proteome of autosomal dominant polycystic kidney disease cyst fluidMasonSBLaiXBacallaoRLBlazer-YostBLGattoneVHWangKCWitzmannFAProteomics Clin Appl20093101247125010.1002/prca.200800163288052220526430Gene expression in early and progression phases of autosomal dominant polycystic kidney diseaseChenWCTzengYSLiHBMC Res Notes2008113110.1186/1756-0500-1-131263266719099603BiologicalNetworks 2.0--an integrative view of genome biology dataKozhenkovSDubininaYSedovaMGuptaAPonomarenkoJBaitalukMBMC Bioinformatics20101161010.1186/1471-2105-11-610301922821190573VANTED: a system for advanced data analysis and visualization in the context of biological networksJunkerBHKlukasCSchreiberFBMC Bioinformatics2006710910.1186/1471-2105-7-109141356216519817Compressed adjacency matrices: untangling gene regulatory networksDinklaKWestenbergMAvan WijkJJIEEE Trans Visualization and Computer Graphics2012181224572466Construction, visualisation, and clustering of transcription networks from microarray expression dataFreemanTCGoldovskyLBroschMvan DongenSMazierePGrocockRJFreilichSThorntonJEnrightAJPLoS Comput Biol200731020322042204197917967053SpectralNET–an application for spectral graph analysis and visualizationFormanJJClemonsPASchreiberSLHaggartySJBMC Bioinformatics2005626010.1186/1471-2105-6-260127678716236170A huge graph visualization frameworkAuberDGraph Drawing SoftwareHeidelberg, Germany: SpringerMutzel P, Junger M2004105126Control of yeast filamentous-form growth by modules in an integrated molecular networkPrinzSAvila-CampilloIAldridgeCSrinivasanADimitrovKSiegelAFGalitskiTGenome Res200414338039010.1101/gr.202060435322314993204Pajek - Program for Large Network AnalysisBatageljVMrvarAConnections19982124757The Enzyme Portal: a case study in applying user-centred design methods in bioinformaticsde MatosPChamJACaoHAlcantaraRRowlandFLopezRSteinbeckCBMC Bioinformatics20131410310.1186/1471-2105-14-103362373823514033Osprey: a network visualization systemBreitkreutzBJStarkCTyersMGenome Biol200343R2210.1186/gb-2003-4-3-r2215346212620107VANLO–interactive visual exploration of aligned biological networksBraschSLinsenLFuellenGBMC Bioinformatics20091032710.1186/1471-2105-10-327276639119821976Bioinformatics enrichment tools: paths toward the comprehensive functional analysis of large gene listsda HuangWShermanBTLempickiRANucleic Acids Res200937111310.1093/nar/gkn923261562919033363Exploiting statistical methodologies and controlled vocabularies for prioritized functional analysis of genomic experiments: the StRAnGER Web applicationChatziioannouAAMoulosPFront Neurosci201158303237921293737