CancerGeneNet is a resource that aims at linking genes that are frequently mutated in cancers to cancer phenotypes. The resource takes advantage of a curation effort aimed at embedding a large fraction of the gene products that are found altered in cancers in the cell network of causal protein relationships. Graph algorithms, in turn, allow to infer likely paths of causal interactions linking cancer associated genes to cancer phenotypes thus offering a rational framework for the design of strategies to revert disease phenotypes. CancerGenNet bridges two interaction layers by connecting proteins whose activities are affected by cancer gene products to proteins that impact on cancer phenotypes. This is achieved by implementing graph algorithms that allow searching for graph path that link any gene of interest to the “hallmarks of cancer".
For questions, support, bug reports, proposals or any other communication, feel free to contact:
Download file description
The following files contain the relations of cancer MiniPathways and curated tumor pathways.
The files are tab-delimited and respect the SIGNOR data format except for the first two columns that provide the SIGNOR pathway ID and NAME.
Cancer Modules |
Curated Tumors |
Below is the Human Interactome used to calculate shortest paths to Metabolic Pathways
CancerGeneNET links links frequently mutated cancer genes to cancer phenotypes. The resource offers five options to browse its content:
This functionality allows the user to look for network paths linking any gene to the cancer hallmark-phenotypes. Results are visualized as a graph and listed in a table.
In the Shortest Path to Cancer Hallmarks submit field, you can enter a gene or a list of gene names. By selecting a phenotype (or all the phenotypes) in a drop-down menu one obtains a graph showing the shortest path to the selected cancer hallmark.
Press the Submit button.
The query could require a matter of tens of seconds, as the graph algorithms explores a variety of paths in the network.
The result page displays a graph showing the shortest path(s) linking the selected gene/s to the selected cancer hallmark-phenotypes. To give to the user a wider perspective of the possible connections between the query gene and the selected phenotype(s) the paths that are one step longer than the shortest ones are also shown in the graph and displayed in the table.
The figure below shows the result page of a query looking for functional connections between the AMPK kinase complex and the DNA Repair phenotype.
The query gene is highlighted in red in the graph and the resulting network displays all the connecting paths of minimal length. Each row of the summary table contains the following information path:
The Cancer Similarity Network offers a convenient entry point to the list of cancer genes annotated in the Cancer Gene Census of in DisGeNET. Each node in the graph is a tumor type. The node size is linked to the number of annotated cancer genes in each tumor.
The map also shows the connections between cancers that are most similar as evaluated by calculating the Jaccard index of the two tumor-specific lists of associated genes. Edge thickness is related to the value of the Jaccard index between the two linked tumors.
The graph can be explored by searching, via the “find tumor type” box, a specific tumor name, e.g.'Breast cancer'. The field also offers autocomplete functionality in order to facilitate searching by name. After submitting the query the corresponding node becomes larger and yellow for easy identification.
The Cancer Browser section offers an alternative way to access data and tools in the resource.
A drop-down menu allows the user to select a cancer type either from Cancer Census or from DisGeNET cancer lists. Clicking the ‘Draw’ button the user is lead to the results page showing an interactive graph viewer, as described in the 'Cancer Similarity Network' section. The graph viewer will display a schematic and detail-rich representation of the causal interactions between the cancer associated genes.
When exploring the Cancer Gene Census dataset, it is possible to consider only the most reliable associations (Tier 1) by checking the corresponding check boxes.
This functionality allows the user to enter a list of gene names (e.g. EPHA2, PTPN11, RAF1, BRAF) or UniprotKB IDs (e.g. P29317, Q06124, P04049, P15056) to obtain a graph linking the query proteins via causal interactions (see figure below).
CancerGeneNet stores a set of curated cancer-specific pathways and Minipathways that are frequently mutated in different cancers. The graph illustrating the pathways can be displayed by selecting them in two drop-down menus in the bottom right frame of the homepage. The dropdown menus allows to select either a specific cancer pathway (e.g. Colorectal Carcinoma) or to select a mini pathway (e.g. ErbB Signaling). A MiniPathway is a small, manually curated pathway that is frequently altered in different tumors.
The pathway page shows an interactive display of the selected pathway.
Pathways are manually curated and the number of entities considered in each pathway is arbitrarily limited to the ones that are considered central, in order to have an interpretable graph. Each edge and node is clickable to get more information.
Entities are automatically laid out and assigned to specific cell compartments (extracellular space, membrane, cytoplasm, nucleus).
"Stimuli" and "Phenotypes" are always placed upstream and downstream respectively. A series of filters at the top of the graph frame allow the user to modify the display of the pathway by including or excluding indirect interactions or relationships below a score threshold.
It is possible to add (or remove) entities to the displayed graph by using the dedicated box on the right of the graph. This same toolbox also allows the user to perform additional analyses (see next section).
The Toolbox section allows to perform different analysis:
The box allows the user to display interactions at different levels of complexity using three different query strategies to retrieve interactions from the SIGNOR database. Connect (Level 1) searches for relationships connecting any two entities in the query list; First Neighbors (Level 2) is a multi-step strategy that initially performs a search in SIGNOR for all interactions involving any of the seed entities, next prunes nodes with degree-one leaving in the resulting network only the query proteins and 'bridge' proteins that help connecting them; All (Level 3) retrieves all signaling interactions involving any of the seed entities and the proteins in the SIGNOR database without any further filtering. By default the visualizer displays Level 2 interactions (First Neighbours); if no interaction is retrieved at level 2,level 3 is shown; By clicking on Add physical interactions from Mentha the user can also include in the graph protein-protein interactions from the mentha database involving any seed entities and filtered by score. Only interactions whose reliability score is higher than 0.4 are shown. By clicking on Download Relations it is possible to download all the interaction visualized in the viewer (note: this does not include mentha interactions).
The edit nodes functionality allows the user to edit the list of the seed entities, by removing or adding nodes.
This functionality shows the overlap between the query genes list and the gene list in the curated MiniPathways.
It is also possible to search whether the list of nodes in the displayed network is enriched for genes that are also annotated in MiniPathways. For this task go to the Gene Enrichment Analysis frame and click the GO button. The algorithm compares the list of genes that are displayed in the graph with the lists of pathway genes as annotated in the MiniPathways datasets.
The tool returns a table showing the MiniPathways that are most represented in the query cancer genes list. For each pathway, in the Hits column, the tool returns the ratio of shared genes. The p-value column provides the p-value calculated as follows:
Data can be downloaded in two different ways:
General URL | https://signor.uniroma2.it/CancerGeneNet/getData.php | |
MiniPathway data | https://signor.uniroma2.it/CancerGeneNet/getData.php?type=miniPathways | This link allows to access the table containing the relations of cancer MiniPathways, the file is tab-delimited and respects the SIGNOR data format except for the first two columns that provide the SIGNOR pathway ID and NAME. |
Curated Pathway data | https://signor.uniroma2.it/CancerGeneNet/getData.php?type=curated | This link allows to access the table containing the relations of cancer Curated Tumour pathways, the file is tab delimited and respects the SIGNOR data format except for the first two columns that provide the SIGNOR pathway ID and NAME. |
Shortest Path Summary data | https://signor.uniroma2.it/CancerGeneNet/getData.php?type=shortestPath&proteins=PROTEINS&phenotype=PHENOTYPE&output=summary
Example: https://signor.uniroma2.it/CancerGeneNet/getData.php?type=shortestPath&proteins=AMPK&phenotype=DNA_Repair&output=summary |
This link allows to access the summary table provided as the output of the shortest path tool. The users have to specify the parameters proteins and phenotype, as shown in the example link. |
Shortest Path Relation data |
https://signor.uniroma2.it/CancerGeneNet/getData.php?type=shortestPath&proteins=PROTEINS&phenotype=PHENOTYPE&output=relations
Example: https://signor.uniroma2.it/CancerGeneNet/getData.php?type=shortestPath&proteins=AMPK&phenotype=DNA_Repair&output=relations |
This link allows to access the SIGNOR relations composing the graph provided as output of the shortest path tool. The users have to specify the parameter proteins and phenotype, as shown in the example link. The last column of the resulting table specify whether the protein relation belongs to a shortest path ("Shortest") or a path with one step more than the shortest one ("Longer"). |
Connect Proteins | https://signor.uniroma2.it/CancerGeneNet/getData.php?type=connect&proteins=LIST&level=LEVEL
Example: https://signor.uniroma2.it/CancerGeneNet/getData.php?type=connect&proteins=P29317+Q06124%2CP04049%2CP15056&level=2 |
This link allows to access the graph connecting every combination of proteins available in SIGNOR. The users have to specify the parameters proteins and level. The level parameter has to be set to 1 if there is a direct connection between desired proteins, 2 if there is a connection between desired proteins taking also into account the first neighbors and finally 3 that corresponds to the graph of the whole relations of desired proteins. |
CancerGeneNet’s data is generated by combining cancer-gene data extracted from the Cancer Census (v89) and Disgenet (v6.0) knowledge bases with the SIGNOR database.
The database is built using PostgreSQL (version 8.4). An extension to the original database was created for this resource to store externa cancer-gene data. The aim has been to produce efficient linking of imported data to the manually curated signaling interaction data within SIGNOR, in order to generate and visualize cancer networks.
Cancer Census data was imported into the database in full, including tier-related information. The information was stored following the structure shown in the table below.
Gene Name | As stated | Tumor Type | Name of associated tumor |
Tier | Tier described for gene-cancer association |
Type | gene-cancer association type as described (somatic or germline) |
Disgenet data was filtered according to the following parameters:
Gene Name | As stated | Disease ID | Disease as described by imported GDA |
UniprotID | Mapped UniprotID, from Disgenet |
Score | GDA score assigned by Disgenet |
Source | Source Disgenet originally imported the information from |
Release | Disgenet release the information was taken from |
The data is presented to the user through a web interface based on HTML5, integrated with JavaScript scripting to provide a more dynamic and responsive user interaction. Most data manipulation and display is handled through PHP (version 7.0.33). R scripting is used to extract further meaningful information. The network viewer is created using SPV (Signaling Pathway Visualizer, v1.0) (Calderone, 2018).
UMLS® Metathesaurus® data is used under the terms of the UMLS® Metathesaurus® license.
The DisGeNET database and Knowledge discovery platform are provided to advance the knowledge about human diseases and their associated genes, and are intended to be used only for research and education. The DisGeNET database is made available under the Open Database License whose full text can be found at http://opendatacommons.org/licenses/odbl/1.0/. Any rights in individual contents of the database are licensed under the Database Contents License whose text can be found at http://opendatacommons.org/licenses/odbl/1.0/.
The development of CancerGenNet was supported by a grant from the Italian Association for Cancer Research (AIRC, Project IG 2017 ID. 20322)