Help

  • Homepage

PrecocityDB is a comprehensive database dedicated to an endocrine disorder, Precocious Puberty (PP). The homepage provides a short description of the disease and database contents. The database has manually curated genes, SNPs associated with PP along with pathways and ontologies associated with these genes.

Data can be retrieved using the navigation tabs (menu bar) on the top of the webpage
1
Database statistics and our contact information is available on the bottom of the webpage
2

  • Search
    • Quick Search

Quick search option in PrecocityDB allows users to query the whole database based on keywords. The number of hits, from each section of the database along with section name, is listed as a result. The results are hyperlinked to the detailed search result of the respective section.
There are two search options: 1. Search, 2. Exact search. Exact search matches the precise keyword, displays specific hits for any id or symbol.

Note: Only the first keyword will be considered for search if multiple keywords are given separated by space. Any other delimiter is invalid.

3

54b

8

    • Advanced Search

Advanced search in PrecocityDB allows users to search the database section-wise by selecting fields and applying logical operators (AND, OR, NOT).

Title

Sub-headings

Description

Gene Information

Gene Symbol

Unique gene symbol provided/ assigned by HUGO Gene Nomenclature Committee (HGNC)

Aliases

Synonyms or known terms for genes. These are unapproved nomenclature used in old literature.

Entrez Gene ID

A unique identification number assigned to every new gene record by NCBI

Gene Name

A unique name provided/ assigned by HUGO Gene Nomenclature Committee (HGNC)

Chromosomal Location

It provides information regarding the location of genes on a chromosome.

Gene Summary

It provides a short description of the gene from the NCBI Gene database.

RefSeq DNA

RefSeq DNA ID

RefSeq mRNA

RefSeq mRNA ID

Gene Ontology

Gene Symbol

Symbol or aliases of the gene

GO ID

The Gene Ontology (GO) ID

Ontology

Biological domain with three aspects: Molecular function, Cellular components, and Biological processes.

Function

Sections under three main biological domains.

Protein Information

Protein Name

It reports the name(s) assigned to the protein in NCBI

Function

A short description of the protein from UniProt.

RefSeq Protein

RefSeq Protein ID

UniProt

A unique accession number assigned to the product. A protein can have more than one accession number.

RCSB PDB

A unique identifier assigned to the new entry of a structure reported in Protein Data Bank (PDB)

Associated Pathways

Gene Symbol

Symbol or aliases of the gene

Pathway Name

Name assigned to the process.

Pathway ID

A unique identifier assigned to a pathway in KEGG pathways and Reactome

SNPs

Gene Symbol

Symbol or aliases of the gene

SNP ID

A unique identification code assigned to the mutation in the gene sequence.

SNP

Change in the nucleotide

Reference

Pubmed ID

An identifier for published and freely- accessible articles

Title

Title of the article

Author

Author(s) of the article

Abstract

Abstract of the article

Source

The source of article

7
Note: Queries can be build combining options using 'AND', 'OR' from one section at a time.

8

  • Browse

Browse enables users to surf the database for genes, SNPs, pathways, and ontology associated with PP. The information available on each page is explained below:

Genes: The genes associated with PP can be browsed in the gene browse page. Links to complete information on manually curated genes are also provided and the rest are hyperlinked to respective NCBI page. Complete information on the manually curated genes has detailed information about protein, gene ontology, associated pathways, and associated literature references.

9
Meta-data of genes can be downloaded in a tab-separated file by selecting the checkboxes beside genes.

SNPs: This option lists all SNPs associated with PP, along with links to reference literature and the dbSNP database.

Pathways: The different metabolic/signaling pathways associated with genes in the database can be browsed. Links to the KEGG/Reactome pathway database is provided here.

Ontologies: The different ontology terms associated with genes in the database can be browsed here and ontologies are linked to GO page. Each ontology term is associated with many genes, and genes are also linked to pathways and diseases in the detailed information page of ontology.
All the above-mentioned pages can be filtered, queried using keywords, and browsed by alphabets. One or more of these can be clubbed to get more specific results.

  • Chromosomal location of SNPs associated with precocious puberty

This tab allows to visualize chromosomal coordinates of SNPs associated with PP. Chromosomes are shown as vertical bars, wherein SNPs are marked in red. For details of a particular SNP, the chromosome can be selected and the SNP ID, chromosome number and nucleotide position can be seen by hovering over the red triangles.

9
Multiple red triangles at a single location indicates presence of SNPs in less than 350000 nucleotide length.

  • Enrichment analysis

This tab displays the result of pathway and disease enrichment analysis using the data available in PrecocityDB.

    • Pathway enrichment – Online available tool Enrichr was used to check significant overlap of PrecocityDb genes and their transcription factors with KEGG pathway’s gene sets. Pathways were ranked based on adjusted P value computed by Fisher’s exact test. Adjusted P value is calculated by first computing P value for many random input gene lists. This is then used as a lookup table of expected ranks with their variances. Adjusted P value denotes the independence for probability of PrecocityDb genes belonging to KEGG pathway’s gene sets, i.e pathways will be assigned lower P value when the combination of input gene overlapping with the pathway genes are not by chance or less random.

    Enriched KEGG pathways were then mapped to their parent pathway groups using KEGG database. Cytoscape app ClueGO was used to cluster enriched pathways. The results were represented in 3 different types of plots 1. Multilayered pie (generated using Excel), 2. Bubble plot (generated in R; ggplot2 package ) and 3. Pathway network clusters ( ClueGO) .

    • Disease enrichment - A publically available web resource, Gene set to Diseases (GS2D) was used for performing disease enrichment. Similar to pathways, diseases were also ranked based on P value.

    Enriched disease terms were mapped to single parent group based on a rule based method used in PCOSKB R2. The results were represented in 3 different types of plots 1. Multilayered pie (generated using Excel), 2. Bubble plot (generated in R; ggplot2 package) and 3. Disease networks (generated using D3 js).

 

| © 2020, Biomedical Informatics Centre, NIRRH |
ICMR-National Institute for Research in Reproductive Health, Jehangir Merwanji Street, Parel, Mumbai-400 012
Tel: 91-22-24192104, Fax No: 91-22-24139412