Overview


Welcome to ViMIC 2.0 (A Database of Human Disease-related Virus Mutations, Integration sites and Cis-effects)


The ViMIC 2.0 is a resource of human disease-related virus information. It mainly covers four key features, including virus mutation sites (VMs), viral integration sites (VISs), target genes and multi-omics data. The database allows users to quickly query the information regarding curated VMs, VIS-Cistrome intersections and statistics of virus sequences data. Additionally, users can query gene changes in virus-infected samples, influenced by viral integration events or virus gene/region/protein, with data derived from multi-omics levels.
Homepage
  
Virus Mutation
  
Virus Integration
  
Target Gene
  
Virus Overview
  
Virus Associated Disease
  
Virus Omics Dataset
  
Search
  
Download
  
Contact us
  
 
Comparison of the data included in ViMIC 2.0 and ViMIC 1.0
Feature 1.0 2.0 Fold increase
Virus 8 28 3.5
Virus Mutation 31,712 64,168 2
Viral Integration 105,624 115,033 1.1
Virus Related Disease 77 177 2.3
Target Gene 16,310 17,604 1.1
Literature 2,539 6,433 2.5
Virus Sequence 1,110,015 1,283,063 1.2
Omics Data 28 255 9.1
Feature 1.0 2.0
Sequencing Type Data
RNA-seq/Array Yes Yes
scRNA-seq No Yes New
Methylation Profiling No Yes New
Chip-seq No Yes New
ChIP-on-chip No Yes New
ATAC-seq No Yes New
Variation No Yes New
Feature 1.0 2.0
Omics Data Analysis
Immune Infiltration Analysis Yes Yes
Differential Expression Analysis No Yes New
Single Cell Type Annotation No Yes New
Single Cell Gene Expression Analysis No Yes New
Differential Methylation Analysis No Yes New
Epigenetic Chipseeker Annotation No Yes New
Epigenetic Deeptools Analysis No Yes New
 

Help Video


  We recommend using google chrome to watch this video.

Homepage


The homepage of ViMIC 2.0 mainly gives a navigation of our database, including listing the modules of ViMIC 2.0 on menu bar and upper right panel, to help users to browse, search, visualize and download data. Main modules on upper left consist of Virus Mutation Site, Viral Integration Site and Target Gene. Other modules consist of Virus (virus information overview), Gene (target gene summary and analysis), Disease (virus infected disease information), Dataset (virus related omics dataset), Download (ViMIC 2.0 data download) and Help (ViMIC 2.0 data statistic and help document).


Return to Top

Virus Mutation


Overview of Mutation


The 'Overview of Mutation' page provides the curated mutation information for each virus. The table shows 'VM Count', virus associated 'Hot Disease' and 'Literature Count' in each virus which are curated by ViMIC 2.0. Click ‘Virus Name’ to show the information of mutations, VIS-Cistrome factor overlapped records (only for integrated virus) and target genes of selected virus. Click 'Hot Disease' to display the summary of literature related to selected virus associated disease. Click 'VM Count' to show the detailed mutation information of selected virus.

Virus Navigation and Mutation Statistics



Virus Mutation Site


The VM annotation module provides researchers who are interested in VMs with annotations of reported pathogenic mutations in virus related to human diseases easily. The curated mutation information includes mutation site, mutation level, virus gene/protein/region, virus-associated disease, genotype/subtype, literature evidence, etc. Three drop-down menu items above the table for users to choose different virus gene/protein, disease and description browsing options respectively. Users can also Click the button to quickly view the hot viral genes/regions/protein for mutation. The table below shows curated mutation sites of selected virus.
We designed a description column to indicate more detailed information for each mutation site using the following tags:
D: The study mentioned certain treatment method, such as drug-resistant caused by mutations.
G: The study mentioned the genotype information.
R: The study mentioned the reference sequence for comparison, or the original strain of the virus used for site-directed mutagenesis.
S: The study provided sequence information or accession numbers.
L: The study provided the geographic region/country of data origin.
C: The study provided clinical baseline information.
I: The study involved information related to immune.
T: The study involved host target genes.
When selecting the virus gene of interest, users can quickly search the keyword of interest in the search box on the top-right of the table. Click the "View" button to show the detail information of this mutation.


Statistics of Virus Sequences Data


The Viral Sequence module provides the geographical distribution of the source, viral gene, region and genotype of virus sequences. Users can browse the statistical data o for geographical distribution and download the annotation information to quickly obtain the sequence GeneBank IDs of interest.



Return to Top

Viral Integration


Overview of Virus Integration Site (VIS)-Cistrome Factor Feature


This feature provides information of the overlapped records between VIS and Cistrome factors as well as Cistrome factor-related samples, the associated factor and overlap amount could indicate that the virus integration site is a potentially functional DNA element. The cistrome refers to "the set of cis-acting targets of a trans-acting factor on a genome-wide scale." The table below provide virus navigation and VIS-cistrome data statistics which summaries about VIS we collected as well as VIS-Cistrome factor overlapped records in each virus. Click virus name to view virus overview of virus mutation, integration and target gene. Click VIS count number (green button) to view VIS-Cistrome factor data. Click Disease Information to display the summary of this disease and disease related literature. The user can click “Search VIS-Cistrome Overlap” to access advanced search options for finding overlapping records by combining various keywords or choose "Explore VIS batch (list)" to search VIS batch.


Search VIS-Cistrome Overlap


Advanced search option for sophisticated overlapped records searches by combining various keywords (e.g. virus name, chromosome, location, disease, Cistrome biological source or Cistrome type). Specifically, the user can combine keywords of related disease and biological resource to filter out overlapped records of interest.


Explore VIS batch (list)


This module supports user provided list of integration sites to search VIS batch.


VIS-Cistrome Factor Overlap in each human chromosome


• Overall Heatmap


Here, take HBV as an example. This heatmap is an overall statistic for HBV VIS-Cistrome factor overlapped records by chromosome. An overlapped record means a match between VIS and 1 of 3 Cistrome factors (histone modification, transcription factor binding site and chromatin accessibility), more specifically, this match includes at least 1 overlap between this VIS and 1 Cistrome factor-related sample from 1 of 3 Cistrome factor data. The darker the red, the greater the overlapped records number. The user can view specific number by moving mouse pointer.


• Browse Data by Chromosome or Factor


The user can browse HBV VIS-Cistrome factor overlapped records data by specific chromosome or statistics histogram for specific Cistrome factor when click the chromosome or Cistrome factor the user is interested in.

• VIS locations with VIS-cistrome overlap information and VIS details for each chromosome


This table shows VIS locations with VIS-Cistrome overlap information for each chromosome. When selecting the chromosome of interest, users can browse VIS information which indicates the numbers of VIS and three Cistrome factors’ overlapped records. Click ‘View’ button to show the detailed information of selected VIS.


By clicking “Show VIS with two points” button, ViMIC2.0 will return VIS-Cistrome table of VIS with two points. Columns with “(S)” are overlaps calculated by single start s ite and columns with “(B)” are overlaps calculated by both two sites. Column “Case/Length” shows the case classification of VISs with two sites based on the original literature and length between two sites.
1. Case 1: The deletion of human fragments during the integration event.
2. Case 2: Due to the experimental method/design, two viral insertion positions were provided in the literature.


VIS-Cistrome Factor Overlapped Record Details


This table at top of page is factor ranked by overlap number. The overlap number refers to the overlapping count between VIS selected and one specific Cistrome factor. The user can click biological source filter with blue button to keep the overlaps which are associated with virus-related diseases based on Cistrome sample's biological source.
An ‘Overlapped Records’ means a match between VIS and 1 of 3 Cistrome factor (histone modification, transcription factor binding site and chromatin accessibility), more specifically, this match includes at least 1 overlap between this VIS and 1 Cistrome factor-related sample from 1 of 3 Cistrome factor data. So, one VIS may have several overlaps with multiple Cistrome factor-related samples, the associated factor and overlap number of samples could indicate that the virus integration site is a potentially functional DNA element. The interface details are as follows:


We take the entry of “1000606” on chromosome 5 in HBV as an example. After users click the “chr5” and search the entry of “1000606” in HBV VIS-Cistrome Factor Overlap in each human chromosome page, ViMIC shows the overlaps result of three cistrome factors. The user can click green button to view VIS-Cistrome Factor Overlapped Record Details related to selected VIS, in addition, the user can inspect some other VIS detail information such as disease and literature by clicking "View" button at Detail column. By clicking the green button with number at “TF” (Transcription factor binding site) column, ViMIC shows the ranking of all transcriptional regulators that have overlaps with this VIS. In this case exploration, users can view cistrome samples by clicking the ‘View’ button of the transcription factor MYC, and then users can view the detail information of samples (GSMID, cell line, cell type, tissue type and factor name). Additionally, sample ID and factor name are crosslinked to the related resources, including GEO and ENCODE databases.


Virus Integration Site and Cistrome Overlapped in each chromosome on human genome


The histogram shows the overlap count of three factors (histone modification, transcription factor binding site, and chromatin accessibility) and virus integration fragments on the human chromosomes. The y-axis represents the number of overlaps, and the X-axis represents the integrated fragments. The table shows the Cistrome factor count rank of VIS on the chromosome user selected. The user can click green button to view details and view VIS detail information by clicking "View" button at Detail column.


Integration Site Overlapped Cistrome Factor Summary


This page summarizes the count of VIS-Cistrome factor overlapped records and the number of VIS which have at least one overlap with each Cistrome factor on the human chromosomes. It also provides top ranked Cistrome factors ordered by the overlap number in each chromosome using stacked barplot.

• Top 10 Cistrome factor ranked by the Overlap Sum


Histogram of factors by chromosome. The statistic factors are top 10 amount factors on all HBV VIS-Cistrome factor overlapped records data. Strategy: Firstly, rank the overlapped records by Cistrome factor, then only visualize the top 10 Cistrome factor statistics to histogram by Chromosome.

• Top 5 Cistrome factor ranked by the Overlap in each chromosome


Histogram of factors by chromosome. The statistic factors on are the sum of the top 5 amount factors on each Chromosome. Strategy: Firstly, rank the overlapped records by Cistrome factor for each Chromosome, then only visualize the sum of top 5 Cistrome factor statistics of each Chromosome to histogram by Chromosome.

• Factor Statistics


The table summarizes the number of VIS which have at least one overlap with each Cistrome factor on the human chromosomes.


Return to Top

Target Gene


Overview of Virus Target Gene


The Target Gene feature provides information for the target gene affected by viral insertion or virus gene/protein regulation. The Impact column in the table provides information on whether the gene has been mentioned in studies related to mutation or integration. The Description column in the table provides information on whether the gene provides virus-related omics data (which level of data is included) and whether a visualization graph is provided. A description of each tag is as follows:
I: Target gene is affected by virus integration.
M: Target gene is associated with the virus gene/protein/region.
E: Target gene is detected in the expression profiling data.
S: Target gene is detected in the single-cell transcriptome data.
C: Target gene is detected in the genome binding/occupancy profiling data.
M: Target gene is detected in the methylation profiling data.
F: Target gene provides figures to visualize the analysis results. F indicates that this gene has visualization results on at least one type of sequencing data.
Using the NOVA1 gene from HPV research as an example, we've segmented the data by initial letter to speed up access. Click the letter "N" to search for NOVA1 in the upper right corner of the table. Click the "View" button for the corresponding virus to view detailed information.


Target gene Detail Information


Gene information mainly shows gene name, gene id, gene type, resources' ID, etc, three tables (the mutation table, integration table and gene expression dataset table) and omics analysis plot. If previous studies reported that the target gene was altered by viral integration events or viral gene/region/protein, the table lists the overlap between VIS and Cistrome factors in the literature. The following is an integration example. The omics table provides basic information on the relevant viral omics data for the gene. ViMIC2.0 provides target gene analysis for 4 types of sequencing data.



1. If the gene is a differentially expressed gene, a volcano plot and a bar plot are displayed below.

2. If the gene is differentially expressed in a cluster of single-cell transcriptome data, the sample and cell annotation UMAP and gene feature plot are displayed below.

3. If the gene contains differentially methylated sites, a volcano plot and a bar plot are displayed below.

4. If a peak is detected in the virus-related Genome binding/occupancy profiling data for this gene region, the peak graph is shown below.

Return to Top

Virus Overview


Virus Info Summary


The Virus module provides an interface of each virus with a summary and links to ViMIC2.0 features to help users easy to query research questions on VMs, VISs-Factors overlap (only for integrated viruses) or target genes, the table below shows basic information of 28 viruses in ViMIC2.0, including Virus Name, Size, Classification and Brief Description, click the View button in table to see the overview of three modules.


Overview of Virus Mutation, Integration-Cistrome Factor Overlap and Target gene


This page gives an overview of Virus Mutation, Integration-Cistrome Factor Overlap and Target gene.




Return to Top

Virus Associated Disease


Virus Infected Disease Summary


The Disease module provides the curated information in each type of virus-related disease, including mutation and integration. Mutation Count means the total PMID number of virus mutation in virus associated diseases. Integration Count means the total PMID number of virus integration in virus associated diseases.


Overview of Disease


The table shows literature information of this disease.
Two related tags:
• Mutation Tag: literature about this virus mutation and click the eye icon to see detail mutation information in literature.
• Integration Tag: literature associated this virus integration and click the eye icon to see detail information of 3 cistrome factors (histone modification, transcription factor binding site and chromatin accessibility).





Return to Top

Virus Dataset


Virus Dataset Summary


This module is dedicated to providing researchers with comprehensive and detailed virus-related omics data from the public databases. The current version brings together Transcriptome, Epigenome and Genome data. The experiment types include: Expression profiling by high throughput sequencing/array, Methylation profiling by high throughput sequencing/array/genome tiling array, Genome binding/occupancy profiling by genome tiling array, Genome variation profiling by high throughput sequencing, etc.


Data Availability column indicates that the data provided by this dataset can be analyzed. In ViMIC 2.0, only basic information of the dataset is displayed for data that cannot be processed or analyzed. We also designed corresponding classification labels: X represents xCell analysis, E represents Differential expression analysis, M represents Differential methylation analysis, C represents Genome binding/occupancy profiling analysis, S represents scRNA analysis, and L represents Literature. Click the Detail button to the detailed information page of the dataset, which provides basic information about the dataset, including the experimental design of each sample. Click the corresponding Download button to download the data results of the corresponding omics analysis.



1. For bulk RNA-seq, ViMIC2.0 provides xCell's immune cell infiltration analysis results, heatmap, and microenvironment score cloud-rain map.


2. For Chip-seq or ATAC-seq data, ViMIC2.0 provides chipseeker annotation pie chart, deeptools analysis heatmap and profile maps for each GSM sample.



3. For scRNA-seq data, ViMIC2.0 provides UMAP of samples and cell types.


Return to Top

Quick Search


Input the virus related disease, histone mark, transcription regulator or gene/protein/region directly in the search box at homepage, popping out a window about results of searched words.


Advanced Search


Advanced search option for sophisticated searches by combining the various keywords (e.g., virus name, chromosome, disease, virus gene/protein/region or target gene). ViMIC 2.0 provides three search examples. Please note that if your search conditions are very complex, it might be difficult to find results that meet all criteria. In such cases, browsing through the modules on the homepage may be a more efficient option.




Return to Top

Download


The ViMIC 2.0 allows the user to download the various virus data files including mutation, integration, sequences annotation, omics and target gene.


Return to Top

Contact us

Emails

Ying Wang: nadger_wang@139.com

Address

Yueyang Hospital of Integrated Traditional Chinese and Western Medicine, Ganhe Road, No.110, Shanghai, P.R. China

Return to Top