Friday, February 6, 2015

Software: 3C-MTS/Capture-C analysis using 3C-analyzer


Abstract: The module of 'Many-to-all' in 3C-analyzer can be used for 3C-MTS/Capture-C data analysis.

      Viewpoints locking

The parameters of the module 'Viewpoint locking' are set in this window (Figure 2). This module allows users to quickly get the chromosome positions of many viewpoints at a time, which depend on the chromosome positions of the primary enzyme site you select.
  • Genome sequences in FASTA: Select reference genome sequences in FASTA format.
  • Chromosome: chromosome of the interested genomic region.
  • Start of genomic region: the start of base pair of the interested genomic region.
  • End of genomic region: the end of base pair of the interested genomic region.
  • Length of viewpoints: the length of viewpoints. Higher values would increase the counting number of co-localized regions. But avoid overlapped viewpoints each other if there were multiple viewpoints in a genomic region.


Figure 2. GUI of Viewpoints locking in the module 'Many-to-All'.
The viewpoints file known as site_info.csv would be generated by 3C-analyzer showed as Figure 2.
Figure 2. Viewpoints required for 3C-MTS/Capture-C
    1. Adapter trimming

Parameters of adapter trimming pipeline are set in this window (Fig. 1).
  • 3' end and 5' end adapter sequences: The default adapter is consistent with the Illumina's sequencing kit.
  • Adapter matching: 3C-analyzer identify adapter sequences based on 8 exact matching or 12 matching read sequences with at most one mismatched base allowed.
  • Minimum query length: Sequences less than the minimum query length after adapter removal will be removed from sequence alignment.
Fig. 1 GUI of reference genome mapping.
    1. Co-localization detection

The third-party aligner (the default is Bowtie2) is required in this module (Figure 3). The options in Bowtie2 integrated into 3C-analyzer are listed below.
Figure 3. GUI of genome mapping.
    1. Co-localization counting

Qualification is done in this step (Figure 4).
  • Reads background: Any read sequences determined by sequencing below the number of reads background is marked as none-detection and to be neglected.
  • Level of probability: The probability is determined by the exponential distribution density function.


Figure 4. GUI of Co-localization counting.


Writing date: 2014.11.20, 2015.02.06

No comments:

Post a Comment