Data processing & QC
Team leader: Yan Zhou
The major research objective of this team is Solexa sequencing data processing and quality control. We do some research on the latest sequencing technology. After data processing, sequence files are transferred from the images. Data processing consists of three steps: image analysis, base calling and sequence analysis. In addition, we will take sequence files up with quality control. QC is an important and effectively measure for determining the sample libraries' qualifies, and it also serve for pointing out whether the sequencing succeeded or failed. For instance, in the pair-end sequencing, the insert size of pair-end is one of the standards of succeeded library. We map our sequence to the reference. In alignment result, we call the distance between the coordinates of two reads in a pair-end as span. The overwhelming majority of span value of the pair-end mapped reads should be the normal insert size. So if the span does not match the expected insert size, the library was fail in building.
Projects and brief description
All the projects of BGI.SHENZHEN institute, the sequence data are yielded from our team. Meanwhile we implement strict quality control, in order to building a well channel for estimating the sequencing eligible and valuable for advanced analysis. These large projects including below:
2) The International Giant Panda Genome Project
4) The 1000 Genomes Project