Download Next Generation Sequencing So ware User`s Manual Version 1.5

Transcript
The first way is to ignore mismatches occurring on read positions with low quality scores during
the mapping process.
A low confidence score denotes low sequencing quality at that position in the read. Therefore,
mismatches occurring at positions with low quality scores are more likely due to sequencing
error. Thus mismatches on positions with high quality scores are more meaningful than those at
low quality score positions. Use the following combo box or enter some value to assign the
threshold of high quality score.
The second way is to utilize quality scores to rank several possible mapping results of each read
and choose the best position as the mapping results. Since the process happens after all mapping
positions have been found, this way will be described in part “5. Collecting Results”.
In our experiments, we observed that more reads were uniquely mapped when quality scores
were included to help mapping.
Mapping Criteria
Mismatches and insertion/deletion
The following is an example that a read is mapped to the reference sequence with only
mismatches. The four red characters in the alignment are the mismatches between the read and
its target region on the reference sequence.
The following is an example that a read is mapped to the reference sequence with a deletion on
the read. The read deletes an “A” nucleotide. There is one deletion of length one.
67