Download Exercise 1

Transcript
Workshop on Bacterial Genomics
Module 1: Artemis
Artemis Exercise 2
This exercise will look at a section of the Malaria genome. You will need to close down
the last Artemis exercise if you haven’t already done so. Then start a new Artemis
Session, as before, using the file ‘Malaria.embl’ in the current directory
(Module_2_Artemis). Unlike the Salmonella exercise, in this instance the annotation and
sequence are contained within the same file ‘Malaria.embl’
The sequence you are going to look at is a small region of contrived sequence (~21 kb)
taken from Plasmodium falciparum chromosome 13. You will see 7 CDSs, some with
multiple exons. As a gentle introduction to splicing we would like you to look at the
genes named , PF13_0119, MAL13P1.294 and PF13_0061. They have only been
partially characterised and may in fact be missing exons. Have a look at these CDSs and
confirm, edit or dismiss the proposed gene models by using G+C content, database
searches and looking for splice sites (Appendix IX).
G+C content is a very good indicator of coding capacity in Malaria. On average, the
coding regions are ~23% G+C and the non-coding regions are ~19%. Have a look at the
G+C content for this region by selecting the appropriate graph. Left click within the
graph window and then select by clicking on the exons to see how this relates to the
G+C peaks on the graph.
Note, we will cover the principals and methods of gene prediction in much more
detail in a module 3.
fasta banner
-25-