SAGE identifies novel transcripts in Drosophila
Genes corresponding to SAGE tags
were identified using a Drosophila tag-to-gene mapping computer program
(E. Pleasance, M. Marra and S. Jones,
submitted). To facilitate the mapping of
our salivary gland SAGE tags, as well as aid in gene annotation and gene
discovery in Drosophila, we incorporated into the program sequence from
5,181 high quality salivary gland 3’ ESTs representing 1,696 different
transcripts (Supplementary Material).
2,866 (61.9%) of the 4,628 different SAGE tags were mapped to known or
predicted Drosophila genes, 289 tags (6.2%) were mapped to genomic DNA
and ESTs not associated with a predicted gene, 1170 (25.3%) were mapped to
genomic DNA and/or the reverse strand of an EST or predicted gene, and 303
(6.5%) could not be matched to existing sequence data. The 303 unmapped tags could be due to
sequence polymorphisms, common sequencing errors, or to lack of representation
in the available sequence resources (e.g. heterochromatic regions). It is also possible that unmapped tags span
adjacent exons that are currently not represented in the EST or cDNA data set.
Our tag-to-gene mapping
results highlight the main advantage of the SAGE method compared to other
expression profiling methods such as oligonucleotide or cDNA array based
analyses. SAGE has the potential to
reveal transcripts not previously identified.
More than 25% of the tags mapped only to genomic DNA and may represent
novel genes or novel 3’ ends or splice forms of already predicted genes. In at least 225 cases, our data suggests the
existence of previously unpredicted transcripts that likely represent
divergently transcribed overlapping gene sequences. Our detection of these 225 transcripts is not
surprising because current gene finding programs are unable to readily detect
overlapping genes [8]. A complete list
of salivary gland SAGE tag sequences, frequency, and mappings can be found in
Supplementary Material Table S1.
To
validate our SAGE data, we conducted real-time quantitative RT-PCR analyses to
verify differential expression of individual genes (Supplementary Material),
and we searched for SAGE tags corresponding to genes associated previously with
salivary gland cell death [3-6]. We
detected expression of the ecdysone-induced primary response genes, E74,
E75 and E93, and the cell death genes ark, dronc, crq,
rpr and iap2 (see Supplementary Material for details). In general, the gene expression profiles
generated by SAGE were consistent with previous reports and could temporally
distinguish known upstream ecdysone-induced transcriptional regulators from
downstream death effector molecules.
No comments