Global analysis of patterns of gene expression during Drosophila embryogenesis
A defining feature
of multi-cellular organisms is their ability to differentially utilize the
information contained in their genomes to generate morphologically and
functionally specialized cell types during development. Regulation of gene expression
in time and space is a major driving force of this process and is achieved
primarily by the regulation of transcription initiation.
A gene’s expression
pattern can be defined as a series of differential accumulations of its
products in subsets of cells as development progresses. Patterns of gene
expression are studied by two principal methods – microarray analysis (Brown and Botstein 1999) and in situ hybridization (Jones and Robertson 1970). Microarray
analysis provides the ability to quantitatively measure gene expression levels
for all genes in parallel, providing an overview of the temporal dynamics of
gene expression regulation (Arbeitman, Furlong et al. 2002). A major limitation
of microarray analysis is that obtaining spatial information depends on the
dissection or cell-sorting of specific tissues or cell types (Furlong, Andersen et al. 2001; Klebes, Biehs et al. 2002). RNA in situ hybridization has the potential
to reveal both spatial and temporal aspects of gene expression during
development. However, RNA in situ hybridization
is not highly quantitative (Wilcox 1993). For these reasons, we have used both methods
in parallel and integrated the analysis of the resultant data sets.
There are several
reasons for choosing Drosophila melanogaster as an organism for the
global study of gene expression during embryonic development. Genetic and
molecular analyses have led to a deep understanding of many embryonic processes
in this animal (St Johnston and Nusslein-Volhard 1992). Classical
embryology has provided a solid framework of anatomical description of all
embryonic stages (Hartenstein 1997) and robust
high-throughput methods for assaying gene expression by whole mount in situ hybridization have been
developed (Kopczynski, Noordermeer et al.
1998; Simin, Scuderi et al. 2002; Tomancak, Beaton et al. 2002). In many cases, the
wild-type gene expression pattern has informed the interpretation of the
phenotype produced by its mutation (Hafen, Kuroiwa et al. 1984). Such studies have
provided unprecedented insights into animal development; the process that
governs the early embryonic patterning of the Drosophila body plan is now the best understood example of a
complex cascade of transcriptional regulation during development (Carroll 1990; Rivera-Pomar and Jackle 1996).
We have assembled a
comprehensive atlas of gene expression patterns during Drosophila embryogenesis. Taking advantage of non-redundant gene
collections (Stapleton, Carlson et al. 2002; Stapleton, Liao et al.
2002), we performed an unbiased survey of gene expression
patterns by using RNA in situ hybridization
of gene specific probes to fixed Drosophila
embryos (Tomancak, Beaton et al. 2002). We documented the
patterns of each gene by set of digital photographs representing all stages of
embryonic development. The dataset was systematically validated with
independently derived Affymetrix microarray time-course measurements. We
describe the tissue specificity of gene expression at each stage using a selected terms from controlled vocabulary (CV) for
embryo anatomy (Grumbling and Strelets 2006). The CV integrates
the spatial and temporal dimensions of the gene expression patterns by linking
together intermediate tissues that develop from one another. It also integrates
morphological and molecular description of development by allowing for
structures that are morphologically indistinguishable and can be defined only
on the basis of gene expression. We present here the collection and analysis of
a highly curated and annotated dataset that describes the spatial and temporal
gene expression of 44% of the genes in the Drosophila
genome. We show that the gene sample interrogated in our study is largely representative
of the genome as a whole, allowing the global analysis of gene expression
during the embryonic development of a multicellular organism. We organized the
complex gene expression space by a hybrid fuzzy-clustering approach that uses
microarray profiles to supplement the CV annotation of in situ patterns. We divided the resulting clusters into two
categories, broad and restricted. Broad patterns are characterized by
quantitative enrichment in tissues that are related by specific cellular
states. Restricted patterns are highly diverse and provide a basis for defining
gene sets expressed in related tissues and with related predicted functions.
Post Comment
No comments