Breaking News

Annotation hierarchy


For describing spatial and temporal gene expression patterns in embryogenesis, we use only two types of relationships between CV terms; “part of” to cover spatial relations and “develops from” to cover temporal relationships among structures. Importantly, we link terms to 6 developmental stage-ranges and used “develops from” relationships exclusively to link terms that belong to consecutive stage-ranges. Our anatomical terms were organized into a hierarchical tree, starting with stage-range 1-3, which has only two CV terms (maternal, pole plasm), and progressively branching through the 6 stage ranges to reach the stage-range 13-16 with 126 anatomical terms (Figure 1). Anatomical structures that are contained within a larger structure are linked to the larger structure by the “part of” relationships (e.g. 13-16 midline glia is “part of” 13-16 midline). Anatomical structures that develop from one another across time are linked by “develops from “ relationship (for example 13-16 midline develops from 11-12 midline primordium (PR)). CV terms can have simultaneously part of and develops from relationship  (e.g. midline glioblast is part of midline PR and midline glia develops from the midline glioblast). Every term occurs in the hierarchy only once. When two terms develop into a single later structure (for example anterior and posterior midgut primordium forming midgut), the strictly hierarchical nature of the tree is broken, and both are linked to the child term (midgut) by “develops from” relationship. This fits within the directed acyclic graph (DAG) format which is used to capture many biological ontologies.
Many specific structures representing small subsets of tissues have very few or no associated genes. Throughout this article, we summarize the obtained data by focusing on subset of 131 structures that make up the most common and readily distinguishable structures in our in situ data. Genes annotated with more specific structures are collapsed into more general parent structures. For example, the terms dorsal epidermis, dorsal apodeme, dorsal histoblast nest abdominal, dorsal ridge and leading edge cell are collapsed into dorsal ectoderm. We distinguish two levels of collapsing. First, we collapse within a stage all “part of” relationships up to the parent term that the curators used frequently and therefore are readily distinguishable. The resulting blocks of terms represent the most relevant units of embryo anatomy for describing the RNA in situ results. Several such blocks may be defined within a single organ system for example trunk and head somatic and visceral musculature (TrunkSomMusc, HeadSomMusc, TrunkViscMusc, HeadViscMusc) in the muscle system.  Second, we collapse terms referring to the same organ system across a range of stages: structures from stage 4-6 are collapsed into early organ systems anlagen; structures from stages 7-8 and 9-10 into mid organ systems (suppl. Table 1); and structures from stages 11-12 and 13-16 into late organ systems (suppl. Table 2). For example, Endocrine_heart refers to the combined anatomical terms for all components of circulatory and endocrine related structures at stages 11-16 (combining blocks 11-12 CardioVAsc, 11-12 RingGland, 13-16 CardMeso, 13-16 RingGland).

No comments