OMIM Annotation Standards

From NCBO Wiki
Revision as of 13:50, 5 November 2007 by Nicolew (talk | contribs) (New page: This page is to describe the rules that the NCBO curators are employing for translating the OMIM gene/disease description pages into phenotype statements using EQ syntax. ==Basic Rules=...)
(diff) ← Older revision | Latest revision (diff) | Newer revision → (diff)
Jump to navigation Jump to search

This page is to describe the rules that the NCBO curators are employing for translating the OMIM gene/disease description pages into phenotype statements using EQ syntax.

Basic Rules

  1. Use the ncbo 'config'. This will include automatic updates to the latest ontologies from SF.
  2. No use of MP, though it can be used to look for good cross-products
  3. Use the same style for gene ids. ie. OMIM:601653.0005 (database:identifier dot allele designation)
  4. no postcomposition using GO:Biological_Process with a continuant
  5. limit the post-comp relation to part_of. we can discuss this- but this is what we do at zfin, so at least the zebrafish data is biased in this way already
  6. If its known that a process is affected (usually when there is more than one time point), annotate to the process in addition to the anatomy.
  7. Annotate to both general omim id, as well as the alleles.
    • Meaning, there is a general description of the phenotype for affected genes at the top of the OMIM page, so annotate that in addition to the phenotypes that are listed for the specific alleles. when annotating to the general description, use the omim identifier without an allele designation (ie OMIM:601653) only
  8. When a PATO term is missing, always annotate to the next most specific term. If a branch is missing, annotate to the general term "quality".
  9. use standard notation like OMIM:1234567.0001/OMIM:1234567.0001 to indicate known homozygosity
  10. use OMIM:1234567.0001/+ to indicate known heterozygosity
  11. use OMIM:1234567.0001/OMIM:1234567.0002 to indicate any known trans-het
  12. indicate dominance/recessiveness in the Genetic Context box.
  13. if OMIM:1234567.0001 is listed without the second allele indicated, this notation indicates unknown het/homo?
  14. mark annotations as 'abnormal', except in the case of 'remarkable normality'. (see below)

Remarkable Normality

There is a special case wherein a particular phenotype character is expected, and is observed NOT to occur, or what we call 'remarkable normality'. But this truly is a reflection of one 'normal' phenotype in comparison to the 'mutant' phenotype.

  1. in the first pass, be sure to list the phenotype as it is listed in the other mutants, but add the qualifier of 'normal'.

Comparative Statements