Infectious Disease Ontology 2008

From NCBO Wiki
Jump to navigation Jump to search

Background

A two-day IDO workshop for invited participants will be held in Buffalo, New York on September 16-17, 2008. This workshop is being organized with the generous support of the Burroughs Wellcome Fund.

The background to this meeting is an Infectious Disease Ontology workshop (IDO 2007), which was organized in Cold Spring Harbor Laboratories in 2007. The workshop had four primary outcomes:

  • Training of a core set of infectious disease researchers in ontology-development methods, facilitating their participation in ontology development;
  • Development of a core Infectious Disease Ontology (IDO) which is designed to serve as a consensus-based controlled vocabulary resource for annotation of data representing all entities relevant to infectious diseases generally;
  • Establishment of a method for creating, on the basis of IDO, a set of ontologies that can be developed in a distributed fashion yet together cover the entire infectious disease domain (the set consists of the above-described core IDO ontology plus sub-domain-specific extensions of the core, such as IDO-tuberculosis, IDO-malaria);
  • Formation of an Infectious Disease Ontology Consortium (IDOC) whose members have agreed to contribute towards continued development of the core IDO and to develop seven different sub-domain-specific ontologies.

IDO 2008 Goals

To capitalize on these outcomes and to sustain the momentum gained from the 2007 workshop, we have scheduled a second Infectious Disease Ontology workshop, to be held in Buffalo, NY on September 16-17, 2008. The goals of this meeting are:

1. to critically evaluate the IDO core ontology for biological and ontological correctness

2. to critically evaluate the ontology development test cases initiated at IDO 2007 for biological and ontological correctness

3. to identify specific research questions that the ontologies should help to provide answers to in domains such as:

  • analysis of high-throughput data (data mining)
  • analysis of literature (text-mining)
  • clinical decision support
  • case-control studies
  • disease surveillance
  • genetic susceptibility of infectious disease
  • design of prevention and treatment strategies

IDO 2008 Schedule

Day 1: Tuesday, September 16

  • 8:30am - Continental Breakfast
  • 9:00am - Introduction to the Workshop slides (Lindsay Cowell)
  • 9:15am - Introduction to Biomedical Ontology slides (Barry Smith)
  • 10:00am - Introduction to the Infectious Disease Ontology slides (Lindsay Cowell)
  • 11:00am - Refreshment Break
  • 11:30am - The Infectious Disease Portion of the Immune Epitope Database Ontology and its Relationship to IDO slides (Bjoern Peters)
  • 12:30pm - Lunch
  • 1:30pm - The Vaccine Ontology slides (Yongqun He)
  • 2:30pm - The Staphylococcus Aureus Ontology slides (Vance Fowler)
  • 3:30pm - Refreshment Break
  • 4:00pm - The Infective Endocarditis Ontology and SemanticDB slides (Sivaram Arabandi)
  • 5:30pm - Dinner
  • 7:30pm - Ontologies in the Future of Infectious Disease Research: A Discussion (Introduced and Moderated by Christos Louis)

Day 2: Wednesday, September 17

  • 8:30am - Continental Breakfast
  • 9:00am - The Vector-Borne Disease Ontology slides (Christos Louis)
  • 10:00am - The Dengue Fever Ontology slides (Saul Lozano-Fuentes)
  • 11:00am - Refreshment Break
  • 11:15am - The Influenza Ontology (Burke Squires) slides ; Ontology Evaluation: Methods and Metrics (Joanne Luciano)
  • 12:15pm - Working Lunch: Refining IDO and Its Extensions to Better Support Interoperability (Barry Smith)
  • 1:45pm - Controlled Vocabularies and Ontologies slides (Marlize Coleman)
  • 2:45pm - Leveraging Annotated Data with Query on the Semantic Web slides (Alan Ruttenberg)
  • 3:30pm - Goals for the Coming Year (Lindsay Cowell)
  • 4:00pm - Close

Format

One person will designated as moderator for each session. All sessions will emphasize group discussion over presentation. Moderators of the ontology evaluation sessions will be responsible for beginning the session with a brief presentation of the ontology and will be prepared to navigate and display the ontology throughout the discussion. Moderators for the remaining sessions will be responsible for jumpstarting discussion with a brief outline of discussion points.

IDO 2008 Venue

The venue for our meeting will be the Ramada Inn and Conference Center in Amherst, NY.

Participants

Sivaram Arabandi (Cleveland Clinic)

Robert Arp (NCBO / University at Buffalo)

Tiffani Bright (Columbia University)

Marlize Coleman (Colorado State University)

Lindsay Cowell (Duke University Medical Center)

Alexander Diehl (Gene Ontology / The Jackson Laboratory)

Vance Fowler (Duke University Medical Center)

Steve Gill (University at Buffalo)

Louis Goldberg (University at Buffalo)

Yongqun "Oliver" He (University of Michigan Medical Center)

Yentram Huyen (Lockheed Martin, OTIS/NIAID/NIH Contractor)

Carla Kuiken (Los Alamos National Laboratory)

Alan J. Lesse (University at Buffalo)

Christos (Kitsos) Louis (IMBB-FORTH, Crete)

Saul Lozano-Fuentes (Colorado State University)

Joanne Luciano (The MITRE Corp.)

Anna Maria Masci (Duke University Medical Center)

Simon Milton (University of Melbourne)

Chris Mungall (Gene Ontology / Lawrence Berkeley National Laboratory)

Darren Natale (Protein Ontology / Georgetown University)

Bjoern Peters (La Jolla Institute for Allergy & Immunology)

Alan Ruttenberg (Science Commons)

Richard Scheuermann (University of Texas Southwestern Medical Center at Dallas)

Lynn Schriml (University of Maryland)

Barry Smith (NCBO / University at Buffalo)

Burke Squires (University of Texas Southwestern Medical Center at Dallas)

Christian Stoeckert (Penn Center for Bioinformatics / Univ of PA)

Tod Strugnell (Sanofi Pasteur)

Pantelis Topalis (IMBB-FORTH, Crete)

Progress Since the IDO 2007

Progress has been made in the development of IDO and seven sub-domain-specific extensions of IDO. The sub-domain-specific extensions ontologies for the following diseases:

Tuberculosis (Carol Dukes-Hamilton, Duke University Medical Center)
Staphylococcus aureus bacteremia (Vance Fowler, Duke University Medical Center)
Infective endocarditis (Sivaram Arabandi, Cleveland Clinic Foundation)
Malaria and other vector-borne diseases (Christos Louis, Institute for Molecular Biology and Biochemistry – FORTH)
Dengue fever (Saul Lozano-Fuentes, Colorado State)
Influenza (Stuart Sealfon, Mount Sinai School of Medicine; Richard Scheuermann, University of Texas, Southwestern Medical Center)

Development of IDO has continued along two fronts, expansion of content driven by development of the subdomain-specific ontologies and refinement of the approach to representing infectious disease-relevant entities ontologically.

IDO is supplemented also by a Vaccine Ontology which is being developed by Yongqun He (University of Michigan) in collaboration with Lindsay Cowell and Barry Smith.

In collaboration with Dr. Carol Dukes-Hamilton at Duke University Medical Center, Drs. Cowell and Smith have begun developing a draft ontology of tuberculosis and a method for defining ISO 11179 data elements using logical constructs based on terms derived from ontologies. Dr. Dukes-Hamilton’s research group has defined eighty tuberculosis data elements and curated these into the National Cancer Institute’s metadata repository, caDSR. Definition of these data elements using ontology terms provides not only a formal method for data element definition, significantly improving the resulting definitions, but also interoperability between data elements (along with the data associated therewith) and the vast amount of biomedical data and information annotated with terms from the same or an interoperable set of ontologies.

In collaboration with Dr. Vance Fowler at Duke University Medical Center, Drs. Cowell and Smith have developed a draft ontology of Staphylococcus aureus bacteremia.

Dr. Sivaram Arabandi of the Cleveland Clinic Foundation is part of a large team developing SemanticDB technology, a semantic datastore with query functionality, having primary focus on Cardiology and Cardiothoracic Surgery. A portion of this work involves developing an IDO extension ontology for infective endocarditis.

Dr. Christos Louis’ research group at the Institute of Molecular Biology and Biochemistry (IMBB), one of the seven institutes of the Foundation for Research and Technology – Hellas (FORTH), based in Crete, is developing an IDO extension for malaria and other vector-borne diseases. The group is working in parallel to develop an ontology of the physiological processes of disease vectors that play a direct or indirect role in disease transmission. These ontology development efforts are being pursued within the context of VectorBase (http://www.vectorbase.org), an NIAID Bioinformatics Resource Center for invertebrate vectors of human pathogens, and embracing efforts to construct decision support systems for vector-borne diseases.

A collaborative group of researchers including Joanne Luciano (MITRE), Burke Squires (University of Texas, Southwestern Medical Center) and Lynn Schriml (University of Maryland, School of Medicine), have utilized the Ontology of Biomedical Investigations (OBI) components of materials/objects, qualities and processes to develop an influenza ontology and to map influenza virus sequence and surveillance terms to their respective materials and qualities.

The Influenza Ontology describes by category the Investigator, Event, Location, Strain Specimen, Amplified Strain Specimen, Virion RNA, Treatment, and Host. The groups from BHB, IGS and MITRE have consolidated influenza sequence and surveillance terms from resources such as the BioHealthBase (BHB), a Bioinformatics Resource Center (BRC) for Biodefense and Emerging and Re-emerging Infectious Diseases, the Centers for Excellence in Influenza Research and Surveillance (CEIRS), and the Gemina and Influenza Virus Genome Projects. The list of data fields that describe influenza virus isolates and surveillance data has been created by consolidating data fields from data contributors and separate CEIRS participants. The initial ontology of terms has been created with a cross reference of terms to existing OBO Foundry ontologies.

The CEIRS projects consist of two research areas: influenza virus surveillance and basic influenza virus sequence and genetic reassortment. Working in collaboration with Dr. Richard Scheuermann, University of Texas, Southwestern Medical Center, the immediate goal is to apply the Influenza Virus Ontology to data collected as part of the CEIRS projects in an effort to enable influenza researchers to more easily elucidate the causes of influenza virulence and pathogenesis. Once completed, a database schema based upon the OBI will serve as the repository for influenza sequence and surveillance data through the BHB portal.

For more information about IDO and its sub-domain extensions, see http://www.infectiousdiseaseontology.org.