Difference between revisions of "PATO Meeting"

From NCBO Wiki
Jump to navigation Jump to search
 
(17 intermediate revisions by 5 users not shown)
Line 3: Line 3:
  
 
[http://www.fruitfly.org/pato/pato-form.html Register here.]
 
[http://www.fruitfly.org/pato/pato-form.html Register here.]
 +
 +
==Agenda==
 +
[[Media:Pheno_workshop_Dec2006.pdf]]
 +
 +
==Minutes==
 +
 +
[http://www.bioontology.org/wiki/images/c/c6/Updated_Meeting_Notes.pdf Media: Updated Meeting Minutes]
 +
 +
==Presentations==
 +
 +
'''Day 1 presentations'''
 +
 +
Gully Burns: Ontology engineering approaches based on semi-automated curation of the primary literature [[Media:GullyBurns.ppt]]
 +
 +
Alexander Garcia: The use of concept maps and automatic terminology extraction during development of a domain ontology: Lessons learnt [[Media:AlexanderGarcia.ppt]]
 +
 +
Adrien Coulet: Suggested ontology for pharmacogenomics (SO-Pharm): modular construction and preliminary testing [[Media:Adrien Coulet.pdf]]
 +
 +
Tina Hernandez-Broussard: Challenges for representing phenotype in pharmacogenomics [[Media:TinaHernandez.ppt]]
 +
 +
James Reecy: A project for the Creation of a Unified Trait Vocabulary for Farm Animals [[Media:JamesReecy.ppt]]
 +
 +
Victoria Petri: Rat genome database disease portals: a platform for genetic and genomic research [[Media:VictoriaPetri.ppt]]
 +
 +
Carol A. Bastiani: WormBase phenotype ontology development, curation, and data integration [[Media:CarolBastiani.ppt]]
 +
 +
Janan Eppig: Ontologies and vocabularies supporting data integration: emphasis on mouse phenotypes and disease [[Media:JananEppig.ppt]]
 +
 +
Rex Chisholm: Phenotype curation tool and ontologies at dictyBase [[Media:RexChisholm.ppt]]
 +
 +
Minna Lehvaslaiho: HIV [[Media:MinnaLehvaslaiho.ppt]]
 +
 +
Eurie Hong: Yeast [[Media:EurieHong.ppt]]
 +
 +
Tanya Berardini: Arabidopsis [[Media:TanyaBerardini.ppt]]
 +
 +
Christopher Smith: Insect behavior [[Media:ChristopherSmith.ppt]]
 +
 +
Michael Ashburner: Drosophila (live demo, no slides)
 +
 +
Maryann Martone: BIRN
 +
 +
Jim Balhoff: NESCENT [[Media:JimBalhoff.pdf]]
 +
 +
 +
'''Day 2 presentations'''
 +
Chris Mungall [[Media:ChrisMungall.ppt]]
 +
 +
Monte Westerfield [[Media:MonteWesterfield.ppt]]
  
 
==Venue ==
 
==Venue ==
Line 22: Line 71:
 
*[http://www.starwoodhotels.com/sheraton/property/overview/index.html?propertyID=214 Sheraton]
 
*[http://www.starwoodhotels.com/sheraton/property/overview/index.html?propertyID=214 Sheraton]
 
*[http://www.stanford.edu/dept/hds/SLAC/ SLAC Guest House]
 
*[http://www.stanford.edu/dept/hds/SLAC/ SLAC Guest House]
 
==Draft agenda==
 
[[Image:Pheno_workshop_Dec2006.jpg]]
 
 
Please join us for dinner
 
*Thursday, November 30, 2006
 
** 7pm: Informal gathering for dinner (Gordon Biersch?)
 
*Fridayday, December 1, 2006
 
** 7pm:
 
 
==Abstracts for Presentations ==
 
 
===Challenges for Representing Phenotype in Pharmacogenomics (Russ Altman)===
 
Abstract: The PharmGKB (http://www.pharmgkb.org/) is an online resource devoted to comprehensive cataloguing of genetic variations relevant to variation in drug response.  We curate primary data (genotype, phenotype at molecular, cellular, clinical level) as well as knowledge (literature curation, pathways, human annotations of key genes).  We provide search and visualization tools for this information, in order to catalyze research in pharmacogenomics.  For both activities, we need to index the relevant phenotypes for the purposes of indexing, aggregation, search, and automatic summarization and data mining.  We need a flexible method for annotating phenotypes that are described in the literature (by curators).  We would prefer to adopt community-based standards that would allow PharmGKB to interoperate with other databases, both human and model organism.<br />
 
 
===Ontologies and Vocabularies Supporting Data Integration: Emphasis on Mouse Phenotypes and Disease (Janan Eppig)===
 
Abstract: The mouse is an exceptional model system for connecting knowledge from sequence-to-phenotype-to-disease. The Mouse Genome Informatics Database (MGI, http://www.informatics.jax.org) supports biological knowledge building for mouse by integrating genetic, genomic, and biological data and facilitating data mining and complex querying. Full access to integrated data is enabled by extensive use of structured vocabularies and ontologies including the Gene Ontology (GO), mouse Embryonic and Adult Anatomical Dictionaries (EMAP and MA), Mammalian Phenotype (MP) Ontology, and Online Mendelian Inheritance in Man (OMIM) disease and syndrome terms. In addition, MGI is the authoritative source for nomenclature for mouse genes, alleles, and strains. Many smaller vocabularies, such as mutation class, sequence type, genetic marker type, expression assay type, etc., also are key to MGI data integration. Phenotypic descriptions in MGI rely on the MP Ontology and definition of specific genotypes and strain backgrounds. The MP Ontology has been adopted successfully to describe mouse (MGI), rat (RGD), human (NBCI), and animal (OMIA) phenotypes. As of July 2006, MGI included >16,000 alleles representing phenotypic mutations in >6,600 genes. Over 65,600 phenotype annotations in MGI have been made using MP Ontology terms. The MP Ontology itself has, thus far, grown to >4,400 defined terms. Over 1,700 mouse models are associated with OMIM disease terms. Supported by NIH grant HG00330.<br />
 
 
===Rat Genome Database Disease Portals: A Platform for Genetic and Genomic Research (Victoria Petri)===
 
Abstract: The Disease Portals at RGD provide a comprehensive research platform
 
through the integration of heterogeneous datasets into the context of
 
the genome using multiple ontologies and tools for data mining and
 
visualization. The portals provide both the novice/experienced user
 
with easy access to a comprehensive, integrated
 
knowledgebase. Current/proposed components of the portals include: 1)
 
comprehensive rat, human and mouse gene sets associated with diseases,
 
related phenotypes, pathways and biological processes; 2) all rat QTLs
 
related to a disease, associated mouse/human QTLs; 3) strains used as
 
disease models; 4) phenotype data in a species-dependent manner; 5)
 
references; 6) expression data; 7) genome-wide view of genes/QTLs via
 
GViewer; 8) comparative maps of disease related regions, 10)
 
customization of datasets/download options; 11) analysis/visualization
 
of function and cellular localization makeup of gene sets.  The
 
portals are designed to highlight genetic/ genomic data generated from
 
rat research in diseases related to the cardiovascular, nervous,
 
musculoskeletal, digestive, endocrine and immune systems as well as
 
metabolic diseases, cancer. Disease data across the three species,
 
along with species-dependent phenotypic data provide the user with a
 
means to distinguish between subtle differences in disease
 
manifestations. Such differences could help elucidate the links
 
between events and conditions, the mechanisms that lead from the
 
normal to the diseased phenotype.<br />
 
 
 
===Ontology Engineering Approaches Based on Semi-Automated Curation of the Primary Literature (Gully Burns)===
 
Abstract: The process of knowledge curation from the primary literature is time-consuming, laborious, and specialized.  Fortunately, the similarities of the curation process to the annotation of text with semantic labels presents an opportunity to employ cutting-edge natural language processing techniques to facilitate ontology construction.  The result is that manual curation work can support both the development of an automated curation system and the semi-automated construction of a formal model of the domain. We present a general approach based on the use of active learning methods in conjunction with text-mining systems using the Conditional Random Fields model. Ultimately, we wish to construct annotation tools that fit seamlessly into scientists' everyday interaction with the primary literature. Our secondary, and complementary, focus is the creation of a domain ontology of the types of information identified for curation, which may encode formally the expert's knowledge and help pinpoint errors or vagueness in his or her understanding. We present preliminary data taken from information extraction experiments performed on the neuroanatomical connectivity literature. While this data is not normally considered a 'phenotype' within neuroanatomy, we argue that it (along with other non-genomic data) should be considered by the PATO community. This work is funded by the Information Sciences Institute and the National Library of Medicine (LM-07061).<br />
 
 
===Suggested Ontology for Pharmacogenomics (SO-Pharm): Modular Construction and Preliminary Testing (Adrien Coulet)===
 
Abstract: Pharmacogenomics studies the involvement of interindividual variations of DNA sequence in different drug responses (especially adverse drug reactions). Knowledge Discovery in Databases (KDD) process is a means for discovering new pharmacogenomic knowledge in biological databases. However data complexity makes it necessary to guide the KDD process by representation of domain knowledge. Three domains at least are in concern: genotype, drug and phenotype. The approach described here aims at reusing whenever possible existing domain knowledge in order to build a modular formal representation of domain knowledge in pharmacogenomics. The resulting ontology is called SO-Pharm for Suggested Ontology for Pharmacogenomics. Various situations encountered during the construction process are analyzed and discussed. A preliminary validation is provided by representing with SO-Pharm concepts some well-known examples of pharmacogenomic knowledge. [[http://www.loria.fr/~coulet/sopharm_description.html]] <br />
 
 
===Phenotype Curation Tool and Ontologies at dictyBase (Rex Chisholm)===
 
Phenotype is an important functional annotation available at dictyBase. Although in the past we have associated phenotypes to genes, it is more accurate to represent them as characteristics of a strain. Therefore we have transitioned towards associating phenotypes to strains and using PATO together with a species specific ontology to describe phenotypes. This involved the development of a species-specific phenotypic ontology, and software tools required to support the use of these ontologies for the phenotypic annotation of strains.  Our interim phenotype ontology contains 415 terms and is based on the Entity-Quality model, where each term consists of an entity, which is either a GO term or a Dictyostelium anatomy term, and a quality. Examples of Dicty Phenotype Ontology terms include: aberrant fruiting body morphology, decreased exocytosis, aberrant cytoskeleton organization. As PATO becomes more stable, we will compose these terms using PATO qualities.  We have developed a web-based software tool to curate phenotypes which runs on top of the Chado schema from GMOD. This tool has been developed in collaboration with NCBO. It is very similar in functionality to Phenote  and provides the curators the ability to capture data using the Entity-Quality model.  In addition to the phenotype described by the entity and quality, the tool allows the curators to capture assay, environment, reference, and genetic context. Supported by NIH grants GM064426 and HG00022.
 
 
===A Project for the Creation of a Unified Trait Vocabulary for Farm Animals (James Reecy)===
 
Ontologies help to identify and formally define the entities and relationships in specific domains of interest. Ontologies play a central role in annotating, integrating, analyzing, and interpreting biological data.  Animal Trait Ontology (ATO) project being carried out under the auspices of the USDA-Animal Genome Research Program is aimed at developing  a standardized trait ontology for farm animals and software tools to assist the research community in collaboratively creating, editing, maintaining, and using such ontology for annotating and querying data.  Towards this end, we have developed a Collaborative Ontology Building (COB) editor to allow ATO curation by multiple, geographically distributed individuals and research groups.  We have also developed database structures to maintain the ATO for cattle, pigs, chickens, and other livestock species.  The ATO project extends our work on Animal QTLdb by adding new capabilities for standardization, annotation, retrieval, integration, and analysis of animal trait information, including in particular,  QTL related traits.
 

Latest revision as of 10:58, 3 January 2007

General Information and Registration

The National Center for Biomedical Ontology will host a two-day meeting focused on the Phenotype and Trait Ontology (PATO) December 1-2, 2006 at Stanford University in Palo Alto, CA.

Register here.

Agenda

Media:Pheno_workshop_Dec2006.pdf

Minutes

Media: Updated Meeting Minutes

Presentations

Day 1 presentations

Gully Burns: Ontology engineering approaches based on semi-automated curation of the primary literature Media:GullyBurns.ppt

Alexander Garcia: The use of concept maps and automatic terminology extraction during development of a domain ontology: Lessons learnt Media:AlexanderGarcia.ppt

Adrien Coulet: Suggested ontology for pharmacogenomics (SO-Pharm): modular construction and preliminary testing Media:Adrien Coulet.pdf

Tina Hernandez-Broussard: Challenges for representing phenotype in pharmacogenomics Media:TinaHernandez.ppt

James Reecy: A project for the Creation of a Unified Trait Vocabulary for Farm Animals Media:JamesReecy.ppt

Victoria Petri: Rat genome database disease portals: a platform for genetic and genomic research Media:VictoriaPetri.ppt

Carol A. Bastiani: WormBase phenotype ontology development, curation, and data integration Media:CarolBastiani.ppt

Janan Eppig: Ontologies and vocabularies supporting data integration: emphasis on mouse phenotypes and disease Media:JananEppig.ppt

Rex Chisholm: Phenotype curation tool and ontologies at dictyBase Media:RexChisholm.ppt

Minna Lehvaslaiho: HIV Media:MinnaLehvaslaiho.ppt

Eurie Hong: Yeast Media:EurieHong.ppt

Tanya Berardini: Arabidopsis Media:TanyaBerardini.ppt

Christopher Smith: Insect behavior Media:ChristopherSmith.ppt

Michael Ashburner: Drosophila (live demo, no slides)

Maryann Martone: BIRN

Jim Balhoff: NESCENT Media:JimBalhoff.pdf


Day 2 presentations Chris Mungall Media:ChrisMungall.ppt

Monte Westerfield Media:MonteWesterfield.ppt

Venue

Stanford University Clark Center, room S360

Directions to Stanford Medical Center

Map of Stanford Medical Center--see "C", Clark Center

Directions on taking Free Marguerite Shuttle Bus--take Line A to Medical Center

Accommodations

We aren't reserving a block of rooms anywhere, but these local hotels are close by and have Stanford shuttle service.