Difference between revisions of "Annotator Dataset Workflow Howto"

From NCBO Wiki
Jump to navigation Jump to search
(New page: == Chapter 1: Synchronizing Data with BioPortal == * The synchronization with BioPortal data should be performed regularly (currently scheduled bi-weekly or on-demand). * The synchron...)
 
(Replaced content with "As of Virtual Appliance v2.2, populating the Annotator Dataset is done automatically when an ontology is processed.")
 
(62 intermediate revisions by 3 users not shown)
Line 1: Line 1:
==  Chapter 1: Synchronizing Data with BioPortal ==
+
As of Virtual Appliance v2.2, populating the Annotator Dataset is done automatically when an ontology is processed.
 
* The synchronization with BioPortal data should be performed regularly (currently scheduled bi-weekly or on-demand). 
 
* The synchronization (Incremental Update) should be done in separate environment from Staging & PROD.
 
<pre>
 
Environment for OBS Data Population
 
Instance of tomcat : ncbodev-obs
 
Instance of DB : ncbodev-obsdb1.sunet
 
</pre>
 
 
 
* The steps are as following:
 
<ol>
 
<li>Remove out-dated ontologies from OBS Database (e.g. older version of ontologies that does not in BioPortal anymore). By invoking this Restlet, it will remove all the outdated ontology data and the associated entities such as concepts, terms, relations, semantic types and hierarchy information.
 
 
 
<p><code>
 
Query: Get List of ontologies to be removed  (old):
 
http://ncbodev-obs:8080/obs_hibernate/admin/ontologies/list/old<br/>
 
Run:
 
http://ncbodev-obs:8080/obs_hibernate/admin/ontologies/remove
 
</code></p></li>
 
<li>Add new ontologies from BioPortal to OBS. By invoking this Restlet, it will add all the new ontology data and the associated entities such as concepts, terms, relations, semantic types and hierarchy information.
 
 
 
<p><code>
 
Query:  Get List of ontologies to be added (new):
 
http://ncbodev-obs:8080/obs_hibernate/admin/ontologies/list/new<br/>
 
Run:
 
http://ncbodev-obs:8080/obs_hibernate/admin/ontologies/add
 
</code></p></li>
 
 
 
<li>Populate Concepts (For details, please refer to Chapter 2.1)
 
<p><code>http://ncbodev-obs:8080/obs_hibernate/loaderBigConcepts/all</code></p></li>
 
<li>Populate Hierarchy (For details, please refer to Chapter 2.2)
 
<p><code>http://ncbodev-obs:8080/obs_hibernate/loaderBigPaths/all</code></p>
 
<p>To monitor the progress and error, refer to:</p>
 
<ol>
 
<li>The "status" field in the table obs_ontology in OBS DB. (ncbo-dev-obsdb1.sunet)
 
<li>Check the log in tomcat. (ncbodev-obs: /usr/local/tomcat5/logs)</li>
 
</ol></li>
 
<li>Create Dictionary: To run this, this step has to be complete: "3. Populate Concepts" (For details, please refer to Chapter 4)</li>
 
<li>Create Mapping Data (For details, please refer to Chapter 6)</li>
 
</ol>
 
* When the update is complete, the snapshot of DB should be copied (or replicated) to Staging/PROD.
 

Latest revision as of 17:06, 14 January 2015

As of Virtual Appliance v2.2, populating the Annotator Dataset is done automatically when an ontology is processed.