-
Notifications
You must be signed in to change notification settings - Fork 168
CDM Configuration
WebAPI supports configuration of one or more databases transformed to the OMOP Common Data Model (CDM) v5. In this section, we'll review the organization of the CDM and how to add the additional schemas to support operations performed by WebAPI.
This guide assumes that you have installed WebAPI which will create the necessary tables in the steps below.
The WebAPI database created in the WebAPI Installation Guide allows you to configure one or more CDMs to use for performing various analyses through WebAPI. WebAPI's database contains two tables that hold this configuration: source
and source_daimon
. The JDBC connections to the CDM(s) are held in the source
table and defines the connection to the CDM database. This source entry is then related to the source_daimons
table using the source_id
and is used to configure the schemas where the CDM, vocabulary, results and temp schemas reside. These schemas support different daimon-type functionality for WebAPI functionality. In this guide, we'll cover how to establish the additional schemas in your CDM required by WebAPI and how to configure the required source
and source_daimon
entries required for WebAPI to function properly.
The OMOP CDM contains a set of tables that are used to hold patient level data which we will refer to as the CDM schema
. In addition, the CDM will contain tables that hold the OMOP vocabulary which we will refer to as the Vocabulary schema
. It is most likely that the CDM schema
and Vocabulary schema
refer to to the same schema in the CDM database.
If you have utilized Achilles to characterize your OMOP CDM, you will have created a Results schema
to hold the results of the characterization resulting in a number of tables prefixed with Achilles
in it. WebAPI utilizes the same Results schema
to perform transactional operations against the CDM so that tables that require write permission can be properly partitioned away from the read-only patient data. Additionally, WebAPI also utilizes a schema for temporary storage of information when performing certain analyses. To support this work, a temp schema
is established that enables the WebAPI to create/drop temporary tables.
The following sections will describe how to prepare your CDM for use with the OHDSI WebAPI and how to configure WebAPI to connect to the CDMs in your environment.
⭐️ We would highly recommend that you run Achilles to characterize your CDM data. As part of that process, you will establish a Results schema
to hold the results. These tables are utilized by WebAPI to supplement the vocabulary search experience to include information about record counts based on CDMs configured in WebAPI. In the event that you have not done this, we'll cover how to set this up.
Ask your database administrator to establish 2 new schemas in your CDM for the Results schema
and the temp schema
. We'll refer to these schemas as results
and temp
through the remainder of this guide. Also, you will need to establish a SQL account that is used by WebAPI to access the CDM and these various schemas. For this guide, we'll refer to this account as webapi_sa
.
To start, webapi_sa
account requires the following permissions on these schemas:
schema | permissions |
---|---|
cdm | read-only |
vocabulary | read-only |
results | insert/delete/select/update |
temp | full control (create/remove tables & data) |
Once the Results schema
is established, you will need to generate the SQL scripts for your CDM dialect to establish the tables that WebAPI will require to run. To do this, WebAPI provides a set of URLs that takes parameters via the query string parameters to generate the proper database script for your setup. The following sections assume you have followed the WebAPI Installation Guide and that you have a WebAPI URL in your enviornment that is accessible to you to gather the necessary SQL scripts.
The following WebAPI URL is used to generate the SQL script that you will need to create and set up your results schema tables:
http://<server:port>/WebAPI/ddl/results?dialect=<your_cdm_database_dialect>&schema=<your_results_schema>&vocabSchema=<your_vocab_schema>&tempSchema=<your_temp_schema>&initConceptHierarchy=true
You will need to modify the URL above to point your instance of WebAPI running on <server:port>
(default in this guide is localhost:8080
) and then substitute the values specific to your CDM setup:
-
<your_cdm_database_dialect>
: This is one of the following:oracle
,postgresql
,pdw
,redshift
,impala
,netezza
,bigquery
, orsql server
and is based on the platform you use to host your CDM. -
<your_results_schema>
: The schema containing your results tables -
<your_vocab_schema>
: The schema containing your vocabulary tables -
<your_temp_schema>
: The schema that it utilized for your temporary schema - The
initConceptHierarchy
value in the URL is set totrue
and is used to establish the concept_hierarchy which is a cached version of the OMOP vocabulary specific to the concepts found in your CDM. This table can take a while to build and only needs to be established one time. This value can be set tofalse
if you do not need to re-establish this table.
Once you have created the URL for your environment, open a browser and navigate to that URL. The resulting SQL will be displayed in the browser and your database administrator can use this script to establish the results schema.
The following script is required as of v2.13 of WebAPI to create the concept count tables in your CDM. This script assumes you have run Achilles and it will use those tables to create the concept count table that is used to power the vocabulary search:
http://<server:port>/WebAPI/ddl/achilles?dialect=<your_cdm_database_dialect>&schema=<your_results_schema>&vocabSchema=<your_vocab_schema>
-
<your_cdm_database_dialect>
: This is one of the following:oracle
,postgresql
,pdw
,redshift
,impala
,netezza
,bigquery
, orsql server
and is based on the platform you use to host your CDM. -
<your_results_schema>
: The schema containing your results tables -
<your_vocab_schema>
: The schema containing your vocabulary tables
The WebAPI source
and source_daimon
tables were created when you started the tomcat service with the WebAPI war deployed. These tables must be populated with a JDBC source
connection and corresponding source_daimon
that specify the location for the cdm
, vocabulary
, results
and temp
schemas associated to the source in order to use the OHDSI tools. For this example it is assumed that the CDM and Vocabulary exist as a separate schema in the same database instance.
Please note the following:
- The
source_id
must be > 0 and that the SQL below uses sequences to use the next availablesource_id
andsource_daimon_id
respectively. - If you are configuring a PostgreSQL connection and are facing difficulities connecting, it may require the addition of the
OpenSourceSubProtocolOverride
to yoursource_connection
connection string as described in WebAPI Issue #592.
INSERT INTO webapi.source (source_id, source_name, source_key, source_connection, source_dialect)
SELECT nextval('webapi.source_sequence'), 'My Cdm', 'MY_CDM', ' jdbc:postgresql://server:5432/cdm?user={user}&password={password}', 'postgresql';
INSERT INTO webapi.source_daimon (source_daimon_id, source_id, daimon_type, table_qualifier, priority)
SELECT nextval('webapi.source_daimon_sequence'), source_id, 0, 'cdm', 0
FROM webapi.source
WHERE source_key = 'MY_CDM'
;
INSERT INTO webapi.source_daimon (source_daimon_id, source_id, daimon_type, table_qualifier, priority)
SELECT nextval('webapi.source_daimon_sequence'), source_id, 1, 'vocab', 1
FROM webapi.source
WHERE source_key = 'MY_CDM'
;
INSERT INTO webapi.source_daimon (source_daimon_id, source_id, daimon_type, table_qualifier, priority)
SELECT nextval('webapi.source_daimon_sequence'), source_id, 2, 'results', 1
FROM webapi.source
WHERE source_key = 'MY_CDM'
;
INSERT INTO webapi.source_daimon (source_daimon_id, source_id, daimon_type, table_qualifier, priority)
SELECT nextval('webapi.source_daimon_sequence'), source_id, 5, 'temp', 0
FROM webapi.source
WHERE source_key = 'MY_CDM'
;
The above inserts creates a source with source_id = 1
with 4 daimon entries, one for each daimon type (0 = CDM, 1 = Vocabulary, 2 = Results, 5 = TEMP). If you'd like to configure more than 1 source for use in WebAPI, repeat the steps above and increment the source_id
used to distinguish the sources from one another.
👉 Note: To see the new sources, open a browser and navigate to <server>:port/WebAPI/source/refresh
The source_daimon
table contains a column called priority
which holds an integer value that is used by WebAPI depending on the context.
For vocabulary daimons (daimon_type = 1
), you'll want to specify at least 1 daimon with a priority >= 1. The value with the highest priority will be used as the default vocabulary provider and you must specify at least 1 daimon with a priority >= 1 for ATLAS to function properly.
For results daimons (daimon_type = 2
), a priority >= 1 indicates to WebAPI to cache the Achilles record counts that are used when doing a vocabulary search. Upon start up, WebAPI will read in the source_daimon
entries and for any results daimon with a priority >=1, it will attempt to load and cache the record counts into memory. For those daimons with a record count of 0, if/when their record counts are accessed, they will be cached after their initial load. Having the record counts loaded upon WebAPI's startup will help improve vocabulary search performance in ATLAS.
Once WebAPI is started, and the source/source_daimon inserts are complete, you should be able to open a browser to the following URL:
http://localhost:8080/WebAPI/source/sources
This should result in the following output:
[{
"sourceId": 1,
"sourceName": "My Cdm",
"sourceDialect": "postgresql",
"sourceKey": "MY_CDM",
"daimons":
[{
"sourceDaimonId": 1,
"daimonType": "CDM",
"tableQualifier": "cdm",
"priority": "0"
}, {
"sourceDaimonId": 2,
"daimonType": "Vocabulary",
"tableQualifier": "vocab",
"priority": "0"
}, {
"sourceDaimonId": 3,
"daimonType": "Results",
"tableQualifier": "results",
"priority": "0"
}, {
"sourceDaimonId": 4,
"daimonType": "Temp",
"tableQualifier": "temp",
"priority": "0"
}
]
}
]
WebAPI is now configured and ready to serve OHDSI tools!