You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
We need nail down what we want our release artifacts to be. Currently, I've been focused on producing:
harmonized-table.tsv: the large pivot table of the biosample_set.xml file
harmonized-table.parquet.gz: a parquet version of harmonized-table.tsv
harmonized-attribute-value.ttl.gz: a turtle version of harmonized-table.tsv
harmonized_table.db.gz: a sqlite version of the harmonized-table.tsv
General question: Do you want change the name of the artifacts from "harmonized X" to something else? It would make sense to do this b/c we are using the term 'harmonized' in a way that differs from common usage.
Other questions:
@turbomam has been normalizing the harmonized_table.db.gz data. We need to add this to outputs produced. Do we want a this to be new sqlite database or a separate table with the database or have columns co-exist within the biosample table in the database?
After new normalized database has been produced, do we to dump out an updated tsv and parquet file? (I think yes)
Do want to keep the original non-normalized/raw tsv as a product? (I think yes)
As part of this, we need to add clean and release targets to the Makefile.
How about "biosample_harmatts X" to indicate files containing the harmonizable attributes about biosamples?
I see a building company in the UK called Harmatt https://harmatt.co.uk/ and three Turkish people named Harmat in Wikipedia, but no other widespread usage.
By the way, I would call this my most recent, most thorough mapping of INSDC annotations to OBO foundry terms. @cmungall and others have found some possible improvements, possibly to be implemented by choosing different target ontologies.
We need nail down what we want our release artifacts to be. Currently, I've been focused on producing:
General question: Do you want change the name of the artifacts from "harmonized X" to something else? It would make sense to do this b/c we are using the term 'harmonized' in a way that differs from common usage.
Other questions:
As part of this, we need to add
clean
andrelease
targets to the Makefile.cc @cmungall @hrshdhgd @realmarcin
The text was updated successfully, but these errors were encountered: