-
Notifications
You must be signed in to change notification settings - Fork 3
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
PR: Which terms outside the current term scopes are xref'd? #35
Conversation
facea2f
to
80a5617
Compare
src/analysis/excluded_terms_in_mondo/excluded_terms_in_mondo.py
Outdated
Show resolved
Hide resolved
src/analysis/excluded_terms_in_mondo/excluded_terms_in_mondo.py
Outdated
Show resolved
Hide resolved
src/analysis/excluded_terms_in_mondo/excluded_terms_in_mondo.py
Outdated
Show resolved
Hide resolved
src/analysis/excluded_terms_in_mondo/excluded_terms_in_mondo.py
Outdated
Show resolved
Hide resolved
src/analysis/excluded_terms_in_mondo/excluded_terms_in_mondo.py
Outdated
Show resolved
Hide resolved
src/analysis/excluded_terms_in_mondo/excluded_terms_in_mondo.py
Outdated
Show resolved
Hide resolved
f6cabe7
to
94edb4a
Compare
src/analysis/excluded_terms_in_mondo/excluded_terms_in_mondo.py
Outdated
Show resolved
Hide resolved
Results(Last updated: 2022/08/12) CodebookI added one to: OutputsEdit: These are old. Now look in the By ontology
Now merged and committed. Can find each of the files here: https://github.com/monarch-initiative/mondo-ingest/blob/main/src/ontology/reports/ All ontologies combined
Now merged and committed: https://github.com/monarch-initiative/mondo-ingest/blob/main/src/ontology/reports/excluded_terms_in_mondo_xrefs.tsv
|
68ddc49
to
5f54e8a
Compare
src/analysis/excluded_terms_in_mondo/excluded_terms_in_mondo.py
Outdated
Show resolved
Hide resolved
src/analysis/excluded_terms_in_mondo/excluded_terms_in_mondo.py
Outdated
Show resolved
Hide resolved
src/analysis/excluded_terms_in_mondo/excluded_terms_in_mondo.py
Outdated
Show resolved
Hide resolved
src/analysis/excluded_terms_in_mondo/excluded_terms_in_mondo.py
Outdated
Show resolved
Hide resolved
src/analysis/excluded_terms_in_mondo/excluded_terms_in_mondo.py
Outdated
Show resolved
Hide resolved
src/analysis/excluded_terms_in_mondo/excluded_terms_in_mondo.py
Outdated
Show resolved
Hide resolved
Before merging, can we take a look at the table with the actual results? |
@matentzn wrote:
Yep. It's not ready to merge yet though. But I already uploaded results. I accidentally wasn't showing |
7b7886d
to
7c69f41
Compare
@matentzn For some reason a lot of your comments are showing 2-3 times. Is this a bug on GitHub's part or are you commenting first and then responding to my open review comments after? Ideally I like to have all comments related to code changes to be review comments so that they can be marked resolved. |
src/analysis/excluded_terms_in_mondo/excluded_terms_in_mondo.py
Outdated
Show resolved
Hide resolved
7c69f41
to
ac2ff9e
Compare
61fd2db
to
7123a5e
Compare
cb4cd43
to
d41cd2d
Compare
4b985e4
to
cf77334
Compare
# todo: the merged _summary.tsv has a column `filename` on the right. would be better if it was named `ontology` and was on the left, and was sorted by most->least terms. | ||
$(REPORTDIR)/excluded_terms_in_mondo_xrefs.tsv $(REPORTDIR)/excluded_terms_in_mondo_xrefs_summary.tsv:: $(foreach n,$(ALL_COMPONENT_IDS), $(REPORTDIR)/$(n)_excluded_terms_in_mondo_xrefs.tsv) | ||
awk '(NR == 1) || (FNR > 1)' $(REPORTDIR)/*_excluded_terms_in_mondo_xrefs.tsv > $@ | ||
@awk -v OFS='\t' 'NR == 1 { print $$0, "filename" } FNR > 1 { print $$0, FILENAME }' \ |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I know combine the summary statistics too. I'm not great at awk
and similar unix-foo, though.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Very nice job. Need to pring up at next call!
reports/*_excluded_terms_in_mondo_xrefs_summary.tsv \ | ||
> $(REPORTDIR)/excluded_terms_in_mondo_xrefs_summary.tsv | ||
|
||
excluded-xrefs-in-mondo: $(REPORTDIR)/excluded_terms_in_mondo_xrefs.tsv |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
New .PHONY target for convenience
… xrefed?' - Added new Python script / sub-package - Added make command src/analysis/ - Created this and initialized w/ new feature: excluded_terms_in_mondo .gitignore - Added ignored/ dir so that I can download and store some inputs here.
… xrefed?' - Bugfix: Added missing term_id to 'excluded_terms_xrefed_in_mondo.tsv'
… xrefed?' - Bugfix: Added missing term_id to 'excluded_terms_xrefed_in_mondo.tsv' - Update: Changed directory for temp/ignored files -.gitignore: Removed ignored/ in favor of tmp/
… xrefed?' - Added new Python script / sub-package - Added make command src/analysis/ - Created this and initialized w/ new feature: excluded_terms_in_mondo .gitignore - Added ignored/ dir so that I can download and store some inputs here.
… xrefed?' - Bugfix: Added missing term_id to 'excluded_terms_xrefed_in_mondo.tsv' - Update: Changed directory for temp/ignored files -.gitignore: Removed ignored/ in favor of tmp/
… xrefed?' - Update: makefile: Added `python-install-dependencies` and utilized this in several make goals.
… xrefed?' - Added new Python script / sub-package - Added make command src/analysis/ - Created this and initialized w/ new feature: excluded_terms_in_mondo .gitignore - Added ignored/ dir so that I can download and store some inputs here.
… xrefed?' - Bugfix: Added missing term_id to 'excluded_terms_xrefed_in_mondo.tsv' - Update: Changed directory for temp/ignored files -.gitignore: Removed ignored/ in favor of tmp/
… xrefed?' - Added new Python script / sub-package - Added make command src/analysis/ - Created this and initialized w/ new feature: excluded_terms_in_mondo .gitignore - Added ignored/ dir so that I can download and store some inputs here.
… xrefed?' - Bugfix: Added missing term_id to 'excluded_terms_xrefed_in_mondo.tsv' - Update: Changed directory for temp/ignored files -.gitignore: Removed ignored/ in favor of tmp/
… xrefed?' - Update: makefile: (i)Added `python-install-dependencies` and utilized this in several make goals, (ii) now passing more paths explicitly, (iii) moved from .PHONY targets to real ones. - Update: Changed from package architecture to a single file, as I am no longer storing cache/ in a diretory relative to the script, but rather in $(TMPDIR). General - Update: Python requirements: (i) requirements.txt: updated several to try and resolve dependency version conflicts, (ii) requirements-unlocked.txt: Added `oaklib`, `pyyaml`. - Update: cli_validate(): Removed a validation no longer needed, and added a new one. - Update: Outputs are now TSV instead of CSV. - Update: Silenced some false positive pandas warnings.
… xrefed?' - Update: Heavy rebase from main. - Update: Design: Heavy refactor: 'all ontologies' -> 'single ontology, then merge'
… xrefed?' - Refactor: Continued - uri_to_curie: Refactored to a combo of my code plus OAK's after all 3 failed: my code by itself, oaklib, bioregistry. - Added labels to output
… xrefed?' - Add: README.md to src/ontology/reports: Describes files. - Add: Target to combine each of these 'by ontology' reports. - Update: The 'summary' file for each ontology generated is now part of the target. - Delete: Anything to do with 'excluded terms in Mondo classes'. No longer in scope for this feature. - Delete: Anything to do with 'exclusion table' input. Granularity of using the component and mirror signature files is necessary. - Delete: Unused functions replaced by package 'curies' - Update: Raise exception when SPARQL returns no results. - Update: Utilizing package: curies - Update: Misc: (i) Simplified things, removing parts of code that were not necessary or useful, (ii) Updated some code comments, (iii) Minor touch-ups to other parts of makefile. - Update: doid.yml config: Added a prefix - Add: Committed problematic exclusion xref tables to reports/. - Update: Now filters to show only problematic exclusions, e.g. leaves only rows where `n_in1_notIn2_in3` is True. - Bugfix: Now filters out prefixes not 'owned' by the ontology. Meant to do this earlier. - Misc: (i) minor refactoring, (ii) Removed completed todo. - Update: Make target for merged file now also makes a merged summary file. Misc - Update: .gitignore: Added exceptions reports/: (i) README.md, (ii) problematic exclusion xref tables. - Update: requirements*.txt: Added package: curies - Update: Signature file(s) - Bugfix: Incorrect prefix in config/doid.yml
… xrefed?' - Add: Makefile: Added .PHONY target alias for convenience for feature
db148f0
to
ba140b9
Compare
Updates