-
Notifications
You must be signed in to change notification settings - Fork 28
Validity check v4 exomes #487
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
|
Loggers seem to not be working for me right now |
|
Loggers are a gnomad_methods issue: https://the-tgg.slack.com/archives/C6BD1M01E/p1697463923095569 |
…to mw/validity_check_v4_exomes # Conflicts: # gnomad_qc/v4/resources/release.py
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I think this is looking good, I only looked at it for logic, which is totally fine at this point.
I think we might want to add in some other checks, like a check for length of globals being the expected length, and maybe counts being the expected counts. Maybe check for any uses of 'pop' or 'oth' in the globals?
Are there any other checks that we thing should be done on the globals? @ch-kr ?
Then only a few small suggestions
|
I've updated this and it depends on broadinstitute/gnomad_methods#630. In the |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Only very small changes. I love seeing the output from the new checks, thank you so much for adding!!
| NEW_SITE_FIELDS = [ | ||
| "monoallelic", | ||
| "only_het", | ||
| "transmitted_singleton", |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
add sibling singletons
| from gnomad_qc.v4.resources.basics import get_logging_path | ||
| from gnomad_qc.v4.resources.release import release_sites, validated_release_ht | ||
|
|
||
| # TODO: Check global lengths match the fields they reference |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
| # TODO: Check global lengths match the fields they reference |
| "release_ht": [ | ||
| "gs://gnomad-tmp/gnomad.exomes.v4.0.qc_data/release/gnomad.exomes.sites.test.updated_101723.ht" | ||
| ] | ||
| }, # release_sites().ht()}, |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I know this is just for testing, but don't forget to change it back
| return vep_csq_header | ||
|
|
||
|
|
||
| def check_globals_for_retired_terms(ht: hl.Table) -> None: |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
This is great, thank you!!
|
Back to you @jkgoodrich -- agree on seeing the output of the globals and len checks as confirmations, good call on adding those |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
two small things then lgtm
Co-authored-by: jkgoodrich <[email protected]>
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM
This unfurls and runs gnomad_methods validate_release_t on the v4 exomes release HT. It depends on broadinstitute/gnomad_methods#625 and there is some room for improvements but I just wanted to get it in. It writes out the HT with unfurled annotations that will move to the next step.