-
Notifications
You must be signed in to change notification settings - Fork 31
Change filter_mt_to_trios to also filter on vds #739
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
gnomad/sample_qc/relatedness.py
Outdated
if isinstance(t, hl.MatrixTable): | ||
t = t.filter_cols(hl.is_defined(fam_ht[t.col_key])) | ||
elif isinstance(t, hl.vds.VariantDataset): | ||
t = hl.vds.filter_samples(t, fam_ht) |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
will the loss of sites impact the trio stats?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
You mean for the ones that are present in these trios?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I checked this variant, (chr1:12648 ["T","G"]), which has a de novo raw in 1 child, in the trio_stats from last year and the new one, an for parents and children as in raw and adj certainly changed using filter_samples
, but it was called as de novo raw either way.
As suggested by Chris, we're filtering VDS to trios by
hl.vds.filter_samples
, which will greatly drop the number of variants to the ones with at least one ALT allele in the trio samples; Also, this function has been only used once in gnomad_qc,annotate_adj
has been used twice before, it will be removed from this function.It will not pass the checks if I changed the function name, should I just write another function?