freebsd-ports/biology/vcf-split/pkg-descr
Jason W. Bacon 5a899ba8ed biology/vcf-split: Split a multi-sample VCF into single-sample VCFs
Vcf-split splits a multi-sample VCF into single-sample VCFs, writing thousands
of output files simultaneously.  Parsing the TOPMed human chromosome 1 BCF
with bcftools takes two days, so extracting the 137,977 samples one at a time
or using thousands of parallel readers of the same file is impractical.
Vcf-split solves this by generating thousands of single-sample outputs during
a single sweep through the multi-sample input.
2021-03-21 15:17:27 +00:00

8 lines
472 B
Text

Vcf-split splits a multi-sample VCF into single-sample VCFs, writing thousands
of output files simultaneously. Parsing the TOPMed human chromosome 1 BCF
with bcftools takes two days, so extracting the 137,977 samples one at a time
or using thousands of parallel readers of the same file is impractical.
Vcf-split solves this by generating thousands of single-sample outputs during
a single sweep through the multi-sample input.
WWW: https://github.com/auerlab/vcf-split