5a899ba8ed
Vcf-split splits a multi-sample VCF into single-sample VCFs, writing thousands of output files simultaneously. Parsing the TOPMed human chromosome 1 BCF with bcftools takes two days, so extracting the 137,977 samples one at a time or using thousands of parallel readers of the same file is impractical. Vcf-split solves this by generating thousands of single-sample outputs during a single sweep through the multi-sample input.
8 lines
472 B
Text
8 lines
472 B
Text
Vcf-split splits a multi-sample VCF into single-sample VCFs, writing thousands
|
|
of output files simultaneously. Parsing the TOPMed human chromosome 1 BCF
|
|
with bcftools takes two days, so extracting the 137,977 samples one at a time
|
|
or using thousands of parallel readers of the same file is impractical.
|
|
Vcf-split solves this by generating thousands of single-sample outputs during
|
|
a single sweep through the multi-sample input.
|
|
|
|
WWW: https://github.com/auerlab/vcf-split
|