You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
INFO:root:Memory usage (Done variant stream): 270.7164 GB
Traceback (most recent call last):
File "/usr/local/bin/kage", line 8, in <module>
sys.exit(main())
File "/usr/local/lib/python3.10/site-packages/kage/command_line_interface.py", line 52, in main
run_argument_parser(sys.argv[1:])
File "/usr/local/lib/python3.10/site-packages/kage/command_line_interface.py", line 559, in run_argument_parser
args.func(args)
File "/usr/local/lib/python3.10/site-packages/kage/indexing/main.py", line 305, in make_index_cli
r = make_index(args.reference, args.vcf, args.out_base_name,
File "/usr/local/lib/python3.10/site-packages/kage/indexing/main.py", line 150, in make_index
haplotype_matrix_original_vcf = SparseHaplotypeMatrix.from_vcf(variant_stream)
File "/usr/local/lib/python3.10/site-packages/kage/indexing/sparse_haplotype_matrix.py", line 225, in from_vcf
return cls.from_vcf2(vcf_file_name, dtype=dtype)
File "/usr/local/lib/python3.10/site-packages/kage/indexing/sparse_haplotype_matrix.py", line 214, in from_vcf2
matrix.extend(submatrix)
File "/usr/local/lib/python3.10/site-packages/kage/indexing/sparse_haplotype_matrix.py", line 30, in extend
self.data = scipy.sparse.vstack([self.data, other.data])
File "/usr/local/lib/python3.10/site-packages/scipy/sparse/_construct.py", line 804, in vstack
return _block([[b] for b in blocks], format, dtype, return_spmatrix=True)
File "/usr/local/lib/python3.10/site-packages/scipy/sparse/_construct.py", line 944, in _block
blocks = [[_stack_along_minor_axis(blocks[:, b], 0) for b in range(N)]]
File "/usr/local/lib/python3.10/site-packages/scipy/sparse/_construct.py", line 944, in <listcomp>
blocks = [[_stack_along_minor_axis(blocks[:, b], 0) for b in range(N)]]
File "/usr/local/lib/python3.10/site-packages/scipy/sparse/_construct.py", line 672, in _stack_along_minor_axis
raise ValueError(f'Mismatching dimensions along axis {other_axis}: '
ValueError: Mismatching dimensions along axis 1: {1, 390}
I have a feeling that something specific in the vcf may be causing this error, probably something in the haplotype data. Would you by any chance be able to share the vcf with me? If not, I could try to add some more debugging output when this error happens that maybe could pinpoint the problem.
thanks, I did a few more tests and I suspect it's indeed a problem with my VCF, I think it comes from broken AN/AC fields. I've fixed my VCF and started another run of indexing. I'll let you know if the error is gone once the run finished!
unfortunately, I'm still getting the same error. I uploaded the VCF and reference file I'm using here.. I used the command mentioned above and I'm running version 2.0.7 of Kage.
I hope this is helpful and sorry that the data is so big - I tried reproducing the error on a smaller subset of this VCF, but unfortunately I wasn't successful so far (so the smaller regions I tested ran without errors).
Hi!
I'm running kage index on a biallelic VCF derived from a Minigraph-Cactus graph with 195 samples. This is the command I'm using:
kage index -r reference.fa -v variants.vcf -o results.npz -k 31 -t 24
However, I'm getting the following error:
This is the full log:
index.log
Do you have any idea what could be causing this error?
Thanks!
Best,
Jana
The text was updated successfully, but these errors were encountered: