unmatched chromosome name get error #5

worker000000 · 2020-11-06T07:52:52Z

Dear professor,
thansk for such a accurate software.

when I am using it, it raise errors like follows. MAYbe caused by chrM and chrMT.

what is more, since your genome.dict has many patch chromosome names, like

SN:GL000207.1
SN:GL000226.1
SN:GL000229.1
SN:GL000231.1
, but when users input bam, they may use a different version of genome, even for hg19, the patch chromosome seems to be different, so why not the input is a fastq, but a bam? ,can you give me some suggestions

ERRORS screenshot like following

worker000000 · 2020-11-06T08:10:44Z

abother question is we know wgs always give big fragments of cnv. so why here the configure file, the window size is 500, people seems to use 1M instead of 500bp

fanxinping · 2020-11-08T06:41:25Z

We recommend remap your samples to the ref genome provided by us to avoid some unexpected behaviour. Or, you can generate your own ref data according to https://www.yfish.org/display/PUB/Accucopy#Accucopy-3.7Makeyourownreferencegenomepackage

polyactis · 2020-11-08T12:03:45Z

You probably need to watch some videos or read some reviews/tutorials to understand how DNA is extracted from a cell, fragmented, and PCRed before it is put on a DNA sequencing machine. 500bp is NOT the CNA length. It is the average length of DNA fragments to be sequenced by a high-throughput DNA sequencer, i.e. Illumina HiSeq or NovaSeq.

These so-called next-gen sequencers can only sequence 100-150bp for one fragment, not from start to end of a chromosome. Anyhow, you need to get familiar with what a next-gen sequencer can and cannot do.

abother question is we know wgs always give big fragments of cnv. so why here the configure file, the window size is 500, people seems to use 1M instead of 500bp

worker000000 · 2020-11-08T12:08:37Z

We recommend remap your samples to the ref genome provided by us to avoid some unexpected behaviour. Or, you can generate your own ref data according to https://www.yfish.org/display/PUB/Accucopy#Accucopy-3.7Makeyourownreferencegenomepackage

thanks a lot, so can this tool accept fastq file instead of bam?

worker000000 · 2020-11-08T16:06:04Z

You probably need to watch some videos or read some reviews/tutorials to understand how DNA is extracted from a cell, fragmented, and PCRed before it is put on a DNA sequencing machine. 500bp is NOT the CNA length. It is the average length of DNA fragments to be sequenced by a high-throughput DNA sequencer, i.e. Illumina HiSeq or NovaSeq.

These so-called next-gen sequencers can only sequence 100-150bp for one fragment, not from start to end of a chromosome. Anyhow, you need to get familiar with what a next-gen sequencer can and cannot do.

abother question is we know wgs always give big fragments of cnv. so why here the configure file, the window size is 500, people seems to use 1M instead of 500bp

thanks a lot. so how to understand here is 500 for segmentation
window_size the window size in base pair for segmentation. The segmentation program (GADA) first calculates the number of reads for each window and then perform segmentation over the genome. A small window size often leads to a large number of small segments. The recommended window size is 500bp.

fanxinping · 2020-11-09T07:28:57Z

We recommend remap your samples to the ref genome provided by us to avoid some unexpected behaviour. Or, you can generate your own ref data according to https://www.yfish.org/display/PUB/Accucopy#Accucopy-3.7Makeyourownreferencegenomepackage

thanks a lot, so can this tool accept fastq file instead of bam?

No, Accucopy accepts bam file only.

You probably need to watch some videos or read some reviews/tutorials to understand how DNA is extracted from a cell, fragmented, and PCRed before it is put on a DNA sequencing machine. 500bp is NOT the CNA length. It is the average length of DNA fragments to be sequenced by a high-throughput DNA sequencer, i.e. Illumina HiSeq or NovaSeq.
These so-called next-gen sequencers can only sequence 100-150bp for one fragment, not from start to end of a chromosome. Anyhow, you need to get familiar with what a next-gen sequencer can and cannot do.

abother question is we know wgs always give big fragments of cnv. so why here the configure file, the window size is 500, people seems to use 1M instead of 500bp

thanks a lot. so how to understand here is 500 for segmentation
window_size the window size in base pair for segmentation. The segmentation program (GADA) first calculates the number of reads for each window and then perform segmentation over the genome. A small window size often leads to a large number of small segments. The recommended window size is 500bp.

500bp for segmentation is just a proper parameter base on our testing and you can set other value.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

unmatched chromosome name get error #5

unmatched chromosome name get error #5

worker000000 commented Nov 6, 2020 •

edited

Loading

worker000000 commented Nov 6, 2020

fanxinping commented Nov 8, 2020

polyactis commented Nov 8, 2020

worker000000 commented Nov 8, 2020

worker000000 commented Nov 8, 2020

fanxinping commented Nov 9, 2020

unmatched chromosome name get error #5

unmatched chromosome name get error #5

Comments

worker000000 commented Nov 6, 2020 • edited Loading

worker000000 commented Nov 6, 2020

fanxinping commented Nov 8, 2020

polyactis commented Nov 8, 2020

worker000000 commented Nov 8, 2020

worker000000 commented Nov 8, 2020

fanxinping commented Nov 9, 2020

worker000000 commented Nov 6, 2020 •

edited

Loading