diff --git a/README.md b/README.md index 233bca6..b6d5de3 100644 --- a/README.md +++ b/README.md @@ -1,7 +1,22 @@ -# Paraphase: HiFi-based SMN1/SMN2 variant caller - -SMN1, the gene that causes spinal muscular atrophy, is considered a 'dark' region of the genome due to high sequence similarity with its paralog SMN2. Paraphase is a Python tool that takes HiFi BAMs as input (whole-genome or enrichment), phases complete SMN1 and SMN2 haplotypes, determines copy numbers and makes phased variant calls for both genes. It also categorizes the haplotypes, enabling future haplotype-based screening of silent carriers (2+0). Please check out our paper for more details about the method and our population-wide haplotype analysis. - +# Paraphase: HiFi-based caller for highly homologous genes + +Many medically relevant genes fall into 'dark' regions where variat calling is limited due to high sequence homology with paralogs or pseudogenes. Paraphase is a Python tool that takes HiFi BAMs as input (whole-genome or enrichment), phases complete haplotypes for genes of the same family, determines copy numbers and makes phased variant calls. + +Paraphase supports the following genes: +- SMN1/SMN2 (spinal muscular atrophy) +- RCCX module + - CYP21A2 (21-Hydroxylase-Deficient Congenital Adrenal Hyperplasia) + - TNXB (Ehlers-Danlos syndrome) + - C4A/C4B (relevant in autoimmune diseases) +- PMS2 (Lynch Syndrome) +- STRC (hereditary hearing loss and deafness) +- IKBKG (Incontinentia Pigmenti) +- NCF1 (chronic granulomatous disease; Williams syndrome) +- NEB (Nemaline myopathy) +- F8 (intron 22 inversion, Hemophilia A) +- CFC1 (heterotaxy syndrome) + +Please check out our paper on its application to the gene SMN1 for more details about Paraphase. Chen X, Harting J, Farrow E, et al. Comprehensive SMN1 and SMN2 profiling for spinal muscular atrophy analysis using long-read PacBio HiFi sequencing. The American Journal of Human Genetics. 2023;0(0). doi:10.1016/j.ajhg.2023.01.001 For whole-genome sequencing (WGS) data, we recommend >20X, ideally 30X, genome coverage. Low coverage or short read length could result in less accurate phasing, especially when haplotypes are highly similar to each other in Exons 1-6. For hybrid capture-based enrichment data, a higher read depth (>50X) is recommended as the read length is generally shorter than WGS. @@ -10,7 +25,7 @@ Currently Paraphase only works on GRCh38. Support for GRCh37 will be adde in the ## Contact -If you need assistance or have suggestions, please don't hesitate to reach out by email or open a GitHub issue. +There is a need for building consensus on how to report variants in segmental duplication regions, which could be complicated due to the frequent presence of copy number changes. If you have suggestions or need assistance, please don't hesitate to reach out by email or open a GitHub issue. Xiao Chen: xchen@pacificbiosciences.com @@ -47,39 +62,68 @@ paraphase -l list.txt -o output_directory ``` Required parameters: -- `-b`: Input BAM file or `-l`: List of BAM files (one per line) +- `-b`: Input BAM file or `-l`: text file listing BAM files one per line - `-o`: Output directory Optional parameters: -- `-v`: If specified, Paraphase will produce VCFs for each haplotype. -- `-c`: Config file, default config file is `paraphase/data/smn1/config.yaml`. +- `-g`: Gene(s) to analyze. All supported genes will be analyzed if not specified. - `-t`: Number of threads, used when `-l` is specified. - `-d`: File listing average genome depth per sample, with two columns, sample ID and depth values, separated by tab or space. This saves run time by skipping the step to calculate genome depth. -- `--samtools` -- `--minimap2` - -The paths to samtools and minimap2 can be provided through the `--samtools` and `--minimap2` parameters or by modifying the `tools` section of the [config](paraphase/data/smn1/config.yaml) file. +- `--novcf`: no vcf output if specified. +- `--samtools`: path to samtools +- `--minimap2`: path to minimap2 -Note that currently only GRCh38 is supported. We will support GRCh37 in the future if there is request. +The paths to samtools and minimap2 can be provided through the `--samtools` and `--minimap2` parameters. ## Interpreting the output Paraphase produces a few output files in the directory specified by `-o`, with the sample ID as the prefix. - `_realigned_tagged.bam`: This BAM file can be loaded into IGV for visualization of haplotypes, see [haplotype visualization](docs/visualization.md). -- If `-v` is specified, Paraphase will generate VCF files. A VCF file is written for each haplotype, and there is also a `_variants.vcf` file containing merged variants from all haplotypes. -- `.json`: Main output file, summerizes haplotypes and variant calls for each sample. Details of the fields are explained below: - - `smn1_cn`: copy number of SMN1, a `null` call indicates that Paraphase finds only one haplotype but depth does not unambiguously support a copy number of one or two. - - `smn2_cn`: copy number of SMN2, a `null` call indicates that Paraphase finds only one haplotype but depth does not unambiguously support a copy number of one or two. - - `smn2_del78_cn`: copy number of SMN2Δ7–8 (SMN2 with a deletion of Exon7-8) - - `smn1_read_number`: number of reads containing c.840C - - `smn2_read_number`: number of reads containing c.840T - - `smn2_del78_read_number`: number of reads containing the known deletion of Exon7-8 on SMN2 - - `smn1_haplotypes`: phased SMN1 haplotypes - - `smn2_haplotypes`: phased SMN2 haplotypes - - `smn2_del78_haplotypes`: phased SMN2Δ7–8 haplotypes - - `two_copy_haplotypes`: haplotypes that are present in two copies based on depth. This happens when (in a small number of cases) two haplotypes are identical and we infer that there exist two of them instead of one by checking the read depth. - - `haplotype_details`: lists information about each haplotype - - `variants`: The variants contained in the haplotype, excluding those in homopolymer regions. For a complete set of variant calls, please use the `-v` option. - - `boundary`: The boundary of the region that is resolved on the haplotype. This is useful when a haplotype is only partially phased. - - `haplogroup`: The haplogroup that the haplotype is assigned to +- `.vcf`: A VCF file is written for each haplotype, and there is also a `_variants.vcf` file containing merged variants from all haplotypes. +- `.json`: Main output file, summerizes haplotypes and variant calls for each sample. Details of the fields are explained below for each gene. + +### SMN1 + +- `smn1_cn`: copy number of SMN1, a `null` call indicates that Paraphase finds only one haplotype but depth does not unambiguously support a copy number of one or two. +- `smn2_cn`: copy number of SMN2, a `null` call indicates that Paraphase finds only one haplotype but depth does not unambiguously support a copy number of one or two. +- `smn2_del78_cn`: copy number of SMN2Δ7–8 (SMN2 with a deletion of Exon7-8) +- `smn1_read_number`: number of reads containing c.840C +- `smn2_read_number`: number of reads containing c.840T +- `smn2_del78_read_number`: number of reads containing the known deletion of Exon7-8 on SMN2 +- `smn1_haplotypes`: phased SMN1 haplotypes +- `smn2_haplotypes`: phased SMN2 haplotypes +- `smn2_del78_haplotypes`: phased SMN2Δ7–8 haplotypes +- `two_copy_haplotypes`: haplotypes that are present in two copies based on depth. This happens when (in a small number of cases) two haplotypes are identical and we infer that there exist two of them instead of one by checking the read depth. +- `haplotype_details`: lists information about each haplotype + - `variants`: The variants contained in the haplotype, excluding those in homopolymer regions. + - `boundary`: The boundary of the region that is resolved on the haplotype. This is useful when a haplotype is only partially phased. + - `haplogroup`: The haplogroup that the haplotype is assigned to + +### RCCX, PMS2, NCF1, CFC1, STRC, IKBKG, NEB & F8 + +- `total_cn`: total copy number of the family (sum of gene and paralog/pseudogene) +- `gene_cn`: copy number of the gene of interest, when the gene and pseudogene can be easily distinguished with known sequence differences, as in PMS2/NCF1/STRC/IKBKG +- `final_haplotypes`: phased haplotypes +- `two_copy_haplotypes`: haplotypes that are present in two copies based on depth. This happens when (in a small number of cases) two haplotypes are identical and we infer that there exist two of them instead of one by checking the read depth. + +Multiple copies of the repeat are phased inito alleles with read-based phasing in the case of RCCX/IKBKG/NEB. Additional output entries include: +- `alleles_final`: haplotypes phased into alleles + +### RCCX + +More info fields on phasing haplotypes into alleles and annotation of CYP21A2: +- `annotated_alleles`: allele annotation for the CYP21A2 gene. This is only based on common gene-pseudogene (CYP21A2-CYP21A1P) conversions (P31L, IVS2-13A/C>G, G111Vfs, I173N, I237N, V238E, M240K, V282L, Q319X and R357W). Please refer to the vcfs for most thorough variant calling and annotation. +- `ending_hap`: the last copy of RCCX on each allele. Only these copies contain parts of TNXB (while the other copies contain TNXA) + +### IKBKG + +- `deletion_haplotypes`: haplotypes carrying the 11.8kb deletion + +### F8 + +Additional output is included to report SVs that occur between the repeat regions: +- `sv_called`: reports deletions/duplications between int22h-1 and int22h-2, or inversions between int22h-1 and int22h-3 + + + diff --git a/paraphase/data/cfc1/cfc1_config.yaml b/paraphase/data/cfc1/cfc1_config.yaml new file mode 100644 index 0000000..f480d3d --- /dev/null +++ b/paraphase/data/cfc1/cfc1_config.yaml @@ -0,0 +1,20 @@ +gene: cfc1 + +# data files +data: + reference: ref.fa + homopolymer: homopolymer_sites.txt + +# coordinates +coordinates: + hg38: + nchr: "chr2" + nchr_old: "chr2_130587000_130610000" + nchr_length: 242193529 + extract_region1: "chr2:130587000-130610000" + extract_region2: "chr2:130510753-130533741" + + pivot_site: 130593061 + + left_boundary: 130587000 + right_boundary: 130610000 \ No newline at end of file diff --git a/paraphase/data/cfc1/homopolymer_sites.txt b/paraphase/data/cfc1/homopolymer_sites.txt new file mode 100644 index 0000000..bafcee2 --- /dev/null +++ b/paraphase/data/cfc1/homopolymer_sites.txt @@ -0,0 +1,1152 @@ +chr2 130587001 C +chr2 130587002 C,G,T,A +chr2 130587003 0 +chr2 130587004 0 +chr2 130587005 0 +chr2 130587006 0 +chr2 130587007 C,1 +chr2 130587008 C,G,T,A +chr2 130587009 0 +chr2 130587010 0 +chr2 130587011 0 +chr2 130587012 0 +chr2 130587013 C,1 +chr2 130587265 C +chr2 130587266 C,G,T,A +chr2 130587267 0 +chr2 130587268 0 +chr2 130587269 0 +chr2 130587270 0 +chr2 130587271 C,1 +chr2 130587289 T +chr2 130587290 C,G,T,A +chr2 130587291 0 +chr2 130587292 0 +chr2 130587293 0 +chr2 130587294 0 +chr2 130587295 T,1 +chr2 130587458 G +chr2 130587459 C,G,T,A +chr2 130587460 0 +chr2 130587461 0 +chr2 130587462 0 +chr2 130587463 0 +chr2 130587464 G,1 +chr2 130587477 A +chr2 130587478 C,G,T,A +chr2 130587479 0 +chr2 130587480 0 +chr2 130587481 0 +chr2 130587482 0 +chr2 130587483 0 +chr2 130587484 0 +chr2 130587485 0 +chr2 130587486 0 +chr2 130587487 1,A +chr2 130587507 A +chr2 130587508 C,G,T,A +chr2 130587509 0 +chr2 130587510 0 +chr2 130587511 0 +chr2 130587512 T +chr2 130587513 C,G,T,A +chr2 130587514 0 +chr2 130587515 0 +chr2 130587516 0 +chr2 130587517 0 +chr2 130587518 T,1 +chr2 130587850 G +chr2 130587851 C,G,T,A +chr2 130587852 0 +chr2 130587853 0 +chr2 130587854 0 +chr2 130587855 0 +chr2 130587856 G,1 +chr2 130587880 C +chr2 130587881 C,G,T,A +chr2 130587882 0 +chr2 130587883 0 +chr2 130587884 0 +chr2 130587885 0 +chr2 130587886 C,1 +chr2 130588013 A +chr2 130588014 C,G,T,A +chr2 130588015 0 +chr2 130588016 0 +chr2 130588017 0 +chr2 130588018 0 +chr2 130588019 1,A +chr2 130588083 T +chr2 130588084 C,G,T,A +chr2 130588085 0 +chr2 130588086 0 +chr2 130588087 0 +chr2 130588088 0 +chr2 130588089 0 +chr2 130588090 0 +chr2 130588091 0 +chr2 130588092 0 +chr2 130588093 0 +chr2 130588094 0 +chr2 130588095 0 +chr2 130588096 0 +chr2 130588097 0 +chr2 130588098 0 +chr2 130588099 0 +chr2 130588100 T,1 +chr2 130588260 T +chr2 130588261 C,G,T,A +chr2 130588262 0 +chr2 130588263 0 +chr2 130588264 0 +chr2 130588265 0 +chr2 130588266 T,1 +chr2 130588388 T +chr2 130588389 C,G,T,A +chr2 130588390 0 +chr2 130588391 0 +chr2 130588392 0 +chr2 130588393 0 +chr2 130588394 0 +chr2 130588395 0 +chr2 130588396 0 +chr2 130588397 0 +chr2 130588398 T,1 +chr2 130588546 T +chr2 130588547 C,G,T,A +chr2 130588548 0 +chr2 130588549 0 +chr2 130588550 0 +chr2 130588551 0 +chr2 130588552 0 +chr2 130588553 0 +chr2 130588554 0 +chr2 130588555 0 +chr2 130588556 0 +chr2 130588557 0 +chr2 130588558 0 +chr2 130588559 0 +chr2 130588560 0 +chr2 130588561 0 +chr2 130588562 0 +chr2 130588563 0 +chr2 130588564 0 +chr2 130588565 0 +chr2 130588566 0 +chr2 130588567 T,1 +chr2 130588569 T +chr2 130588570 C,G,T,A +chr2 130588571 0 +chr2 130588572 0 +chr2 130588573 0 +chr2 130588574 0 +chr2 130588575 T,1 +chr2 130588795 C +chr2 130588796 C,G,T,A +chr2 130588797 0 +chr2 130588798 0 +chr2 130588799 0 +chr2 130588800 0 +chr2 130588801 C,1 +chr2 130589054 T +chr2 130589055 C,G,T,A +chr2 130589056 0 +chr2 130589057 0 +chr2 130589058 0 +chr2 130589059 0 +chr2 130589060 T,1 +chr2 130589450 T +chr2 130589451 C,G,T,A +chr2 130589452 0 +chr2 130589453 0 +chr2 130589454 0 +chr2 130589455 0 +chr2 130589456 T,1 +chr2 130589772 A +chr2 130589773 C,G,T,A +chr2 130589774 0 +chr2 130589775 0 +chr2 130589776 0 +chr2 130589777 0 +chr2 130589778 1,A +chr2 130590013 A +chr2 130590014 C,G,T,A +chr2 130590015 0 +chr2 130590016 0 +chr2 130590017 0 +chr2 130590018 0 +chr2 130590019 1,A +chr2 130590177 A +chr2 130590178 C,G,T,A +chr2 130590179 0 +chr2 130590180 0 +chr2 130590181 0 +chr2 130590182 0 +chr2 130590183 0 +chr2 130590184 1,A +chr2 130590185 C,G,T,A +chr2 130590186 0 +chr2 130590187 0 +chr2 130590188 0 +chr2 130590189 0 +chr2 130590190 0 +chr2 130590191 1,A +chr2 130590192 C,G,T,A +chr2 130590193 0 +chr2 130590194 0 +chr2 130590195 0 +chr2 130590196 0 +chr2 130590197 0 +chr2 130590198 0 +chr2 130590199 1,A +chr2 130590200 C,G,T,A +chr2 130590201 0 +chr2 130590202 0 +chr2 130590203 0 +chr2 130590204 0 +chr2 130590205 1,A +chr2 130590206 C,G,T,A +chr2 130590207 0 +chr2 130590208 0 +chr2 130590209 0 +chr2 130590210 0 +chr2 130590211 0 +chr2 130590212 1,A +chr2 130590943 G +chr2 130590944 C,G,T,A +chr2 130590945 0 +chr2 130590946 0 +chr2 130590947 0 +chr2 130590948 0 +chr2 130590949 G,1 +chr2 130591122 G +chr2 130591123 C,G,T,A +chr2 130591124 0 +chr2 130591125 0 +chr2 130591126 0 +chr2 130591127 0 +chr2 130591128 G,1 +chr2 130591154 G +chr2 130591155 C,G,T,A +chr2 130591156 0 +chr2 130591157 0 +chr2 130591158 0 +chr2 130591159 0 +chr2 130591160 G,1 +chr2 130591242 T +chr2 130591243 C,G,T,A +chr2 130591244 0 +chr2 130591245 0 +chr2 130591246 0 +chr2 130591247 0 +chr2 130591248 T,1 +chr2 130591588 T +chr2 130591589 C,G,T,A +chr2 130591590 0 +chr2 130591591 0 +chr2 130591592 0 +chr2 130591593 0 +chr2 130591594 0 +chr2 130591595 T,1 +chr2 130591609 A +chr2 130591610 C,G,T,A +chr2 130591611 0 +chr2 130591612 0 +chr2 130591613 0 +chr2 130591614 0 +chr2 130591615 0 +chr2 130591616 1,A +chr2 130591780 T +chr2 130591781 C,G,T,A +chr2 130591782 0 +chr2 130591783 0 +chr2 130591784 0 +chr2 130591785 0 +chr2 130591786 T,1 +chr2 130591935 T +chr2 130591936 C,G,T,A +chr2 130591937 0 +chr2 130591938 0 +chr2 130591939 0 +chr2 130591940 0 +chr2 130591941 0 +chr2 130591942 0 +chr2 130591943 0 +chr2 130591944 0 +chr2 130591945 0 +chr2 130591946 0 +chr2 130591947 0 +chr2 130591948 0 +chr2 130591949 0 +chr2 130591950 0 +chr2 130591951 0 +chr2 130591952 T,1 +chr2 130591954 T +chr2 130591955 C,G,T,A +chr2 130591956 0 +chr2 130591957 0 +chr2 130591958 0 +chr2 130591959 0 +chr2 130591960 T,1 +chr2 130592062 C +chr2 130592063 C,G,T,A +chr2 130592064 0 +chr2 130592065 0 +chr2 130592066 0 +chr2 130592067 0 +chr2 130592068 C,1 +chr2 130592079 A +chr2 130592080 C,G,T,A +chr2 130592081 0 +chr2 130592082 0 +chr2 130592083 0 +chr2 130592084 0 +chr2 130592085 0 +chr2 130592086 1,A +chr2 130592129 T +chr2 130592130 C,G,T,A +chr2 130592131 0 +chr2 130592132 0 +chr2 130592133 0 +chr2 130592134 0 +chr2 130592135 T,1 +chr2 130592184 T +chr2 130592185 C,G,T,A +chr2 130592186 0 +chr2 130592187 0 +chr2 130592188 0 +chr2 130592189 0 +chr2 130592190 T,1 +chr2 130592331 A +chr2 130592332 C,G,T,A +chr2 130592333 0 +chr2 130592334 0 +chr2 130592335 0 +chr2 130592336 0 +chr2 130592337 1,A +chr2 130592454 C +chr2 130592455 C,G,T,A +chr2 130592456 0 +chr2 130592457 0 +chr2 130592458 0 +chr2 130592459 0 +chr2 130592460 C,1 +chr2 130592603 C +chr2 130592604 C,G,T,A +chr2 130592605 0 +chr2 130592606 0 +chr2 130592607 0 +chr2 130592608 0 +chr2 130592609 C,1 +chr2 130592809 A +chr2 130592810 C,G,T,A +chr2 130592811 0 +chr2 130592812 0 +chr2 130592813 0 +chr2 130592814 0 +chr2 130592815 1,A +chr2 130593027 C +chr2 130593028 C,G,T,A +chr2 130593029 0 +chr2 130593030 0 +chr2 130593031 0 +chr2 130593032 0 +chr2 130593033 C,1 +chr2 130593080 A +chr2 130593081 C,G,T,A +chr2 130593082 0 +chr2 130593083 0 +chr2 130593084 0 +chr2 130593085 0 +chr2 130593086 1,A +chr2 130593326 G +chr2 130593327 C,G,T,A +chr2 130593328 0 +chr2 130593329 0 +chr2 130593330 0 +chr2 130593331 0 +chr2 130593332 G,1 +chr2 130593353 T +chr2 130593354 C,G,T,A +chr2 130593355 0 +chr2 130593356 0 +chr2 130593357 0 +chr2 130593358 0 +chr2 130593359 T,1 +chr2 130593632 C +chr2 130593633 C,G,T,A +chr2 130593634 0 +chr2 130593635 0 +chr2 130593636 0 +chr2 130593637 0 +chr2 130593638 C,1 +chr2 130594018 C +chr2 130594019 C,G,T,A +chr2 130594020 0 +chr2 130594021 0 +chr2 130594022 0 +chr2 130594023 0 +chr2 130594024 C,1 +chr2 130594232 C +chr2 130594233 C,G,T,A +chr2 130594234 0 +chr2 130594235 0 +chr2 130594236 0 +chr2 130594237 0 +chr2 130594238 0 +chr2 130594239 C,1 +chr2 130594569 T +chr2 130594570 C,G,T,A +chr2 130594571 0 +chr2 130594572 0 +chr2 130594573 0 +chr2 130594574 0 +chr2 130594575 T,1 +chr2 130594591 A +chr2 130594592 C,G,T,A +chr2 130594593 0 +chr2 130594594 0 +chr2 130594595 0 +chr2 130594596 0 +chr2 130594597 1,A +chr2 130594790 C +chr2 130594791 C,G,T,A +chr2 130594792 0 +chr2 130594793 0 +chr2 130594794 0 +chr2 130594795 0 +chr2 130594796 C,1 +chr2 130595089 A +chr2 130595090 C,G,T,A +chr2 130595091 0 +chr2 130595092 0 +chr2 130595093 0 +chr2 130595094 0 +chr2 130595095 0 +chr2 130595096 0 +chr2 130595097 1,A +chr2 130595129 T +chr2 130595130 C,G,T,A +chr2 130595131 0 +chr2 130595132 0 +chr2 130595133 0 +chr2 130595134 0 +chr2 130595135 T,1 +chr2 130595287 T +chr2 130595288 C,G,T,A +chr2 130595289 0 +chr2 130595290 0 +chr2 130595291 0 +chr2 130595292 0 +chr2 130595293 0 +chr2 130595294 T,1 +chr2 130595567 A +chr2 130595568 C,G,T,A +chr2 130595569 0 +chr2 130595570 0 +chr2 130595571 0 +chr2 130595572 0 +chr2 130595573 1,A +chr2 130595574 C,G,T,A +chr2 130595575 0 +chr2 130595576 0 +chr2 130595577 0 +chr2 130595578 0 +chr2 130595579 0 +chr2 130595580 1,A +chr2 130595581 C,G,T,A +chr2 130595582 0 +chr2 130595583 0 +chr2 130595584 0 +chr2 130595585 0 +chr2 130595586 0 +chr2 130595587 0 +chr2 130595588 1,A +chr2 130595737 A +chr2 130595738 C,G,T,A +chr2 130595739 0 +chr2 130595740 0 +chr2 130595741 0 +chr2 130595742 0 +chr2 130595743 0 +chr2 130595744 0 +chr2 130595745 0 +chr2 130595746 0 +chr2 130595747 0 +chr2 130595748 0 +chr2 130595749 1,A +chr2 130596365 C +chr2 130596366 C,G,T,A +chr2 130596367 0 +chr2 130596368 0 +chr2 130596369 0 +chr2 130596370 0 +chr2 130596371 C,1 +chr2 130596481 G +chr2 130596482 C,G,T,A +chr2 130596483 0 +chr2 130596484 0 +chr2 130596485 0 +chr2 130596486 0 +chr2 130596487 G,1 +chr2 130596590 G +chr2 130596591 C,G,T,A +chr2 130596592 0 +chr2 130596593 0 +chr2 130596594 0 +chr2 130596595 0 +chr2 130596596 G,1 +chr2 130596728 G +chr2 130596729 C,G,T,A +chr2 130596730 0 +chr2 130596731 0 +chr2 130596732 0 +chr2 130596733 0 +chr2 130596734 G,1 +chr2 130596878 G +chr2 130596879 C,G,T,A +chr2 130596880 0 +chr2 130596881 0 +chr2 130596882 0 +chr2 130596883 0 +chr2 130596884 G,1 +chr2 130597267 T +chr2 130597268 C,G,T,A +chr2 130597269 0 +chr2 130597270 0 +chr2 130597271 0 +chr2 130597272 0 +chr2 130597273 T,1 +chr2 130597456 G +chr2 130597457 C,G,T,A +chr2 130597458 0 +chr2 130597459 0 +chr2 130597460 0 +chr2 130597461 0 +chr2 130597462 G,1 +chr2 130597514 G +chr2 130597515 C,G,T,A +chr2 130597516 0 +chr2 130597517 0 +chr2 130597518 0 +chr2 130597519 0 +chr2 130597520 G,1 +chr2 130597675 C +chr2 130597676 C,G,T,A +chr2 130597677 0 +chr2 130597678 0 +chr2 130597679 0 +chr2 130597680 0 +chr2 130597681 C,1 +chr2 130597694 C +chr2 130597695 C,G,T,A +chr2 130597696 0 +chr2 130597697 0 +chr2 130597698 0 +chr2 130597699 0 +chr2 130597700 C,1 +chr2 130597715 C +chr2 130597716 C,G,T,A +chr2 130597717 0 +chr2 130597718 0 +chr2 130597719 0 +chr2 130597720 0 +chr2 130597721 0 +chr2 130597722 C,1 +chr2 130597757 C +chr2 130597758 C,G,T,A +chr2 130597759 0 +chr2 130597760 0 +chr2 130597761 0 +chr2 130597762 0 +chr2 130597763 C,1 +chr2 130597799 C +chr2 130597800 C,G,T,A +chr2 130597801 0 +chr2 130597802 0 +chr2 130597803 0 +chr2 130597804 0 +chr2 130597805 C,1 +chr2 130597833 C +chr2 130597834 C,G,T,A +chr2 130597835 0 +chr2 130597836 0 +chr2 130597837 0 +chr2 130597838 0 +chr2 130597839 C,1 +chr2 130598012 C +chr2 130598013 C,G,T,A +chr2 130598014 0 +chr2 130598015 0 +chr2 130598016 0 +chr2 130598017 0 +chr2 130598018 0 +chr2 130598019 C,1 +chr2 130598456 T +chr2 130598457 C,G,T,A +chr2 130598458 0 +chr2 130598459 0 +chr2 130598460 0 +chr2 130598461 0 +chr2 130598462 T,1 +chr2 130598681 C +chr2 130598682 C,G,T,A +chr2 130598683 0 +chr2 130598684 0 +chr2 130598685 0 +chr2 130598686 0 +chr2 130598687 C,1 +chr2 130598819 A +chr2 130598820 C,G,T,A +chr2 130598821 0 +chr2 130598822 0 +chr2 130598823 0 +chr2 130598824 0 +chr2 130598825 0 +chr2 130598826 1,A +chr2 130599031 A +chr2 130599032 C,G,T,A +chr2 130599033 0 +chr2 130599034 0 +chr2 130599035 0 +chr2 130599036 0 +chr2 130599037 1,A +chr2 130599117 T +chr2 130599118 C,G,T,A +chr2 130599119 0 +chr2 130599120 0 +chr2 130599121 0 +chr2 130599122 0 +chr2 130599123 T,1 +chr2 130599286 T +chr2 130599287 C,G,T,A +chr2 130599288 0 +chr2 130599289 0 +chr2 130599290 0 +chr2 130599291 0 +chr2 130599292 T,1 +chr2 130599497 G +chr2 130599498 C,G,T,A +chr2 130599499 0 +chr2 130599500 0 +chr2 130599501 0 +chr2 130599502 0 +chr2 130599503 G,1 +chr2 130599561 G +chr2 130599562 C,G,T,A +chr2 130599563 0 +chr2 130599564 0 +chr2 130599565 0 +chr2 130599566 0 +chr2 130599567 0 +chr2 130599568 G,1 +chr2 130599585 C +chr2 130599586 C,G,T,A +chr2 130599587 0 +chr2 130599588 0 +chr2 130599589 0 +chr2 130599590 0 +chr2 130599591 0 +chr2 130599592 C,1 +chr2 130599681 A +chr2 130599682 C,G,T,A +chr2 130599683 0 +chr2 130599684 0 +chr2 130599685 0 +chr2 130599686 0 +chr2 130599687 0 +chr2 130599688 0 +chr2 130599689 0 +chr2 130599690 0 +chr2 130599691 0 +chr2 130599692 0 +chr2 130599693 0 +chr2 130599694 0 +chr2 130599695 1,A +chr2 130599707 A +chr2 130599708 C,G,T,A +chr2 130599709 0 +chr2 130599710 0 +chr2 130599711 0 +chr2 130599712 0 +chr2 130599713 1,A +chr2 130599889 C +chr2 130599890 C,G,T,A +chr2 130599891 0 +chr2 130599892 0 +chr2 130599893 0 +chr2 130599894 0 +chr2 130599895 C,1 +chr2 130599934 G +chr2 130599935 C,G,T,A +chr2 130599936 0 +chr2 130599937 0 +chr2 130599938 0 +chr2 130599939 0 +chr2 130599940 G,1 +chr2 130599949 C +chr2 130599950 C,G,T,A +chr2 130599951 0 +chr2 130599952 0 +chr2 130599953 0 +chr2 130599954 0 +chr2 130599955 C,1 +chr2 130599993 C +chr2 130599994 C,G,T,A +chr2 130599995 0 +chr2 130599996 0 +chr2 130599997 0 +chr2 130599998 0 +chr2 130599999 C,1 +chr2 130600329 A +chr2 130600330 C,G,T,A +chr2 130600331 0 +chr2 130600332 0 +chr2 130600333 0 +chr2 130600334 0 +chr2 130600335 1,A +chr2 130600343 A +chr2 130600344 C,G,T,A +chr2 130600345 0 +chr2 130600346 0 +chr2 130600347 0 +chr2 130600348 0 +chr2 130600349 1,A +chr2 130600474 G +chr2 130600475 C,G,T,A +chr2 130600476 0 +chr2 130600477 0 +chr2 130600478 0 +chr2 130600479 0 +chr2 130600480 0 +chr2 130600481 G,1 +chr2 130600628 T +chr2 130600629 C,G,T,A +chr2 130600630 0 +chr2 130600631 0 +chr2 130600632 0 +chr2 130600633 0 +chr2 130600634 0 +chr2 130600635 T,1 +chr2 130600782 T +chr2 130600783 C,G,T,A +chr2 130600784 0 +chr2 130600785 0 +chr2 130600786 0 +chr2 130600787 0 +chr2 130600788 0 +chr2 130600789 0 +chr2 130600790 0 +chr2 130600791 0 +chr2 130600792 0 +chr2 130600793 T,1 +chr2 130600918 A +chr2 130600919 C,G,T,A +chr2 130600920 0 +chr2 130600921 0 +chr2 130600922 0 +chr2 130600923 0 +chr2 130600924 1,A +chr2 130600932 T +chr2 130600933 C,G,T,A +chr2 130600934 0 +chr2 130600935 0 +chr2 130600936 0 +chr2 130600937 0 +chr2 130600938 0 +chr2 130600939 0 +chr2 130600940 T,1 +chr2 130601088 T +chr2 130601089 C,G,T,A +chr2 130601090 0 +chr2 130601091 0 +chr2 130601092 0 +chr2 130601093 0 +chr2 130601094 0 +chr2 130601095 0 +chr2 130601096 T,1 +chr2 130601098 T +chr2 130601099 C,G,T,A +chr2 130601100 0 +chr2 130601101 0 +chr2 130601102 0 +chr2 130601103 0 +chr2 130601104 T,1 +chr2 130601222 A +chr2 130601223 C,G,T,A +chr2 130601224 0 +chr2 130601225 0 +chr2 130601226 0 +chr2 130601227 0 +chr2 130601228 1,A +chr2 130601383 T +chr2 130601384 C,G,T,A +chr2 130601385 0 +chr2 130601386 0 +chr2 130601387 0 +chr2 130601388 0 +chr2 130601389 0 +chr2 130601390 T,1 +chr2 130601491 C +chr2 130601492 C,G,T,A +chr2 130601493 0 +chr2 130601494 0 +chr2 130601495 0 +chr2 130601496 0 +chr2 130601497 C,1 +chr2 130601633 T +chr2 130601634 C,G,T,A +chr2 130601635 0 +chr2 130601636 0 +chr2 130601637 0 +chr2 130601638 0 +chr2 130601639 0 +chr2 130601640 T,1 +chr2 130601650 A +chr2 130601651 C,G,T,A +chr2 130601652 0 +chr2 130601653 0 +chr2 130601654 0 +chr2 130601655 0 +chr2 130601656 1,A +chr2 130601727 A +chr2 130601728 C,G,T,A +chr2 130601729 0 +chr2 130601730 0 +chr2 130601731 0 +chr2 130601732 0 +chr2 130601733 0 +chr2 130601734 0 +chr2 130601735 0 +chr2 130601736 0 +chr2 130601737 1,A +chr2 130602574 T +chr2 130602575 C,G,T,A +chr2 130602576 0 +chr2 130602577 0 +chr2 130602578 0 +chr2 130602579 0 +chr2 130602580 T,1 +chr2 130602775 T +chr2 130602776 C,G,T,A +chr2 130602777 0 +chr2 130602778 0 +chr2 130602779 0 +chr2 130602780 0 +chr2 130602781 T,1 +chr2 130602782 C,G,T,A +chr2 130602783 0 +chr2 130602784 0 +chr2 130602785 0 +chr2 130602786 0 +chr2 130602787 T,1 +chr2 130602818 T +chr2 130602819 C,G,T,A +chr2 130602820 0 +chr2 130602821 0 +chr2 130602822 0 +chr2 130602823 0 +chr2 130602824 T,1 +chr2 130602914 G +chr2 130602915 C,G,T,A +chr2 130602916 0 +chr2 130602917 0 +chr2 130602918 0 +chr2 130602919 0 +chr2 130602920 0 +chr2 130602921 G,1 +chr2 130603166 T +chr2 130603167 C,G,T,A +chr2 130603168 0 +chr2 130603169 0 +chr2 130603170 0 +chr2 130603171 0 +chr2 130603172 T,1 +chr2 130603262 A +chr2 130603263 C,G,T,A +chr2 130603264 0 +chr2 130603265 0 +chr2 130603266 0 +chr2 130603267 0 +chr2 130603268 1,A +chr2 130603278 A +chr2 130603279 C,G,T,A +chr2 130603280 0 +chr2 130603281 0 +chr2 130603282 0 +chr2 130603283 0 +chr2 130603284 1,A +chr2 130603318 G +chr2 130603319 C,G,T,A +chr2 130603320 0 +chr2 130603321 0 +chr2 130603322 0 +chr2 130603323 0 +chr2 130603324 G,1 +chr2 130603439 A +chr2 130603440 C,G,T,A +chr2 130603441 0 +chr2 130603442 0 +chr2 130603443 0 +chr2 130603444 0 +chr2 130603445 0 +chr2 130603446 0 +chr2 130603447 0 +chr2 130603448 0 +chr2 130603449 1,A +chr2 130603452 A +chr2 130603453 C,G,T,A +chr2 130603454 0 +chr2 130603455 0 +chr2 130603456 0 +chr2 130603457 0 +chr2 130603458 1,A +chr2 130603723 A +chr2 130603724 C,G,T,A +chr2 130603725 0 +chr2 130603726 0 +chr2 130603727 0 +chr2 130603728 0 +chr2 130603729 0 +chr2 130603730 0 +chr2 130603731 0 +chr2 130603732 0 +chr2 130603733 0 +chr2 130603734 1,A +chr2 130604335 T +chr2 130604336 C,G,T,A +chr2 130604337 0 +chr2 130604338 0 +chr2 130604339 0 +chr2 130604340 0 +chr2 130604341 T,1 +chr2 130604414 A +chr2 130604415 C,G,T,A +chr2 130604416 0 +chr2 130604417 0 +chr2 130604418 0 +chr2 130604419 0 +chr2 130604420 1,A +chr2 130604523 T +chr2 130604524 C,G,T,A +chr2 130604525 0 +chr2 130604526 0 +chr2 130604527 0 +chr2 130604528 0 +chr2 130604529 T,1 +chr2 130604875 G +chr2 130604876 C,G,T,A +chr2 130604877 0 +chr2 130604878 0 +chr2 130604879 0 +chr2 130604880 0 +chr2 130604881 G,1 +chr2 130605019 G +chr2 130605020 C,G,T,A +chr2 130605021 0 +chr2 130605022 0 +chr2 130605023 0 +chr2 130605024 0 +chr2 130605025 G,1 +chr2 130605334 A +chr2 130605335 C,G,T,A +chr2 130605336 0 +chr2 130605337 0 +chr2 130605338 0 +chr2 130605339 0 +chr2 130605340 0 +chr2 130605341 1,A +chr2 130605489 A +chr2 130605490 C,G,T,A +chr2 130605491 0 +chr2 130605492 0 +chr2 130605493 0 +chr2 130605494 0 +chr2 130605495 0 +chr2 130605496 0 +chr2 130605497 0 +chr2 130605498 0 +chr2 130605499 1,A +chr2 130605500 C,G,T,A +chr2 130605501 0 +chr2 130605502 0 +chr2 130605503 0 +chr2 130605504 0 +chr2 130605505 0 +chr2 130605506 0 +chr2 130605507 0 +chr2 130605508 0 +chr2 130605509 0 +chr2 130605510 0 +chr2 130605511 0 +chr2 130605512 0 +chr2 130605513 1,A +chr2 130605514 C,G,T,A +chr2 130605515 0 +chr2 130605516 0 +chr2 130605517 0 +chr2 130605518 0 +chr2 130605519 1,A +chr2 130605520 A +chr2 130605521 C,G,T,A +chr2 130605522 0 +chr2 130605523 0 +chr2 130605524 0 +chr2 130605525 0 +chr2 130605526 0 +chr2 130605527 1,A +chr2 130605545 T +chr2 130605546 C,G,T,A +chr2 130605547 0 +chr2 130605548 0 +chr2 130605549 0 +chr2 130605550 0 +chr2 130605551 0 +chr2 130605552 T,1 +chr2 130605577 A +chr2 130605578 C,G,T,A +chr2 130605579 0 +chr2 130605580 0 +chr2 130605581 0 +chr2 130605582 0 +chr2 130605583 1,A +chr2 130605712 A +chr2 130605713 C,G,T,A +chr2 130605714 0 +chr2 130605715 0 +chr2 130605716 0 +chr2 130605717 0 +chr2 130605718 1,A +chr2 130605889 A +chr2 130605890 C,G,T,A +chr2 130605891 0 +chr2 130605892 0 +chr2 130605893 0 +chr2 130605894 0 +chr2 130605895 0 +chr2 130605896 1,A +chr2 130606261 A +chr2 130606262 C,G,T,A +chr2 130606263 0 +chr2 130606264 0 +chr2 130606265 0 +chr2 130606266 0 +chr2 130606267 1,A +chr2 130606390 A +chr2 130606391 C,G,T,A +chr2 130606392 0 +chr2 130606393 0 +chr2 130606394 0 +chr2 130606395 0 +chr2 130606396 0 +chr2 130606397 1,A +chr2 130606594 A +chr2 130606595 C,G,T,A +chr2 130606596 0 +chr2 130606597 0 +chr2 130606598 0 +chr2 130606599 0 +chr2 130606600 0 +chr2 130606601 1,A +chr2 130607033 G +chr2 130607034 C,G,T,A +chr2 130607035 0 +chr2 130607036 0 +chr2 130607037 0 +chr2 130607038 0 +chr2 130607039 G,1 +chr2 130608012 A +chr2 130608013 C,G,T,A +chr2 130608014 0 +chr2 130608015 0 +chr2 130608016 0 +chr2 130608017 0 +chr2 130608018 1,A +chr2 130608122 G +chr2 130608123 C,G,T,A +chr2 130608124 0 +chr2 130608125 0 +chr2 130608126 0 +chr2 130608127 0 +chr2 130608128 G,1 +chr2 130608151 G +chr2 130608152 C,G,T,A +chr2 130608153 0 +chr2 130608154 0 +chr2 130608155 0 +chr2 130608156 0 +chr2 130608157 G,1 +chr2 130608501 G +chr2 130608502 C,G,T,A +chr2 130608503 0 +chr2 130608504 0 +chr2 130608505 0 +chr2 130608506 0 +chr2 130608507 G,1 +chr2 130608585 A +chr2 130608586 C,G,T,A +chr2 130608587 0 +chr2 130608588 0 +chr2 130608589 0 +chr2 130608590 0 +chr2 130608591 0 +chr2 130608592 0 +chr2 130608593 1,A +chr2 130608672 A +chr2 130608673 C,G,T,A +chr2 130608674 0 +chr2 130608675 0 +chr2 130608676 0 +chr2 130608677 0 +chr2 130608678 1,A +chr2 130608690 A +chr2 130608691 C,G,T,A +chr2 130608692 0 +chr2 130608693 0 +chr2 130608694 0 +chr2 130608695 0 +chr2 130608696 0 +chr2 130608697 0 +chr2 130608698 1,A +chr2 130609152 A +chr2 130609153 C,G,T,A +chr2 130609154 0 +chr2 130609155 0 +chr2 130609156 0 +chr2 130609157 0 +chr2 130609158 1,A +chr2 130609364 G +chr2 130609365 C,G,T,A +chr2 130609366 0 +chr2 130609367 0 +chr2 130609368 0 +chr2 130609369 0 +chr2 130609370 G,1 +chr2 130609394 C +chr2 130609395 C,G,T,A +chr2 130609396 0 +chr2 130609397 0 +chr2 130609398 0 +chr2 130609399 0 +chr2 130609400 C,1 +chr2 130609411 G +chr2 130609412 C,G,T,A +chr2 130609413 0 +chr2 130609414 0 +chr2 130609415 0 +chr2 130609416 0 +chr2 130609417 G,1 +chr2 130609635 A +chr2 130609636 C,G,T,A +chr2 130609637 0 +chr2 130609638 0 +chr2 130609639 0 +chr2 130609640 0 +chr2 130609641 1,A +chr2 130609786 G +chr2 130609787 C,G,T,A +chr2 130609788 0 +chr2 130609789 0 +chr2 130609790 0 +chr2 130609791 0 +chr2 130609792 G,1 diff --git a/paraphase/data/cfc1/ref.fa b/paraphase/data/cfc1/ref.fa new file mode 100644 index 0000000..dbff2b6 --- /dev/null +++ b/paraphase/data/cfc1/ref.fa @@ -0,0 +1,385 @@ +>chr2_130587000_130610000 +CACCCCCTCCCCCATCCCCTGCGGAGCTTTTCTCAGCCTGCGCCTTCCCGCGCAGGAAGA +GGACCATGTGCGGTCCTGACGCTCTTCCCTGAAGAAACCCTCATGTCGATTTTCATATTA +TAGTGTTTCTTGCTCACCACACAGGAATTATGGCGGGCAGGAGTGAAAGAGACAGACTTT +CCACACGATAACGGCATTTCCTATTCTTGGCAACATTGTTTAAAGCAAAATATTAGACCC +CAAATGGTGAAACTTGGAGGATGATTCCCCCAGCTCTGCCCAATGGCTCCTTTTTCCTGA +GGTTCTCAGCAGCACTTGGCGGATGTGGCTACACCGAGGTCTGACTGCAGAAGGCAGGAA +CCCTGTCCTGCCTTCCAGCTATTGAGATCCTGGGAGGGCTGACTGAGGCCGCCTGTAATA +CCAATGGGAATGAGAGAGGGGCTGGAGTGATGCTGCAGTGGGGGAAGTTTCTTTTATTAA +AAAAAAATATATTATTTAATTTTAATTTAAAAATTTTTGAGACAGGGTCTTGCTCTGTAG +CTTAGGCTGGAGTGCAGCGGTGGCATCATATTATATAGTTCACTGCAGCCTCTACCTCCA +GGGCTCAGTGATCCTCCTGCCTCAGCCTCCCAAAGTGTGGTGATTACAGGGGTGAGCCAC +TGTGTACCTGGCTGGAAGTTTTCATTTGATGCCAAAAGTGTCTGAGGGAGCAAAGACTCT +GGGATGACTGAAGTCTCAGGCCTGGCAAGCACCCCACACAGGGAGGCTCAGGAACAAGGG +TTAAAATGATAATTGACGTCAAACTTGTAGTGGAGGTTTACAGTCTCTCCTAGACTGCAG +AAAAGGTGCCTGGGGGTTGGAATCCAGGGGCCAAACTGCAGCCCCCTGATTTCCTGTTCG +CTGGCTTCCGGACCTGCTGAATTCTCCCACCTCCTCAAGCCTGGGTTTTCTACATAAGTC +AAAGACCGTAAAGGTGGCTTTTCTTTCATCATGATCTATCACATGCCATGAGGGAAAAAT +ACAAAGGTCTGCTTGGAGGTGAGATTTATTATTATTATTATTATTATTATTATTATTATT +ATTATTTTTTTTTTTTTTTTGAGAGGGAGTCTGGCTGTGTCGCCCAGGCTGGATGGAGTG +CAGTGGCCGAGCTTGGCTCACTGCAAGCTCTGCTTCCTGGGTTCACGCCATTCTCCCACC +TCAGCCTCCCGAGTAGCTAGGACTACAGGCGCCTGCCACCATGCCCGGCTAATTTTGTGC +ATTTTTAGTAGAGACGGGGTTTAACCATGTTAGCCAGGATGGTCTCGATCTCCTGACCTC +GTGATCCGCCCACCTCGGCCTCCCAAAGTGCTGGGATTACAGGCGTGAGCCACAGCGCCT +GGCCTATTATTTTTTTTTGAGACAGAGTCTAGCTTTGTCGTCCAGGCTGGAGTGCAGTGG +CACAATCTCTGCTCACTGCAACCTCCGCCTCCCCGGTTCAAGCAATTCTCCTGCCTCAGC +CTCTGGAGTAGCTGACATTACAGGCATTTGCCACCACACCCGGCTGATTTTTTTTTTTTT +TTTTTTTGTATTTTTAGTAGAGTCGGGGTTTCACCATGTTGGCCAGGCTGGTCTCGAACT +CCTGACCTTGTGATCCACCCGCCTCGGCCTCCCAAAGTGCAGGGATTACAGGCGTGAGCC +ACTGCACCCCGCCTGAGATTCATTATTAAATGAAGTCATTGATTCCGTGACAGATTCTGA +AGCAGGGTGTGTCCGTTTAGAACCCAGTGAGCATGTTCTGGGTGGTGATGAGACTTCCCC +CTAGGTTTACTGAGGTGTCCTTGCCAGAGAGTAAACCTCAGAGGCAGCAGCAGGCTGCTG +AGCACAATGCTGGGTCCGAGCAGATGCTTGTTACTCATCATCCTCGTCTCGGCTCTGGGG +TGTGCCCCGCCAACGAGACAGAAAGTCACCCCACCAGCATCAGGGGAGCTTTAGCAAGGA +CAGCCCAGAAGCAAGGCCTCAGTGGCTCCTTAGTCCATGGCGGTTCTAATGGTGTGAACA +GCTCTTGCCTATTTGTTTTTAAATGTGGGTTTTATTGTTATTTAGGTAGGACAAGAATGC +AGACATGGAAATGATGAAAAGATAGTTTATTATATTCATAGGGCAGGAGGACAGTCCACC +CCACCCCACACAGGGCCACATGGAGAGGCACCAGGGTTGGTCAGGAAGCAAAGGGAGCAA +GGGGAAAATGTACCAGGCAACTAGGGACAGCCCCTGTGCCCTGAAGCCTGCAGAAATGAC +TCACTAGCCAACCCTACACCATCTGCCCCTGCCTCACCTACTCCTTCCCACAGAAGCCAC +AGTACAGGCTCCTGCCCACAGTGCCCACTCCCTCTGCCTCCCGGCCCACCCTGGGGCTGC +CCCATGTGGCCCTGCATGACACGCCTCCTGTTTCTAAGTGAGTATACACACTTTTTCCTC +CTTGACATCATTTCCAGGTCTGCATTCTTACCATATCTGGTTAAAACGAATCTCATTTGA +AAACGAACTGACTCGGCAAATAAATCACTCGCGAACAAACTAGTAATGTTCTCACATATG +TAGTGCCAGAGTTTAGCTGTTTCAGGTAATTAACATAATTTGTATAAGGATTAGACATAG +TTCTACAGTAGATCACATTTCAGATTTAATGACAATAAATTTGTTGGAGAGTCAGTGAGT +TGGGATGCAACTCCACTTGTCAAATCATGGTTGCATCACAACCATAGGTTTGAGTACAGA +TAGAAGATCTGATAAAAATCAACAAAAGCATATAGGCTACATGGAGTAAAGAGTACTGTA +TATTTTGTTGTTACTTGAAATTCTGTGCTATAATGTACTTTATATCAGTAAAACTTATAA +TAAAGTGATATCGGCCGGGTGCAGAGGCTCAGGCCTGTAATCCCACCCAGCACTTAGGGA +GGCCGAGGCGGGTGGATCACGAGGTCAGGAGATCGAGACCATCCTGGCTAACACAGTGAA +ACCCCGTCTCTACTAAAAATACACAAAGTTAGCCGGGCATGGTGGCGGGTGCCTGTAGTC +CCAGCTACTTGGGAGGCTGAGGCAGGAGAATGGTGTGAACCCGGGAGGTGGAGTTTGCAA +TGAGCCGAGATCGCACCACTGCACTTCCAGCCTGGGCGACAAAGCGTGATTCCATCTCAA +AAAATAAAAAATAAAAAAACAAAAACAAAAAACAAGTGATATCAGCATACTGATGCACCT +GTTTTGTTTCTGGAGCGCCAGTTCTTACACACTGAGCAGCACATCCTTCCCCTGGGAGTA +CTGCAGACATTTCCCAGCCTAGGGCCCCACTCACTTGGCAGCACTCAGCTCCTGGGGATC +CAGCTAGCTCTGTGGCCACAAACTTCCTTAGTACTGTGCTGGGGTTCTGAAGGGGTCCAG +GTGCTGGCCAGATCAATGTGAGTCGAGCAGCTGTAACCATGGGAGACACAGCCCCTTGTT +TTCCTTTCCCTATAACAGACAGTCCAAGTGAGATAAGTGCCTGCGCATAGAAAGCTTATC +AAACTTAATTTTAAATACTCAAAACTGAATTCTGTGGAAAAGTCACTCCCAGAAACTTTT +GTGGTGTGATGTATGGCTGTATGTGCCAGAGTGGCTTTGCAGACAATGGAAACTGGGAAA +ATATCCAGAGACTGCTTTTGAACGCTCACAGCCAGAAGAGTCGATTTGGTTCTAGGCTAG +AGTGGGGAAATTGTGATGTGGCTGAAAGGAACAAGAGTGCCCCTCAGCCGGCAGGCATTC +GCTGGGGCTCCGAAGTTAAACCCCAACATAGGAATGGGCTGAACATACCAGCCCTGTGCT +GGTACAAGTCCTTATGCATCCCCTGCGTCTCCTCCCAGTGTCGCTGGTGAGTGGCTCCCT +CGGAGAGGGGATGTGAGGACAGATTTTGCTCTTCTGGGATGTGTGGGGGATGACTGAGCC +AGGGCTAAGCTGGCAAGGAAAGGGCAGCTTCTATAAACACTGCAGAATGGACACAGCTGG +GGACTGCTCTGGGAAGGGAGATGCCCTCCCTCCATCCGGGTGAGCAAAGCCAGAGGTGTG +AACAGGCTCTTCCCAGCAAGGGGTGCTGGCAGATAGTGGCCTAGGGGGATTTCTCTATCT +CTCCTCCTCGACTGCGGGGGACAGATAAGATCTGGGACACATTTATGTGAAGCTAACAGC +CATGAGCCCAGCATTGCTCTTTTCTCTCCTGAGAAGTTTCTCCTTTTTAACTGTCTCCCT +GGACGCCTGGGCTGCCAAAAGAAAATGCCTTTTATCTTCAAATTCTCCTTGTCTTTTCAG +AGCTAGGTACTTAAGGGCTTCTCAGTTAGTAGACAGATAAATGGTAACTTTGGCTTTCTT +GCTTATAATCAAGAATCATTCCACTAGAAGGATCCATTCTAGTATTCTAGTAGAGAAATC +CATAGAGCCTGATTTCTAATTTTCTAATTAAAACGTATTTATACCCTATTCATACTAATA +TTGAACAGTTCTCAGTTTTCCTTTTAACCACAACTGTGACCTCTCAAGTTCCATTTCCTA +AGTGCACCTACTGAAAATAAGGCAGTGTCTTTTTTGCCTGCCAGTGTTGTAAAAAATAGG +CATTTTGTAAACACAAATTTTATGAATAGCATATACATATTCATCAAGCTCTTAGGCTAA +TTTGCAAAGTTCATCACTTGTTGATTCCTACAATAAACTGAAATAAGGAAAGGTACTGCT +CAGAACACACTTGAAACAACTTTGGTATAAATATATATATATTTTTGAGACGGAGTCTCG +CTCTGTCGCCCAGGCTGGAGTGCAGTGGCGCAATCTCGGCTCACTGCGAGCTGCGCTTCC +CGGGTTCACTCACACCATTCTGCCTCAGCCTCCCAAGTAGCTAGGACTACAGGCGCCCGC +CACCACGCCCAGCTAATTTTTTTTTTTTTTTTGTATTTTTAGTAGAGACGGGGTTTCACC +GTGTTAGCCAGGATGGTCTCGATCTCCTGACCTCGTGATCCGCCCACCTCGGCCTCCCAA +AGTGCTGAGATTACAGGCGTGAGCCCCCGCGCCGGGTCTCAAAAAATATTAATGCCTAGA +AATATTTATGTCTCAATACTTGAATAATTATTTTTCTCATGAACCAATGAACTCCAAGCC +AGCCCTCTAATGAAGTTCTACTTTATTTTTCTTTAATACTCATTTGAATGTTACGTTAAA +TTCATATTTTCATAGCCAATAGTTGAAAAGATTGAAAATGCCAATAAACTAATTGACAAT +ATAAAATGTAAAAGTAATTATGTCACACAGGCTCATGAACTACAGTGGATTTAAAAAGCT +GAAGAGCTCAATGCAAGTGCCCTCTACCCGAAGTGTGTGCTCTTGAGGGCACATTCCGCA +TGGCCAAGCCTTTGTGGTGTCGCCGGGCCTGGCGTCCTCAGCTAGTGGCGCACATCCCCC +TGGACTGGTTGTGGACACGACAGCCTCCTCAGATTGGGAAGCGACCTGAGAAGCGGTGCT +CGGAAGACAGAGGAGGCTGGCCTGGCGCTCTTCAGGCTGAACTCTGCCTGCGTTGCCTTG +CGGTGGGTTCCACAGCAGACGCTTCCCCCTTAAGGGAGATGTTGTGAAAGGAAGGTGCTA +ACTGTCAGGGGAAGGTGGCGGTGCCAGGCTTTCTGGCTCCAGCTTTGGCTCCTTCTCGCT +GTTGTTCTGCTGTCTCTACCTACCGGTCACAGTGGTGCTGGGGACACACGTGTCCCTCTC +CCAAGGATCTGGAGCCAAAGGCGCTTTCATGAAAATATTTATTACCCAATAAAAATATTC +ACCCAGTGCTTCAGCTTACGGTAAACTAAACACATCTATTATTTACAACATAGAAAATTA +AAGGCGATGCCCAAGTCCCGGCCTTCCGCAGGGGCGCCGCTCCCGCTGGAGGACGGAAGG +GACCAGGGACCGAGGGTGCGCGGGCGCATCCGGGCGCAGGAGGCGGTGCAGGAGTGCGCA +GGGCAGCAAGAGTAGCAGGCTGGGCGCGCCCCCGGCGCTCGGCCCGTGAGCGTGGGAGGC +CAGGAAGTCTTTCGGGTCTGGAAAAACAGAGAAACCGGAGATGTATGAAATAAAGTGTTC +CGGCTCGGTTTGAGAGAGTGCAATAGAAACGCCTGCTCCCTGCCTTCTGCAGGCCCTTTT +CTCCTGCGCCCTCACTAGGCTCTGTCCCTGGGACGCGCTGCAGGGTGAGGTGTGCGGAGG +CCAGTCCCGGCCGGCTGCACAGAGGGAAGCCTCTAAATGTTTGTAGAACTCGGAGAGCTG +AAGGAGAATTACAGTACTGTGTTTAAAGGGGGAACGAAGAACAGCCCTTCCGTCTTTTTC +ATTTGGCCCGTGACCTTTCAGGGAGTCCCGCTCCAGGTCATTGAGCGATGGGGCACATAG +ACCCGATTTAAAACAGTGCTATAGTATATAAATCAGCTCGCTTCCGTCTGCTGCAACAAA +GGTTCCACTTTTCCCCAAATGAGTGCTCGAAATATTTCTTTGACGAGGGACAGCTGATAG +AGCAGGAGGGTGCAGCCATCCAAAGTCTGGCTGCTGTTCTGAGGAGGCCTAGCCCTGTGA +GTGGCGCCGCGCCCCATGGGACTCACCTGCCGGCCCCCTCGCCCACACCGCTGGTCATAG +GAGGCCATGCCTGAACGCCTAGTCCCTGTGTCCCCTCCCCACCCCTCACAGCAGGCTTCA +TTCATGCTGAACCCTGAGGGTCCCTGAGCCTTGGTCTTTGTGTAGCTGGGACCCCAGCCC +AGAACAGCGGAATGGGAATCTCTGGAGTGGTGGGGCTGGGCAGGGTCTTGGCTAATGCCC +TGGGTGGTGGTTGCCATGTGAAGCGGACAACCTTGCAGGCATGGCAACCAGCACCACTAG +AACATTCTCACAAGCGAGTCCAGGCCTGTCTTGGAGTGGGGTCAGGCTGGTAGAAGAGCA +TGCCCAACTACAGGGATCCTCTCCCCGGGGCCACTGAGCAAAGGCCAGCCACCTCCCTGC +CCCCTGACCAGGCCTGCCCTGCCCTCTGCTATACCTCCAGGAGATGTCCTCTGGACTTGG +GAACCTGGCATGAGCATGAAACTGGAAATTAACAAGGAATGATGCAGATGAACACCTGCA +GCAAGAACCACTCTACACCTGTCCACAGGTAGGCTGTGGTAGCATTGGCCAGGTCAGGCT +CACTCCAGGCTTGGGTGCCACCTCCCCATAGCACCCCCCACCTTGGGAGAGAGGATTGTT +CCAGGGGTGCCAGAATCCCCAAACAGGAATGCTAACGTCTAGGCTCAGACGAGGCTCCAG +AGAAGCTGGTGCTAAGACAGGCGCTCACAGCATGTGAAGCTATTCAGGTGAGAAAGGGGT +CAGCAGTACTGGGTCCGTGCTGGCCTTCCCCTGAGGGAAGGAAACAGGAGGGCTGGACAC +AGGGTGTATGCTTACAGGCTCAGAGTCAGCTGGTGCCAGGACCTGTGAGAGCCATTCCAA +TGGCATCCCAGAGGTCTGTAGACTGGGTCTGGCCCACGGATGTTTGTTTGCTTGTTGGTT +GGTTGGTGGGTTTTTGCCAGAATGGCATGTTTAAAAATGAGTTTGCAGCAGTGAGGTGAG +TCTTCACCGGGCCCCATCTCACTTCCGTGTTACCTGCCCATCTCTGAAGACACCTGAGTT +GGCTACATCACTGTGTCATTCCACAGACTCTGAATTTTAGAGTAGTAAACTGTCTCTTGA +TGCTTGTGAAAACTGATACTTGATGCTTTAGGGCCACAGTAGAAAGAACCACCCCCTTAG +ACTAACAAGGAGATCCTGGCTAGCTCAGGACTGCGCAAAAGTCCCTGCTACATGGAAGCT +TTTCATTCTTGGACCTGCAGAGCATCAGTCAGAGCAAGGAGGGCTCTTGGTGATCCCCAG +ACAGAGAGTCCCTGCAGCAGCCATGCTTTTGTAGCAACATATTCTATCTTCCACATACTC +TTCAAAACCATCATAATCTCCCATTAAAGGATTTAAAACATTTACTTATCTATTACTGTG +CTGTTGGCTTTGGAACATGACCAAATTTTCTATATTCTATACAACACTATAAAAAAATCT +TCATTTGTTTCATATTTATTCTTTCTTTTCTTTTTAAGAGACAGGGTCTCGCTCTGTCAC +CCAGGTTGGAGTGCAGTGGTGCAATCATAGCTCACTGTAACCTCACACTCCTAGGCTCAA +GTGATCCTTTTGCCTCTCAGCCTCCCAAGTAGTTGGGACCAGAGGTGCTCGTCACCACAC +CTGGCTAATTTTTTGTAGAGACAGGTCTTGCTATATTGCCCAGGCCGTTCTCGAACTCCT +GGCCTCAAGCAATCTTCCCACCTAAGCCTCCTAGAGTGCTGGGATTATAGGCATGAGCCA +CTGTACCCAGTCATATTGATAATTACTCTTATAGAAAGAGTACAAGAATGGGCCGGGCAC +GGTGGCTCACGCCTGTAATCCCAGCACTTTGGGAGGCTGAGGTGGGCGGATCACAAGGTC +AGGAGATCGAGACCATCCTGGCTAACATGGTTAAACCCCGTCTCTACTAAAAATAAAAAA +TAAAAAAAGTTAGCTGGGCGTGGTGGCAGGTGCCTGTAGTCCCAGCTACTCAGGAAGCTG +AGGCAGGAGAATGGCATGAACCCTGGAGGTGGAGCTTGCAGTGAGCCGAGACTGCGCCAC +TGCACCCCAGCCCGGGCGACAGAGTGAGACTCCATCTCAAAAAAAAAAAGACTACAGGAA +TGGCACAAGGTAAAGAATCAGACCCTGTTTACTATCTGTAGGAGTTGATCTCTCTGTGTC +TCAGCTTCCCCAAATGTAAAAGGTGTACCTACTTCATTGGCTTTCATAAGAGTTAAATGG +GATAATGGGTATATAAAGTGCCTGAAATACACTGAGGGCTCCTCTCATGCTGTAACACCC +AGAAGAGCTTGCCCTATTGCAATGGCACCTGCCACGGAGGATGTTCCATCTATCATCACT +GTTGTCAAACTTCAAGTCATCATAACCACCATCAGCACAGCCATCACCAGCATCACTGTA +ACTTGCCTTGAAGCAGGATGAATGGGTAAAAGAACACAAAGGTGTTACTGCCCTCTCTGC +ACCTGACAGATGACTCTGAAGGATGACTGCACTATTTTGTCTAGCTTTGATGAACAGAGG +CCACATTTTCAGAGCCATTATCATTGTCCTTGGCAAATAGCAGTCCCAGCACTGGATGTT +TCCAAGAATGTAAAATCATGACCAGATATCCTGGAAGTCATGACAGCTAGTCCCTTCCCC +ACATTCCTGCTGGGGCGGCCTAATGGCAGGTGGGAACTGACTTCATCTGTGAGGACCACA +CAAGCACCCCCTGTGATTCTGATGCCTGTCCATGAAATCTGTGTCCTCCTGTCTCTAGGG +CATAGAGGAGCAGCTGGTGACTTCAGAGGCAGAGGCCCAGCAAAAGGAGCACACATAGAA +GAGGGGGTGCTTTGGGAAAATGTGGTCTGCACGGCAGGCGGGGAGTGCTCTTCCCTGCTG +GCACCATGCAGAGCTAGCTGAGGAGAAGGGAGCAGGGTGGGAAGTGGGGTTGGGGGCTGT +GCCTCTATTAAGAGTGTCCCTTGACTGCTGGTCCTACCAGGGAGAGAAGAGAGGACAGGA +AGGTGAAGGAGAGCTAAGCATGCATTGTGTACTCCTCCATTGTGGGCCTCCAAGAGCAAG +AGGGAGGCTGGGGGAGCCGGGTTGAAGATCTTTGGGCCACTGGTGCTGCCCTCCATGCAG +CCCTAGCAGGACGGCCTGAGAGCGACAGGTGGGAAGTGGGGTTGCGGGGCTGTGCTCTTT +GACAGATGGAGGTGCCCAGGGCTGACTGAAGTGGAGGGAGGGGGCAGGAGAGTGGATCAG +GGGCCTTGGCTTCATCAGGTGTGCCTCTGGACACTGCACCACCAGCCCACCCGGGCCTGG +GAGAGGGCAGAGCTCGGGCTGGGGCCAAGAGGCTGGGCCTTCACTAGCTGTGCCCTCCAT +TGCTAGCCCCAGCCAGTGATCACCAAAGAGGAGAGGAGAGAAGGCACAGGATTCCAGAGG +GAAATGCGGCTCCTGCCTGTGTCCCTGCTGGTCCCACAGCTGAGCAGATGGTGCTGGCCT +AGTGCAGGGAAACCACCTGAGATTTACGCAGCTAACTTGTCTCCCAGCTTCCTCTCCGAT +TACCAGCTGGTCATCTCCTGAAATTCTGCTCACTGAAACGGACATGTGAGGACATGGTTT +GGAAATCATTTTTAAACTCAGCAATCTGGATATGAGAACTTCAAAACAAAATTTAATTCA +CTATATGATTCCCAATGAATGCGACTTTCTTCAATGATGAATCGATCTACTTTGCTAAAT +GCTATGTGGCCTGGCACTGGAAGATGCACTGTGGATCGGTATGGAGGGTGAGCCGCAATG +AAGGCGCAACCGCGTGCGGGGGTGAGCTGGGGACCGAATCACAGGGCCTCTTACCACAGC +GGTCAGGCGTCTGGAGGGGGAGGCAGTGCAGGGCCCCGAAGATGCACCTGCAGAGGTGGC +AGGCGCGGAGGGTCCAGGCTCCGTGCTCCAGGGCGCCGCATTCACTGAGAAGACGGGGAC +CTGCATCAATGCGGTGGCCCGCGCGCCCCTCGCCCGGCGCCCACTCCTGCGCCGCTCCCC +CACCCGCGCATCCCTCCCCCGGCGCCAACTCCCGCGCCCCCCAACCCGCGCGCCCATCGC +CCGGCGCCCACTCCCGCGCCCCCGCACCCGCGCGCCCCTCTCCTGACGCCTACTCCCGCG +CCCCCAGACCCGCAGGTCCCTCACCCGGCGCCCACCCCCGTCCCCACCCCGCGCACCCCT +GTGCCCACCTGCGCCTCTGGTCATGCTCGCAGTAGCGGCCGGTGAAGTGGGCCGGGCACA +CGCAGAAGCTGCCCAGCACGCAGGTACCGCCGTTCCTGCAGCAGCGCGGCCGCGCGGACG +CACCTGCGGGGTCACCCGGGCGTCAGCCCCGCGCCCCCCGCGGCCCGTGGCCCTCCCCGC +AGGGACGGGGCCGAACGTGGCCGCCCAGGGGAGGCAGTAAAATCAGCCTCCAGCCAAGTG +TGTTTGCTCAGGTAAACCCTGCTTGTTAGGGATGTGAAACAAACTATCAGGGGTGCGGCC +TGCAAGCCGGGCTTGCTCTCTCCCTTCCTTTCACATGTTAACCTCTGGCTGTGGCTTCAC +CTGACAGTGATTAAGCACAGGCCTCCTGGGCGGGACCACCCACGGGTGTGATGACGCCAA +ACCCACCGGCGAGGGGCCTCCTGGGGACGCGGTGTCTGGGAGAAGCCAAGCAAGAGTCCG +CTGTTATCTTGGCTTGGGATATTTATTTTCAACACATTTAAATCACAGTTCTCTAGATCT +GCAGCAAAATAACCATTGAAGAGCAAATACAACGTTTTACCAATGAAAGAGTAAAGCTTT +TTAACACCTCACTAGGAACTGTCTCTAGCTTTATAGCTTTGTAAGTGCATCATCATTTGA +AGATAACAAAATGCTCTCCTTGGATTTTATCCTGAATTTATCCTACATATTCTCAGGTCC +TAAACTCTGAGTCCGCAGACTGAGATGACGCCCCGGATTTTGTGGGCAGCGCTGCAACTT +ACCCTCTCCGAAAGCCCGGGAGTAGGGGAGCGGCTCCTCCGGCCCCCAGCCCTCGGCGCT +CCCAGTCACCTCTCCGAAATGACTGGAGGTCCAGTTGAGCGGTGACTGTCGGTGCTTCTG +AGTGGCAACCTTGGTGACTTCCTCTCTACCGCCGTTATGTTTCTCTCTTTGATAGCCTTT +AAAAAATATTGAAAATGTGAGACTGATAATGATTGAGAAACAAATATTTTATGCAATTAA +AACTGAAATTATGAATAAAATGTTTCTTACTGTTTCCCAAATTGATGATCTGTAATGCCA +AACTGACCGTAAACAGAAGCCTAGAATATTTAAAGAAAATCATTGAAGTTGAAAGTATCT +TTAGAAGTATAAGCAGAAATTCACATACACATAAAAATACAATGCCATCATTATATATAA +ATATAAAATTATGTAAAATAATTATAATGAATAAGGCAATCATTTTGTAACTTGTAGGTT +TTTGTTAATTTTGCTATGGCTTAGAAAGATTTTGAGACAGAAAGTGTACACCTCCTTTAA +TGAACAGTGGCTCATGCACTACGATTTATTCTGAGGCTCTTAAGACCCCTTCTTGGCTCT +AAGATCTCATGACCTCTGTCCTTACCTGACATGGTGCCTCCAGGTCATTTTTGGTTTGAG +AGTCTCCTTCTCTTAGTGCATTGAAGATCAGTTTCCAGGAAAGCTGAATCTGAAGAGTCT +CTGCATTTGAAGGTGTCTTTACTCCAGGCCCCGGGGACACAGAATGGACGAGTACAGAAG +ATGTGGATGTTCCCTTCCCTTCCTCAGCCAGGAGAAACGGGATGGTATCTTGAGTTCTGC +AGTCCCTGAGCTAGGGTCGGGGGCGCAGTGACAGGCAGGCCCCTCCGAGCCTGAAGAACA +ATTCCCTCTATGAGACCTGGCTGGGGGGTGGGGACTTGGGCTGACTCCCCCCAGCAGTGT +TTACAGTGCTGACAGCAGCAGGAGTGACCAATCCCTGATGTAAAACATTGGTATCCAAGG +TGACGAGCCATTTTCTCACTGGAAAAAAAAAAAAATCCCAACATGAACAAAAAGCTTTCA +CAGGAAGGTGGAGATTTTCTACTGCATACTACTTGTACTGGGAGATGGTGAGGGGCCCTG +TGGGAGAGGGAGCTCCTGTGTGGGGCACAGGCAACAGCCTGGGGTCCCAGCCGCCCTGAG +TGGGGAACACGAGGATAAGGGAAAGCACGTCACCCCTGCTTTAAGAGAGGCCCCCTTACT +CTGCAGATGGAGCCTCTTCTCTGTGCCCAGCAAGTGGGGGAGGGGCACTGCCCCCATTTA +CTATAGGACCCTCTAGGGTAGGCACTATTATTATCCCCCACGTGAAAAGAGAGAAAGGGA +GGCTTAGAGAGTTTAAGTCCTTTGGCAGTTAGATCTCTGAGAAACATGAATGGCACCGAT +ATGATGAGGAGATACCCACAGGTATTTAAGGCGAGCCCTTAGGAGCACAGGCGCAACTGC +GGCGCCCGCCCATCGCCAGAGGAGCCTGCTGAATCCAAACGGAACCCAAGGAAGGAGTGA +TTTGGGGTATTTTCTAGCACCTTCTAAATCTGCTTTTCTAACTACCAGAATAGGGACTGA +GCCCCTTTTATCAGCACTAAAGAAGACACTTTGCAACCCAGTGTTGCAAATTGTTCTGTG +ATTTTAATATAAAAACTCATTTCTAAAAATAGTATTCTTCCTCCAAGGCTGAAATAGTCT +TTAATGGTGATTTAATTTCTTACATCTTTACGTGTGAGGTTTATTAAAGAGGCTGTTAAA +TAAAGCAATAGGAAATTGGAGGACCGGGCTACCCTGGGGGGCCTCCCAGCAATGGGTCCT +GTGGGCTGCAAGGACCAGGTCAAGGCCAGCAGACCCGAGTGGGCAGAGCACATGGAGGAA +GGTGCCCCAGGACTTCATCAAGCATCTGTAAGTGCTGTAAGTCATGAGACAAACACCTTG +TTAATTTTGTTTTTTAGAGGCAAAGTGTCACTCTGTCAGCCAGGCTGGAGTGCAGTGGGC +GATCATAGCTCACCACATCCTCGAACTTCTGGGCTTAAGTGATCCTCCTACCCCAGCCTC +TGAGTGGCTAGGACTACAGGCTCGCGCCCCTATGCCCAGCTAATTTTTTTTTTATTTTGT +AGAGATAGGGATCTTGCTATGTTTCCCAGGCTGTCCTCTAACTTCTGGCCTTAAGCGATC +CTCCTGCCTCTGCCTCCCAAAGTGCTGGGATTACAGGCACAAACCCACCGTGTCCGGTGA +AAAACACACTTTATTTTTTTGAGACGGAGTCTAGCTCTGTGGCCCAGGCTGGAGTGCAGT +GGTGCAATCTCGGCTCACTGCAACCTCCGCCTCCCGGATTCACACCATTCTCCTGCCTCA +GCCTCCCGAGTAGCTGGGACTACAGGCGCCCACCACCATGCCTGGCTAATTTTTTTGTAT +TTTTAGTAGAGACGGGGTTTCACCGTGTTAGCCAGGATGGTCTTGATCTCCTGACCTCGT +GATCCACCCGCCTCGGCCTCCCAAAGTGCTGGGATTACTGGCATGAGCCACTGCGCCCGG +CCGAAAAATACACTTTTAAACTCAGTGGTTACTACTGGTTAATTGCATCATTTAATTATA +TGTAAAAGTAAGTACATTTTATGACTTTAAGTTAATAGAATCTTTTGCATGAGCTAAATT +GTTTGTATACTAAAATGAAGATGTATATAAGCTAGATGATTGTATTTTTTGTATAAATAA +GTTTTAAAACTGTTCATTTTACAGAGTGATATTGAAACCAAACCTCTCTGACAATAGTAA +AGCTTTAAAAGCAGCCTCTAAAATCTAACTTTCCCCCACAGTTTATTTATTAAATTTTAC +CTTAGTGGACATTTCCAAACATACACAAAGGCTGGTAGAATGAGCCAATAAACCCAGGTT +CAACAATTAAGTATCAGCCTTTGCCAATTTTATTTCATTTGTTTCCTCCCTAACTTTTTT +CTTGAAGTTTTAAAAACTCCAATACATCATGACATTTCATATGTAGAGTAAGATTTTGAA +AATCTGCACTTTATAAGTGGAAACTAGGAAAAAAAAACGTCTTGCCAGCAAGACAGAACT +GAGAAGCAAGAAATAGCAGTGCGACGTGCTTAAGCTGTTTGCTTAAGCTCTGCAGTCGGC +TCTGAGGAGAAGGGCTGATGGCCAGGCAATAAAGCTTCTGATGACCTATTTTCGCCAAAC +TGTCACTTGCTCTCCACACAAAGCGGTCCCAGGTTGCCGCATTGACTCAAAGGTCTTATA +AAAGGGTTTTGCATTTTACTGCAGAGGCGCTGCCCTGTTTGCCATCAAGAGGCCCAGCTG +GAAGGGTTGCCTGGGAGCACTGGAAGGGAGGAGGTGGGTCCCTGCCAGGAAAGGGCCTCT +TTTCTCCTTATCAGCTGTGAACAGATAAGGCTGGAGCCGACCCACCCGTCAAAACTCCAG +AGGAATGTTCTGGCTCATAGAGTGTGTGGGACCGGAACCCCGTGCCAGGACTGGAGGCCC +CGTCAATGTTCTCAGCACATCCATCATGAAGGAAAATGCTCTGCAACAAATTAGGTGTGA +AGGGATTCTCTGAGGAGGACTTTAAAGCGATGGCTGTCCAAGTGGGCTGTCCCCGGCACC +TGTGGGGAGTGAGAAACACCTTTCTCCACCCTCCTGGCTTCTTGGGTGGGGCTGCTTTCT +GTCTCACACATTTTCCTCATCTAGAGATTGCATGTGAGGGGAATCATGCAGTATTTTCCT +TTTGCGTCCGGCCTATGCCACGTAGCATAATGTCTTTTAGGTCCATCTGTGCTGTAGCGC +ACATCAGAGCTTCACTCCTTTTGTGACTGAATAATATTTCACGAACACCCATCCACCGCA +TTTTGTCTGTTTGTCATCTGTTGATGCATATTTGGTTTTTCTACCTTTTGGAAGAGCACA +GTGAGATGGTAGCTATCCACATACCGATAAAGGAGGACTCCGGGGAACCCAAATCTGGTG +ACGCCTTGATTTCAGACTTCCAGCCTCCAGAACTGTGAGAGGAAATGTCTCTGTGGGTTA +GGCCTCCCAGCCTGCGGCATTTGGTTAGGGCAGCCCGAGCTGACAGATACACCAGGTTTT +TGTTTTTCCATGATTGTTATGTGTTATGTAATTTAGTAATTTTTGCTGCATTTAGAGTGC +ATTTGTACTGGTGGCTTTCTAGAATTACAGAGAGCATCTTTTACGCAGGGTCCTTACACA +GACATTTTCATTCACGGGGGGACCCTGTAGACAGTCCTCAAAGGCAATGGCGTGAGTCCG +GCTGATGTTCCTTTAAATCAAATGGTGAATCTGAACTGTTGTGTGGCCCTTCAGAAAGCC +CAGCACAGTCTGAGATGCTCATTTTGGCCTCCAGGCAGTGACCATGCTGGACAGAACATC +ACCTCCGCAAGCAGTCGGGCTCCAGGACCAAACTTCATTTTCCAGATGATGCTATGCTCA +GATAAGGACCTTTATTAATTATACCAATTTTTATTTAGCATAATTAGCAATGCTTTTAGT +CTCTAAATCATACAAAATACCTGCCAGAAAAGATGTACTTGAACAAAACATGGTGCAGAA +AGCAAAAATTCCCTGAAGTAAAAATGGAAGCTGAAAATAATGCACTTTAAGAGAGAGATG +GGGGCCGGGCACGGTGGCTCACACCAGTAATCCCAACACTTTGGGAGGCTGAGGCGGGTG +GATCACGAAGTCAGGAGATGGAGACCATTTTGGCTAACATGGTGAAACCCTGTCTCTACT +AAAAAAAAAGTACAAAAATTAGCCAGGCATGGTGGTGGGTGCCTGTAGTCCCAGCTACTT +GGGAGGCTGAGGCAGGAGAATTACTTGAACCCGGGAGGCGGGGTTGCAGTGAGCTGAGAT +TGCGCCACTGCACTCCAGCCTGGGTAACAGAGTGAGACTCTGTCTCAAGAAACAAACAAA +CAAACAAAAGCAATGAATCACATGAATCTGGGAGGCGGAGGTTGCAGTGAGCTGAGATTG +TGCCACTGCACTCCAGCCTGGGTGACAGAGTGAGACTCTGTCTCAAAAAAAAAAGAGAGA +GATTGGATCTTTCAGGGAGGCAGGCAGTCACGTGCTGAAGGACAGGCCTGGCGTGCAGGG +GCCCTCCTGGGAGTGCAGGCTCAAGGAGGGGAATCAGCCTGTCCTCACCTCTGGGCTCAG +AGCTGGGTCAGGCATCAAGACCATTTTATCTTCCCATGGCAGTGCCAGTGCTGCTGCAGA +GCCCTGAGGACTGGGACACAGAGAAGCACCCTCCCTCAGGAGCTTCCCTTTCAACTGGGT +GGCATTCAGGAAACAGCCAGGACCCTGCAGTGCCTGAGTGCTTTGACCAGAAGCATTTGG +GCTTGGAAGGGTCTTTCCTGCCCTTTCCTCCACCCAGCCCCTACACCCTCCACTTCCAGC +CTGGTGGCCTCTCCCCTGCACCAGCACATATGGCCTCCCCTAGGGTAGGAGGGAGAGGGT +GAGAGAGTGGGAGTCCCCGCCTTTTGTGACCTGCTGTCTGTTAGAGGTGAGTCCAGGGCC +ACTCTAAGGGTGGAGGTTGCACATTGGCATGAAGGCCAGGAAGGACTCTTTGGGGCCCTG +TCAGAGACTTAGCCACACCCATAGATTAAACTCGTTTTGCCATATGTGCTATTTTGTTTT +TAGTGAAGAATTTTAAAGTAAATTACATACATCATGACATCTTCTTCCTGAATATTTTCA +TATTTAGGATGTCTGAAAAATGAGGCTGTGTTCCCACAAATCTGTATTAGATGGAACTGA +ATTAACAACAGTGCTCCAGAACCCTGGACCATCCAGCCTAAATTCCATTTGGGGATGAGA +TGCGTTTTTGCCTCAGAATCCCAAGCGGTAAGCACCATCAACCCCTCAGTTCATATCTCA +ATCTTCCTTTCTATTGGATTTTGTTCTCAATTTCTAACTTCTTACTAGATTATATCTCAT +TTAATTCTACTTGTTAAAGCTGATTTATTCTAAAATTTGATGAAGCAAAAGTCTGGATTT +TGCCTCAATATAATAAGAACCACTTAATTATAAACATTGATGCAGGGCTTTGCTAACCTG +CATTTTGTTCTTGGTATAGCCACATGTCTTTGGGGCCCCTTAAAAGGTAATTTTCCATAT +GAAGCACTTCTATATGGATCTACAATAAGAATTAACTTTAAAGTTAGTATTTTATTGGGG +GATGATGCCCTGCTGATGAAACTCTGTGAATCAACCTCATTTGTTCATAGATAGCACTTG +ATATATAAAGAGCTTAACACTAGTCCCTCCAGACTGCACACCAATGTTCTGGATTTCACA +GTGAGACTCAGAGGGTGCATGGGGGTTGTCACCTGCAGCTCTGGGATCCAGCCAGTTGGG +CTGGGGCTTCCTCCCTGTTTAAATTTGAAAAGGAAACCAGTTTGGAGTAAGAAGTAGATT +TTGCCTTGAAACAGCCAACCTACCAGTTTTAATGATTGACAACTGAGTTCACATAGAGAA +CTGAGCTCAATTTAAGAGTTAATAAAATGGGCCAGGAGTGGTGGCTCACGCCTGTAATCC +CAGCACTTTGGGAGCCCGAGGCAGGCGGATCATGAGGTCAGGAGATCGAGACCATCTTGG +CTAACATGGTGAAACCCCGTCTCTACTAACAATACAAAAAATTAGCTGGGCGCGGTGGCA +GGTGCTTGTAGTCCCAGCTGCTTGGGAGGCTGAGGCGGGAGAATGGCGTGAACCCAGGAG +GCAGAGCTTGCAGTGAGCCGAGATCGCGCCACTGCACTCCAGCCTGGACAACAGAGAGAG +ACTCTGTCTCAAAAAAAAACAAAAAAAAAAAAACAAAAACCAAAAAACTTAATAACATGA +ACAGCCTTTTTTGAGGGATAATTTCAAGGTTTTAAGATAAAAAGTAATAATCGGGCCAAG +TGCAGTGGCTCACACCTGTAATCCCAGCACTATGGGAGGCCAAGGCAGGCAGATCACTTG +AGGTCAGGAGTTGGAGACCAGCCTGGCCAACATGGTGAAACCCTGTTTCTACTAAAAATA +CAAAATTAGCCGGGCGTGGTGGCACATGCCTGTAATCCCTCCTACTCAGGAGGCTGAAGC +AGGAGAATCGCTGGAACCCGGGAGGCAGAGGTTGCAGTGAGCCGAGATCATGCCATTGCA +CTCCAGCCGGGGCGACAAGAGCAAAACTCTGTCAAAACAAACAAACAAACAAAAAACAAA +CAACAACAACAAAACAAACCAGTAATAACCGTACTTTTGCATAAAATGCAGAGACAGATT +TTCTTTTCTTCAGGCCCAGGTCCAATAAGGGACCCAGAGGGGCTGTGTGGAGTAGAGCTC +TGCAGTTCTGAATGCCCACCCTGCTGGGAATAAGGTCATGCTTGATCCAAATGACTGCAG +ACCCAGCAAAGGACTCAGTTCTGATAAGACCTCAGCAATTTCCTGGGTGAATTTCAAGCC +ACTTCATTAATAAGTCAAAGCTTCACAATCGCAAAGATAAGGAATCGACCAAGGTGTCCA +TCATTGGATAAACGGATAAAGATAATGTGCCATATATACACAATGGTAAACTATTCAGCC +ATAAAAAGAACGAAATCATGTCTTTTGCAGCAACCTGGATGGAAATGGAAGCCATTATCT +TAAGTTCTTAAGTGAAATAAGCCAGGCATAGAAAGACAAACATTGCATGTGCTCACTCAT +AATTGGGAGCTAAAAAATATGTATGCTGAAAGTAAGATGAGAGACAATGGAGACAAAGGG +TGAAGGGGTGGCAGGGAGGAGGAGGAGGAGAAATTGGTTAATGGGTGCAATGTACATTAT +TCTGGTGACGGATATCCTAAAAGCCCCAACTTGGCCACTATGCAATCCATGCATGTAACA +AAATTGCACATGTACCCCATGAATGTGTACAAATGAAAAAATAAGCTTCCTATTTTCACA +AAGACAACAATCCCCTCATGCCACATAGTTTATTAATTGTTGACTTTGTGTTTGAGCTGC +AAATAACTATTGTCACACAACTGTTTATGAACAATAGCTGCTTAGGGATTTCCTGAAATC +TCTCCAGCAGGTGCACCTCTGTTGACATTGCTGCACGCTCCCACCCTCCTAATTCAGTCA +CCAGCACTCCTGAGAGCCTGCTCAGGCCCTGGCTCACTAAGTTAAGAAGAAAGCACCTTC +CCTCTGCCAGCTCCAGTGTGAGGTGTGATAGAGGTCACATGCTGGGAGACGGAAGGAGGG +GTACCTGATTTGAACAGGAAGGAGGGGTGTGTGATTTACGGCCATTCTGGCTGGAAAGAA +GAAGGAAGAGTGGGAGCCTTTCAGGAAGACTGGGTGGGACTCCTGAATGTGGCAGGGGGT +AGGAGGCCTTGGGGCTTGTGCCCTGTGGCTAAGTTGCAGGGTAGGCTGGGGTCCCAGCCT +GGGTGGCATGAGAAGAGGGGTGGCTATGATAGCAGTGTTTGTCCTGTCTGAAACTCCTGC +TGAAACTGAATCCCCAGTGTGGCAGTGTGAGAAGTTGGGTTCTTTAAGTGGCGATTGGGT +CCTGAGGGCTTCATTCTCACGAATGGACTCATCCACCATGAGTTAATGGATTAGTGCATT +ATGATATCAATGGGACTGAAGGCTTTAAAGGAAGAAGAAGAAGAGGTAGCATCAGCATGC +TCCACCTCCTTGCCATGGACTGTCCTACACTGCCTTGGGACTCTGCAGAGTCCTCACCAG +CAAGAAGTTCCTCACCAGATGTGCTCCCTCGACCTTGGGAGAGAAATGAAAATTTCTTTT +CTTTATCAATTACCCAGTTTCACGTTTTCTGCTATAAACAGCATATAACAGACTAAGACA +GGTGCTGCTTGGGTTCAATGCTAAACACGTGATTTCTCTAACTTAGCAAATCGCCAAAAG +CTAGGGTTCAGTCCAGTGTCTCTGATGCAGATTCACTGTTGGTGGCATTGGGCAAGGGCT +CAGGGTAGCTGGGCAGCCTCAGCAGGGTGCAGTGCCAGGTCTCTTTCCAGAGGTGGCAGG +TGTGAATGCACAGAGGTAGGGACCTCACTGGTTTTGGCAGACCGTTTAACCTCTCAGAGC +TGGTTTCTTCAAGCACTACTTCAGCTAGTCTTAGACAAGAGGCTGTGGGGCTAAGCTTAG +TGTGGAAGGGAGGGGAGATGAACACATCAATCATTAGTGTCCAAGAAGAGATGCGGAGGA +AAGACTGTGGAAACAGAAAGCAGGGAGGACATTGTCTGCTTGGAAGAGCCAGCAAAACTG +GGTGGAGGACACAGAGTGGTGGAGAGAGGGGCTGAGGTCAGCAGGTATAAGGTCAGCATG +AGCAGTGTTAAAGAAAAATTACTCAACGTTACTAGCATGGCAGGTGGACTTTATTCCAGA +CCATTGAGGTAGGTATAGGTGGATCGAGATAGATCAGGTTCAACTCAAATACAGCATGGG +CGAGGGGGAGCTGATAACCAAGGAGCAGGGTTGGGGGCAGTGGGTGGAAAATGACTAAAA +GGTGATTCAGAGTTAAAGGACTCTGGCTAATCTGGCCCAATAGGATTCTTGCTGACGACA +GGCCATGGTGGTCAGACTTCACCTTGGGGACGATGGAGGATGAGGAGCCTCATCAGATAT +GGAGGGCGATGAGGTATTGAGGGTAGGGGCTTCTTGCTAAACTGACTTACCAAGGTTCTT +GGTGAAACTGAATTTTACCAGGACGTGCACAGATGGACGTAGGAGGAGGTTCGGGAGCTG +GACTACCCTTTGGCCAAGCAGAGAATCTTTGTTAGCAGGAAGCTGAGGGCCCGTTGCAGG +GCAGCAGAGTGATAGGTAAATAGGGGGAGCATATTCATTCATTCTCTCAACAATTAAATA +TTAAATACGTCTGTGCTACCAGGGATACAAAGGTAAACAAAGCAGGAAAAAAAGTTCTGC +CTTTATGGCCAGTACATTCTGGTAGGAGAAGAAAGAAAATAAACAACATAAGTAAATAGT +ATATTAGAGGAAGAAAAATGCTATGGAGAGTAAAAAAATGAGGGCATAAAGGGACATTTG +GAGTGTTTCACAGAGAAAGTGACACTCTGGCAAGACCTCCTCTGCTGAGGGAGCCAGGAT +GCCTGCAAAGAGTGCGCTGGGCCCAGGAAGGAGCAGGGAGTGCTGGCCCCTGCTGTAGGC +CTCGCAGGCCTGCTCTCTGGTGGAGCCAGGGAGGCGCGATCTGACTTTTGAGCCTGTGTG +GAGCATGAACCGCAGGGCGCAGGGTGGAGGCGGCTGCGAGGGGACCCGGAGCCTAGGTCA +GGGCGGTGGAGGCTGGAAGCTTGCAAGAGTCGCAGGGATTAATGCTGGGTTGGATCTGGG +TGTGAGAGAAAGAGCACACGGGGAGGACTCGGGGTTTTCCGTCCTGAGGAGCCATGAACT +GAAACGGAGACTGTTTTGTTTGGAGCTGACTGTGAAGGAAGGGCGAGAGGTGGTGTCTAT +GAGCATCAAGTGGAAAAACGGAGGAGACGCTCCTAGCGAGGGAGAGGCCTGGGCGGGGCG +CGATGAGCAGGCAGGTGGCATTTGGAGCTGCGGGAATAAATGCAACCCCAGGGAGCAGGT +GTGGACAGAAAACAGAAGAGCGGCCTTGGGCAGCCAGGAGTCCCGAAATGTGCGCGGAGG +AGGGGCCTTGGAGCGGGAGAGGAGTGGTCCAGGAAGCCACGTGCAGGGGGTGCTGAGGAG +CAGCAGGGAGGGAGGCCCCCTGTCTTACAGCTGGGGGCTGCACTTCTCCATAGGCCATTG +CTGCCGGCAGGAGCAGGTTCAGGGTAGTGCAGAGGTGGAGAGCTGAACTGGAGTAGCTCA +CATAGCGAGAGGGAGGAGGGGTGTTCAAGGTAAGACGAGACCACGCTTTGGAGCCATTTT +GCTCTACAGGGGAACAGAGAAATGGGATGCAGCTGGAGAGCGAAGCATGAAGAGGATGTT +TTAAAGATGGGCCAGGAAAAAGCACGTTTGTACAACTGACTAGAGAGGAAGAAGTGGGTG +ATTCAGGAGTGAGGTGGAAGGCAGAGGTGTCCTTAAGAGGTCTAGTCCGCCAGGGCAGGG +GTGTGGCCATCCTTCTGCAGTAATGCAGGGAAGACACGTTGCCGTGAGGGGGCAGCCTGC +AGATCTATTTTATTTATTTATTTATTTATTTATTTATTTATTTATTTATTTATGAGATGG +ACTTTTGCTCTTGTTGCCCAGGCTGGAGTGCAATGGCAGTATCTTGGCTCACCGCAACCT +CCGCCTCTCGGGATCAAGCGATTCTCCTGCCTCAGCCTCCCAAGTAGCTGGGATTACAGG +CGTGCACCACCACGCCTGGCT diff --git a/paraphase/data/cfc1/ref.fa.fai b/paraphase/data/cfc1/ref.fa.fai new file mode 100644 index 0000000..3701426 --- /dev/null +++ b/paraphase/data/cfc1/ref.fa.fai @@ -0,0 +1 @@ +chr2_130587000_130610000 23001 26 60 61 diff --git a/paraphase/data/f8/f8_config.yaml b/paraphase/data/f8/f8_config.yaml new file mode 100644 index 0000000..e94b4e2 --- /dev/null +++ b/paraphase/data/f8/f8_config.yaml @@ -0,0 +1,22 @@ +gene: f8 + +# data files +data: + reference: ref.fa + homopolymer: homopolymer_sites.txt + +# coordinates +coordinates: + hg38: + nchr: "chrX" + nchr_old: "chrX_155376007_155387000" + nchr_length: 156040895 + extract_region1: "chrX:155376007-155386059" + extract_region2: "chrX:154880316-154890327 chrX:155454650-155464706" + + clip_3p_positions: [155386059, 155386727] + + left_boundary: 155376507 + right_boundary: 155386059 + + diff --git a/paraphase/data/f8/homopolymer_sites.txt b/paraphase/data/f8/homopolymer_sites.txt new file mode 100644 index 0000000..05a1876 --- /dev/null +++ b/paraphase/data/f8/homopolymer_sites.txt @@ -0,0 +1,476 @@ +chrX 155376566 T +chrX 155376567 C,T,G,A +chrX 155376568 0 +chrX 155376569 0 +chrX 155376570 0 +chrX 155376571 0 +chrX 155376572 0 +chrX 155376573 1,T +chrX 155376669 T +chrX 155376670 C,T,G,A +chrX 155376671 0 +chrX 155376672 0 +chrX 155376673 0 +chrX 155376674 0 +chrX 155376675 1,T +chrX 155377328 G +chrX 155377329 C,T,G,A +chrX 155377330 0 +chrX 155377331 0 +chrX 155377332 0 +chrX 155377333 0 +chrX 155377334 1,G +chrX 155377335 C,T,G,A +chrX 155377336 0 +chrX 155377337 0 +chrX 155377338 0 +chrX 155377339 0 +chrX 155377340 1,G +chrX 155377341 C,T,G,A +chrX 155377342 0 +chrX 155377343 0 +chrX 155377344 0 +chrX 155377345 0 +chrX 155377346 1,G +chrX 155377369 A +chrX 155377370 C,T,G,A +chrX 155377371 0 +chrX 155377372 0 +chrX 155377373 0 +chrX 155377374 0 +chrX 155377375 1,A +chrX 155377470 A +chrX 155377471 C,T,G,A +chrX 155377472 0 +chrX 155377473 0 +chrX 155377474 0 +chrX 155377475 0 +chrX 155377476 1,A +chrX 155377519 C +chrX 155377520 C,T,G,A +chrX 155377521 0 +chrX 155377522 0 +chrX 155377523 0 +chrX 155377524 0 +chrX 155377525 C,1 +chrX 155377591 A +chrX 155377592 C,T,G,A +chrX 155377593 0 +chrX 155377594 0 +chrX 155377595 0 +chrX 155377596 0 +chrX 155377597 1,A +chrX 155377703 A +chrX 155377704 C,T,G,A +chrX 155377705 0 +chrX 155377706 0 +chrX 155377707 0 +chrX 155377708 0 +chrX 155377709 0 +chrX 155377710 0 +chrX 155377711 0 +chrX 155377712 0 +chrX 155377713 0 +chrX 155377714 0 +chrX 155377715 0 +chrX 155377716 1,A +chrX 155377812 G +chrX 155377813 C,T,G,A +chrX 155377814 0 +chrX 155377815 0 +chrX 155377816 0 +chrX 155377817 0 +chrX 155377818 1,G +chrX 155377824 C +chrX 155377825 C,T,G,A +chrX 155377826 0 +chrX 155377827 0 +chrX 155377828 0 +chrX 155377829 0 +chrX 155377830 C,1 +chrX 155377891 C +chrX 155377892 C,T,G,A +chrX 155377893 0 +chrX 155377894 0 +chrX 155377895 0 +chrX 155377896 0 +chrX 155377897 C,1 +chrX 155378491 A +chrX 155378492 C,T,G,A +chrX 155378493 0 +chrX 155378494 0 +chrX 155378495 0 +chrX 155378496 0 +chrX 155378497 1,A +chrX 155378566 A +chrX 155378567 C,T,G,A +chrX 155378568 0 +chrX 155378569 0 +chrX 155378570 0 +chrX 155378571 0 +chrX 155378572 0 +chrX 155378573 1,A +chrX 155379171 A +chrX 155379172 C,T,G,A +chrX 155379173 0 +chrX 155379174 0 +chrX 155379175 0 +chrX 155379176 0 +chrX 155379177 0 +chrX 155379178 1,A +chrX 155379453 G +chrX 155379454 C,T,G,A +chrX 155379455 0 +chrX 155379456 0 +chrX 155379457 0 +chrX 155379458 0 +chrX 155379459 1,G +chrX 155379559 G +chrX 155379560 C,T,G,A +chrX 155379561 0 +chrX 155379562 0 +chrX 155379563 0 +chrX 155379564 0 +chrX 155379565 1,G +chrX 155379873 A +chrX 155379874 C,T,G,A +chrX 155379875 0 +chrX 155379876 0 +chrX 155379877 0 +chrX 155379878 0 +chrX 155379879 1,A +chrX 155379923 G +chrX 155379924 C,T,G,A +chrX 155379925 0 +chrX 155379926 0 +chrX 155379927 0 +chrX 155379928 0 +chrX 155379929 1,G +chrX 155380214 G +chrX 155380215 C,T,G,A +chrX 155380216 0 +chrX 155380217 0 +chrX 155380218 0 +chrX 155380219 0 +chrX 155380220 1,G +chrX 155380221 G +chrX 155380222 C,T,G,A +chrX 155380223 0 +chrX 155380224 0 +chrX 155380225 0 +chrX 155380226 0 +chrX 155380227 1,G +chrX 155380243 G +chrX 155380244 C,T,G,A +chrX 155380245 0 +chrX 155380246 0 +chrX 155380247 0 +chrX 155380248 0 +chrX 155380249 1,G +chrX 155380487 C +chrX 155380488 C,T,G,A +chrX 155380489 0 +chrX 155380490 0 +chrX 155380491 0 +chrX 155380492 0 +chrX 155380493 C,1 +chrX 155380509 C +chrX 155380510 C,T,G,A +chrX 155380511 0 +chrX 155380512 0 +chrX 155380513 0 +chrX 155380514 0 +chrX 155380515 C,1 +chrX 155380832 G +chrX 155380833 C,T,G,A +chrX 155380834 0 +chrX 155380835 0 +chrX 155380836 0 +chrX 155380837 0 +chrX 155380838 1,G +chrX 155381195 C +chrX 155381196 C,T,G,A +chrX 155381197 0 +chrX 155381198 0 +chrX 155381199 0 +chrX 155381200 0 +chrX 155381201 C,1 +chrX 155381208 C +chrX 155381209 C,T,G,A +chrX 155381210 0 +chrX 155381211 0 +chrX 155381212 0 +chrX 155381213 0 +chrX 155381214 C,1 +chrX 155381228 C +chrX 155381229 C,T,G,A +chrX 155381230 0 +chrX 155381231 0 +chrX 155381232 0 +chrX 155381233 0 +chrX 155381234 0 +chrX 155381235 C,1 +chrX 155381330 G +chrX 155381331 C,T,G,A +chrX 155381332 0 +chrX 155381333 0 +chrX 155381334 0 +chrX 155381335 0 +chrX 155381336 1,G +chrX 155381337 C,T,G,A +chrX 155381338 0 +chrX 155381339 0 +chrX 155381340 0 +chrX 155381341 0 +chrX 155381342 0 +chrX 155381343 0 +chrX 155381344 1,G +chrX 155381357 G +chrX 155381358 C,T,G,A +chrX 155381359 0 +chrX 155381360 0 +chrX 155381361 0 +chrX 155381362 0 +chrX 155381363 0 +chrX 155381364 1,G +chrX 155381401 G +chrX 155381402 C,T,G,A +chrX 155381403 0 +chrX 155381404 0 +chrX 155381405 0 +chrX 155381406 0 +chrX 155381407 1,G +chrX 155381496 C +chrX 155381497 C,T,G,A +chrX 155381498 0 +chrX 155381499 0 +chrX 155381500 0 +chrX 155381501 0 +chrX 155381502 C,1 +chrX 155381701 G +chrX 155381702 C,T,G,A +chrX 155381703 0 +chrX 155381704 0 +chrX 155381705 0 +chrX 155381706 0 +chrX 155381707 0 +chrX 155381708 0 +chrX 155381709 0 +chrX 155381710 0 +chrX 155381711 0 +chrX 155381712 0 +chrX 155381713 0 +chrX 155381714 0 +chrX 155381715 0 +chrX 155381716 0 +chrX 155381717 0 +chrX 155381718 0 +chrX 155381719 1,G +chrX 155382033 G +chrX 155382034 C,T,G,A +chrX 155382035 0 +chrX 155382036 0 +chrX 155382037 0 +chrX 155382038 0 +chrX 155382039 1,G +chrX 155382786 C +chrX 155382787 C,T,G,A +chrX 155382788 0 +chrX 155382789 0 +chrX 155382790 0 +chrX 155382791 0 +chrX 155382792 C,1 +chrX 155383207 C +chrX 155383208 C,T,G,A +chrX 155383209 0 +chrX 155383210 0 +chrX 155383211 0 +chrX 155383212 0 +chrX 155383213 C,1 +chrX 155383255 T +chrX 155383256 C,T,G,A +chrX 155383257 0 +chrX 155383258 0 +chrX 155383259 0 +chrX 155383260 0 +chrX 155383261 1,T +chrX 155383274 T +chrX 155383275 C,T,G,A +chrX 155383276 0 +chrX 155383277 0 +chrX 155383278 0 +chrX 155383279 0 +chrX 155383280 1,T +chrX 155383354 A +chrX 155383355 C,T,G,A +chrX 155383356 0 +chrX 155383357 0 +chrX 155383358 0 +chrX 155383359 0 +chrX 155383360 1,A +chrX 155383488 A +chrX 155383489 C,T,G,A +chrX 155383490 0 +chrX 155383491 0 +chrX 155383492 0 +chrX 155383493 0 +chrX 155383494 1,A +chrX 155384120 T +chrX 155384121 C,T,G,A +chrX 155384122 0 +chrX 155384123 0 +chrX 155384124 0 +chrX 155384125 0 +chrX 155384126 0 +chrX 155384127 0 +chrX 155384128 0 +chrX 155384129 0 +chrX 155384130 0 +chrX 155384131 0 +chrX 155384132 0 +chrX 155384133 0 +chrX 155384134 0 +chrX 155384135 0 +chrX 155384136 C +chrX 155384137 C,T,G,A +chrX 155384138 0 +chrX 155384139 0 +chrX 155384140 0 +chrX 155384141 0 +chrX 155384142 0 +chrX 155384143 0 +chrX 155384144 0 +chrX 155384145 C,1 +chrX 155384293 T +chrX 155384294 C,T,G,A +chrX 155384295 0 +chrX 155384296 0 +chrX 155384297 0 +chrX 155384298 0 +chrX 155384299 1,T +chrX 155384301 T +chrX 155384302 C,T,G,A +chrX 155384303 0 +chrX 155384304 0 +chrX 155384305 0 +chrX 155384306 0 +chrX 155384307 1,T +chrX 155384529 C +chrX 155384530 C,T,G,A +chrX 155384531 0 +chrX 155384532 0 +chrX 155384533 0 +chrX 155384534 0 +chrX 155384535 C,1 +chrX 155384601 C +chrX 155384602 C,T,G,A +chrX 155384603 0 +chrX 155384604 0 +chrX 155384605 0 +chrX 155384606 0 +chrX 155384607 0 +chrX 155384608 C,1 +chrX 155384636 T +chrX 155384637 C,T,G,A +chrX 155384638 0 +chrX 155384639 0 +chrX 155384640 0 +chrX 155384641 0 +chrX 155384642 1,T +chrX 155384789 T +chrX 155384790 C,T,G,A +chrX 155384791 0 +chrX 155384792 0 +chrX 155384793 0 +chrX 155384794 0 +chrX 155384795 1,T +chrX 155384804 T +chrX 155384805 C,T,G,A +chrX 155384806 0 +chrX 155384807 0 +chrX 155384808 0 +chrX 155384809 0 +chrX 155384810 1,T +chrX 155385481 T +chrX 155385482 C,T,G,A +chrX 155385483 0 +chrX 155385484 0 +chrX 155385485 0 +chrX 155385486 0 +chrX 155385487 1,T +chrX 155385942 A +chrX 155385943 C,T,G,A +chrX 155385944 0 +chrX 155385945 0 +chrX 155385946 0 +chrX 155385947 0 +chrX 155385948 1,A +chrX 155386062 T +chrX 155386063 C,T,G,A +chrX 155386064 0 +chrX 155386065 0 +chrX 155386066 0 +chrX 155386067 0 +chrX 155386068 0 +chrX 155386069 0 +chrX 155386070 0 +chrX 155386071 0 +chrX 155386072 1,T +chrX 155386099 T +chrX 155386100 C,T,G,A +chrX 155386101 0 +chrX 155386102 0 +chrX 155386103 0 +chrX 155386104 0 +chrX 155386105 1,T +chrX 155386219 C +chrX 155386220 C,T,G,A +chrX 155386221 0 +chrX 155386222 0 +chrX 155386223 0 +chrX 155386224 0 +chrX 155386225 C,1 +chrX 155386538 T +chrX 155386539 C,T,G,A +chrX 155386540 0 +chrX 155386541 0 +chrX 155386542 0 +chrX 155386543 0 +chrX 155386544 1,T +chrX 155386722 T +chrX 155386723 C,T,G,A +chrX 155386724 0 +chrX 155386725 0 +chrX 155386726 0 +chrX 155386727 0 +chrX 155386728 1,T +chrX 155386730 T +chrX 155386731 C,T,G,A +chrX 155386732 0 +chrX 155386733 0 +chrX 155386734 0 +chrX 155386735 0 +chrX 155386736 1,T +chrX 155386819 T +chrX 155386820 C,T,G,A +chrX 155386821 0 +chrX 155386822 0 +chrX 155386823 0 +chrX 155386824 0 +chrX 155386825 0 +chrX 155386826 1,T +chrX 155386897 T +chrX 155386898 C,T,G,A +chrX 155386899 0 +chrX 155386900 0 +chrX 155386901 0 +chrX 155386902 0 +chrX 155386903 1,T +chrX 155386904 C,T,G,A +chrX 155386905 0 +chrX 155386906 0 +chrX 155386907 0 +chrX 155386908 0 +chrX 155386909 0 +chrX 155386910 0 +chrX 155386911 1,T diff --git a/paraphase/data/f8/ref.fa b/paraphase/data/f8/ref.fa new file mode 100644 index 0000000..642bbee --- /dev/null +++ b/paraphase/data/f8/ref.fa @@ -0,0 +1,185 @@ +>chrX_155376007_155387000 +CTGGGTAATTTATAAAGAAAAGAGGTTTATTTGGCTCATGATTCTGCAGGCTGTACAGGA +AACATGGTGCTGGCATCTGCTTGGTTTCTGGTGAGGGTCTCAGGCTGCTTCCACTCATGG +CATAAGGTGAAGGGGAGCCGGCAAGTGCAGAGATCACATGATGAGAGAGGAATCAAGGTG +GGGTGGGGGGGAGGTGCTACACTCTTTTTTAACAACCAGCTCTTGTGGGAATGAATCAAG +TGAGAACTCACTCATTACTGCGAGGCTGGCACCAAGCCATTCATGAAGGATCCACCCCAT +GACTCAAACACCTTCCATTAGACCCCACCTTCCAACACTGGGTATCAAATTTCCACATGA +GATTTGGAGGGGTCAAACAAACCAAACTATAACCAGCACCTTGAACTTCCCCTCTCATAG +CACTTATCACAGTATAGTGTTGTCATTGTCTGGCTCCTTGTCTGTATGTTCCGCCAGACT +GCATGCAGGGATCTGATCTATATTTCTCGTTGGTGTTGTCTCAATGCTTAACACAGTATA +CCTGGCACTACCCTTCAATATTTTTTGAAGAATGAGCACATGAATTATCATGTCTCCAGC +TAGGTTCGCCAGTATGAGTAATATGTAGTGAGTGTCTACTGTGTGCCATTTTACTACCTA +AACTTTTTATGGACTCTGTCCCCTAGTCTTATCTACCAATGCCATTGTCTTAGTTGAGAT +ATCTGTCATTTCTTGACTGGTATAATACAGCAGCTTTCTAATTCCAATCTCAGTCCTCTC +TGTTCTATTCTCCACACACAATCCTCTCTGTTCTATTCTCCACACAGCTTCCAGGGTGAT +ATTTCTAGAGGCAAATCAGACCACGTCACTCCCAAGCTGAAAGACCACAGTGGGTTCAGT +GTAGCCTACAGGGTATTCTAAAACCTGAGCATGGCATAATAGAACCTCCATGTCTTGGCT +CCTGCCTACCTCTTTCCCAGCCACATCAACCACTACTGCTCCACTGGCATCGTACTCTCC +AGCCTTAATGAATTGGTTGCAACTCCCCTCCTCCCACCGGTGCCTTGGCTTCTCATCATG +CTACAGATTCTTTAACACACACATTGTCCTCCAAGAAGCTTGCCAGATCCACTCCTATTC +TTTCAACGAAATGTTTATTGACCACTCTTTATACACTATGACCAAATAGGATTTATCCCA +CAAATGTAAGGTTGGTTCAACATGTGAAAATCAGTGTATATAATACACCATATTAATGGT +TGGTTCAACATGTGAAAATCAATTTATATAATACACCATATTAACAGAATGCATGGCGGA +GCGGGGGCGGGGGAGGGGGCTCATGGTCATCTTAATCATGTACAAAAAGGATCATTTGAC +AAAACGCAGTACCCTTTCATGAAAATAAATTCAACAAACCAGGAACAGAAAGAAACTTCT +TCAACCTGATAGAGGGCATCCATGAAAAACCCAGAGATAATATCATACTTAATGGTGAAA +GACTGGTTGATTTCCCCCTAAGATCAGGAATGCCACTTGTAACTCAACATTGTACTCAAA +GTTCTAGTCAGGGCAATTAGTCAAGAAAAAGAAATAAAAGGCATCCAGATCGGAAAGGAA +CAAGTAAAACCATCTCTATTCATAGATGCATAGATGGCATGTTCTTATATAGAGGACACT +AAAGAGTCCAACAAAAGAAAAAAAAAAAACTGAGACGGTTTGGCTCTGTGTCCCCACCCA +AATCTCATGTCGAATTGTAATCCCCACCTGTTGAAGGAAGGGCCTAGTGGGAGGTGATTG +AATCATGGGGGCGGACTTCCCCCTTGCTATTCTTCTGATAGAGCGCTCACAAGATCTGAT +GATTGTTTGAAAATGTGCAGCGCTTCCCCCTTTGCGTGCGTTCTCTCTCTCTTCCTCCTG +CTCCAACATGTGAAGAAGGTGCTTGCTTCCCCTTCGCCTTCCGCCATGATTGCACGTTTC +CTGAGGCCTGCCCAGAAGCAGAAGCCTTGTACAGCCTGCAGAACCATGAGGCAATTAAAC +CTCTTTTCTTTAGGAATTACCCATTCTCAGGCAGTTCTTTATAGCAGTGTAGGAACGGAC +TAATATGCCAACCCTGTCTGAGTATGCTCAGCAAAGCATATTCGGCTCAGCAAAGTTGCA +GTATCCAAGATCAATCTGCAACAGTCGGTTGTATTTCTACACCCTAGCAATGAATAACAT +GAAAATAAGATTAAGGAAATGATTCATTTTACAGTAGCATCGAAACAATAAAATGTCGAG +GAATAAATTTAACTAAAGGTGTACAAGATGTATACTCTGAAAATTTCAAGACGTTGCTAA +AATAAATTAAAGAAGACCTAAATAAATCGAAAGGCATTTTGTGTTAACGGATTGGGACAC +TTAATATTGTTAAGATGCCCATAATCTCCAAGGCAATATACAGATTCATCGTGATGCCTG +TCAGAATCCAACCTGCCTTCTTTGTAAAAACTGACAAGCTGATCCTAAAATTCATAAGAA +AATGTAAGGGGCTCAGATAGCCAAAACAATCTCAGAAAAGAAAAAAGTTGAAGGACTCAC +ACTTCCCAATTTCAAAACGTACTACAAAACCGCAGTAATCCGAAAAGGGTGGTACTGCCA +TCGTGATCGATACATTAGGTCAATGTTACAGAACTCAGAGTCAGGAATAAACCCATATAT +TTATGTAAAGTTTATTTTCAGTAAGAGTTCCGGCACCCCTCAATGGGTGGAGGGGAAGAA +TAATCCCCTCAACAAACTGCGCTGGGACAACTGGATAGCCACATGCAAAAGAAGGCCCCT +GGACCCCTACCTCACATCATGTAAAGAATTAATTCAAAATGGATCACAGATCTAAATGTA +AGGGCTGAAACTATAAAACTCTAAGAAGAAAACGGAGATGTAACTTTGCACGACCTTGGA +TTAGGCAACAGCTCCTTCAGTATGACACCAAAGGCAAAGCCACAGATATCTTTTCATTTG +GCAAGTTGGACTACATCCAAATGTAAATCTTTTGGGCAACAAAGGGCTCTATCCAGAAAG +TGAATGGGAGAAACATTTGCAAGTCACATATCTGATAAGAGTCTATTTTCCAGAATATAT +GAATAAGTTTTAGAAGCCCACAGCCAAAATACAATCAACCGAATTAAAAAATGGGCAAAG +GGCTTGAATAGACATTCCTCCAAAGCAGAGGTACAGAGGGCCAAGAAGCACATGAAAAGA +TGTCTTAGTCTGTTTTCTGTTGCCATAATTGAATACCCGACACTGAACAATTTATAAAGA +AAAGAAATTTATTTCTTACCGTTCTGGGGACTGGGAAGTGACCCGGCATCTGGGGACGGC +AGAGGGTATCACACGGCGAGAGGGCAAGAGCGTGTATGTCGTTTCAGGCCTCTCTTCCTA +TTCTTATAAAGCCACCAGTCGCATCCTGGGGGTCTCACCCCGATGACCTTATCTAATCCT +AATTACCGTCTCAGAGGCTGCGCCTCCAATCAACATGTGAATTTGGGGATTAAGTTTCCA +AGACACACAATTTGGGGGATGCACTCAAACCACAGCAACATCATTAGCCATTTGGGAAAT +ACGAATGAAAACCACCATGAGATAGCACTTGACACCCAGTAGGATGGCTACTGTAAACAA +AACAAAATGAGGCAAACAGAAAACAGGTTTTGATGAGGCTGTGAAGAAATTGGGACTCTC +ATGCACTACTGTTGGGAATGCGAAAGATTACAGCTGCTGTGGAAATCAGTTTGGTGGTTC +CTCAATAAGTTAAACAGAATTACCACACGAGCCAGCAGTTTGACTGCTAGAAATAGACCC +AAAAGAACTGAAAAGAGATATTCAAAGAAAAACCTGTAGAGAGATGGTCATGGCAGTGCC +ACTCACAATAGCCAGAAGGGGGAAATAACCTGTCCATCAACTGACGAGTGGGTAAACAAA +ATGTGGCCTCCTATTCATACAACAGAATCGTACTCAGTCATTCAAAGGAAGGAAGTAGTG +ATTCATGCCTCTACATGGATGCACCCTGATGAAACCATTACGCTAAGTGAAAGCAGCCAG +ACACACACGGCCTCACATTTGGTCGTATAATTCCATCTATAGGAAATCTCCAGAACAGGC +TAACCTCTATGGACAGAAAGCAGATGGACTAGGGGTTGCCAGGGACTATCGGGAACGTGT +GTATGGGCGGGGGCAGGGGGTGTGTGTGGGAGGGTGTGGGGGCTGGTAGTGGTGGTGTGG +AAGGAGGCGTGCGGCAAGTGACTGCTTATAGGTACAGGGGTTTCGGTTTCTTTTCGGGGT +GTTGAAAACGTTTTGGAACTAGGTGGTCGTGATGGATGTGCAACATTGTGAAGGCGTGGT +ACCCGTCCTCCCCTCGCCCACCCCTTTGTCTTTCCCACGGTGTTCCCATCCAGTAGAAAG +GGAAAGTTCCCACTGGCAAAGAGAGGGTCACCAGCGCGATTCCCCCTGCCCCGCCCCACC +CCACCCCCACGGCCCTCCAGCCACTTCTAGGCCCAGGCCTGCTCCCTCGAGGCCAATGGG +AAAGTGATGAGGTCACCTGACGGACGTGTGTCCCTTGTGAGGGGCCGTCACCACATGCGT +GCCCGTCGCCGGATGTGCGTGCCCATTGGTCTCGGCTCCCTAGCCAATGAGGGGCTGGGC +CCTGGTCGCTAGGATACACAGTACTGGACGCGGTAACTGTCCTCTTGAGAGCCATCTTGC +CTAGCTGGGCCAAGCCGAGATAGCACACTCAACGCCCAGCATGCCGAGGAGGAGGAGACG +CCGAGGGTCCTCCGGTGCTGGCGGCCGGGGGCGGACCTGCTCTCGCACCGTCCGAGCGGA +GCTTTCGTTTTCAGTGAGCCAGGTGGAGCGCAGTCTACGGGAGGGCCACTACGCTCAGCG +CCTGAGTCGCACGGCGCCGGTCTACCTCGCTGCGGTTATTGAGTACCTGACGGCCAAGGT +CCTGGAGCTGGCGGGCAACGAGGCCCAGAACAGCGGAGAGCGGAACATCACTCCCCTGCT +GCTGGACATGGTGGTTCACAACGACAGGCTACTGAGCACCCTTTTCAACACGACCACCAT +CTCTCAAGTGGCCCCTGGCGAGGACTAGCTTCTGACACCCGGCCCCTGGGACCTGACAGG +TCCACTCGTCCACCCACCCGGCCCCAAATCCCCCGGCCTGAACCCCCGGCCTTAAACACC +CTCCCCCCACAACCCAGGCCCCAAAGTCTTGGGCCTTCATTAATTCTGTCAATAAAATGT +TTCAAGGAAGCCATCTGCCCGCTTCAGTCACTTTGCCTTGGGGTGGGGGTGGGGGGGCGG +TGAGATGGGTTGGGGGGTTGTGAGACCTCCACAGCTGGAGGGATGGAGAGGTGGTGGGGG +TCTGGGGTTGGGTGGGAGCCAGGGATGGCACAGAGGAGCCGTCTCTCCGGGTGACAGTGC +AGGGGAGTGGCCCTGGGTGGGAAGGGGACACCCCCGCTGGTTCTGAGCAAGTCAGGGCCT +GCCCGGGAAAGTCCTCAGCATGATGGTGTTCGTTGGGGCGGGTGGGCCTCAAGACCATGA +CTGCAGTGTTGCGACAGGACGGACTTCTAAGGCAACCGGGGTGGGGACGGAGAGGTGGAC +CCGGTGCTTGGCAAGGGGCCCTGGACCGGAAGACGGAGGACTGAGTAGTCGGGGCGGGGG +GGGGGGGGGGGGACAACAGTCCAGAAATGCGCAGACATGGAATTAGGGTCACGAGTTTCC +TTTGGGTCATGTTCAGATGGAGAGGCTGTGGGATGCCTTTAGCAACCTAGACCACTTGCA +TTGCAGCCGAGCTAGCAGCAGCTGGTGCCCCGCCAGGCCCCGCGTACCTTCCCAGGGTCT +TGGATCCGCATCCCCGAGACCCTCGATTTGCATACGCCGCTCACAGCCCGGCCAGGCCCC +GCCTCCGATCCCGCTCTTTGCACCTCACCAGGGCAAGGGGCAGGCTCGGGCTGGGGAGCC +GGCGCGGGCACGGCGGCAGCCCGCCGAGGGGGACGCAGGGCACGTCGCCCCGCCCCGGCC +CGCCCGCCAGCTGCGGCGGCGCGTCGGGAAGTGCTTGGGGCGGCGAGCATGGCGGCAGCG +GCTGCAGGCCTGGGCGGCGGCGGCGCCGGCCCGGGACCCGAGGCCGGGGACTTCCTGGCC +CGCTACCGGCTGGTATCGAACAAGCTGAAGAAGCGGTTCCTGCGGAAGCCGAACGTGGCG +GAGGCCGGCGAGCAGTTCGGACAGCTGGGCCGGGAGCTGCGCGCCCAGGAGTGTCTGCCC +TACGCGGCCTGGTGCCAGCTGGCGGTGGCGCGCTGCCAGCAGGCGCTCTTCCACGGGCCC +GGGGAGGCGCTGGCCCTCACCGAGGCCGCCCGCCTCTTCCTGCGGCAGGAGCGCGACGCG +CGCCAGCGCCTGGTCTGCCCCGCCGCCTACGGGGAGCCGCTGCAGGCCGCCGCCAGCGCC +CTGGGCGCCGCGGTGCGTCTGCACCTCGAGCTGGGCCAGCCGGCCGCCGCCGCCGCCCTC +TGCCTCGAGCTGGCCGCCGCCCTGCGCGACCTGGGCCAGCCGGCCGCCGCCGCCGGTCAC +TTCCAGCGCGCCGCCCAGCTCCAGCTGCCCCAGCTGCCCCTGGCCGCGCTGCAGGCGCTT +GGCGAGGCCGCCTCCTGCCAGCTGCTGGCGCGCGACTACACCGGCGCCCTGGCGGTCTTC +ACGCGCATGCAGCGCCTGGCGCGGGAGCACGGCAGCCACCCGGTGCAGTCACTGCCGCCG +CCCCCGCCGCCGGCACCCCAGCCCGGGCCCGGGGCGACGCCCGCCCTACCGGCCGCGCTG +CTTCCTCCGAACTCCGGCTCGGCGGCGCCCTCTCCCGCCGCCCTGGGCGCCTTCTCGGAC +GTGCTGGTCCGCTGCGAGGTGTCCCGCGTGCTGCTGCTGCTCCTCCTGCAACCACCGCCC +GCCAAGCTGCTGCCGGAGCACGCCCAGACCCTGGAGAAGTACTCCTGGGAGGCTTTTGAC +AGCCACGGGCAGGAGAGCAGCGGCCAGCTTCCCGAGGAGCTCTTTCTGCTGCTCCAGTCT +TTGGTCATGGCTACCCACGAAAAGGACACGGAAGCCATCAAGTCGCTGCAGGTGGAGATG +TGGCCACTGTTGACTGCTGAGCAGAACCACCTCCTTCACCTCGTTCTGCAAGAAACCATC +TCCCCCTCAGGACAGGGAGTCTGATCCATCCCATTCACCCAGTGACTTCTTTTTGCCCAG +GCCGGGACTTTTTGCATCAGTCACGTTAACCAGATGACTTTGCCTGTTACCAAACCTCAT +GCATCCACGTTTGCGTCTGGGGAGGAATAAAAAGACATCGTTCCCGCTTCTGCGTTTTGT +TATTCCTACTGCCGCCATAGGAATTATTTCGTTGGCTGAACGTTACCAGCACCCCGAGAA +CACATTTTGATAGAATCAGAGTAGAGGACATGGCTGTCTTCTAAAAAGCCACGACATGAA +AATGACAATCCCTTTCGTCTCCTTCCTCCGCTGCTTCCACCCAACGCAGCCTCCTGCCTC +CGCCTTTGTTTCATAGTGAGGATTTTATTTTGCACGGCGCCCTCCCTCTAAATACCTACC +CTAGATGATTTCATCCTGCCCCTCACTTCTTAACATATTCCTGTGTCTTTGTAATGGCCA +AATTTCTCCTTCCACTCGTCCGCACGGTACGTCTTCATGGGAGTCATTTTATTCCTTACA +GCTTCCTTGTAACCACAGCCCTAAATCCATGATAAAGCTACTCTTGTACTCCTCATGTGT +GGTTTTGGTGTCCGTCGTCTATAAAATGTGACTCTCTTACCTACTTGTCAGGAGGCCAGC +AGCAGAACCAGATAGCTGTTTGGGCTCCAGGATGTATTCGTTCCTTTGAAATTTGCGCTA +TGCCTCCTATGTGGGCAACGAACCTGCAGAGATCATCTATGATGTGCTCCGTACTCGGGT +GACTGCAATGCAGATACATCCAGTGCAGCGGGTCTGGTGCCGATGCATCGATTTGCATTG +GAATTCACGGTTCTGGTTCCATTCGCAGAGGTATATGCAGTCTGGATCACAACAGCTTTT +ATTTGTAATAGGGATTTTTTTTTTTTTTTTCCCCCCCCGAGATGGAGTCTCACTCTGTCG +CCCAGGCAGGAGCGCAGTGGCGTGATCTCGGCTCACTGCAACCTCCGCCTCCTGGGTTCA +AGCGATTCTCCTACCTCAACCTCCAGGGTGCCTGGGATTACAGGCGTGCACCACCACGCT +GGGCTAATTTTTGTATTTTTAATAGAGATGGGGTCTCACCATGTTGGCCAAGCTGGTCTC +CAACTCCTGACCTCAAATGATCCACCCTCCTCGGCCTCCCAAAGTGCTGGGATTACAGGC +ATGAGCCACCACGCCTGGCCTGTAATAGGGATTTTGCTGCAACTTTTAGTTAAGTAGGTG +AAATCAGAGTGCGTGGCCCTTTGGTGTTGTGCGATCACGTGACTGCAAGGCAGCCTCCGT +GTTCCCCCAAAGCAGGATGGCGTTCCGTTTGAGAACACTTAAGTTTCCAAGGCTTTCTGC +CTCAAATGGAGCATGCCCCCCAACTACGCTGTTTTCTTATTTATTGCGTCTTTTTAATCC +TTTTGATGGTGACAACAAGAACTCTTCAGAATATAGCCTCTTTCTCAAACTTTCAACCCT +GGGGAAATGCATCCTTTGGGCCGAAGTTGCACACAATCCTGATCTTCGGGGCTGAGATGT +AAGGACACAGTTTGGAGTAGGTGTTTTTGTTTTGCGTCTTTTTGCAGGGGAGAGGTGTGG +AGGGTTGTCACTGTGAGTTGAGTGCCTCCAAATGGGTGGGACTTGTAGGGTGTTCTTTTC +TGCTTCTATTTCTATTTCGACAGTGTGGAAGTAATGGGAAGCACAGGAGAGGCCCATTCC +ACCATCATTCCTCTTTCTCTCACTTGGCTGCCAGGAAGGGTCCTACGGAGCCCTGAACGC +CTAGCCCAAATTATTGTAAAAGCGGTTGAAGGGCTCATGCTCTTTTGCTCTTTGGCTCTT +TCCTGGTATGCTCCTCTTAACCCGGGCTGCAGACCTGGCCGGCTGGCTTCTTGTCAGGTT +CAGGAGTTACGGAGCATGACTGGTGACCAGTTTGGGATTTGGGCTTCAGGAGAAGCCTCC +TTAGGCCTCTGGTCCAAACTCCTTGCGTCAGGGAGGTGAGGGGAAGGATAACCAGACGGT +CAGGTAGTGGGTATCTGAGCATAATTCTAGGAAACACTAAAACTCCTCGTTTTCCATTTG +CCGTGGGCTCATTCATGTATCTCCCTCAGAAGCTTAAGAGCAGTGATAACTGGGCCGAGG +CCAGATTTCAGCAGTTTTGAGAATTCCCTCGTAATGTATCACATCAGTATCAAAAGTCAA +CACTGATTTCTAGCATGCCAAAAGCAAATACTTTCAGAGGACATGATTAGTTTGCTTTTT +AACTTGAAAACAATTCCAAATTCACACTTAAAACAGTGTAAACCGTCGGAAATGGTGGCA +GCGAAGCAGTGAACCCTTAAAGGCTGTTGGGGTTCCCCTTCTCCCCATTGTGAGGTACTC +CCAAAAGGATAAATGTTGCTTATGAAAGAATATTTTATACGTGAATAGTACAAAGACAAA +GACGTTTGAGAACCGCTACGCCGGAGGAAGGTGGTCAAGACTGAAATTAGCGTGTTAGGC +AAGATCTGGTTCCTAGGTGCTGAGGATTCTGGTGCCAAGGATTAATGTTAACCGTGCAGG +CATCTCTACCACCGTTAGAGGAGACCAGCAGCCCACAGACTCTTTCTGTCAAGGGCGGGA +TAGAAATGTGTTAGTCTTTACAGACCACAGGTGGTCTCTGTTGCACAGTCTTCTTTGTTT +CTTTGTATGTTTGTCTTTGCAATCCTTTCAACGTGTAAAAACTATTCTTAACTCACAGTC +CGTACAAAAGCAGGCTATGGGCCAAATTCTGGGGCTATAGTTTGCCCACCCCTGAATTAG +AACCCAGGACATAGTAGTTGCTCAATAACTGTTTAATTTTTTTTTATTTCAATAACTTTT +GGCTTACAAGTGCTTTTTGGTTAAATGGATGAATTATATAGTGGTAAATTCTGAGATTTT +AGTGCACCCATCACCCGAGTAGTGTACATTGTACCCAATATGTAGTTTTCTTATCCCACA +CCCTCACACTCTTCCCCCTTGTGTGTCTCCAAAGTCCATTATATCACTCTGTATGTCTTT +GCATTCTCATAGCTTAGCTCCCACTTACAGGTGAGAACACAGGGTAATTTGGTCTTCCAT +TCCTGAGTTACTTCACTTAGAATAATGGCCTCCAGCTCCATCCAAGTTGCTGCAAATGAC +ATTATTCTGTTTCTCTTTATGGCTGAGTAGTTCTCCACGGTGTGTACATACCACATTTTC +TATATCTACTCATTGGTTGATTGGTGCTTCGGTTCGTTCTATATCTTTGCATTTGTGAAT +TGTGCTGAAATAAACATATGTGTGCATTTGTGTTTTTCATATAATGACTTCTTCTCCTTT +GGGCAGATACCCAGTAGTGGGATTGCTGGATCGCGTGGTAGATCTACTTTTAGTTCTTTA +AGGAATCTCCATACTGTTTCCCATAGAGGTTATACATTCCCACCTAGCAGTGTATGCACC +GTCCCTTTTCACCACATCCACGCCAACGTCTATTGCTTTTTGACTTTTTAATAATGGCCA +TTCTTGCAGGAGTAAGGTGGTATCTTATTGTGGTTTTAATGTGCATTTCCCTGATGATTA +GTGATGTTGAGTATTTTTTCATATATTTTGTGGCTATTTGTATATCTTTTGACCAATGTT +TATTCATGTTATTTGCTATTTGATGGGATTATTTTTATTTTTTTCTTGCTGATTTGTTGC +ATTCGTTGTAGATTCTATATACTAGTCCTTTGTTGGCTGCATAATTTGCAAATATTTTCT +CCCATTCTGTGGGT diff --git a/paraphase/data/f8/ref.fa.fai b/paraphase/data/f8/ref.fa.fai new file mode 100644 index 0000000..5f27da8 --- /dev/null +++ b/paraphase/data/f8/ref.fa.fai @@ -0,0 +1 @@ +chrX_155376007_155387000 10994 26 60 61 diff --git a/paraphase/data/ikbkg/homopolymer_sites.txt b/paraphase/data/ikbkg/homopolymer_sites.txt new file mode 100644 index 0000000..ec4a24e --- /dev/null +++ b/paraphase/data/ikbkg/homopolymer_sites.txt @@ -0,0 +1,720 @@ +chrX 154555617 C +chrX 154555618 A,G,C,T +chrX 154555619 0 +chrX 154555620 0 +chrX 154555621 0 +chrX 154555622 0 +chrX 154555623 C,1 +chrX 154555746 T +chrX 154555747 A,G,C,T +chrX 154555748 0 +chrX 154555749 0 +chrX 154555750 0 +chrX 154555751 0 +chrX 154555752 T,1 +chrX 154555754 T +chrX 154555755 A,G,C,T +chrX 154555756 0 +chrX 154555757 0 +chrX 154555758 0 +chrX 154555759 0 +chrX 154555760 T,1 +chrX 154556037 G +chrX 154556038 A,G,C,T +chrX 154556039 0 +chrX 154556040 0 +chrX 154556041 0 +chrX 154556042 0 +chrX 154556043 G,1 +chrX 154556450 C +chrX 154556451 A,G,C,T +chrX 154556452 0 +chrX 154556453 0 +chrX 154556454 0 +chrX 154556455 0 +chrX 154556456 C,1 +chrX 154556738 G +chrX 154556739 A,G,C,T +chrX 154556740 0 +chrX 154556741 0 +chrX 154556742 0 +chrX 154556743 0 +chrX 154556744 0 +chrX 154556745 G,1 +chrX 154556831 C +chrX 154556832 A,G,C,T +chrX 154556833 0 +chrX 154556834 0 +chrX 154556835 0 +chrX 154556836 0 +chrX 154556837 0 +chrX 154556838 C,1 +chrX 154557295 G +chrX 154557296 A,G,C,T +chrX 154557297 0 +chrX 154557298 0 +chrX 154557299 0 +chrX 154557300 0 +chrX 154557301 G,1 +chrX 154557473 G +chrX 154557474 A,G,C,T +chrX 154557475 0 +chrX 154557476 0 +chrX 154557477 0 +chrX 154557478 0 +chrX 154557479 G,1 +chrX 154557550 G +chrX 154557551 A,G,C,T +chrX 154557552 0 +chrX 154557553 0 +chrX 154557554 0 +chrX 154557555 0 +chrX 154557556 G,1 +chrX 154557894 G +chrX 154557895 A,G,C,T +chrX 154557896 0 +chrX 154557897 0 +chrX 154557898 0 +chrX 154557899 0 +chrX 154557900 G,1 +chrX 154558288 T +chrX 154558289 A,G,C,T +chrX 154558290 0 +chrX 154558291 0 +chrX 154558292 0 +chrX 154558293 0 +chrX 154558294 T,1 +chrX 154558392 G +chrX 154558393 A,G,C,T +chrX 154558394 0 +chrX 154558395 0 +chrX 154558396 0 +chrX 154558397 0 +chrX 154558398 G,1 +chrX 154558410 A +chrX 154558411 A,G,C,T +chrX 154558412 0 +chrX 154558413 0 +chrX 154558414 0 +chrX 154558415 0 +chrX 154558416 0 +chrX 154558417 A,1 +chrX 154558517 T +chrX 154558518 A,G,C,T +chrX 154558519 0 +chrX 154558520 0 +chrX 154558521 0 +chrX 154558522 0 +chrX 154558523 T,1 +chrX 154558657 G +chrX 154558658 A,G,C,T +chrX 154558659 0 +chrX 154558660 0 +chrX 154558661 0 +chrX 154558662 0 +chrX 154558663 G,1 +chrX 154558887 G +chrX 154558888 A,G,C,T +chrX 154558889 0 +chrX 154558890 0 +chrX 154558891 0 +chrX 154558892 0 +chrX 154558893 G,1 +chrX 154558982 A +chrX 154558983 A,G,C,T +chrX 154558984 0 +chrX 154558985 0 +chrX 154558986 0 +chrX 154558987 0 +chrX 154558988 A,1 +chrX 154559021 G +chrX 154559022 A,G,C,T +chrX 154559023 0 +chrX 154559024 0 +chrX 154559025 0 +chrX 154559026 0 +chrX 154559027 0 +chrX 154559028 G,1 +chrX 154559152 G +chrX 154559153 A,G,C,T +chrX 154559154 0 +chrX 154559155 0 +chrX 154559156 0 +chrX 154559157 0 +chrX 154559158 G,1 +chrX 154559485 G +chrX 154559486 A,G,C,T +chrX 154559487 0 +chrX 154559488 0 +chrX 154559489 0 +chrX 154559490 0 +chrX 154559491 0 +chrX 154559492 G,1 +chrX 154559606 G +chrX 154559607 A,G,C,T +chrX 154559608 0 +chrX 154559609 0 +chrX 154559610 0 +chrX 154559611 0 +chrX 154559612 G,1 +chrX 154559824 G +chrX 154559825 A,G,C,T +chrX 154559826 0 +chrX 154559827 0 +chrX 154559828 0 +chrX 154559829 0 +chrX 154559830 0 +chrX 154559831 G,1 +chrX 154559961 T +chrX 154559962 A,G,C,T +chrX 154559963 0 +chrX 154559964 0 +chrX 154559965 0 +chrX 154559966 0 +chrX 154559967 T,1 +chrX 154559986 C +chrX 154559987 A,G,C,T +chrX 154559988 0 +chrX 154559989 0 +chrX 154559990 0 +chrX 154559991 0 +chrX 154559992 C,1 +chrX 154560016 C +chrX 154560017 A,G,C,T +chrX 154560018 0 +chrX 154560019 0 +chrX 154560020 0 +chrX 154560021 0 +chrX 154560022 C,1 +chrX 154560027 C +chrX 154560028 A,G,C,T +chrX 154560029 0 +chrX 154560030 0 +chrX 154560031 0 +chrX 154560032 0 +chrX 154560033 C,1 +chrX 154560120 C +chrX 154560121 A,G,C,T +chrX 154560122 0 +chrX 154560123 0 +chrX 154560124 0 +chrX 154560125 0 +chrX 154560126 C,1 +chrX 154560235 C +chrX 154560236 A,G,C,T +chrX 154560237 0 +chrX 154560238 0 +chrX 154560239 0 +chrX 154560240 0 +chrX 154560241 C,1 +chrX 154560322 C +chrX 154560323 A,G,C,T +chrX 154560324 0 +chrX 154560325 0 +chrX 154560326 0 +chrX 154560327 0 +chrX 154560328 C,1 +chrX 154560329 A,G,C,T +chrX 154560330 0 +chrX 154560331 0 +chrX 154560332 0 +chrX 154560333 0 +chrX 154560334 0 +chrX 154560335 C,1 +chrX 154560363 C +chrX 154560364 A,G,C,T +chrX 154560365 0 +chrX 154560366 0 +chrX 154560367 0 +chrX 154560368 0 +chrX 154560369 0 +chrX 154560370 C,1 +chrX 154560569 G +chrX 154560570 A,G,C,T +chrX 154560571 0 +chrX 154560572 0 +chrX 154560573 0 +chrX 154560574 0 +chrX 154560575 G,1 +chrX 154560817 T +chrX 154560818 A,G,C,T +chrX 154560819 0 +chrX 154560820 0 +chrX 154560821 0 +chrX 154560822 0 +chrX 154560823 0 +chrX 154560824 T,1 +chrX 154560827 T +chrX 154560828 A,G,C,T +chrX 154560829 0 +chrX 154560830 0 +chrX 154560831 0 +chrX 154560832 0 +chrX 154560833 0 +chrX 154560834 0 +chrX 154560835 0 +chrX 154560836 0 +chrX 154560837 T,1 +chrX 154560993 A +chrX 154560994 A,G,C,T +chrX 154560995 0 +chrX 154560996 0 +chrX 154560997 0 +chrX 154560998 0 +chrX 154560999 A,1 +chrX 154561000 C +chrX 154561001 A,G,C,T +chrX 154561002 0 +chrX 154561003 0 +chrX 154561004 0 +chrX 154561005 0 +chrX 154561006 C,1 +chrX 154561008 C +chrX 154561009 A,G,C,T +chrX 154561010 0 +chrX 154561011 0 +chrX 154561012 0 +chrX 154561013 0 +chrX 154561014 0 +chrX 154561015 0 +chrX 154561016 C,1 +chrX 154561311 T +chrX 154561312 A,G,C,T +chrX 154561313 0 +chrX 154561314 0 +chrX 154561315 0 +chrX 154561316 0 +chrX 154561317 T,1 +chrX 154561319 T +chrX 154561320 A,G,C,T +chrX 154561321 0 +chrX 154561322 0 +chrX 154561323 0 +chrX 154561324 0 +chrX 154561325 T,1 +chrX 154561464 A +chrX 154561465 A,G,C,T +chrX 154561466 0 +chrX 154561467 0 +chrX 154561468 0 +chrX 154561469 0 +chrX 154561470 A,1 +chrX 154561607 G +chrX 154561608 A,G,C,T +chrX 154561609 0 +chrX 154561610 0 +chrX 154561611 0 +chrX 154561612 0 +chrX 154561613 G,1 +chrX 154561981 G +chrX 154561982 A,G,C,T +chrX 154561983 0 +chrX 154561984 0 +chrX 154561985 0 +chrX 154561986 0 +chrX 154561987 0 +chrX 154561988 G,1 +chrX 154562091 C +chrX 154562092 A,G,C,T +chrX 154562093 0 +chrX 154562094 0 +chrX 154562095 0 +chrX 154562096 0 +chrX 154562097 C,1 +chrX 154562319 G +chrX 154562320 A,G,C,T +chrX 154562321 0 +chrX 154562322 0 +chrX 154562323 0 +chrX 154562324 0 +chrX 154562325 0 +chrX 154562326 G,1 +chrX 154562355 G +chrX 154562356 A,G,C,T +chrX 154562357 0 +chrX 154562358 0 +chrX 154562359 0 +chrX 154562360 0 +chrX 154562361 G,1 +chrX 154562595 A +chrX 154562596 A,G,C,T +chrX 154562597 0 +chrX 154562598 0 +chrX 154562599 0 +chrX 154562600 0 +chrX 154562601 0 +chrX 154562602 0 +chrX 154562603 A,1 +chrX 154563013 T +chrX 154563014 A,G,C,T +chrX 154563015 0 +chrX 154563016 0 +chrX 154563017 0 +chrX 154563018 0 +chrX 154563019 0 +chrX 154563020 0 +chrX 154563021 0 +chrX 154563022 0 +chrX 154563023 0 +chrX 154563024 0 +chrX 154563025 0 +chrX 154563026 0 +chrX 154563027 0 +chrX 154563028 0 +chrX 154563029 T,1 +chrX 154563036 T +chrX 154563037 A,G,C,T +chrX 154563038 0 +chrX 154563039 0 +chrX 154563040 0 +chrX 154563041 0 +chrX 154563042 T,1 +chrX 154563195 T +chrX 154563196 A,G,C,T +chrX 154563197 0 +chrX 154563198 0 +chrX 154563199 0 +chrX 154563200 0 +chrX 154563201 T,1 +chrX 154563203 T +chrX 154563204 A,G,C,T +chrX 154563205 0 +chrX 154563206 0 +chrX 154563207 0 +chrX 154563208 0 +chrX 154563209 T,1 +chrX 154563445 G +chrX 154563446 A,G,C,T +chrX 154563447 0 +chrX 154563448 0 +chrX 154563449 0 +chrX 154563450 0 +chrX 154563451 G,1 +chrX 154563746 G +chrX 154563747 A,G,C,T +chrX 154563748 0 +chrX 154563749 0 +chrX 154563750 0 +chrX 154563751 0 +chrX 154563752 0 +chrX 154563753 G,1 +chrX 154563786 G +chrX 154563787 A,G,C,T +chrX 154563788 0 +chrX 154563789 0 +chrX 154563790 0 +chrX 154563791 0 +chrX 154563792 0 +chrX 154563793 G,1 +chrX 154563999 C +chrX 154564000 A,G,C,T +chrX 154564001 0 +chrX 154564002 0 +chrX 154564003 0 +chrX 154564004 0 +chrX 154564005 C,1 +chrX 154564007 C +chrX 154564008 A,G,C,T +chrX 154564009 0 +chrX 154564010 0 +chrX 154564011 0 +chrX 154564012 0 +chrX 154564013 0 +chrX 154564014 C,1 +chrX 154564164 C +chrX 154564165 A,G,C,T +chrX 154564166 0 +chrX 154564167 0 +chrX 154564168 0 +chrX 154564169 0 +chrX 154564170 C,1 +chrX 154564274 C +chrX 154564275 A,G,C,T +chrX 154564276 0 +chrX 154564277 0 +chrX 154564278 0 +chrX 154564279 0 +chrX 154564280 C,1 +chrX 154564361 C +chrX 154564362 A,G,C,T +chrX 154564363 0 +chrX 154564364 0 +chrX 154564365 0 +chrX 154564366 0 +chrX 154564367 0 +chrX 154564368 0 +chrX 154564369 C,1 +chrX 154564691 T +chrX 154564692 A,G,C,T +chrX 154564693 0 +chrX 154564694 0 +chrX 154564695 0 +chrX 154564696 0 +chrX 154564697 T,1 +chrX 154565000 T +chrX 154565001 A,G,C,T +chrX 154565002 0 +chrX 154565003 0 +chrX 154565004 0 +chrX 154565005 0 +chrX 154565006 T,1 +chrX 154565028 A +chrX 154565029 A,G,C,T +chrX 154565030 0 +chrX 154565031 0 +chrX 154565032 0 +chrX 154565033 0 +chrX 154565034 A,1 +chrX 154565121 C +chrX 154565122 A,G,C,T +chrX 154565123 0 +chrX 154565124 0 +chrX 154565125 0 +chrX 154565126 0 +chrX 154565127 C,1 +chrX 154565161 C +chrX 154565162 A,G,C,T +chrX 154565163 0 +chrX 154565164 0 +chrX 154565165 0 +chrX 154565166 0 +chrX 154565167 C,1 +chrX 154565212 T +chrX 154565213 A,G,C,T +chrX 154565214 0 +chrX 154565215 0 +chrX 154565216 0 +chrX 154565217 0 +chrX 154565218 T,1 +chrX 154565287 T +chrX 154565288 A,G,C,T +chrX 154565289 0 +chrX 154565290 0 +chrX 154565291 0 +chrX 154565292 0 +chrX 154565293 0 +chrX 154565294 T,1 +chrX 154565386 T +chrX 154565387 A,G,C,T +chrX 154565388 0 +chrX 154565389 0 +chrX 154565390 0 +chrX 154565391 0 +chrX 154565392 0 +chrX 154565393 0 +chrX 154565394 0 +chrX 154565395 0 +chrX 154565396 0 +chrX 154565397 0 +chrX 154565398 0 +chrX 154565399 0 +chrX 154565400 0 +chrX 154565401 0 +chrX 154565402 T,1 +chrX 154565551 T +chrX 154565552 A,G,C,T +chrX 154565553 0 +chrX 154565554 0 +chrX 154565555 0 +chrX 154565556 0 +chrX 154565557 T,1 +chrX 154565559 T +chrX 154565560 A,G,C,T +chrX 154565561 0 +chrX 154565562 0 +chrX 154565563 0 +chrX 154565564 0 +chrX 154565565 T,1 +chrX 154565686 T +chrX 154565687 A,G,C,T +chrX 154565688 0 +chrX 154565689 0 +chrX 154565690 0 +chrX 154565691 0 +chrX 154565692 T,1 +chrX 154565694 T +chrX 154565695 A,G,C,T +chrX 154565696 0 +chrX 154565697 0 +chrX 154565698 0 +chrX 154565699 0 +chrX 154565700 T,1 +chrX 154565708 G +chrX 154565709 A,G,C,T +chrX 154565710 0 +chrX 154565711 0 +chrX 154565712 0 +chrX 154565713 0 +chrX 154565714 G,1 +chrX 154565843 T +chrX 154565844 A,G,C,T +chrX 154565845 0 +chrX 154565846 0 +chrX 154565847 0 +chrX 154565848 0 +chrX 154565849 T,1 +chrX 154565975 T +chrX 154565976 A,G,C,T +chrX 154565977 0 +chrX 154565978 0 +chrX 154565979 0 +chrX 154565980 0 +chrX 154565981 0 +chrX 154565982 T,1 +chrX 154566157 T +chrX 154566158 A,G,C,T +chrX 154566159 0 +chrX 154566160 0 +chrX 154566161 0 +chrX 154566162 0 +chrX 154566163 0 +chrX 154566164 0 +chrX 154566165 0 +chrX 154566166 0 +chrX 154566167 T,1 +chrX 154566189 T +chrX 154566190 A,G,C,T +chrX 154566191 0 +chrX 154566192 0 +chrX 154566193 0 +chrX 154566194 0 +chrX 154566195 C +chrX 154566196 A,G,C,T +chrX 154566197 0 +chrX 154566198 0 +chrX 154566199 0 +chrX 154566200 0 +chrX 154566201 C,1 +chrX 154566278 T +chrX 154566279 A,G,C,T +chrX 154566280 0 +chrX 154566281 0 +chrX 154566282 0 +chrX 154566283 0 +chrX 154566284 T,1 +chrX 154566663 A +chrX 154566664 A,G,C,T +chrX 154566665 0 +chrX 154566666 0 +chrX 154566667 0 +chrX 154566668 0 +chrX 154566669 A,1 +chrX 154566699 G +chrX 154566700 A,G,C,T +chrX 154566701 0 +chrX 154566702 0 +chrX 154566703 0 +chrX 154566704 0 +chrX 154566705 G,1 +chrX 154566712 G +chrX 154566713 A,G,C,T +chrX 154566714 0 +chrX 154566715 0 +chrX 154566716 0 +chrX 154566717 0 +chrX 154566718 G,1 +chrX 154566837 T +chrX 154566838 A,G,C,T +chrX 154566839 0 +chrX 154566840 0 +chrX 154566841 0 +chrX 154566842 0 +chrX 154566843 T,1 +chrX 154566860 T +chrX 154566861 A,G,C,T +chrX 154566862 0 +chrX 154566863 0 +chrX 154566864 0 +chrX 154566865 0 +chrX 154566866 0 +chrX 154566867 0 +chrX 154566868 0 +chrX 154566869 0 +chrX 154566870 0 +chrX 154566871 T,1 +chrX 154567002 T +chrX 154567003 A,G,C,T +chrX 154567004 0 +chrX 154567005 0 +chrX 154567006 0 +chrX 154567007 0 +chrX 154567008 0 +chrX 154567009 T,1 +chrX 154567063 T +chrX 154567064 A,G,C,T +chrX 154567065 0 +chrX 154567066 0 +chrX 154567067 0 +chrX 154567068 0 +chrX 154567069 T,1 +chrX 154567097 T +chrX 154567098 A,G,C,T +chrX 154567099 0 +chrX 154567100 0 +chrX 154567101 0 +chrX 154567102 0 +chrX 154567103 0 +chrX 154567104 T,1 +chrX 154567685 A +chrX 154567686 A,G,C,T +chrX 154567687 0 +chrX 154567688 0 +chrX 154567689 0 +chrX 154567690 0 +chrX 154567691 A,1 +chrX 154567704 C +chrX 154567705 A,G,C,T +chrX 154567706 0 +chrX 154567707 0 +chrX 154567708 0 +chrX 154567709 0 +chrX 154567710 C,1 +chrX 154568050 C +chrX 154568051 A,G,C,T +chrX 154568052 0 +chrX 154568053 0 +chrX 154568054 0 +chrX 154568055 0 +chrX 154568056 C,1 +chrX 154568160 A +chrX 154568161 A,G,C,T +chrX 154568162 0 +chrX 154568163 0 +chrX 154568164 0 +chrX 154568165 0 +chrX 154568166 A,1 +chrX 154568256 G +chrX 154568257 A,G,C,T +chrX 154568258 0 +chrX 154568259 0 +chrX 154568260 0 +chrX 154568261 0 +chrX 154568262 G,1 +chrX 154568979 G +chrX 154568980 A,G,C,T +chrX 154568981 0 +chrX 154568982 0 +chrX 154568983 0 +chrX 154568984 0 +chrX 154568985 G,1 +chrX 154569157 G +chrX 154569158 A,G,C,T +chrX 154569159 0 +chrX 154569160 0 +chrX 154569161 0 +chrX 154569162 0 +chrX 154569163 G,1 +chrX 154569234 G +chrX 154569235 A,G,C,T +chrX 154569236 0 +chrX 154569237 0 +chrX 154569238 0 +chrX 154569239 0 +chrX 154569240 G,1 +chrX 154569578 G +chrX 154569579 A,G,C,T +chrX 154569580 0 +chrX 154569581 0 +chrX 154569582 0 +chrX 154569583 0 +chrX 154569584 G,1 diff --git a/paraphase/data/ikbkg/ikbkg_config.yaml b/paraphase/data/ikbkg/ikbkg_config.yaml new file mode 100644 index 0000000..3974f2f --- /dev/null +++ b/paraphase/data/ikbkg/ikbkg_config.yaml @@ -0,0 +1,32 @@ +gene: ikbkg +use_supplementary: True + +# data files +data: + reference: ref.fa + homopolymer: homopolymer_sites.txt + +# coordinates +coordinates: + hg38: + nchr: "chrX" + nchr_old: "chrX_154555600_154575000" + nchr_length: 156040895 + extract_region1: "chrX:154555883-154569698" + extract_region2: "chrX:154634729-154648555" + + clip_3p_positions: [154569698] + clip_5p_positions: [154557136] + + pivot_site: 154555882 + + left_boundary: 154555780 + right_boundary: 154568820 + + deletion1_size: 10806 + del1_3p_pos1: 154558000 + del1_3p_pos2: 154558030 + del1_5p_pos1: 154568800 + del1_5p_pos2: 154568840 + + noisy_region: [[154566998, 154567171], [154555869, 154555876], [154565356, 154565401]] \ No newline at end of file diff --git a/paraphase/data/ikbkg/ref.fa b/paraphase/data/ikbkg/ref.fa new file mode 100644 index 0000000..ae6c441 --- /dev/null +++ b/paraphase/data/ikbkg/ref.fa @@ -0,0 +1,325 @@ +>chrX_154555600_154575000 +GACAGAGTTTTGCTCTGTCCCCCAGGCTGGAGTGCAATGGTGTGATCTCGGCTCACTGCA +ACCTCCACCTCCCAGGTTCAAGTGCTTCTCCTGCCTTAGCCTCCCAAGTAGCTGGGATTA +CAGGCGTGTGCCACCACACCGGGCTAATTTTTGTATTTTTATTAGAGACGGGGTTTCACC +ATGTTGGCCAAGCTGGTCTGGAGCTCCTGACCTCAGGTGATCCACCCACCTTGGCCTCCC +AAAGTGCTGGGATTACAGGCATGAGCCACAGTGCCTGGCCAACACGTACTTTTAAGTGAA +GCTGATGTGTTTGGTGTTATTTTCTTGCAGAAAGTGAGGGGCATTAGTGTAAAGGATTTT +GGAAGTGTTTAAAGAAACAAAAGGGAGTGTTGAGACGCCATCCACCCCTGAGAGAAGCTG +CGTGGTATTATGGCGGGTGGGGGCACCAGGATGGGTGGCCCCACTTCTGGCCTCTGACTT +CCTGAGCCTCAGGCCCATGTGGGCCCAGGCAGGGCCCGGCAGGCCGGGCTGCCCAGCTCC +CCTCCACTGTCCCCTCTGCCACCAGATGCCATCCGGCAGAGCAACCAGATTCTGCGGGAG +CGCTGCGAGGAGCTTCTGCATTTCCAAGCCAGCCAGAGGGAGGAGAAGGAGTTCCTCATG +TGCAAGTTCCAGGAGGCCAGGAAACTGGTGGAGAGACTCGGCCTGGAGAAGCTCGATCTG +AAGAGGCAGAAGGAGCAGGCTCTGCGGGAGGTGGAGCACCTGAAGAGATGCCAGCAGGTA +GTCGGGGCAGGGCCAGGTTCTGAAAACCCGCGGTGACGCCAGTGTTCCACAAGGGAACCC +GTGGTCGGGGTCCCCCAAAGCACCCTGGGGCTCAGTGCTGTGCCGGGAGGGCTCGGAACT +CAGAAAAGCCGTCACACTCCCAGTTCCGGTTTATTACAAGGAAAGGACACAGGTTACGGT +GAGCGAAGGCTCAGGGCGCACAGGGCGGGCTCCAGGAGAGACCAGGCGTGAGCTTCAGCG +GCTCCTCGCCCAGGGGAGTTGTGCAGACGGCACCTGTTTCTTTCGGCAACAGTGTGGGAC +AGCGAGCACGGAGTCACAACCGGGAAGCTCACCCCAGCCGTGGCGGCCGGGGTTTTCACG +GGGGGTGGGCCGCGTGGGCACCGAGCGCCTGCGTGGCCAACCCTGGTCACTCGGCTGTAG +CCACCAGAGGTCCAGCTGTGTGGCCCAAGGCTCCCCCCATAAATCGTGTCATTAGCACAG +ACCGCCTGGTTTCAGGGTCTTTGTGTGTGGGCTTGGCTGATCGCAGGATCCTGGCGATGG +TAGTCAGGAAGGGGCCGTGCTCCCTTTGAGGGGCAAGGTGGAGAGAAGTGCTGGAGAGGA +GACTTGCTGGCGGGTACCTGGCACTTGCCACAGCCAGGCTCCACTCCCCTGGGGAAAGGC +GTGGATGGTGGGCTGTGCACGCCGCTCCACTCAGGGCTTAGAGCGCCTGGCTTAAGGCGT +TGATTTCCTGTGTGGGAAGTGGATGAGTTTTCTACAGCTGCCGTGACCAAGCACCACAGA +CTGCGGGGCCGAAGCCACAGAAACGCGTGGCCTCCCGCTTCTGGAGGCCTGGAGGCTGAG +CTAGCGGTGGTGTCGGCAGGGTGGGCTCCCCGCCAGGGCCGCGAGGGAGCTGCCTTCCAG +GCCTCTCCACGGCGCCGGGGGCCGCCGGCTGCACCTCTCCAGCCTCCATCTCCGTCATCC +TGTGGCCTTGTCCCCGCGGGCCTCTGTGCCTGTCCTCCTCTTTTGACAAGAACACCGGAG +ATACACAAAGGTACACAAAAGCGGGCCTTTGTTCAAGCTGGCAAAAGAGATCTTCTTCAG +AAACCCCTGCTTGCGGGGGAGAGAGCTGAGCTCCGTTCCCGCCCCAGCAGAGGCGGCCTG +GCCTTGCGAAGGGAGAAGGAGGGAGTCGGGAGGGGGCGAGTGCAGGCTCAGGTGAAAGAT +GACGGGGCAGCCAGCGTCCTTGCCGCGAGGCCAGCCGTGTGTGGGAGCTGCCGGTGCTTA +CCAAGGTTGGGATGCTTCCGTCCCGTGGAGACTGGGAGACTGGGCCCCGCGCCTCCTGAG +GTTTCCGTTTCCAAGGAGTGGCTGCGGGGCCCTCGGGAAAGCCCCTGGGTTGTGGGTGCT +ACACAGATGTCTCAAAGGGACAGGGTAAGCCCTTTGTAGTAAATGCTGTCAGAAAGGGAG +GTCAGGTGTTGGCCGGAACAGACAGTACATGCTCTGGGCAGCCCTGAGCGTTTCCAGACG +GGAACTCACTCAAAAGGGGGCTGGGGCGTCCCAGGGGCGCGGCCTTAGGCTCCCAGAGGC +CCCGCGAGGTGGTGGCCGGGTGTCTTCGGGCAGGGGTTTGAGTGCAGTGTGCCTGCCGAG +AGGTTCTGCAGTTCCCAGTGTTTAACAAAATTCAGTGTCCACTCTTGATCTGCACAAACT +CTCCCATCCTGGCGGCCCCGGGTGTGGACTGGGGCCTGTGTTTACTTTGCCCTATTCGTG +TCTGGCCTCCTTTTGTCCCAAGTCTCAGAGAGACGGAGAGAGATCCCCTGCTGGGGCTGT +AGCTGCAAGGCCACCGGGTTCAGCCCTCGAGGCCTGCTTGCCGGGGCAGTGACTAAGCCG +TTGACAACCTCAAGGCAGCTTTGTGCTCCTTCGTCTCTTTGGGGATCTCTTTTTGCCCCA +TCTGTGTGTCACCCTGTGGCAGAGGGTTAAGGTGGGCAGCTGGGGAGGGTTGGGTGGCCC +TTGGGCTCATGAGGCCCTAGGGCACCCAGGTTTGGGGGTGCCGAGGGCAGGAAAAAAGGC +CTCATGGCGCGCAGGCCTCAGCCGCTTGCGGGTTGCCCCGGGCTTGCGGATGGCAGGAGT +GGGCCGCTGGGGAGAAAGCAGTGCTGACAGGAAGTGGCTTTTTATCCTGCAGCAGATGGC +TGAGGACAAGGCCTCTGTGAAAGCCCAGGTGACGTCCTTGCTCGGGGAGCTGCAGGAGAG +CCAGAGTCGCTTGGAGGCTGCCACTAAGGAATGCCAGGCTCTGGAGGGTCGGTGAGTCGG +GGGAGCCGGCTCCGGAGACCCCTTCCAGGGTTTCCAAAAGCAATGAGGTGGGTTTAGGGG +CCTCCAGGGTGCTCCTTGATGAGGATAGACCGGGGCAGGCTGCGTAAAGACGTCGGGGCA +GACGTCGGGAGAGGTCTGGGCCAGGCATCCGGGACCTGGGTCCCAGCCGGCTCTCCGCAC +TCTGTGACCCTTTGATGGAGTTTGGATTATTTCCTTAGGAGGCATTCTGGGGGCCCCGAG +CCCACACCCACAGTGTCTAGTTCTCTGGAAGGACTTCTGGGACCGGCGCACAGTCGCCCT +CGTGGCTGAGGTTGATGACAGGGAAAAAGGCACAGGGCAGGAGCCGCCAGGGCAGGAGCC +GTGGGGGGAGTTGGAGAAGCCCTGTCCCAGCCTCCCGCTGCCCTGCGAGCGGCACAGTGA +GAAGCGCCTCCCACACGGGCCGTGTTCCTGCCCAGGGATGCCCGCTAGAGACTCAGCGCC +CTGGGTGTTTACTGGGGGCAGGTCCTGTCAGCGCCCTCTGCCAGGCAGGTACCTAAATCC +CCGACTCCCAGCAGCAGAGCGGGTGCTCACGTCAACCACGTTCTTCCTACAAATGGCCTA +GGTGCAGGGACCGACCTGACCACTAGAGAAAGTGTCACTGTGGCAAGGGAACTTCACCAG +CCAAGGGCCAACCTTGCCAGCCGGTGGCCTCAGGCCTGCTGGTTACTCTCTTTTGCAAAG +GGGTCTTGGTTCTTGTGAGTGGGACCATTGGGTCAAAGGGCAGGGAGGTTTCTGTGGTTC +TCATTCGGTCCTGCTTCTGCCCTCCAGACAGATGGATCAGCTGCCAGGGGGGCCCCAGCC +ATCCCAGCACAGTAGGCGGTCAAGGTGCACTTGGGGCAGCCAGCAGGGCAGAGGGGAGGG +GAGCTTGACCCAGGCTCTGATGGGCAGAGGGAACCCGTGCAGGGTGTGGGGGCAGTATGC +AGGCAGGCGCGGAGGGGAGAGCCAAGCAGCCAGGCCTGCCAGGCAGAGTTGGGGTGACTG +GAGAAGGGCCGTGTCTGCCTGTGGCCAGAGGCCACCCAGGACCTGGACAGATGCACCCAC +CATTGTCCCTGCAGTGAGGCTGTGGAAGGGCTTGGTGTGGTGGGATGAGGCCAGACCCTG +GAAACTGGAGGTTAAGGGAGCTGTAGGGGGGCAGGTGTGGGAACTGAGCATCCGAGCAGG +TCGTCTGGGACTCCAGCAGAGCTCTGGGCAGCAGCAGGGATGGGGCCGAGGCCCGGTCTG +CATTGAGCTCAGTGCTTGCACGCCCAGGTGGGCAGTCTCTCATTTTTGGAACAGCAGTCT +CTCCTGACCCCCTCCACTGAGACTGCTTTTGCTGGGGCCCCCAGCAGTCCCCCAGTGGAA +CTCCACGGGCAGTTCCGAGGGCTCCTCTCACCTGGCCCCAGCACTGCGGGATGCAGGCGA +CCCCATCCTTTTCTCGGACCACCCCCTTCCCCTGGCTTCCAGGTCTCCTTGCCATCTGTA +CTTGGTCACCTGCTGGGCCCCTGCATTGAAGCAAACACGTCTTAAGCAAAGCTCCTCACC +TGCTGCTCCCACCTGGCCCCCTGCAGTTGTCCTCGTGTCTGTTGACGGTGCCTCCACCCT +GCCGCCTGGCATCAGCTCGCAGTCACAGGGTGTTCAGAGCCGACCCCCACCCCCCGCCCA +CGCCCTGCGCATAGCCCCTGCCGTCCCCCCGTTCGTCCTCCCTGAGTCTGCTCTTTCCCC +GTGCCAGGGCCCGGGCGGCCAGCGAGCAGGCGCGGCAGCTGGAGAGTGAGCGCGAGGCGC +TGCAGCAGCAGCACAGCGTGCAGGTGGACCAGCTGCGCATGCAGGGCCAGAGCGTGGAGG +CCGCGCTCCGCATGGAGCGCCAGGCCGCCTCGGAGGAGAAGTGAGTCAGCGGGGGCGGGG +CCGCACCGCAGGGTCTGTGGTTCTACACTTGATCTTAGCCGAAAGGCTGAGAAGTGTCGG +GTCCATGGTTCTTTCTGCCTTCTGAGGACTCCTTCAGATTCTGCCTGTGGCTGTGGGCCC +ATTCTGTCCCTTAGCCTTGCTAACGGTAGAGGCGACCATGATGACACCCGGTTTGTCTTT +GATACAGTCATGCCATCTGCTCTCCAGACCACGTTTCACTGCGTGTCCACACGTGGCCTT +TTTTGTAGTTTTTTTTTCCTAGCCACTAGGTCATCAGGGGACTTGTCCTTTAAAACCCCT +TCTAGGCCAGGTGCTGTGGCTCACGCCTGTAATCCCAACACTTTGGGAGGCCAAAGTGGG +TAGATGGCTTAAGCCCGGGAGTTCCAAGACCAGCCTGGGCAACAGAAAGACAACAAAAAT +ACCCCCAAACCCCCCCGTCTACCAGCATCCAATCTGGGACCTCAGGTTCCTGTCCTTGGC +GTGCCTTTTCAGTCTCCTTTAATCTAGAACAGTTCCCCTGCCTTTCTGAGCTGTTTGTGA +AGTTCACAGTTTTGAACAGTGCAGGGTAGTTCCATTGTATTATTACTATTATTTTCAAGA +CAGGGTCTTGCTCTACCGTCCAGGCTGGAGTGCAGTGGCATAATCTCGGCTTACTGTACC +TTCCGCCTCTTGGTCTCAAGCGATCCTCCCAGGTAGCTGGGACTATAGGCGCAGGCCAGC +ACACCTGGCTAATTTTTGCATTTTTGGTAGAGGTGGCGTTTTCCTATGTTGCCCGGGCTG +GTCTTGAACTCCTGAGCTCAAGCGATCCTCCTGCCTTGGCTTCTCAAAGTGTTGGGATTA +TGGGCGTGAGCCACCGCGTCTGGCCGCGATTTTATTATAAACATTAAAAATACTAGCTTT +TAGGAAAACGATATTAACTGCCTGGTGACCAGCCCACCAAAGCCTGCTTTAGAGTTGACG +GCCTCAGGAGTCCTCACACAGCCTTGGAAGACCCCATTCCAGGCCTGTGATGCGAGGGAG +GGAAGGAAGGGGGTAGAGTTGGAAGCAGGCAGCACCGTGGCTGGACTGGCATGAGGTGGT +TTCTCCAGCAAAAGCTCCCTTTCCTCAGGAGGAAGCTGGCCCAGTTGCAGGTGGCCTATC +ACCAGCTCTTCCAAGAATACGACAACCACATCAAGAGCAGCGTGGTGGGCAGTGAGCGGA +AGCGAGTGAGTGCGACCACTGGGGCTCTAGGGCTGGCCTTGCCTCTTCCTCTCCCCGTGG +CCCTGAACCTTGAGAATGGGTAGACCTGCCTTAGACTTGCCTTAGACCTGTGTCAGGCTG +CAGCTGCGACAGCTCAGGGAAGCTGTGGGGAGATGGCAACCCCAGGATGTTGCTCTCAGG +AGTGTCAGCAGGCCATCTTAATGGGGGGCTGGGCCAGAGCCTTGGGGTGCTCCCTCTGTG +GGGCTGGGGACGTCTTGTCTCCATGGACATTCCCTCTTGCCAGCCATCGCCATCTGGCAC +CTGGCTCAGCTTCCCCCAAGCCAAGGTAAGCCCGACAGCATTTCCACCCCAGTGTTGGCT +GGGAGCCTTTTCCTAGTTTGTCCTCATCAGACCTAAGCTGGGGTGCAGTTTGCTAGTGAT +CACATTTTAGCAGGACACCGTCAATCGTAAGTGTACCCAGAGGAGATTTATAAGGACAAA +GCCTGAAGCCAGGTCACATGGGGAAGAGTTAGCTACAAAACTGGCCACTTAATCTCTGGA +GGGGGGCGTTGGTGGGGTGTGTCTGTGTGTGTCTCAGGGGGCTGGAGATGCCTGCGTGGG +AGGAGTGCACCTCTGACCAGGTGGCAGAGTGGAAGGACTGAGGGCTCTCAGCTGAGCTGT +GCACATGGCGGGCACAGGACCGGCTGGCTGTGAGTGGGTGTGGCCTGTGGCCTGTGAAGG +GTGGGAGGAGGGCTGTGGAGCTGGGGATTCTGGGAAGGGAATGTCGGCCCAGCTGGGAGG +TTGTACCAGATGACCTCAGCGGCCTCTTCAGTCCTGAAAAAAACCTCAGCATCTCCTCTG +TCGTTTTGGGCCGTGACAGGACGCAGCCATCTCCCTGTGCACGCTGAGATCCTGCAATGG +GCCCTCAAATCAGGGGCTGGCATCACCCAGCCTGGTCAGCCAGGGCCACTCTTTCATCCT +TCTCAGTTCTTCTCAGCCAGCCTCGCCCTGGGCTGACGAGGCTCCGTCAGCTCCCCTTGC +CCGTCCTTAGGGAATGCAGCTGGAAGATCTCAAACAGCAGCTCCAGCAGGCCGAGGAGGC +CCTGGTGGCCAAACAGGAGGTGATCGATAAGCTGAAGGAGGAGGCCGAGCAGCACAAGAT +TGTGATGGAGACCGTTCCGGTGCTGAAGGCCCAGGTGAGGGCCCTCCTCTCTGACCCACC +CTGGCACTGGGACCTGGAGAGTCTCTTTGGCGTCTTTTTTTTTTTTTTTGCTTTTGCTTT +TTGAGATTGAGTTTTGCTCTTGTTGCCCAGGCTGGAGTGCCACTAGTGGCACGATCTTGG +CTCACTGCAACCTCTGCCTCCCGGGTTCAAACAATTCTCTTGCCTCAGCCTCCTGAGTAG +CTGGGATTACAGGCGCCTGCCGCCATGCCCGTCTAATTTTTGTATTTTTAGTAGAGACAG +GGTTTCACCATGTTGGCCCAGCTGGTCTCGAACTTCTGGCCTCAGGTGATCTGCCCACCG +CAGTCTCTCAAAGTTCTGGGATTACAGGCGTGAGCCACCGCACCCGGCCTCTTTGGCATC +ATTTTGTAGTGGCCTTTCGTAAGCTTCTGAGCCACTTGTGCTGCTCCTTAGACCTCTCGG +TGAGCTTGGCATTACTCGCCGACGTATCTGTTTCCTCTGCGCCGCTGGGGGCTCTGGGAG +GACAGCAGTGGGTTCTGCTTTGTTCCTGTGGTGCCTGGCGCAGTGCCTGGTGGGTGGCTG +GCTTGTGGCGGGCACATCCCTTTCTGTTGGATTTGCCAGGCGGATATCTACAAGGCGGAC +TTCCAGGCTGAGAGGCAGGCCCGGGAGAAGCTGGCCGAGAAGAAGGAGCTCCTGCAGGAG +CAGCTGGAGCAGCTGCAGAGGGAGTACAGCAAACTGAAGGCCAGCTGTCAGGAGTCGGCC +AGGTGGGCCTCTGAGAGCGTGCCCGTGTGAGCAGTGGGTGCGACACTGGGGGGTCGCCAG +TGGTGACCCCGCAGTGGGTGCGACACTGGGGGGTTGCCAGTGGTGACCACAGGAGACGGA +TGGCTCCTGGTGTTCTGGGTTAGGGCTCACTGTGGTCCCTCTCCTCTCACCTGAGCTTCC +AAGAGCTGCTTTGACACTAGTCCAGCCAAGGAGCTTTACAGAAATGCGTGGCTTGACTGG +ACGGTTTCTGTTTCCAAAGGATCGAGGACATGAGGAAGCGGCATGTCGAGGTCTCCCAGG +CCCCCTTGCCCCCCGCCCCTGGTGAGTGAGCGAGAACTGGGCCTGCGGGAGGAGGTGGGT +GGGGAGGGCAGGTGCTGCGCCGCGGGAGGTCACAGTTCGACCTTCCTGTTGCTCTCTGGA +GACTTGACGGCGGGAGCTCGTGTAGGCCACCCCATCGGTAGCCCACCCCCTTCCCCGAGG +CTAAGGGAGGCATGCCGTGGTAGCGGCGGCTCCTGGTCTTACATGAGTGGCCTGTGAGAC +CAGGCCTGCCATTGACAGTCCTGCCAAGTCTCCGTCCCCCTCCATCCTCCCCTTCCCTCT +GACTCTTCTCTTTTCCCAGCCTACCTCTCCTCTCCCCTGGCCCTGCCCAGCCAGAGGAGG +AGCCCCCCCGAGGAGCCACCTGACTTCTGCTGTCCCAAGTGCCAGTATCAGGCCCCTGAT +ATGGACACCCTGCAGATACATGTCATGGAGTGCATTGAGTAGGGCCGGCCAGTGCAAGGC +CACTGCCTGCCGAGGACGTGCCCGGGACCGTGCAGTCTGCGCTTTCCTCTCCCGCCTGCC +TAGCCCAGGATGAAGGGCTGGGTGGCCACAACTGGGATGCCACCTGGAGCCCCACCCAGG +AGCTGGCCGCGGCACCTTACGCTTCAGCTGTTGATCCGCTGGTCCCCTCTTTTGGGGTAG +ATGCGGCCCCGATCAGGCCTGACTCGCTGCTCTTTTTGTTCCCTTCTGTCTGCTCGAACC +ACTTGCCTCGGGCTAATCCCTCCCTCTTCCTCCACCCGGCACTGGGGAAGTCAAGAATGG +GGCCTGGGGCTCTCAGGGAGAACTGCTTCCCCTGGCAGAGCTGGGTGGCCGCTCTTCCTC +CCACCGGACACCGACCCGCCCGCCGCTGTGCCCTGGGAGTGCTGCCCTCTTACCATGCAC +ACGGGTGCTCTCCTTTTGGGCTGCATGCTATTCCATTTTGCAGCCAGACCGATGTGTATT +TAACCAGTCACTATTGATGGACATTTGGGTTGTTTCCCATCTTTTTGTTACCATAAATAA +TGGCATAGTAAAAATCCTTGTGCATTAGTCGTGCGTATCTTTGGCATAGATTCTGAGAAG +TGACACCACTGAGCATGGGCGATGGCGTAGATGGTACCTGAGCCCCCTTCCTCCTTGGAG +CTTGGTTTCCCATCTCTCCCCACCCCCTATTTCCCTAGCCTTGCCAAGGAGGAGGTGGGA +AAGCCCGTTTGGGTTTTTGTCATTCGCTAGGCCATGCAGTTCTCTGTTAAGAGTGAGCTT +AAACATCTTTCCTGAGGCTTTAAGGACCTTTTTTAGTTCTGCTTCTGAATGGGCTGCTCA +TATCATATATATATATGTATATGTATAGTTGTGTATATGTATGTGTGTGTGTGTGTGTGT +GTGTGTATTTTTTTTTTTTTTTGAGACAGAGTTTTGCTCTTCTCGCCCAGACTGGAGTGC +AGTGGCGTGATCTCAGCTCACTGCAACCTCTGCCTCCTGCGTTCAACCTATTCTCCTGCC +TCAGCCTCCCTAGTAGCTGGGACTACAGGCGCCTGCCACCACGCTCGGCTAATTTTTGTA +TTTTTAGTAGAGATGGGGTTTCACCATGTTGGCCAGGCTGGTCTCGAACTCCTGACCTCA +CGTGATCCACCTGCCTTGGCCTCCCAAAGTGCTGGGATTACAGGTGTGAGCCACTGTACC +TGGCCAATTTTTGTATTTTTAGTAGAGATGGGGGTTTCAGCACTTTGGCCAGGCTGGTCT +CAAACTCCTGACCTCAGGTGATCTGCCTGCCTCGGCCTCCCAAAGTGCTAGGATTACAGG +TGTGAGCCACTGCGCCCCGCTGGGCTGCTCATATCTTTTATCCGTTTTTCTACTGGGTTA +TCTTTTGTTGCTCAGCTTTTAGAAACTCTTTGTCCATGAGCAGGATTGGGTTTGTGGCTG +TGATAGAAATTGCAGATATCTTTTCCTGCTTTGACCTCGCTACTAGTGTTTCTTTGTTTT +TTGTTTTGAGATAAGGTCTCTGTCACCCGGGTGGGAGTGTAGTGGCACAATCAGAGCTCA +CTGCAGCCTTGACCTCCTGAGCTCCAGCAGTCCTCCTGCCTCGGCCTTGCGAGTAGCTGG +GATCACAGGCGTGCGCCACCACACCCTGCTAAACTACTGGTGCTTCTCAGCCTCGAAGTT +TTTTTTTCTTTATGTAGTCACATTTATCCATTTTTTCCCCCTTTTACTGCTTTGACTTTG +GAGTCGTCGTGACCACCTTGTGGTCTGTGGGAGGGATGCTTCACACATGAGCACTTTTCT +TTTTCTAACACCGAAGCGTGCTCTTGTCTTCACCAAGAGTGTTGTTGCCCAGTGTGACTG +GTGATGGCTGTTTCATCATATTCCAGAAGTTTGTGATGCCATTTCCCACACTGGCCCCAA +TTAGGAAGAACAGACGACTCTACTTGTTTTGCAAGATTCTGGAAATTTCCCGGGGTGTCA +GAGCCCTGGGCATTAACCCTGCAATTGTAGCAAAGGAAACAAAGTTGGAAGTTGACTCCC +CAGCACCCGGGCCTTCCTTCCTGCCCTCCTCCAAGCTGCCCTGGTTGGGAACGGGATATT +GTGAATCCACCCTGAGGCTTCCTCACAGGGCTTCCTGCTATTTGACAGTGTGAGGCTGGC +TAGAATTGGGATATGAGGCCTTCTAAAAATTAACCTGGCCTGAGGGCTTTCTCAGCATCT +GGGGGCGGGGTGTGGGGGCGGGGAGTGACCTTTCCCCTTCTTCAAGCCAGGTGCCCATCA +GATTCTACTTGGGAAAATGTGAAAGGCACAGACAGCTTGTCTGCTCACGGGTGCTGCACT +TAAATCCTAATCTTGCAATTTTTCTACAAGAGCACTTTACATTTTTTTTTTGGAGGGAGG +GTGTATGAGTAGACGGTGGCTGCTGTAACAAAGGACTGCACACTTGGGGCTTAAAACAAC +AGAAACTCCTTCACAGTGTTGGAAGCCACAGTCTGCAATCAAGTTGTCAGCAAGGCTGGT +TTCTTTTTTCTTTCTCTTTCTTTCTTTCTTTCTCTTTCTTTCTTTCTTTCTTTCTCTTTC +TTTCTTTTTCTTTCTCTTTCTCTTTCTTTCTTTCTTTCTTTTTTCTTTCTTTTCTTTCTC +CCTCTCTCTCTCCCTCTCTCTCTCCTCTCTCTCTTTCTCTCTTTCTTTCTTGTTGGATAC +AGTCTTGCTCTGACACCCAGGCTGGAGTGCAGTGGTGCCATCTCAGTTCACTGCAACTTC +CGCCTCCAGGGTTCAAGCAATTCTTGTGCCTCAGCCTCCCGAGTATCTGGGATTACAAGT +GCCCGCCACTATGTCCGGCAAGGCTGGTTTCCTCTGCAGGTGCCAAGAGAATCGTTTACA +GGTCTCTGTCTTAGCCTCTGCTGGTTGCTTATAGAGCCATCACCCTAGTCTCTGCCTCTG +TCATCACACAGTCTCTTCCTCTTCGTCTGTGTCTTCCCTTTTGTCTCATAAAGGACACCA +GTCATTGGATGTAGGGCCCACCTGGATAATCCAGAATGATCTCCTCATCTCGGCATTCAT +TACATCTGCAAAGACCCTTTTCCCAAAGAAGGTCACACCCATAAGATATGCACGTATCTC +TTCAGGGCCATCACACAACCCAGTATGGGGACATGTTCCAGTGCACACAGATGAGGGCAA +ATGCAGCAGCAGTGCCTGGGAGCTGGAAAAACACCCCTGCTTGGGCCCCCTGGACCAACC +GAGTCAGAATCTTGGTCTAGACCCACCAGGCATTGGTCTAGACCCACCGCCCTGCCCGGA +GGTGGGGCTGCCTCCTCCCTCCTCCGGTAGCACAGTGTAGGGTTGACCATACACCAGACA +CCAAGGATTTAAAACCATACCATTTGCGCAGAAGTACGGCATTCCAGGGTTTTCCTTTTC +GGCTTTACTTAGGTTGAGCTTCCAGAACAACTGGTTGGATTACTCTTGGGAGGGAACCAG +CCTGCCTGCCCTATTGGCTGCGGCCCCTTGATCCACAAACGAAGCCCATCCCTGCCGGAG +CTTGTGGCGTTCTCTGGTGTTCATGCGCGCTCCCCCTCCTGCCGACCAATGCACAAGAAC +AGTGCGGGCGACCAGTAAGCACAGTGAGAAAGCATTTCACCCTCATCAGTAACTAAAACA +ACAGCAAGTCTGACGGCAAGCAAAAAGTGATCTGAAAATGGCAACCGATGAAGCCCTAGG +TGAGCGCTCTGCTGGCAAGGTGCAGGACAGCGGCCCTTGCAATGCTGCCCTTTCCGAGGG +GGCCATTTGCTGGGATGGAGCAAGCCCGTAGGCAGCGTCCTCAGCCCTGGTGCGTAACCC +TAAGAGAGAGTCGGGCACTCAGCGATGACTCTGGCATTGGGAAGGTGCTTCACTTCGTCA +TTTGTAGCTGCCAAAGGTCGGGGTGAGATAAGAGGCTGCGTATGTAACGATGTGGTCCGG +CACAGCTGGCATTTTGAACATCTCCTCACACGCAACATGCCTTGTAGCATGTTGGTGTGA +ACAGGCCAGGTGACAAACTTATGGATAAAGCGTGACCCCAATAGAGAAGTAAAAGCTCTG +AGTGCATACCCTGCTAACATTTGGACGGACGTGAGTAGCTGGGTGGCCAGTAATGAGAAG +TTTTCCTTACATGTTAGTACCTTTCCTAAATGCCTATCATTTGCACGCATTCCTTTGATA +CACAGAGAATACGTCTTCCCACACAGTCGCTCGGGTGGTAACGCAGCTTGGTTTTCTTCT +GTGCCAGTGGCAGGGAAGAGCCCGCTGTTGACACAGCCTCTCAGCAAGGCACGGGGCAGG +GGCTGACTGTGTCTCCTGGGGCTGCCGTGACCAAGCACCACAGACTGCGGGGCCGAAGCC +ACAGAAACGCGTGGCCTCCCGCTTCTGGAGGCCTGGAGGCTGAGCTAGCGGTGGTGTCGG +CAGGGTGGGCTCCCCGCCAGGGCCGCGAGGGAGCTGCCTTCCAGGCCTCTCCACGGCGCC +GGGGGCCGCCGGCTGCACCTCTCCAGCCTCCATCTCCGTCATCCTGTGGCCTTGTCCCCG +CGGGCCTCTGTGCCTGTCCTCCTCTTTTGACAAGAACACCGGAGATACACAAAGGTACAC +AAAAGCGGGCCTTTGTTCAAGCTGGCAAAAGAGATCTTCTTCAGAAACCCCTGCTTGCGG +GGGAGAGAGCTGAGCTCCGTTCCCGCCCCAGCAGAGGCGGCCTGGCCTTGCGAAGGGAGA +AGGAGGGAGTCGGGAGGGGGCGAGTGCAGGCTCAGGTGAAAGATGACGGGGCAGCCAGCG +TCCTTGCCGCGAGGCCAGCCGTGTGTGGGAGCTGCCGGTGCTTACCAAGGTTGGGATGCT +TCCGTCCCGTGGAGACTGGGAGACTGGGCCCCGCGCCTCCTGAGGTTTCCGTTTCCAAGG +AGTGGCTGCGGGGCCCTCGGGAAAGCCCCTGGGTTGTGGGTGCTACACAGATGTCTCAAA +GGGACAGGGTAAGCCCTTTGTAGTAAATGCTGTCAGAAAGGGAGGTCAGGTGTTGGCCGG +AACAGACAGTACATGCTCTGGGCAGCCCTGAGCGTTTCCAGACGGGAACTCACTCAAAAG +GGGGCTGGGGCGTCCCAGGGGCGCGGCCTTAGGCTCCCAGAGGCCCCGCGAGGTGGTGGC +CGGGTGTCTTCGGGCAGGGGTTTGAGTGCAGTGTGCCTGCCGAGAGGTTCTGCAGTTCCG +AGCACCATCATTTTCTCCTCCTCAGACCCCTTGGTTCTCCTTCCACGTCCTGGCAGCTGC +TTCGCAGGCTCCTTGCTGGTTCCTGTGTCTCCGAGCTGACTCCCGAATTCTCTTCCTCTC +CTTGCTCAGACACTGCCCCTTTGTGACCTCGTCCATCTTCAAGGCCTTGATGCTGATGAC +AGATTTCTGTCTTCCAGTCCTGATCTGTTCCTTCACGAAAATGAGCCCAGTAGCCCGTCC +AAGCCAGAAATGGACTTTTAGCCCCCACCCCCTGCCAACTCTGCCCTTTCCCTCATCCCC +AGTTGCTTAAACCAAAACGGATTCCTCTTCCTCTCATGATCCAAACTCCTGAGTCCCTTC +ACCTTTTGCCTACACTATCACAGTGACCTCCTTGCTGCTTTCACACTGGAGAGCGTGGGC +TCCCTGTGATCTACTCTCCACATGGCAGCCAGTGTCATCTGGTAAACCTTTGCTGAAACC +CTGCCATGCCCTTCAGTTGCCCTGGAAACCTGAACTCATCCTCAGCCTGGCTCGCAGAGC +CCTCATGCCGCTGGGACGTCACGTTATGTCCCTCTCCTGCTGGCCCGCTGCACCCAGCCG +CACCACATGCCGGCCGCACCTCACACACGCTGGCCTCTTGGCCCTTCCTCGAACACACGG +CGCTTGTCCTTGTTGTCCCCTCATCTTTGCATGGCCGATTTCTTTTTCTCATTCAGCTCT +AAGTTTAAACTTTCAACAGTTCTAAGCGTATCACCTTCTTCATCTTAAAGTCCTCATCCT +AAATCACACTGCACTGTTTTAACTCCCAGCTTGGCAGTCCAAACGGCCTCATTTATTCTG +CTTTGTTTTCTCTCTCCTTCCACCAGAGGGAGAGCAGAGGCCTCCTCCAGCTCAGTCAGG +CACCATCCCCCGGAACAGCGGTCTCCAACCTTTTTGGCACCAGGGACCGGTTTTGTGGAA +GACGAGTTTTCCACAGACGGGGATGGGGCGTGGGATGACGGTTCGGGGATGAAACTCTTC +CACTTCAGATCATCAGGCATTAGTCAGATGCTCCTAAGGAGCACACAACCTAGATCCCTC +GCACACACAGTTCACAATAGGGTTTGCGCTCCTGTGAGAGTCTGAGGCCGCTGGCTGATC +TGACAGGAGGCAGAGCTCAGGCGGTCATGCGAGCAATGGGGAGCAGCTGTACACACAGAT +GAAGCTTCGCTCGCGTGCCCACCACTCGCCGCCTGCTCTGTGGCTCAGTTCTTAACAGGC +CACAGACCAGTATCGGTCCGTGGCCGGGGGGTTGGGGACCCCTGCCCTAGAACGATGCTC +GGCACAGCTACACTCGGTGCACATTTCTTGGTGGCCTGAGCTTGTCTTGAGCTCCATCAG +GATTTCTCTGTCAATATTTTTGTAGACCATACCCTGTCACCATAAATGGTGCTTGGTAGA +AATGACTGTATGCATGTGATAGGGATAACACAGAAAACCCATGGTAGAGAGCATGGCGAC +TAACTTGGAGGGACTTCTTTTTGTCTTTTTTTTTTTTTTTTTTTTTTTTTTCGTTTTTGC +AGAGATGCAGGTATTGTCATGTTGCCTGGGCTGGTCTCAAACTCCTGGGTTCAAGCAATC +CTCCCACCTGTGCCTCCCTAAGTGTTGGGATTACAGGCATGAGCCGCTGCCTCCCCGCAG +CATGGGGGGACTTCTTATAAGCATTTTTATTTATTTATTTGGAATTCGGATGGATAGATC +ATGTTGGCCTGATAATGTATGTGCAAGAGCACAGACTCTGGGGTCTCATGGACAAGCCCC +ACTCTGCCCCTTGCTTAATGTCTCTGAGCCTCAGTTTCCTCATCTGTAACAAGGGCAGAA +ATGCCATCCGTGTGCAGCAGAGTACCGACGAAAGGATGGATGCAATGCTCTTACGGCAGT +GCCCCACCCAAGGCACAAGGAAAAACCACGAAAGCAATTCCAGTTTACACCACGTCCCAA +ACTGGATTTTCCATTCGACTATAATTCCTCCCAATAAGGGCTCAAAACGTTGTCTAAGGA +AAGTCGATTGGTTGTTGCCCAGGGCTGGAGGAGTTGGGGGAAAACGGGGAGTGACTGCTG +ATGGAGACAGAGTTTCCTTTTGAGATGATGAGAATGTCCTGAAGCTGGTCATGCTGATGG +TTGCACGTATCTATGAATATACTAAAAACCATCGAATTGTACACTTTAGACATGTGAATT +GTGTAGCATGTGAATTAAACCTCAAGTCGTTATTTTAAAAATGTTGTCTAAGACACCTGG +GCGCCTCCTGTAAGCTCCCATGAAAATGCAAATGAAGACACAGAAGAAGACCAACATTGA +ACATTTACATCCTCTCCTAAAGCTATGTTTGACAGGTGACCTTTTTTTCATCCTTCCTTC +CTTCCTTCCTCTTCCTTATTCCTTCCTTCCTTACTTCACTCCCTCTTCCTTATTCCTTCC +TTCCTTCTTTCCCTCTTCCTTATTCCTTCCTTCCTTCTTTCCCTCTTCCTTATTCCTTCC +TTCCTTCTTTCCCTCTTCCTTATTCCTTTCTTCGTTCCTTCCTCCCTTCCTTCCTTCCCT +CCTTCCTTTTTTCCTTATTCCTTCCTTTCCTCCCTTCCTCCCTCCCTCCCTCCCTCCCTC +CCTCCCTCCCTTCCTTCCTTCCTTCCTTCCTTCCTTCCTTACGTTTTTTAACAGTAGAGA +CAGGGTCTCACTGTGTTGCCCAGGCTGGACTCGAACTCCTGGAAATCAAGTGATCCTCCC +ACCTGGGCCTCCCCAAATGCTGGGATTACAGGCTTGAGAACTGGCAGGGAGAATGGGCGA +CTTTGTGTTGTGAAGCAGAGTGACCCTACTCTGTCCTTTCCTCAGAGACCCAGGGTTGGT +ACCAACCAGGAGACTAGCAGCTGCTCTTCTACCCGGCCCAGGTTGCTTTCTCTATAGTCA +TTGAGAGGGAAACCCTCAAACAGAAAGTGTGTCAGCAGGGGGCAGGCAGCTGATGGATGT +CATTGTACCCTTGGAGGAACAGAACTCAGCGGTCACAGGAAGCGGGGAGATCCAGGGATT +CCATCGGTCGCGTTGGTTCCGAGTGTCGCAGGCCTGAGGGGAAGGGGTGAGATTTGCTTT +CTCAGTCTAGGAGAGCCAAAGTCGACAAATGTGTGCTATAGCTGGAAATAAGTCTTCTCC +CGCACCTAGCGGGCGGTGGTCAGGGTGGTCTGTCTGTCCATCCCCGAAAGCACTGCAAGA +CCTGCCATCTGTTTCTGGTGATGCTGGGTGGTATTATTGAGACAGATATTTCTGCTGAAA +ATAACTGCAAATGATAGATGAAACAGTTTGAGACTGTTCAAAACAGCTGAGAACCAACAA +GAGAGCAAAACCTGAGAGGAAGTGGGAATCCAAGAAGAAGTACGCCACTGAAGCTGCTTA +TTTCCCGAGGACGCTGGCCGACCTGGGCAAACTTGGGTTTTGGTTCTGGAGGCCGTCCAG +GCAAGGGAGACAGCAGGCAAGGCCCAAAGCCTTTGTGAAGTGTGGGGAGTTGTATAGCAG +CCCCCGCTACATTAAGGAAACCCCACCCCAGCTCTGCAGATTGCGGGGAAGCAGGAGATG +AAAGGAAAGGGAAAACACCCCTGTGCAAAGTTGTGGCCAAGGGCCAGGTTCTGTGCAGAC +TTGCAGTCTAGGTTTTCCTGGTGGGGCCGTTCAGTGGCCCTCGAGTCCTAAATTTGGTCT +GAGGTGGTCCTGGGCCGACAGTACCCATATTAGTTTTCTAGGGCTGCCGTAAGAAATGGC +CACAAACTGGGTGGCCGAAAACAACAGAAACGGATTCTCCCGTGGTTCTGGAAGCTACAA +GTCTGGAATCCAGCTGTCAGCAGGGTTGGTTCCTACTGGAGGCTCTGTGGGAGAATCTGT +TCCATGCCTCTCTCGTAGCTCCAGGTGGTTTCTGGCAATCCCTGGCACTCTCTGTCTCAT +AGAGACACCGCTCCAATCTCTGCCTCCTTCTTCGTGGGGTCATCTTCCCAGTGTGTCTGC +CTCTCTCTCTTATGAGGACACTAGTCACATTTGATTAGCGCCCACCCCAATCTAGTATGA +CCTTAACTTGATTACATCGGCAAAGACCCTATTTCCAAATCAGGTCACATTCCCAGGCAC +CCAGGTGTTAGGACTTGAACATACCTTTTTGGGGGACACAATTCAACCCACAACGGTTAC +CCCAACGGATGTGTGGATTCGTGGGAGCCTGGGGTTGAGATGCAGTCTCCACAAAAGTTC +CTGAGTGAGAGCATGTCCTTTGCAGGAGCGTGGATGGAGCTGGAGGCCATTATCCTTAGC +AAACTGACGCAGGAACAGACAACCAAATACCACATGTTCTCACTTATAAGTGGGAGCTAA +ATGATGAGAACACACGGATACATAGAGGGGAATGAAACACACTGGGGCCTTTTGCAGGGC +GGAGGTTGGGAGGAGGGAGAGTATGAGGAAAAATAGCTCATGGGTGCTAGGCTTAATACC +TAGGTGATAAAATAATCTGTACAGCAAACCCCCATGACACAAGTTTACTCATATAACAAA +CCTGCACATGTACACCTGAGCTCTTAAAATAAAAAGAGCGATGTATTAATATAACAACTT +TAGGAGCACTGGGGGAAAAAAGTTCTTAATGACACTGAGCAGTGCACATAAAAACTAAGC +ACACCCAGGGCAGGAAAGAGACCATGAGACTAAGAGTTCAGATGCTGGAGGCTGGGTGCA +GTGACTCACACCTGTAATCCCAGCACTTTGGGAGGCCGAGGTGGGCAGATCACAAGGTCA +GGAGTTCTAGACCAGCCTGGCCAATATGGTGAAACCCCGTCTCTACTAAAAATACAAAAA +TTAGCCGGGCGTGGTAGTGGGCGCCTGTACTCCCAGCTACTCGGGAGGCTGAGGCAGGAG +AATCACTTGAACCTGGGAGGTGGAGGTTGCAGTGAGCCGAGATGGCACCATTGCACTCCA +GCCTGGGTGACAGAGAGAGACTCAAAACAGATGCTGGAGCACCACACAGATGACAGAACA +ACTGAACTTTAAAAACTTAAAAAAAAAAAAAAAGGCTTGTGAATATCCACAGGGAATAGG +GAATATATATATCCAAAAAGTAAAAGCTGTCTAACTTTTTATTTCTTTGTTTGTTTGTTT +ATTTATTTGAGACAGGGTCTCACTCTGTCACCCAGGCTGGAGTGTAGTGGCACGATCACA +GCTCATTGCAGCCTCAACCTCCCGGGCTCAAGCGATCCTCCCGCCTCAGCCTCCCGAGTA +GCCGGGACTACAGGCGCGTGC diff --git a/paraphase/data/ikbkg/ref.fa.fai b/paraphase/data/ikbkg/ref.fa.fai new file mode 100644 index 0000000..abbf24f --- /dev/null +++ b/paraphase/data/ikbkg/ref.fa.fai @@ -0,0 +1 @@ +chrX_154555600_154575000 19401 26 60 61 diff --git a/paraphase/data/ncf1/homopolymer_sites.txt b/paraphase/data/ncf1/homopolymer_sites.txt new file mode 100644 index 0000000..22aa7d5 --- /dev/null +++ b/paraphase/data/ncf1/homopolymer_sites.txt @@ -0,0 +1,2379 @@ +chr7 74763881 T +chr7 74763882 G,C,A,T +chr7 74763883 0 +chr7 74763884 0 +chr7 74763885 0 +chr7 74763886 0 +chr7 74763887 1,T +chr7 74763914 T +chr7 74763915 G,C,A,T +chr7 74763916 0 +chr7 74763917 0 +chr7 74763918 0 +chr7 74763919 0 +chr7 74763920 0 +chr7 74763921 0 +chr7 74763922 1,T +chr7 74763923 G,C,A,T +chr7 74763924 0 +chr7 74763925 0 +chr7 74763926 0 +chr7 74763927 0 +chr7 74763928 1,T +chr7 74763929 G,C,A,T +chr7 74763930 0 +chr7 74763931 0 +chr7 74763932 0 +chr7 74763933 0 +chr7 74763934 1,T +chr7 74763935 G,C,A,T +chr7 74763936 0 +chr7 74763937 0 +chr7 74763938 0 +chr7 74763939 0 +chr7 74763940 0 +chr7 74763941 1,T +chr7 74764061 G +chr7 74764062 G,C,A,T +chr7 74764063 0 +chr7 74764064 0 +chr7 74764065 0 +chr7 74764066 0 +chr7 74764067 1,G +chr7 74764318 G +chr7 74764319 G,C,A,T +chr7 74764320 0 +chr7 74764321 0 +chr7 74764322 0 +chr7 74764323 0 +chr7 74764324 1,G +chr7 74764639 A +chr7 74764640 G,C,A,T +chr7 74764641 0 +chr7 74764642 0 +chr7 74764643 0 +chr7 74764644 0 +chr7 74764645 1,A +chr7 74764647 A +chr7 74764648 G,C,A,T +chr7 74764649 0 +chr7 74764650 0 +chr7 74764651 0 +chr7 74764652 0 +chr7 74764653 0 +chr7 74764654 1,A +chr7 74764799 A +chr7 74764800 G,C,A,T +chr7 74764801 0 +chr7 74764802 0 +chr7 74764803 0 +chr7 74764804 0 +chr7 74764805 0 +chr7 74764806 0 +chr7 74764807 0 +chr7 74764808 1,A +chr7 74764910 A +chr7 74764911 G,C,A,T +chr7 74764912 0 +chr7 74764913 0 +chr7 74764914 0 +chr7 74764915 0 +chr7 74764916 1,A +chr7 74764945 C +chr7 74764946 G,C,A,T +chr7 74764947 0 +chr7 74764948 0 +chr7 74764949 0 +chr7 74764950 0 +chr7 74764951 0 +chr7 74764952 1,C +chr7 74764978 C +chr7 74764979 G,C,A,T +chr7 74764980 0 +chr7 74764981 0 +chr7 74764982 0 +chr7 74764983 0 +chr7 74764984 1,C +chr7 74765007 T +chr7 74765008 G,C,A,T +chr7 74765009 0 +chr7 74765010 0 +chr7 74765011 0 +chr7 74765012 0 +chr7 74765013 0 +chr7 74765014 0 +chr7 74765015 0 +chr7 74765016 0 +chr7 74765017 0 +chr7 74765018 0 +chr7 74765019 0 +chr7 74765020 0 +chr7 74765021 1,T +chr7 74765171 T +chr7 74765172 G,C,A,T +chr7 74765173 0 +chr7 74765174 0 +chr7 74765175 0 +chr7 74765176 0 +chr7 74765177 1,T +chr7 74765179 T +chr7 74765180 G,C,A,T +chr7 74765181 0 +chr7 74765182 0 +chr7 74765183 0 +chr7 74765184 0 +chr7 74765185 1,T +chr7 74765447 G +chr7 74765448 G,C,A,T +chr7 74765449 0 +chr7 74765450 0 +chr7 74765451 0 +chr7 74765452 0 +chr7 74765453 1,G +chr7 74765740 A +chr7 74765741 G,C,A,T +chr7 74765742 0 +chr7 74765743 0 +chr7 74765744 0 +chr7 74765745 0 +chr7 74765746 1,A +chr7 74765822 T +chr7 74765823 G,C,A,T +chr7 74765824 0 +chr7 74765825 0 +chr7 74765826 0 +chr7 74765827 0 +chr7 74765828 1,T +chr7 74765972 C +chr7 74765973 G,C,A,T +chr7 74765974 0 +chr7 74765975 0 +chr7 74765976 0 +chr7 74765977 0 +chr7 74765978 1,C +chr7 74766326 A +chr7 74766327 G,C,A,T +chr7 74766328 0 +chr7 74766329 0 +chr7 74766330 0 +chr7 74766331 0 +chr7 74766332 1,A +chr7 74766334 A +chr7 74766335 G,C,A,T +chr7 74766336 0 +chr7 74766337 0 +chr7 74766338 0 +chr7 74766339 0 +chr7 74766340 1,A +chr7 74766489 A +chr7 74766490 G,C,A,T +chr7 74766491 0 +chr7 74766492 0 +chr7 74766493 0 +chr7 74766494 0 +chr7 74766495 0 +chr7 74766496 0 +chr7 74766497 0 +chr7 74766498 0 +chr7 74766499 0 +chr7 74766500 0 +chr7 74766501 0 +chr7 74766502 0 +chr7 74766503 0 +chr7 74766504 0 +chr7 74766505 0 +chr7 74766506 0 +chr7 74766507 1,A +chr7 74766522 G +chr7 74766523 G,C,A,T +chr7 74766524 0 +chr7 74766525 0 +chr7 74766526 0 +chr7 74766527 0 +chr7 74766528 1,G +chr7 74766608 G +chr7 74766609 G,C,A,T +chr7 74766610 0 +chr7 74766611 0 +chr7 74766612 0 +chr7 74766613 0 +chr7 74766614 1,G +chr7 74767463 T +chr7 74767464 G,C,A,T +chr7 74767465 0 +chr7 74767466 0 +chr7 74767467 0 +chr7 74767468 0 +chr7 74767469 0 +chr7 74767470 0 +chr7 74767471 0 +chr7 74767472 0 +chr7 74767473 0 +chr7 74767474 0 +chr7 74767475 1,T +chr7 74767627 T +chr7 74767628 G,C,A,T +chr7 74767629 0 +chr7 74767630 0 +chr7 74767631 0 +chr7 74767632 0 +chr7 74767633 1,T +chr7 74767635 T +chr7 74767636 G,C,A,T +chr7 74767637 0 +chr7 74767638 0 +chr7 74767639 0 +chr7 74767640 0 +chr7 74767641 0 +chr7 74767642 1,T +chr7 74768140 A +chr7 74768141 G,C,A,T +chr7 74768142 0 +chr7 74768143 0 +chr7 74768144 0 +chr7 74768145 0 +chr7 74768146 1,A +chr7 74768174 C +chr7 74768175 G,C,A,T +chr7 74768176 0 +chr7 74768177 0 +chr7 74768178 0 +chr7 74768179 0 +chr7 74768180 1,C +chr7 74768222 C +chr7 74768223 G,C,A,T +chr7 74768224 0 +chr7 74768225 0 +chr7 74768226 0 +chr7 74768227 0 +chr7 74768228 1,C +chr7 74768229 C +chr7 74768230 G,C,A,T +chr7 74768231 0 +chr7 74768232 0 +chr7 74768233 0 +chr7 74768234 0 +chr7 74768235 1,C +chr7 74768460 A +chr7 74768461 G,C,A,T +chr7 74768462 0 +chr7 74768463 0 +chr7 74768464 0 +chr7 74768465 0 +chr7 74768466 0 +chr7 74768467 1,A +chr7 74768469 A +chr7 74768470 G,C,A,T +chr7 74768471 0 +chr7 74768472 0 +chr7 74768473 0 +chr7 74768474 0 +chr7 74768475 1,A +chr7 74768625 A +chr7 74768626 G,C,A,T +chr7 74768627 0 +chr7 74768628 0 +chr7 74768629 0 +chr7 74768630 0 +chr7 74768631 1,A +chr7 74768812 G +chr7 74768813 G,C,A,T +chr7 74768814 0 +chr7 74768815 0 +chr7 74768816 0 +chr7 74768817 0 +chr7 74768818 1,G +chr7 74768842 G +chr7 74768843 G,C,A,T +chr7 74768844 0 +chr7 74768845 0 +chr7 74768846 0 +chr7 74768847 0 +chr7 74768848 1,G +chr7 74768958 T +chr7 74768959 G,C,A,T +chr7 74768960 0 +chr7 74768961 0 +chr7 74768962 0 +chr7 74768963 0 +chr7 74768964 1,T +chr7 74768986 A +chr7 74768987 G,C,A,T +chr7 74768988 0 +chr7 74768989 0 +chr7 74768990 0 +chr7 74768991 0 +chr7 74768992 0 +chr7 74768993 1,A +chr7 74769052 T +chr7 74769053 G,C,A,T +chr7 74769054 0 +chr7 74769055 0 +chr7 74769056 0 +chr7 74769057 0 +chr7 74769058 0 +chr7 74769059 1,T +chr7 74769206 T +chr7 74769207 G,C,A,T +chr7 74769208 0 +chr7 74769209 0 +chr7 74769210 0 +chr7 74769211 0 +chr7 74769212 0 +chr7 74769213 0 +chr7 74769214 1,T +chr7 74769215 G,C,A,T +chr7 74769216 0 +chr7 74769217 0 +chr7 74769218 0 +chr7 74769219 0 +chr7 74769220 1,T +chr7 74769338 T +chr7 74769339 G,C,A,T +chr7 74769340 0 +chr7 74769341 0 +chr7 74769342 0 +chr7 74769343 0 +chr7 74769344 0 +chr7 74769345 0 +chr7 74769346 0 +chr7 74769347 0 +chr7 74769348 0 +chr7 74769349 0 +chr7 74769350 1,T +chr7 74769353 A +chr7 74769354 G,C,A,T +chr7 74769355 0 +chr7 74769356 0 +chr7 74769357 0 +chr7 74769358 0 +chr7 74769359 0 +chr7 74769360 0 +chr7 74769361 1,A +chr7 74769520 A +chr7 74769521 G,C,A,T +chr7 74769522 0 +chr7 74769523 0 +chr7 74769524 0 +chr7 74769525 0 +chr7 74769526 0 +chr7 74769527 0 +chr7 74769528 0 +chr7 74769529 0 +chr7 74769530 0 +chr7 74769531 1,A +chr7 74769553 A +chr7 74769554 G,C,A,T +chr7 74769555 0 +chr7 74769556 0 +chr7 74769557 0 +chr7 74769558 0 +chr7 74769559 1,A +chr7 74769562 A +chr7 74769563 G,C,A,T +chr7 74769564 0 +chr7 74769565 0 +chr7 74769566 0 +chr7 74769567 0 +chr7 74769568 0 +chr7 74769569 1,A +chr7 74769583 A +chr7 74769584 G,C,A,T +chr7 74769585 0 +chr7 74769586 0 +chr7 74769587 0 +chr7 74769588 0 +chr7 74769589 0 +chr7 74769590 1,A +chr7 74769958 T +chr7 74769959 G,C,A,T +chr7 74769960 0 +chr7 74769961 0 +chr7 74769962 0 +chr7 74769963 0 +chr7 74769964 0 +chr7 74769965 0 +chr7 74769966 0 +chr7 74769967 0 +chr7 74769968 0 +chr7 74769969 0 +chr7 74769970 0 +chr7 74769971 0 +chr7 74769972 0 +chr7 74769973 0 +chr7 74769974 0 +chr7 74769975 1,T +chr7 74770126 T +chr7 74770127 G,C,A,T +chr7 74770128 0 +chr7 74770129 0 +chr7 74770130 0 +chr7 74770131 0 +chr7 74770132 1,T +chr7 74770134 T +chr7 74770135 G,C,A,T +chr7 74770136 0 +chr7 74770137 0 +chr7 74770138 0 +chr7 74770139 0 +chr7 74770140 1,T +chr7 74770338 A +chr7 74770339 G,C,A,T +chr7 74770340 0 +chr7 74770341 0 +chr7 74770342 0 +chr7 74770343 0 +chr7 74770344 1,A +chr7 74770727 T +chr7 74770728 G,C,A,T +chr7 74770729 0 +chr7 74770730 0 +chr7 74770731 0 +chr7 74770732 0 +chr7 74770733 1,T +chr7 74771237 G +chr7 74771238 G,C,A,T +chr7 74771239 0 +chr7 74771240 0 +chr7 74771241 0 +chr7 74771242 0 +chr7 74771243 1,G +chr7 74771428 A +chr7 74771429 G,C,A,T +chr7 74771430 0 +chr7 74771431 0 +chr7 74771432 0 +chr7 74771433 0 +chr7 74771434 1,A +chr7 74771436 A +chr7 74771437 G,C,A,T +chr7 74771438 0 +chr7 74771439 0 +chr7 74771440 0 +chr7 74771441 0 +chr7 74771442 0 +chr7 74771443 1,A +chr7 74771566 G +chr7 74771567 G,C,A,T +chr7 74771568 0 +chr7 74771569 0 +chr7 74771570 0 +chr7 74771571 0 +chr7 74771572 1,G +chr7 74771591 A +chr7 74771592 G,C,A,T +chr7 74771593 0 +chr7 74771594 0 +chr7 74771595 0 +chr7 74771596 0 +chr7 74771597 0 +chr7 74771598 0 +chr7 74771599 0 +chr7 74771600 0 +chr7 74771601 0 +chr7 74771602 0 +chr7 74771603 0 +chr7 74771604 0 +chr7 74771605 0 +chr7 74771606 1,A +chr7 74771607 G,C,A,T +chr7 74771608 0 +chr7 74771609 0 +chr7 74771610 0 +chr7 74771611 0 +chr7 74771612 0 +chr7 74771613 0 +chr7 74771614 1,A +chr7 74771858 A +chr7 74771859 G,C,A,T +chr7 74771860 0 +chr7 74771861 0 +chr7 74771862 0 +chr7 74771863 0 +chr7 74771864 0 +chr7 74771865 0 +chr7 74771866 0 +chr7 74771867 0 +chr7 74771868 0 +chr7 74771869 0 +chr7 74771870 0 +chr7 74771871 0 +chr7 74771872 0 +chr7 74771873 0 +chr7 74771874 0 +chr7 74771875 0 +chr7 74771876 0 +chr7 74771877 0 +chr7 74771878 0 +chr7 74771879 0 +chr7 74771880 0 +chr7 74771881 1,A +chr7 74772001 A +chr7 74772002 G,C,A,T +chr7 74772003 0 +chr7 74772004 0 +chr7 74772005 0 +chr7 74772006 0 +chr7 74772007 1,A +chr7 74772222 C +chr7 74772223 G,C,A,T +chr7 74772224 0 +chr7 74772225 0 +chr7 74772226 0 +chr7 74772227 0 +chr7 74772228 1,C +chr7 74772400 G +chr7 74772401 G,C,A,T +chr7 74772402 0 +chr7 74772403 0 +chr7 74772404 0 +chr7 74772405 0 +chr7 74772406 1,G +chr7 74772445 C +chr7 74772446 G,C,A,T +chr7 74772447 0 +chr7 74772448 0 +chr7 74772449 0 +chr7 74772450 0 +chr7 74772451 1,C +chr7 74772899 A +chr7 74772900 G,C,A,T +chr7 74772901 0 +chr7 74772902 0 +chr7 74772903 0 +chr7 74772904 0 +chr7 74772905 0 +chr7 74772906 0 +chr7 74772907 1,A +chr7 74773034 A +chr7 74773035 G,C,A,T +chr7 74773036 0 +chr7 74773037 0 +chr7 74773038 0 +chr7 74773039 0 +chr7 74773040 0 +chr7 74773041 0 +chr7 74773042 0 +chr7 74773043 0 +chr7 74773044 0 +chr7 74773045 0 +chr7 74773046 0 +chr7 74773047 0 +chr7 74773048 0 +chr7 74773049 1,A +chr7 74773198 A +chr7 74773199 G,C,A,T +chr7 74773200 0 +chr7 74773201 0 +chr7 74773202 0 +chr7 74773203 0 +chr7 74773204 0 +chr7 74773205 0 +chr7 74773206 0 +chr7 74773207 0 +chr7 74773208 0 +chr7 74773209 0 +chr7 74773210 0 +chr7 74773211 1,A +chr7 74773291 A +chr7 74773292 G,C,A,T +chr7 74773293 0 +chr7 74773294 0 +chr7 74773295 0 +chr7 74773296 0 +chr7 74773297 0 +chr7 74773298 0 +chr7 74773299 1,A +chr7 74773300 A +chr7 74773301 G,C,A,T +chr7 74773302 0 +chr7 74773303 0 +chr7 74773304 0 +chr7 74773305 0 +chr7 74773306 0 +chr7 74773307 0 +chr7 74773308 0 +chr7 74773309 1,A +chr7 74773484 A +chr7 74773485 G,C,A,T +chr7 74773486 0 +chr7 74773487 0 +chr7 74773488 0 +chr7 74773489 0 +chr7 74773490 1,A +chr7 74773531 A +chr7 74773532 G,C,A,T +chr7 74773533 0 +chr7 74773534 0 +chr7 74773535 0 +chr7 74773536 0 +chr7 74773537 1,A +chr7 74773665 G +chr7 74773666 G,C,A,T +chr7 74773667 0 +chr7 74773668 0 +chr7 74773669 0 +chr7 74773670 0 +chr7 74773671 0 +chr7 74773672 1,G +chr7 74773700 G +chr7 74773701 G,C,A,T +chr7 74773702 0 +chr7 74773703 0 +chr7 74773704 0 +chr7 74773705 0 +chr7 74773706 0 +chr7 74773707 1,G +chr7 74773766 C +chr7 74773767 G,C,A,T +chr7 74773768 0 +chr7 74773769 0 +chr7 74773770 0 +chr7 74773771 0 +chr7 74773772 1,C +chr7 74773936 C +chr7 74773937 G,C,A,T +chr7 74773938 0 +chr7 74773939 0 +chr7 74773940 0 +chr7 74773941 0 +chr7 74773942 1,C +chr7 74774033 G +chr7 74774034 G,C,A,T +chr7 74774035 0 +chr7 74774036 0 +chr7 74774037 0 +chr7 74774038 0 +chr7 74774039 1,G +chr7 74774130 G +chr7 74774131 G,C,A,T +chr7 74774132 0 +chr7 74774133 0 +chr7 74774134 0 +chr7 74774135 0 +chr7 74774136 0 +chr7 74774137 1,G +chr7 74774732 T +chr7 74774733 G,C,A,T +chr7 74774734 0 +chr7 74774735 0 +chr7 74774736 0 +chr7 74774737 0 +chr7 74774738 0 +chr7 74774739 1,T +chr7 74774742 T +chr7 74774743 G,C,A,T +chr7 74774744 0 +chr7 74774745 0 +chr7 74774746 0 +chr7 74774747 0 +chr7 74774748 0 +chr7 74774749 0 +chr7 74774750 1,T +chr7 74774751 G,C,A,T +chr7 74774752 0 +chr7 74774753 0 +chr7 74774754 0 +chr7 74774755 0 +chr7 74774756 0 +chr7 74774757 1,T +chr7 74774905 T +chr7 74774906 G,C,A,T +chr7 74774907 0 +chr7 74774908 0 +chr7 74774909 0 +chr7 74774910 0 +chr7 74774911 1,T +chr7 74774913 T +chr7 74774914 G,C,A,T +chr7 74774915 0 +chr7 74774916 0 +chr7 74774917 0 +chr7 74774918 0 +chr7 74774919 1,T +chr7 74775040 T +chr7 74775041 G,C,A,T +chr7 74775042 0 +chr7 74775043 0 +chr7 74775044 0 +chr7 74775045 0 +chr7 74775046 0 +chr7 74775047 0 +chr7 74775048 1,T +chr7 74775178 T +chr7 74775179 G,C,A,T +chr7 74775180 0 +chr7 74775181 0 +chr7 74775182 0 +chr7 74775183 0 +chr7 74775184 1,T +chr7 74775187 T +chr7 74775188 G,C,A,T +chr7 74775189 0 +chr7 74775190 0 +chr7 74775191 0 +chr7 74775192 0 +chr7 74775193 1,T +chr7 74775655 T +chr7 74775656 G,C,A,T +chr7 74775657 0 +chr7 74775658 0 +chr7 74775659 0 +chr7 74775660 0 +chr7 74775661 0 +chr7 74775662 1,T +chr7 74775725 T +chr7 74775726 G,C,A,T +chr7 74775727 0 +chr7 74775728 0 +chr7 74775729 0 +chr7 74775730 0 +chr7 74775731 0 +chr7 74775732 1,T +chr7 74775733 G,C,A,T +chr7 74775734 0 +chr7 74775735 0 +chr7 74775736 0 +chr7 74775737 0 +chr7 74775738 1,T +chr7 74775739 G,C,A,T +chr7 74775740 0 +chr7 74775741 0 +chr7 74775742 0 +chr7 74775743 0 +chr7 74775744 1,T +chr7 74775745 G,C,A,T +chr7 74775746 0 +chr7 74775747 0 +chr7 74775748 0 +chr7 74775749 0 +chr7 74775750 0 +chr7 74775751 0 +chr7 74775752 1,T +chr7 74775902 T +chr7 74775903 G,C,A,T +chr7 74775904 0 +chr7 74775905 0 +chr7 74775906 0 +chr7 74775907 0 +chr7 74775908 0 +chr7 74775909 1,T +chr7 74775948 T +chr7 74775949 G,C,A,T +chr7 74775950 0 +chr7 74775951 0 +chr7 74775952 0 +chr7 74775953 0 +chr7 74775954 0 +chr7 74775955 0 +chr7 74775956 0 +chr7 74775957 0 +chr7 74775958 0 +chr7 74775959 0 +chr7 74775960 0 +chr7 74775961 0 +chr7 74775962 0 +chr7 74775963 1,T +chr7 74776113 T +chr7 74776114 G,C,A,T +chr7 74776115 0 +chr7 74776116 0 +chr7 74776117 0 +chr7 74776118 0 +chr7 74776119 1,T +chr7 74776121 T +chr7 74776122 G,C,A,T +chr7 74776123 0 +chr7 74776124 0 +chr7 74776125 0 +chr7 74776126 0 +chr7 74776127 1,T +chr7 74776405 T +chr7 74776406 G,C,A,T +chr7 74776407 0 +chr7 74776408 0 +chr7 74776409 0 +chr7 74776410 0 +chr7 74776411 1,T +chr7 74776412 T +chr7 74776413 G,C,A,T +chr7 74776414 0 +chr7 74776415 0 +chr7 74776416 0 +chr7 74776417 0 +chr7 74776418 0 +chr7 74776419 0 +chr7 74776420 1,T +chr7 74776568 T +chr7 74776569 G,C,A,T +chr7 74776570 0 +chr7 74776571 0 +chr7 74776572 0 +chr7 74776573 0 +chr7 74776574 1,T +chr7 74776576 T +chr7 74776577 G,C,A,T +chr7 74776578 0 +chr7 74776579 0 +chr7 74776580 0 +chr7 74776581 0 +chr7 74776582 1,T +chr7 74776877 T +chr7 74776878 G,C,A,T +chr7 74776879 0 +chr7 74776880 0 +chr7 74776881 0 +chr7 74776882 0 +chr7 74776883 1,T +chr7 74776885 T +chr7 74776886 G,C,A,T +chr7 74776887 0 +chr7 74776888 0 +chr7 74776889 0 +chr7 74776890 0 +chr7 74776891 1,T +chr7 74776899 G +chr7 74776900 G,C,A,T +chr7 74776901 0 +chr7 74776902 0 +chr7 74776903 0 +chr7 74776904 0 +chr7 74776905 1,G +chr7 74776969 A +chr7 74776970 G,C,A,T +chr7 74776971 0 +chr7 74776972 0 +chr7 74776973 0 +chr7 74776974 0 +chr7 74776975 1,A +chr7 74777242 C +chr7 74777243 G,C,A,T +chr7 74777244 0 +chr7 74777245 0 +chr7 74777246 0 +chr7 74777247 0 +chr7 74777248 1,C +chr7 74777259 C +chr7 74777260 G,C,A,T +chr7 74777261 0 +chr7 74777262 0 +chr7 74777263 0 +chr7 74777264 0 +chr7 74777265 1,C +chr7 74777493 T +chr7 74777494 G,C,A,T +chr7 74777495 0 +chr7 74777496 0 +chr7 74777497 0 +chr7 74777498 0 +chr7 74777499 1,T +chr7 74777560 C +chr7 74777561 G,C,A,T +chr7 74777562 0 +chr7 74777563 0 +chr7 74777564 0 +chr7 74777565 0 +chr7 74777566 1,C +chr7 74777725 T +chr7 74777726 G,C,A,T +chr7 74777727 0 +chr7 74777728 0 +chr7 74777729 0 +chr7 74777730 0 +chr7 74777731 1,T +chr7 74777733 T +chr7 74777734 G,C,A,T +chr7 74777735 0 +chr7 74777736 0 +chr7 74777737 0 +chr7 74777738 0 +chr7 74777739 0 +chr7 74777740 1,T +chr7 74777827 G +chr7 74777828 G,C,A,T +chr7 74777829 0 +chr7 74777830 0 +chr7 74777831 0 +chr7 74777832 0 +chr7 74777833 1,G +chr7 74777841 A +chr7 74777842 G,C,A,T +chr7 74777843 0 +chr7 74777844 0 +chr7 74777845 0 +chr7 74777846 0 +chr7 74777847 0 +chr7 74777848 0 +chr7 74777849 1,A +chr7 74777850 T +chr7 74777851 G,C,A,T +chr7 74777852 0 +chr7 74777853 0 +chr7 74777854 0 +chr7 74777855 C +chr7 74777856 G,C,A,T +chr7 74777857 0 +chr7 74777858 0 +chr7 74777859 0 +chr7 74777860 0 +chr7 74777861 1,C +chr7 74777983 A +chr7 74777984 G,C,A,T +chr7 74777985 0 +chr7 74777986 0 +chr7 74777987 0 +chr7 74777988 0 +chr7 74777989 1,A +chr7 74778092 T +chr7 74778093 G,C,A,T +chr7 74778094 0 +chr7 74778095 0 +chr7 74778096 0 +chr7 74778097 0 +chr7 74778098 1,T +chr7 74778113 T +chr7 74778114 G,C,A,T +chr7 74778115 0 +chr7 74778116 0 +chr7 74778117 0 +chr7 74778118 0 +chr7 74778119 0 +chr7 74778120 0 +chr7 74778121 1,T +chr7 74778296 T +chr7 74778297 G,C,A,T +chr7 74778298 0 +chr7 74778299 0 +chr7 74778300 0 +chr7 74778301 0 +chr7 74778302 0 +chr7 74778303 1,T +chr7 74778479 G +chr7 74778480 G,C,A,T +chr7 74778481 0 +chr7 74778482 0 +chr7 74778483 0 +chr7 74778484 0 +chr7 74778485 1,G +chr7 74778841 T +chr7 74778842 G,C,A,T +chr7 74778843 0 +chr7 74778844 0 +chr7 74778845 0 +chr7 74778846 0 +chr7 74778847 1,T +chr7 74778849 T +chr7 74778850 G,C,A,T +chr7 74778851 0 +chr7 74778852 0 +chr7 74778853 0 +chr7 74778854 0 +chr7 74778855 1,T +chr7 74779067 T +chr7 74779068 G,C,A,T +chr7 74779069 0 +chr7 74779070 0 +chr7 74779071 0 +chr7 74779072 0 +chr7 74779073 1,T +chr7 74779143 C +chr7 74779144 G,C,A,T +chr7 74779145 0 +chr7 74779146 0 +chr7 74779147 0 +chr7 74779148 0 +chr7 74779149 1,C +chr7 74779184 G +chr7 74779185 G,C,A,T +chr7 74779186 0 +chr7 74779187 0 +chr7 74779188 0 +chr7 74779189 0 +chr7 74779190 1,G +chr7 74779197 G +chr7 74779198 G,C,A,T +chr7 74779199 0 +chr7 74779200 0 +chr7 74779201 0 +chr7 74779202 0 +chr7 74779203 1,G +chr7 74779623 G +chr7 74779624 G,C,A,T +chr7 74779625 0 +chr7 74779626 0 +chr7 74779627 0 +chr7 74779628 0 +chr7 74779629 1,G +chr7 74779790 C +chr7 74779791 G,C,A,T +chr7 74779792 0 +chr7 74779793 0 +chr7 74779794 0 +chr7 74779795 0 +chr7 74779796 0 +chr7 74779797 1,C +chr7 74780102 G +chr7 74780103 G,C,A,T +chr7 74780104 0 +chr7 74780105 0 +chr7 74780106 0 +chr7 74780107 0 +chr7 74780108 1,G +chr7 74780295 A +chr7 74780296 G,C,A,T +chr7 74780297 0 +chr7 74780298 0 +chr7 74780299 0 +chr7 74780300 0 +chr7 74780301 1,A +chr7 74780303 A +chr7 74780304 G,C,A,T +chr7 74780305 0 +chr7 74780306 0 +chr7 74780307 0 +chr7 74780308 0 +chr7 74780309 1,A +chr7 74780460 A +chr7 74780461 G,C,A,T +chr7 74780462 0 +chr7 74780463 0 +chr7 74780464 0 +chr7 74780465 0 +chr7 74780466 0 +chr7 74780467 0 +chr7 74780468 0 +chr7 74780469 0 +chr7 74780470 0 +chr7 74780471 1,A +chr7 74780603 C +chr7 74780604 G,C,A,T +chr7 74780605 0 +chr7 74780606 0 +chr7 74780607 0 +chr7 74780608 0 +chr7 74780609 1,C +chr7 74780782 A +chr7 74780783 G,C,A,T +chr7 74780784 0 +chr7 74780785 0 +chr7 74780786 0 +chr7 74780787 0 +chr7 74780788 0 +chr7 74780789 1,A +chr7 74780845 G +chr7 74780846 G,C,A,T +chr7 74780847 0 +chr7 74780848 0 +chr7 74780849 0 +chr7 74780850 0 +chr7 74780851 0 +chr7 74780852 1,G +chr7 74780859 G +chr7 74780860 G,C,A,T +chr7 74780861 0 +chr7 74780862 0 +chr7 74780863 0 +chr7 74780864 0 +chr7 74780865 0 +chr7 74780866 0 +chr7 74780867 0 +chr7 74780868 1,G +chr7 74781084 A +chr7 74781085 G,C,A,T +chr7 74781086 0 +chr7 74781087 0 +chr7 74781088 0 +chr7 74781089 0 +chr7 74781090 1,A +chr7 74781244 A +chr7 74781245 G,C,A,T +chr7 74781246 0 +chr7 74781247 0 +chr7 74781248 0 +chr7 74781249 0 +chr7 74781250 0 +chr7 74781251 0 +chr7 74781252 0 +chr7 74781253 0 +chr7 74781254 0 +chr7 74781255 0 +chr7 74781256 0 +chr7 74781257 0 +chr7 74781258 0 +chr7 74781259 0 +chr7 74781260 0 +chr7 74781261 0 +chr7 74781262 0 +chr7 74781263 0 +chr7 74781264 0 +chr7 74781265 0 +chr7 74781266 1,A +chr7 74781360 G +chr7 74781361 G,C,A,T +chr7 74781362 0 +chr7 74781363 0 +chr7 74781364 0 +chr7 74781365 0 +chr7 74781366 1,G +chr7 74781535 T +chr7 74781536 G,C,A,T +chr7 74781537 0 +chr7 74781538 0 +chr7 74781539 0 +chr7 74781540 0 +chr7 74781541 1,T +chr7 74781699 T +chr7 74781700 G,C,A,T +chr7 74781701 0 +chr7 74781702 0 +chr7 74781703 0 +chr7 74781704 0 +chr7 74781705 1,T +chr7 74781831 T +chr7 74781832 G,C,A,T +chr7 74781833 0 +chr7 74781834 0 +chr7 74781835 0 +chr7 74781836 0 +chr7 74781837 1,T +chr7 74782276 A +chr7 74782277 G,C,A,T +chr7 74782278 0 +chr7 74782279 0 +chr7 74782280 0 +chr7 74782281 0 +chr7 74782282 1,A +chr7 74782284 A +chr7 74782285 G,C,A,T +chr7 74782286 0 +chr7 74782287 0 +chr7 74782288 0 +chr7 74782289 0 +chr7 74782290 1,A +chr7 74782432 A +chr7 74782433 G,C,A,T +chr7 74782434 0 +chr7 74782435 0 +chr7 74782436 0 +chr7 74782437 0 +chr7 74782438 0 +chr7 74782439 0 +chr7 74782440 0 +chr7 74782441 0 +chr7 74782442 0 +chr7 74782443 0 +chr7 74782444 0 +chr7 74782445 0 +chr7 74782446 0 +chr7 74782447 0 +chr7 74782448 0 +chr7 74782449 0 +chr7 74782450 0 +chr7 74782451 1,A +chr7 74782580 A +chr7 74782581 G,C,A,T +chr7 74782582 0 +chr7 74782583 0 +chr7 74782584 0 +chr7 74782585 0 +chr7 74782586 0 +chr7 74782587 1,A +chr7 74782736 A +chr7 74782737 G,C,A,T +chr7 74782738 0 +chr7 74782739 0 +chr7 74782740 0 +chr7 74782741 0 +chr7 74782742 0 +chr7 74782743 1,A +chr7 74783023 G +chr7 74783024 G,C,A,T +chr7 74783025 0 +chr7 74783026 0 +chr7 74783027 0 +chr7 74783028 0 +chr7 74783029 1,G +chr7 74783121 C +chr7 74783122 G,C,A,T +chr7 74783123 0 +chr7 74783124 0 +chr7 74783125 0 +chr7 74783126 0 +chr7 74783127 1,C +chr7 74783635 C +chr7 74783636 G,C,A,T +chr7 74783637 0 +chr7 74783638 0 +chr7 74783639 0 +chr7 74783640 0 +chr7 74783641 1,C +chr7 74783774 G +chr7 74783775 G,C,A,T +chr7 74783776 0 +chr7 74783777 0 +chr7 74783778 0 +chr7 74783779 0 +chr7 74783780 1,G +chr7 74783944 G +chr7 74783945 G,C,A,T +chr7 74783946 0 +chr7 74783947 0 +chr7 74783948 0 +chr7 74783949 0 +chr7 74783950 1,G +chr7 74784015 A +chr7 74784016 G,C,A,T +chr7 74784017 0 +chr7 74784018 0 +chr7 74784019 0 +chr7 74784020 0 +chr7 74784021 0 +chr7 74784022 0 +chr7 74784023 1,A +chr7 74784074 T +chr7 74784075 G,C,A,T +chr7 74784076 0 +chr7 74784077 0 +chr7 74784078 0 +chr7 74784079 0 +chr7 74784080 0 +chr7 74784081 1,T +chr7 74784230 T +chr7 74784231 G,C,A,T +chr7 74784232 0 +chr7 74784233 0 +chr7 74784234 0 +chr7 74784235 0 +chr7 74784236 1,T +chr7 74784238 T +chr7 74784239 G,C,A,T +chr7 74784240 0 +chr7 74784241 0 +chr7 74784242 0 +chr7 74784243 0 +chr7 74784244 1,T +chr7 74784418 T +chr7 74784419 G,C,A,T +chr7 74784420 0 +chr7 74784421 0 +chr7 74784422 0 +chr7 74784423 0 +chr7 74784424 0 +chr7 74784425 0 +chr7 74784426 0 +chr7 74784427 0 +chr7 74784428 0 +chr7 74784429 0 +chr7 74784430 0 +chr7 74784431 0 +chr7 74784432 0 +chr7 74784433 0 +chr7 74784434 0 +chr7 74784435 1,T +chr7 74784583 T +chr7 74784584 G,C,A,T +chr7 74784585 0 +chr7 74784586 0 +chr7 74784587 0 +chr7 74784588 0 +chr7 74784589 0 +chr7 74784590 1,T +chr7 74784592 T +chr7 74784593 G,C,A,T +chr7 74784594 0 +chr7 74784595 0 +chr7 74784596 0 +chr7 74784597 0 +chr7 74784598 1,T +chr7 74784752 C +chr7 74784753 G,C,A,T +chr7 74784754 0 +chr7 74784755 0 +chr7 74784756 0 +chr7 74784757 0 +chr7 74784758 1,C +chr7 74784938 G +chr7 74784939 G,C,A,T +chr7 74784940 0 +chr7 74784941 0 +chr7 74784942 0 +chr7 74784943 0 +chr7 74784944 1,G +chr7 74784967 C +chr7 74784968 G,C,A,T +chr7 74784969 0 +chr7 74784970 0 +chr7 74784971 0 +chr7 74784972 0 +chr7 74784973 1,C +chr7 74785046 G +chr7 74785047 G,C,A,T +chr7 74785048 0 +chr7 74785049 0 +chr7 74785050 0 +chr7 74785051 0 +chr7 74785052 1,G +chr7 74785221 G +chr7 74785222 G,C,A,T +chr7 74785223 0 +chr7 74785224 0 +chr7 74785225 0 +chr7 74785226 0 +chr7 74785227 1,G +chr7 74785720 A +chr7 74785721 G,C,A,T +chr7 74785722 0 +chr7 74785723 0 +chr7 74785724 0 +chr7 74785725 0 +chr7 74785726 1,A +chr7 74785883 A +chr7 74785884 G,C,A,T +chr7 74785885 0 +chr7 74785886 0 +chr7 74785887 0 +chr7 74785888 0 +chr7 74785889 0 +chr7 74785890 0 +chr7 74785891 0 +chr7 74785892 0 +chr7 74785893 0 +chr7 74785894 0 +chr7 74785895 0 +chr7 74785896 0 +chr7 74785897 0 +chr7 74785898 0 +chr7 74785899 0 +chr7 74785900 0 +chr7 74785901 0 +chr7 74785902 0 +chr7 74785903 0 +chr7 74785904 0 +chr7 74785905 1,A +chr7 74785914 T +chr7 74785915 G,C,A,T +chr7 74785916 0 +chr7 74785917 0 +chr7 74785918 0 +chr7 74785919 0 +chr7 74785920 1,T +chr7 74786070 A +chr7 74786071 G,C,A,T +chr7 74786072 0 +chr7 74786073 0 +chr7 74786074 0 +chr7 74786075 0 +chr7 74786076 0 +chr7 74786077 0 +chr7 74786078 0 +chr7 74786079 1,A +chr7 74786080 A +chr7 74786081 G,C,A,T +chr7 74786082 0 +chr7 74786083 0 +chr7 74786084 0 +chr7 74786085 0 +chr7 74786086 0 +chr7 74786087 1,A +chr7 74786234 A +chr7 74786235 G,C,A,T +chr7 74786236 0 +chr7 74786237 0 +chr7 74786238 0 +chr7 74786239 0 +chr7 74786240 0 +chr7 74786241 0 +chr7 74786242 0 +chr7 74786243 0 +chr7 74786244 0 +chr7 74786245 0 +chr7 74786246 0 +chr7 74786247 0 +chr7 74786248 0 +chr7 74786249 0 +chr7 74786250 0 +chr7 74786251 0 +chr7 74786252 0 +chr7 74786253 0 +chr7 74786254 0 +chr7 74786255 0 +chr7 74786256 1,A +chr7 74786314 T +chr7 74786315 G,C,A,T +chr7 74786316 0 +chr7 74786317 0 +chr7 74786318 0 +chr7 74786319 0 +chr7 74786320 0 +chr7 74786321 1,T +chr7 74786408 T +chr7 74786409 G,C,A,T +chr7 74786410 0 +chr7 74786411 0 +chr7 74786412 0 +chr7 74786413 0 +chr7 74786414 0 +chr7 74786415 1,T +chr7 74786419 T +chr7 74786420 G,C,A,T +chr7 74786421 0 +chr7 74786422 0 +chr7 74786423 0 +chr7 74786424 0 +chr7 74786425 0 +chr7 74786426 1,T +chr7 74786427 T +chr7 74786428 G,C,A,T +chr7 74786429 0 +chr7 74786430 0 +chr7 74786431 0 +chr7 74786432 0 +chr7 74786433 0 +chr7 74786434 1,T +chr7 74786555 T +chr7 74786556 G,C,A,T +chr7 74786557 0 +chr7 74786558 0 +chr7 74786559 0 +chr7 74786560 0 +chr7 74786561 0 +chr7 74786562 1,T +chr7 74786563 T +chr7 74786564 G,C,A,T +chr7 74786565 0 +chr7 74786566 0 +chr7 74786567 0 +chr7 74786568 0 +chr7 74786569 0 +chr7 74786570 0 +chr7 74786571 1,T +chr7 74786850 C +chr7 74786851 G,C,A,T +chr7 74786852 0 +chr7 74786853 0 +chr7 74786854 0 +chr7 74786855 0 +chr7 74786856 1,C +chr7 74786946 T +chr7 74786947 G,C,A,T +chr7 74786948 0 +chr7 74786949 0 +chr7 74786950 0 +chr7 74786951 0 +chr7 74786952 0 +chr7 74786953 0 +chr7 74786954 0 +chr7 74786955 0 +chr7 74786956 0 +chr7 74786957 1,T +chr7 74787035 A +chr7 74787036 G,C,A,T +chr7 74787037 0 +chr7 74787038 0 +chr7 74787039 0 +chr7 74787040 0 +chr7 74787041 0 +chr7 74787042 0 +chr7 74787043 0 +chr7 74787044 0 +chr7 74787045 0 +chr7 74787046 0 +chr7 74787047 0 +chr7 74787048 0 +chr7 74787049 0 +chr7 74787050 1,A +chr7 74787053 A +chr7 74787054 G,C,A,T +chr7 74787055 0 +chr7 74787056 0 +chr7 74787057 0 +chr7 74787058 0 +chr7 74787059 0 +chr7 74787060 1,A +chr7 74787143 A +chr7 74787144 G,C,A,T +chr7 74787145 0 +chr7 74787146 0 +chr7 74787147 0 +chr7 74787148 0 +chr7 74787149 1,A +chr7 74787276 A +chr7 74787277 G,C,A,T +chr7 74787278 0 +chr7 74787279 0 +chr7 74787280 0 +chr7 74787281 0 +chr7 74787282 0 +chr7 74787283 0 +chr7 74787284 1,A +chr7 74787450 G +chr7 74787451 G,C,A,T +chr7 74787452 0 +chr7 74787453 0 +chr7 74787454 0 +chr7 74787455 0 +chr7 74787456 1,G +chr7 74787540 A +chr7 74787541 G,C,A,T +chr7 74787542 0 +chr7 74787543 0 +chr7 74787544 0 +chr7 74787545 0 +chr7 74787546 1,A +chr7 74787573 T +chr7 74787574 G,C,A,T +chr7 74787575 0 +chr7 74787576 0 +chr7 74787577 0 +chr7 74787578 0 +chr7 74787579 0 +chr7 74787580 1,T +chr7 74787585 T +chr7 74787586 G,C,A,T +chr7 74787587 0 +chr7 74787588 0 +chr7 74787589 0 +chr7 74787590 0 +chr7 74787591 0 +chr7 74787592 1,T +chr7 74787939 C +chr7 74787940 G,C,A,T +chr7 74787941 0 +chr7 74787942 0 +chr7 74787943 0 +chr7 74787944 0 +chr7 74787945 1,C +chr7 74788069 G +chr7 74788070 G,C,A,T +chr7 74788071 0 +chr7 74788072 0 +chr7 74788073 0 +chr7 74788074 0 +chr7 74788075 0 +chr7 74788076 1,G +chr7 74788094 G +chr7 74788095 G,C,A,T +chr7 74788096 0 +chr7 74788097 0 +chr7 74788098 0 +chr7 74788099 0 +chr7 74788100 1,G +chr7 74788177 G +chr7 74788178 G,C,A,T +chr7 74788179 0 +chr7 74788180 0 +chr7 74788181 0 +chr7 74788182 0 +chr7 74788183 1,G +chr7 74788215 G +chr7 74788216 G,C,A,T +chr7 74788217 0 +chr7 74788218 0 +chr7 74788219 0 +chr7 74788220 0 +chr7 74788221 1,G +chr7 74788261 G +chr7 74788262 G,C,A,T +chr7 74788263 0 +chr7 74788264 0 +chr7 74788265 0 +chr7 74788266 0 +chr7 74788267 1,G +chr7 74788288 G +chr7 74788289 G,C,A,T +chr7 74788290 0 +chr7 74788291 0 +chr7 74788292 0 +chr7 74788293 0 +chr7 74788294 1,G +chr7 74788313 G +chr7 74788314 G,C,A,T +chr7 74788315 0 +chr7 74788316 0 +chr7 74788317 0 +chr7 74788318 0 +chr7 74788319 0 +chr7 74788320 0 +chr7 74788321 1,G +chr7 74788330 G +chr7 74788331 G,C,A,T +chr7 74788332 0 +chr7 74788333 0 +chr7 74788334 0 +chr7 74788335 0 +chr7 74788336 1,G +chr7 74788411 G +chr7 74788412 G,C,A,T +chr7 74788413 0 +chr7 74788414 0 +chr7 74788415 0 +chr7 74788416 0 +chr7 74788417 1,G +chr7 74788456 C +chr7 74788457 G,C,A,T +chr7 74788458 0 +chr7 74788459 0 +chr7 74788460 0 +chr7 74788461 0 +chr7 74788462 1,C +chr7 74788466 C +chr7 74788467 G,C,A,T +chr7 74788468 0 +chr7 74788469 0 +chr7 74788470 0 +chr7 74788471 0 +chr7 74788472 0 +chr7 74788473 1,C +chr7 74788507 G +chr7 74788508 G,C,A,T +chr7 74788509 0 +chr7 74788510 0 +chr7 74788511 0 +chr7 74788512 0 +chr7 74788513 1,G +chr7 74788865 G +chr7 74788866 G,C,A,T +chr7 74788867 0 +chr7 74788868 0 +chr7 74788869 0 +chr7 74788870 0 +chr7 74788871 1,G +chr7 74789082 C +chr7 74789083 G,C,A,T +chr7 74789084 0 +chr7 74789085 0 +chr7 74789086 0 +chr7 74789087 0 +chr7 74789088 1,C +chr7 74789400 G +chr7 74789401 G,C,A,T +chr7 74789402 0 +chr7 74789403 0 +chr7 74789404 0 +chr7 74789405 0 +chr7 74789406 0 +chr7 74789407 1,G +chr7 74789647 A +chr7 74789648 G,C,A,T +chr7 74789649 0 +chr7 74789650 0 +chr7 74789651 0 +chr7 74789652 0 +chr7 74789653 0 +chr7 74789654 1,A +chr7 74789787 A +chr7 74789788 G,C,A,T +chr7 74789789 0 +chr7 74789790 0 +chr7 74789791 0 +chr7 74789792 0 +chr7 74789793 1,A +chr7 74789795 A +chr7 74789796 G,C,A,T +chr7 74789797 0 +chr7 74789798 0 +chr7 74789799 0 +chr7 74789800 0 +chr7 74789801 1,A +chr7 74789953 A +chr7 74789954 G,C,A,T +chr7 74789955 0 +chr7 74789956 0 +chr7 74789957 0 +chr7 74789958 0 +chr7 74789959 1,A +chr7 74789961 A +chr7 74789962 G,C,A,T +chr7 74789963 0 +chr7 74789964 0 +chr7 74789965 0 +chr7 74789966 0 +chr7 74789967 1,A +chr7 74790115 A +chr7 74790116 G,C,A,T +chr7 74790117 0 +chr7 74790118 0 +chr7 74790119 0 +chr7 74790120 0 +chr7 74790121 0 +chr7 74790122 0 +chr7 74790123 0 +chr7 74790124 0 +chr7 74790125 0 +chr7 74790126 0 +chr7 74790127 0 +chr7 74790128 0 +chr7 74790129 0 +chr7 74790130 0 +chr7 74790131 0 +chr7 74790132 0 +chr7 74790133 0 +chr7 74790134 0 +chr7 74790135 0 +chr7 74790136 1,A +chr7 74790421 G +chr7 74790422 G,C,A,T +chr7 74790423 0 +chr7 74790424 0 +chr7 74790425 0 +chr7 74790426 0 +chr7 74790427 1,G +chr7 74790687 A +chr7 74790688 G,C,A,T +chr7 74790689 0 +chr7 74790690 0 +chr7 74790691 0 +chr7 74790692 0 +chr7 74790693 1,A +chr7 74790840 A +chr7 74790841 G,C,A,T +chr7 74790842 0 +chr7 74790843 0 +chr7 74790844 0 +chr7 74790845 0 +chr7 74790846 0 +chr7 74790847 0 +chr7 74790848 0 +chr7 74790849 0 +chr7 74790850 0 +chr7 74790851 0 +chr7 74790852 0 +chr7 74790853 0 +chr7 74790854 1,A +chr7 74790857 A +chr7 74790858 G,C,A,T +chr7 74790859 0 +chr7 74790860 0 +chr7 74790861 0 +chr7 74790862 0 +chr7 74790863 0 +chr7 74790864 1,A +chr7 74790895 T +chr7 74790896 G,C,A,T +chr7 74790897 0 +chr7 74790898 0 +chr7 74790899 0 +chr7 74790900 0 +chr7 74790901 1,T +chr7 74791142 T +chr7 74791143 G,C,A,T +chr7 74791144 0 +chr7 74791145 0 +chr7 74791146 0 +chr7 74791147 A +chr7 74791148 G,C,A,T +chr7 74791149 0 +chr7 74791150 0 +chr7 74791151 0 +chr7 74791152 0 +chr7 74791153 1,A +chr7 74791290 A +chr7 74791291 G,C,A,T +chr7 74791292 0 +chr7 74791293 0 +chr7 74791294 0 +chr7 74791295 0 +chr7 74791296 0 +chr7 74791297 0 +chr7 74791298 0 +chr7 74791299 0 +chr7 74791300 0 +chr7 74791301 1,A +chr7 74791428 A +chr7 74791429 G,C,A,T +chr7 74791430 0 +chr7 74791431 0 +chr7 74791432 0 +chr7 74791433 0 +chr7 74791434 1,A +chr7 74791601 A +chr7 74791602 G,C,A,T +chr7 74791603 0 +chr7 74791604 0 +chr7 74791605 0 +chr7 74791606 0 +chr7 74791607 1,A +chr7 74791623 A +chr7 74791624 G,C,A,T +chr7 74791625 0 +chr7 74791626 0 +chr7 74791627 0 +chr7 74791628 0 +chr7 74791629 1,A +chr7 74791630 G,C,A,T +chr7 74791631 0 +chr7 74791632 0 +chr7 74791633 0 +chr7 74791634 0 +chr7 74791635 0 +chr7 74791636 0 +chr7 74791637 1,A +chr7 74791647 A +chr7 74791648 G,C,A,T +chr7 74791649 0 +chr7 74791650 0 +chr7 74791651 0 +chr7 74791652 0 +chr7 74791653 0 +chr7 74791654 1,A +chr7 74791686 T +chr7 74791687 G,C,A,T +chr7 74791688 0 +chr7 74791689 0 +chr7 74791690 0 +chr7 74791691 0 +chr7 74791692 0 +chr7 74791693 0 +chr7 74791694 1,T +chr7 74791728 T +chr7 74791729 G,C,A,T +chr7 74791730 0 +chr7 74791731 0 +chr7 74791732 0 +chr7 74791733 0 +chr7 74791734 1,T +chr7 74791832 A +chr7 74791833 G,C,A,T +chr7 74791834 0 +chr7 74791835 0 +chr7 74791836 0 +chr7 74791837 0 +chr7 74791838 0 +chr7 74791839 0 +chr7 74791840 1,A +chr7 74792055 T +chr7 74792056 G,C,A,T +chr7 74792057 0 +chr7 74792058 0 +chr7 74792059 0 +chr7 74792060 0 +chr7 74792061 1,T +chr7 74792072 T +chr7 74792073 G,C,A,T +chr7 74792074 0 +chr7 74792075 0 +chr7 74792076 0 +chr7 74792077 0 +chr7 74792078 0 +chr7 74792079 0 +chr7 74792080 0 +chr7 74792081 0 +chr7 74792082 0 +chr7 74792083 0 +chr7 74792084 0 +chr7 74792085 0 +chr7 74792086 0 +chr7 74792087 1,T +chr7 74792233 T +chr7 74792234 G,C,A,T +chr7 74792235 0 +chr7 74792236 0 +chr7 74792237 0 +chr7 74792238 0 +chr7 74792239 0 +chr7 74792240 1,T +chr7 74792242 T +chr7 74792243 G,C,A,T +chr7 74792244 0 +chr7 74792245 0 +chr7 74792246 0 +chr7 74792247 0 +chr7 74792248 1,T +chr7 74792377 T +chr7 74792378 G,C,A,T +chr7 74792379 0 +chr7 74792380 0 +chr7 74792381 0 +chr7 74792382 0 +chr7 74792383 1,T +chr7 74792504 G +chr7 74792505 G,C,A,T +chr7 74792506 0 +chr7 74792507 0 +chr7 74792508 0 +chr7 74792509 0 +chr7 74792510 1,G +chr7 74792758 A +chr7 74792759 G,C,A,T +chr7 74792760 0 +chr7 74792761 0 +chr7 74792762 0 +chr7 74792763 0 +chr7 74792764 1,A +chr7 74792891 A +chr7 74792892 G,C,A,T +chr7 74792893 0 +chr7 74792894 0 +chr7 74792895 0 +chr7 74792896 0 +chr7 74792897 1,A +chr7 74792899 A +chr7 74792900 G,C,A,T +chr7 74792901 0 +chr7 74792902 0 +chr7 74792903 0 +chr7 74792904 0 +chr7 74792905 1,A +chr7 74792919 G +chr7 74792920 G,C,A,T +chr7 74792921 0 +chr7 74792922 0 +chr7 74792923 0 +chr7 74792924 0 +chr7 74792925 0 +chr7 74792926 1,G +chr7 74793053 A +chr7 74793054 G,C,A,T +chr7 74793055 0 +chr7 74793056 0 +chr7 74793057 0 +chr7 74793058 0 +chr7 74793059 0 +chr7 74793060 0 +chr7 74793061 1,A +chr7 74793066 A +chr7 74793067 G,C,A,T +chr7 74793068 0 +chr7 74793069 0 +chr7 74793070 0 +chr7 74793071 0 +chr7 74793072 1,A +chr7 74793518 A +chr7 74793519 G,C,A,T +chr7 74793520 0 +chr7 74793521 0 +chr7 74793522 0 +chr7 74793523 0 +chr7 74793524 0 +chr7 74793525 1,A +chr7 74793527 A +chr7 74793528 G,C,A,T +chr7 74793529 0 +chr7 74793530 0 +chr7 74793531 0 +chr7 74793532 0 +chr7 74793533 1,A +chr7 74793681 A +chr7 74793682 G,C,A,T +chr7 74793683 0 +chr7 74793684 0 +chr7 74793685 0 +chr7 74793686 0 +chr7 74793687 1,A +chr7 74793688 G,C,A,T +chr7 74793689 0 +chr7 74793690 0 +chr7 74793691 0 +chr7 74793692 0 +chr7 74793693 1,A +chr7 74793815 G +chr7 74793816 G,C,A,T +chr7 74793817 0 +chr7 74793818 0 +chr7 74793819 0 +chr7 74793820 0 +chr7 74793821 1,G +chr7 74794066 G +chr7 74794067 G,C,A,T +chr7 74794068 0 +chr7 74794069 0 +chr7 74794070 0 +chr7 74794071 0 +chr7 74794072 1,G +chr7 74794090 C +chr7 74794091 G,C,A,T +chr7 74794092 0 +chr7 74794093 0 +chr7 74794094 0 +chr7 74794095 0 +chr7 74794096 1,C +chr7 74794100 T +chr7 74794101 G,C,A,T +chr7 74794102 0 +chr7 74794103 0 +chr7 74794104 0 +chr7 74794105 0 +chr7 74794106 1,T +chr7 74794188 C +chr7 74794189 G,C,A,T +chr7 74794190 0 +chr7 74794191 0 +chr7 74794192 0 +chr7 74794193 0 +chr7 74794194 1,C +chr7 74794286 A +chr7 74794287 G,C,A,T +chr7 74794288 0 +chr7 74794289 0 +chr7 74794290 0 +chr7 74794291 0 +chr7 74794292 1,A +chr7 74794396 A +chr7 74794397 G,C,A,T +chr7 74794398 0 +chr7 74794399 0 +chr7 74794400 0 +chr7 74794401 0 +chr7 74794402 0 +chr7 74794403 0 +chr7 74794404 0 +chr7 74794405 0 +chr7 74794406 0 +chr7 74794407 0 +chr7 74794408 0 +chr7 74794409 0 +chr7 74794410 0 +chr7 74794411 0 +chr7 74794412 0 +chr7 74794413 0 +chr7 74794414 0 +chr7 74794415 0 +chr7 74794416 0 +chr7 74794417 0 +chr7 74794418 0 +chr7 74794419 0 +chr7 74794420 0 +chr7 74794421 0 +chr7 74794422 0 +chr7 74794423 0 +chr7 74794424 0 +chr7 74794425 1,A +chr7 74794428 A +chr7 74794429 G,C,A,T +chr7 74794430 0 +chr7 74794431 0 +chr7 74794432 0 +chr7 74794433 0 +chr7 74794434 0 +chr7 74794435 1,A +chr7 74794440 A +chr7 74794441 G,C,A,T +chr7 74794442 0 +chr7 74794443 0 +chr7 74794444 0 +chr7 74794445 0 +chr7 74794446 1,A +chr7 74794601 A +chr7 74794602 G,C,A,T +chr7 74794603 0 +chr7 74794604 0 +chr7 74794605 0 +chr7 74794606 0 +chr7 74794607 1,A +chr7 74794608 G,C,A,T +chr7 74794609 0 +chr7 74794610 0 +chr7 74794611 0 +chr7 74794612 0 +chr7 74794613 0 +chr7 74794614 1,A +chr7 74794811 A +chr7 74794812 G,C,A,T +chr7 74794813 0 +chr7 74794814 0 +chr7 74794815 0 +chr7 74794816 0 +chr7 74794817 0 +chr7 74794818 0 +chr7 74794819 1,A +chr7 74794937 G +chr7 74794938 G,C,A,T +chr7 74794939 0 +chr7 74794940 0 +chr7 74794941 0 +chr7 74794942 0 +chr7 74794943 1,G +chr7 74795033 T +chr7 74795034 G,C,A,T +chr7 74795035 0 +chr7 74795036 0 +chr7 74795037 0 +chr7 74795038 0 +chr7 74795039 1,T +chr7 74795040 G,C,A,T +chr7 74795041 0 +chr7 74795042 0 +chr7 74795043 0 +chr7 74795044 0 +chr7 74795045 1,T +chr7 74795054 T +chr7 74795055 G,C,A,T +chr7 74795056 0 +chr7 74795057 0 +chr7 74795058 0 +chr7 74795059 0 +chr7 74795060 0 +chr7 74795061 0 +chr7 74795062 0 +chr7 74795063 0 +chr7 74795064 0 +chr7 74795065 0 +chr7 74795066 0 +chr7 74795067 0 +chr7 74795068 0 +chr7 74795069 0 +chr7 74795070 1,T +chr7 74795218 T +chr7 74795219 G,C,A,T +chr7 74795220 0 +chr7 74795221 0 +chr7 74795222 0 +chr7 74795223 0 +chr7 74795224 1,T +chr7 74795226 T +chr7 74795227 G,C,A,T +chr7 74795228 0 +chr7 74795229 0 +chr7 74795230 0 +chr7 74795231 0 +chr7 74795232 1,T +chr7 74795376 A +chr7 74795377 G,C,A,T +chr7 74795378 0 +chr7 74795379 0 +chr7 74795380 0 +chr7 74795381 0 +chr7 74795382 0 +chr7 74795383 1,A +chr7 74795387 A +chr7 74795388 G,C,A,T +chr7 74795389 0 +chr7 74795390 0 +chr7 74795391 0 +chr7 74795392 0 +chr7 74795393 0 +chr7 74795394 0 +chr7 74795395 1,A +chr7 74795495 T +chr7 74795496 G,C,A,T +chr7 74795497 0 +chr7 74795498 0 +chr7 74795499 0 +chr7 74795500 0 +chr7 74795501 1,T +chr7 74795642 G +chr7 74795643 G,C,A,T +chr7 74795644 0 +chr7 74795645 0 +chr7 74795646 0 +chr7 74795647 0 +chr7 74795648 1,G +chr7 74795813 A +chr7 74795814 G,C,A,T +chr7 74795815 0 +chr7 74795816 0 +chr7 74795817 0 +chr7 74795818 0 +chr7 74795819 1,A +chr7 74795821 A +chr7 74795822 G,C,A,T +chr7 74795823 0 +chr7 74795824 0 +chr7 74795825 0 +chr7 74795826 0 +chr7 74795827 1,A +chr7 74795975 A +chr7 74795976 G,C,A,T +chr7 74795977 0 +chr7 74795978 0 +chr7 74795979 0 +chr7 74795980 0 +chr7 74795981 0 +chr7 74795982 0 +chr7 74795983 1,A +chr7 74795985 C +chr7 74795986 G,C,A,T +chr7 74795987 0 +chr7 74795988 0 +chr7 74795989 0 +chr7 74795990 0 +chr7 74795991 1,C +chr7 74795996 A +chr7 74795997 G,C,A,T +chr7 74795998 0 +chr7 74795999 0 +chr7 74796000 0 +chr7 74796001 0 +chr7 74796002 1,A +chr7 74796162 T +chr7 74796163 G,C,A,T +chr7 74796164 0 +chr7 74796165 0 +chr7 74796166 0 +chr7 74796167 0 +chr7 74796168 1,T +chr7 74796239 A +chr7 74796240 G,C,A,T +chr7 74796241 0 +chr7 74796242 0 +chr7 74796243 0 +chr7 74796244 0 +chr7 74796245 0 +chr7 74796246 0 +chr7 74796247 0 +chr7 74796248 1,A +chr7 74796250 A +chr7 74796251 G,C,A,T +chr7 74796252 0 +chr7 74796253 0 +chr7 74796254 0 +chr7 74796255 0 +chr7 74796256 0 +chr7 74796257 0 +chr7 74796258 0 +chr7 74796259 0 +chr7 74796260 0 +chr7 74796261 0 +chr7 74796262 0 +chr7 74796263 1,A +chr7 74796407 A +chr7 74796408 G,C,A,T +chr7 74796409 0 +chr7 74796410 0 +chr7 74796411 0 +chr7 74796412 0 +chr7 74796413 1,A +chr7 74796561 A +chr7 74796562 G,C,A,T +chr7 74796563 0 +chr7 74796564 0 +chr7 74796565 0 +chr7 74796566 0 +chr7 74796567 0 +chr7 74796568 1,A +chr7 74796569 G,C,A,T +chr7 74796570 0 +chr7 74796571 0 +chr7 74796572 0 +chr7 74796573 0 +chr7 74796574 0 +chr7 74796575 0 +chr7 74796576 0 +chr7 74796577 0 +chr7 74796578 1,A +chr7 74797340 A +chr7 74797341 G,C,A,T +chr7 74797342 0 +chr7 74797343 0 +chr7 74797344 0 +chr7 74797345 0 +chr7 74797346 1,A +chr7 74797697 T +chr7 74797698 G,C,A,T +chr7 74797699 0 +chr7 74797700 0 +chr7 74797701 0 +chr7 74797702 0 +chr7 74797703 1,T +chr7 74797890 T +chr7 74797891 G,C,A,T +chr7 74797892 0 +chr7 74797893 0 +chr7 74797894 0 +chr7 74797895 0 +chr7 74797896 0 +chr7 74797897 1,T +chr7 74797945 G +chr7 74797946 G,C,A,T +chr7 74797947 0 +chr7 74797948 0 +chr7 74797949 0 +chr7 74797950 0 +chr7 74797951 1,G +chr7 74797984 T +chr7 74797985 G,C,A,T +chr7 74797986 0 +chr7 74797987 0 +chr7 74797988 0 +chr7 74797989 0 +chr7 74797990 1,T +chr7 74798038 T +chr7 74798039 G,C,A,T +chr7 74798040 0 +chr7 74798041 0 +chr7 74798042 0 +chr7 74798043 0 +chr7 74798044 0 +chr7 74798045 1,T +chr7 74798506 T +chr7 74798507 G,C,A,T +chr7 74798508 0 +chr7 74798509 0 +chr7 74798510 0 +chr7 74798511 0 +chr7 74798512 1,T +chr7 74798675 T +chr7 74798676 G,C,A,T +chr7 74798677 0 +chr7 74798678 0 +chr7 74798679 0 +chr7 74798680 0 +chr7 74798681 1,T +chr7 74798683 T +chr7 74798684 G,C,A,T +chr7 74798685 0 +chr7 74798686 0 +chr7 74798687 0 +chr7 74798688 0 +chr7 74798689 1,T +chr7 74798997 T +chr7 74798998 G,C,A,T +chr7 74798999 0 +chr7 74799000 0 +chr7 74799001 0 +chr7 74799002 0 +chr7 74799003 0 +chr7 74799004 0 +chr7 74799005 1,T +chr7 74799006 G,C,A,T +chr7 74799007 0 +chr7 74799008 0 +chr7 74799009 0 +chr7 74799010 0 +chr7 74799011 1,T +chr7 74799012 G,C,A,T +chr7 74799013 0 +chr7 74799014 0 +chr7 74799015 0 +chr7 74799016 0 +chr7 74799017 0 +chr7 74799018 0 +chr7 74799019 0 +chr7 74799020 0 +chr7 74799021 0 +chr7 74799022 0 +chr7 74799023 0 +chr7 74799024 0 +chr7 74799025 0 +chr7 74799026 0 +chr7 74799027 0 +chr7 74799028 1,T +chr7 74799263 T +chr7 74799264 G,C,A,T +chr7 74799265 0 +chr7 74799266 0 +chr7 74799267 0 +chr7 74799268 0 +chr7 74799269 1,T +chr7 74799418 T +chr7 74799419 G,C,A,T +chr7 74799420 0 +chr7 74799421 0 +chr7 74799422 0 +chr7 74799423 0 +chr7 74799424 1,T +chr7 74799426 T +chr7 74799427 G,C,A,T +chr7 74799428 0 +chr7 74799429 0 +chr7 74799430 0 +chr7 74799431 0 +chr7 74799432 0 +chr7 74799433 1,T +chr7 74799617 T +chr7 74799618 G,C,A,T +chr7 74799619 0 +chr7 74799620 0 +chr7 74799621 0 +chr7 74799622 0 +chr7 74799623 1,T diff --git a/paraphase/data/ncf1/ncf1_config.yaml b/paraphase/data/ncf1/ncf1_config.yaml new file mode 100644 index 0000000..ec43783 --- /dev/null +++ b/paraphase/data/ncf1/ncf1_config.yaml @@ -0,0 +1,24 @@ +gene: ncf1 +check_nm: 0.2 + +# data files +data: + reference: ref.fa + homopolymer: homopolymer_sites.txt + depth_region: genome_region.bed + +# coordinates +coordinates: + hg38: + nchr: "chr7" + nchr_old: "chr7_74768800_74792800" + nchr_length: 159345973 + extract_region1: "chr7:74769011-74792315" + extract_region2: "chr7:73215625-73238945 chr7:75153639-75176964" + + pivot_site: 74777266 + left_boundary: 74769011 + right_boundary: 74792315 + + #mutli-alleleic site + noisy_region: [[74783765, 74783765], [74781743, 74781743], [74776715, 74776718], [74780432, 74780432], [74778377, 74778377]] \ No newline at end of file diff --git a/paraphase/data/ncf1/ref.fa b/paraphase/data/ncf1/ref.fa new file mode 100644 index 0000000..cb957fc --- /dev/null +++ b/paraphase/data/ncf1/ref.fa @@ -0,0 +1,402 @@ +>chr7_74768800_74792800 +GCGGCGGGGTGCAGGGGGCCCTTCCTATTAGCACTCGGGACGTGGGGGAATTCTTGGGGC +CCTGGTATTGTTCTAACACCCAGTGATGGTTCACCAGCTGTCTCCTTTATAATAATTAGG +TAAGAGATGTGGTTTTCTACATATTTCACAATACAAACATTTTTAGAAATTCTATTTCCT +TCCCCTGAAAAAACTCTCTTATCTCTTTATTACTTCCTTCCTTAACTTTATTTTATTTTA +TTTTATTTATTTATTTTTTGAAACCAAGTCTACTCTGTTACCCAGCCTGGAGTGCAGTGG +TGTGATCTCGGCTCACTGCAACCTCTGCCTCCCAGGTTCAAGCGATTCTTCTGCCTCAGC +CTCCCAAGTAGCTGGGATTATAGGCTTGCAATGCCATGCCTGGCTAATTTTTTTATTTTT +AGTAGAGACCCGGTTTCACCGTGTTGGCCAAGCTGGTCCCGAACTCCTGACCTCAGATGA +CCCACCCACCTTGGCCTCCCAAAGTGCTGGAATTACAGGCATGAGCCACTGCACCTATCT +TTTTTTTTTTAATTAAAAAAATTATTTGGTACCTTGTTTCATCCATGCATTAAATTAAAT +CCTGGCCAGACACAGTGGCTTATGCCTGTAATCCCAGCATTTTGGGAGGCTGAGGTGGGA +GGACCACTTGATACTGGAGCTTGAGACCAGGCCGAGCAGCATCTCGAGACCCCGTCTCTA +CAAAAAAAAAATAATAATAATAATAATAATAAATAAAAAGTGGAAAAAATCCTATGTCAT +CCTGAAAAAAGGCTGTAAGCCTGCTTACAGAGGTCATTACAAGGTCAAACTCAAGTTCGG +AGCGCTTCCTGCCTCTGCTCATCCAACAAACTTGCTGGATACCTCCTGTCTGCAGAGCAC +TTTGAGGGAACATAACAGGGTCTTGGGAGGCCACAGGAGGAGAGTTGAAAGATCACAGCC +AGGGGCTCAGGGTGTCCACAGGACAAGTACCCTTGGCCAGGCAGTTACGCAAGTGTGGAA +AGACTGCTAGAGGAAGGGAAGGAAGTGCCGAGAGCCCACAAAATTCTCTGCTTACAACCA +GCCCCACTAGAACCTTCCTCTGCCCTGCCTCGACATGCCCAGGAGAGCACCGCTGCAGGT +CTGGCCTCTGTGCTGAGCCTTTTTTTTTTTTTTTTCCTGAGACAGAATCTCACTCTGCTT +CCCAGGCTGGAGTGCAGTGGCAGGATCTCGGCTCACTGCAACCTCCACCTCCCTGGTTCA +AGTGATTCTCCTGTCTTACCCTCCGGAGTAGCTGGGATTACAGGTGTGTGCCACCATGCC +CAGCTAATTTTTGTATTTTTAGTGGAGACTGGGTTTCACCATGTTGGGCCAGCTGGTCTT +GAACTCCTGACCTCAGGTGATCCGCCCACCGTAGCCTCCCAAAGTTCTGGGATTAGAGCT +ATGAGCCACCATGCCTGGCTACCGTGCTGGGCCTTTCGAGGAGGCATTTGACAGGGAAGA +TGAGAGACAAATTGAGTGTCAGGGAAGGGGTGTTGATAGAAAAATTACAGGAGAGCACAC +AACTTTCAGCGGGTGAGCCCAGTGCCTGAGCTGCGGGACCACCCTACCAATGACCTTGAA +CTTATCTGACTGCAGCCTTGAACTCCTGAGCTCAAGGAGTCCTTCTGCCTCAGCCTCCTC +CCAAGTAGCTGGGACTACTGGCACATGCCACCATGCCCAGCTAATTATTTTATTTATTTT +ATTTTATTTTATTTTATTTTATTTTATTTTGAGATGGAGTTTTGTCCTTGTTGCCCAGGC +TGGAGTGCAATGGTGCAATCTCAGCTCGCCGCAACTTCTGCCTCCCAGGTGCAAGCGAAT +TCTCCTGCCTCAGCCTCCTGAGTAGCTGGGATTACAGGCATGTGCCACCACGCCTGGCTA +ATTTTGTATTTTTAGTAGAGACGGGGTTTCACCATGTTGGCCACGCTGGTCTCGAACCCC +TGACCTCAGGTGATCCACCTGCCTCGGCCTCCCAAAGTACTGAGATTACAGGCATGAGCC +ACCGCACCTGGCCCCACTTGTGGAACTAGCATCTATCTGGAGAGGAGGCAAACATCGCCC +ACCACCTCCCGCTCTCTCCTGTCACCACTGTCCCCACCATCATTCCAGAGGTCACCCTGG +CTTCCAACACCACAGCCTGGCTTGGGCAGTTTTCAAGCCTCGTATAAATGACATCCTCCA +GAACATGTGCTCTGTGCCTGCCTTCCTTCCGTCAGTGATGTATCTGGAAGATTCCACTGT +GTCGCCCTGTGGGACAGGTCCTTGTCATTGCTGAGTAGATCCTGTTGCAAATGCCTATCT +CTCTTCATGGAAAGATCCAAGATACACAGATGGAAATCATCATAGGAAGGGCTGGCAAGG +CCGTTCACACCCAGGGCTGGGGACCTCAGGGTGGAGGTGGGGGACAGTAAGGACCAGAAG +GAGCAGGTGCCGGCGGGTGATGTGAGCTTTCTTCTCTATAGAGAAGTGAAGGCCGGGTGC +AGTGGCTCACTCCTGTAATCCCAGCGCTTTGGGAGGTCGAGGCGGGCAGATCACTTGAGG +TCAGGAGTTCGAGACCAGCCTGGGCAATTTGGTGAAACCCCATCACTATAAAAATACAAA +AAATTAGCCGGACATGGTGGTGCACGCCTGTAATCCCAGCTATTTGGGAGGCTGAGGCAG +GAGAATTGCTTGTACCCGGAAGGTGAAGGTTGCAGTGAGCCGAGATCATGCCACTGCATA +CCAGCCTGGGGGACAGAAAGAGACTCTGTCTCAAAAAAAAAAAAAAGAAAAAAAGAAGTG +AAGCACTTGCCAAGCAAATCTTTCAGAGCAGGTGGAGTGGACCCTACACCTCTTGGATAA +TAAATGCACTGGATAATAAAAGCAGGAACAGGCCAGGTGCGGTGGCATGTGCCTGTAGTC +CCAACCTACTGGGGAGGCCAAGGCAGGAGGACTGCTTGAGCCCAGGAGTTGGAGGCTGCA +GTGAGTTATGACCAGGCAACTGCACTCCAGCCTGGGTGACAGATAGAGACCCTGTCTTTA +AAAAAAAAAAAAAAAAAAAAAGGGCCAAGCACAGTGGCTCATGCCTGTAATCCCAACACT +TTGGGAGGCTGAGGTGGGTGGATCTCCTGAGCTCAGGAGTTCAAGACCAGCCTGGCCAAC +AGGGTGATACCCCTTCTCTACTAAAAATACAAAATTAGCCAGGCGTGGTGGCGCACACCT +GTAATCCCAGCTACTTGGGAAGCTGAGGCAGGAGAATCGCTTGAACCTGGAAGGCAGAGG +TTGCAGTGAGCCGAGATTGTGCCACTGCACTCCAGCTTGGGCAACAAGAGCGAAACTTCG +CTTCAAACAAATAAATTAACGCCCAGCATGTCTTGGCTTTCATCTGCCAGACCTCAACCC +TCACCCCCAGGAGATCAGGTCCGGACCATGAGCTGACCCTGGACTCAGGCAAGGGTGAGT +TGGTGCAGCCCTGGCCTGCTGGGAGGCACAGGCTGCAGCAGGCTGCCTGGGGCTGAGGCC +CGCCACTCATGAACTCATGACCTTGAATGAGCTCCAAAAGCTCTGGGCCTCCCAGGCTCT +AGGGGGAGTGGGAGAGAGAGGCCTCAGCCTGTCCCTGGGCATGCTGCCCCCTCCTCACCT +CTTTGTCCCAAATCCCCTTCCTGGCAAAGCTGACAGTCTTAATATCACTCTGGAGAAAAC +TGAGTCAGCCCTAAGGAACAATTCAATGAACCATTTGCTTACTTGAGGATTGGAACTCAA +GTCTCACTCAAAGTCTGTGCCATTTTCGTCCCAGCTGTCACTGGCCCTCATCCACACACA +CCCAAGGATGAGCATCTAACGCTTGCATGCACACTCCCATGCCCGCGTTCATTCACTCAT +TCATTCATTCATTCACTCATTCATTGACTCATTCATTCATTCACTCACTCATTCATTCAC +TCAGTGAATGTTGCAGTCACGATCCAAATATTTATGGCCTCTGTGTGCCAGGCACTAGAT +GGAGGGGCTGGGGCTAGAGCCCCTGATAACCCGGTCATGCCCTAGCTTTCCTGGGACACA +CATTGTGGTAAGGGGAGACTAAAAAAATTAAGTCAGGCCAGGCACGGTGGCTCATGCCTG +AATCCCAGCACTTTGGGAGGCCGAGGCGAGTGAATTACCTGAGGTCAGGAGTTCAAGACC +AGCCTGGCCAACATGGAGAAACCCAGTCTCTAATTAAAAAAAAAAAAAATTAACCAGGTG +TGGTGGCACATGCCTGTAATCCCAGCTACTCAGGAGACTAACGCAAGAGAATTGCTTGAA +CCCAGGAGGCAGAGGTTGCGGTGAGCCGAGATCGCGCCATTGCACTCCAGCCTGGGAAAC +AAGAGCGAGACTCCATCTCAAAAAAAAAAAAGTGGGAGGCAGAGGCAGGAGGATCACTAG +AGGCCAGTAGTTTGAGACCATCCTGGGCAACATAGCAGGACCCTGTCTGTACAAAAAAAT +TAAAAAAAATTTAACCGGGCATGGTGGCACACACCCGTAGTCCCAGCTACTCCAGAGGCT +GAGGCAGGAGGATCGCTGGAGCCCAGGAGTTGGAGGCTGCAGTGAACTGTGATCCCACCA +CTGCACTTAAGCCTGGATAACAAAGCAAGACCCTGTCTCAAATAACAATAGCAATAATAA +TAAAGAAAAATTAAATGCAATTTGCGATGCATCAGTGATAAGTGCTCTGCAGAAAAAGGA +GGCAGGAAGAGGCTGAGAAAGGTATGAGGTTTGCTATGCAATGTGAAGTTATCAAGGAAG +GCTTCTCGGAAGAGGTGACATTTGAGCAGAGAAATGGAGGAGAGTTATGGAGGGAAGATG +GTGAATGGGGGGAACATGGTCAAGACCAGGAATATGGTCAAGGGGGGAAAGATGGTCAAG +GGGACGCAGCAAATGCAAAGGCCCTGAGGCAGGAGCAGCTTGATTCACCCCCAAAACCCG +TGGGGCCCGTGCAGGCGACGGGAAGGACAAGTGTAAACCCTTTTCCTTGTCCCTGCAGGT +GTGTGTGAACATGAGTCTGCCCATGTTTACACCCTGCAAGCCTGAAGAGTCCCCAGAAAC +TGAAAGAAGAAGCAAAGCCCTTTCTGTACCCTCCCTGCCCCCTGTCCCGACCGCGACAAA +AGCGACTTCCTCTTTCCAGTGCATTTAAGGCGCAGCCTGGAAGTGCCAGGGAGCACTGGA +GGCCACCCAGTCATGGGGGACACCTTCATCCGTCACATCGCCCTGCTGGGCTTTGAGAAG +CGCTTCGTACCCAGCCAGCACTATGTGAGTAGCTGGTGGAGGGCATCCCGTGGGGGGAAT +ACGGGAGGGACAGCACGGCCACCCTTGCAGTCCCAGGGCCAACCAGCTCCAGTGAGGACT +AACGGGGCAGGGTCTTGGGCACCTGGTCCCTGGTCTTTGAGCCTGGATCTACCCCTCTGA +TCCCTGGGAAGACAGTTCCCTTGGACCCGCCCTGGGCCCCAGCCCTTTACTGTCCCCGCC +TGTGTCCCCAGCCAGGCCCTCAGCCTTAGCCAGGAGTCCTCTTTCTGCTCCCCTGCCATG +GCCAGGCAGCCCAGCGCTCTCTCAGGTCCGAGGCCCACTCCTCCAGGAAGCCTTCCCTGA +CTAGCCCAGCTATCAGAGAGTGGCCCTCCCAAGAGGGAGGCCTGGAAACTAAAGCTCTCT +CTCTCCCCAGCTGCCTGTAGTGTCAGTTAGAGTCTTATCCTCTCCAGTAGGGTGACACCA +TGACAGGGGCCAATAGAGTCCTCCCATCTGTCCCCAAGGAGGCTGGACAAATGCCTGCTC +AGACACACAAGTCCACTGGGTCCCCTAATCCCATAGGAAGGCCAGGGAGGAACTACATTT +AGGAAATTGAAGCTTGTATGGAACATTTAGTCCTATGTGCCAAGACCTTTCTCTTTTTTG +TTATTTTTTTGTTTTTTGAGACAGAGTCTTGATCTGTTGCCCAGGCCAGAGTGCAGTGGC +ACGATCTCAGCTCACTGCAACCTCCGCCTTCCAGGTTCAACTGGTTCTCCTGCCTCAGCC +TCCAGAGTAGTTGGGATTACAGGTGCCCACCACCACGCCTGGCTAATTTTTGTATTTTTA +GTAGAGACAGGGTTTCACCATGTTGGCCAGACTGGTCTCAAACTCCTGACCTCAAGTGAT +CCACCCACCTGGGCCTCCCAAAGTGCTGGGATTACAGGCATGAGCCACCGTGCCTGGCCT +GTTTTTTTGAAATGAGGTCTGGAGTGCAGTGGTGCGATCATAGTTCACTGCAGCCTCGAC +CTCCCAGGCCCAAGTGATCCTCCTGCCTCAGCCCCTTGAGTAGCTGGGGCTACAGGCGCA +CACCACCATGCCTGGCTAGTTTTTAAAATTTTTGTGGAGATGAGGTTTCACTATGTTGTC +CAGGCTAATCTTGAACTCCTCGGCTTAAGCAACCCTCTGGTCTCAGCCTCCCACAGTGCT +AGGATTACAAGCGTGAGCTACCGTGCCTAGTCACTTTTCTCCTTTTCTTTGTAACTTTCA +GTTTTGAAATTTCAAATTTACAGAAAGGCTACTGGGTGTCAAAACGGTACCAGTCACTCC +AATAGTCTTTCACTCACCTTCATCCACACCTCTCTTTCTGGGGATATTTTCTGAATTATT +TGAGAGTGAGTTGAAGACGTGTTTCTTTACCTCTAAATACTAGTTGTTGGGCATTTCTTA +AAATCAAGGCATTCTCTTACATAATCACAACACACGTGTCAAAATCAGGAAATTAACATG +GACAAAACACCATTATCCACCCACAGACTTTACTGAGGTTTCCCCGATTATCCTGCTTGT +CCTCTGCAGTGAAAACTTTTTTCAGGTCTAGGATCCAGTCAAGGATCAATGTCATAGCCT +TTAACCTTCTTTAATCTGGATCAGTCTTTTTTCTTTTTCTTTTTCTTTTTTTGGACACGG +AATCTCACTCTGTCGCCAGACTGGAGTGCAGTGGTGCAATCTCGGCTCATTGCAACCTCT +GCCTCCTGGGTTCAAGAGATTCTCCTGCCTCAGCCTCCTGAGTAGCTGGGAATACAGGTG +CGCGCCACCATGCCCAGCTCGCATTTTTTGGTAGAGACAGGGTTTTGCCATATTGATTCT +GGATCAGTCTTTTTTTTTTTTTTATGAGATGGAGTCTTACTCTGTCACCCAGGCTGGAGT +GCAATGGCACAATCTCCACTCACTGCATCCTCCGCCTCCCAGGTTCAAGCAATTCTCGTG +CCTCAGCCTCCCGAGTAGCTGGGATTACAGGCATGCGCCACCATGCCCGGCTACTTTTTG +TATTTTTAGTAGAGACAGGGTTTCACCATGTTAGCCAGGCTGATCTCGAACTCCTGACGT +CAGGTGATCTGCCCGCCTCGACCTCCCAAAGTGCTGGGATTACAGGCGTGAGCCACCGTG +CCAGCGGATTCTGGATCGGTCTTAATCAGTCTTTGTCTTTTGCAACTTTGATGTTTTGCA +GAGAGCAGACCAGTTACCTTGTAGAATGTCCCTTAGTTTGGGTTTATCTTCATTAGATTC +AGTTTGTGTATCCAGGGCAGTGGATCTTAGATGCAATTCTGTCTTCTTTTTAATTTTTTT +GAGAGGGAGTCTCGCTCTGTCACCCAGGCTGGAGTGCAGTGGCACAACCTCAGCTCACTG +CAGCCTCCGCCTCCCGGGTTCAAGCAATTATCCTGTCCCAGCCTCCCAAGTAGCTGGGAT +CACAGGTGCCCATCACCACTACCGGGTAATTTTTGTGTTTTTAGTAGAGACAGGGTTTCA +CCATATTGGTCAGGCTGGTCTTGAACGCCTGACCTCAGGTGATCCACCTGCCTTGGCCTC +CCAAAGTGCTGGGATTACAGACGGGAGCCAACATGCCCAGCCTTCCTGCCCCTCCCGTCC +CCTCCCCTCTCCTCCTGTCCCCTCCCTTCCCCTCCCCTCCCCACCCAAGCTGGAGTGCAG +TGGTGCAATCATAGCTCACTAAAGCCTTGACCTCCAAGTCTCAAGCAATTCTCCTGCCTC +ACCTGGGGCCACAGGTGTGCGGCACCACACCCGGACAATTTTTGTGTTTTTAGTAGATAT +GGGGGTCTCGCTATGTTGCCCAGGCTGGTCTCAAACTCTTGGACTCAAGCGATCTTCCCA +CCTCGGTACTAAAAAGTGCTGGGATTCCAGGTGTGAGCCACCGTGCCCAGCCTAGGTCCT +ACTTTTATCTCCAATTTACAGATGAGTCCATTTGAGAGAAGCTGACCCTCTTGCCCTGGG +TCTCAAGGCTGGGGCGTGGCAGCACTTGGGTCCACGTTTGTGCCCTTTCTGCAATCCAGG +ACAACCGCAAAGATGGTCCTCACCCCAATCCTCTGGGCTTCCTCCAGTGGGTAGTGGGAT +CCTGGGTGCACACAGCAAAGCCTCTTTGGAGGCTGAATGGGGTCCCCCGACTCTGGCTTT +CCCCCAGGTGTACATGTTCCTGGTGAAATGGCAGGACCTGTCGGAGAAGGTGGTCTACCG +GCGCTTCACCGAGATCTACGAGTTCCATGTGAGTGTGGGGATGGAGGAGGGACAGGGACC +CACCGTTCCAGCTCCACCCTTTGGGAAGGACCTTAGCCCAGGTGATGGGGAAACTGCAGA +ACCCAGAATCCCCTCCCAGACCACAGTTAAAGGGGATTTATTTATTTATATAAATTTTTG +TGACAGGGTCTTGCTCTGTCACCACTCTGAACACCTCATGTTCTCTGATTACAGGCATGA +GCCCCCACGCTCGGCCTTTTAGGTGGTTTTGAGAGGTATTTAGGTTTGCAGTGCAGGGGC +GCAATCATAGCTCACTGCAGCCTCGACCTCTGGGGCTCAAGCGATCCTCCTGCCTCAGCC +TCCTGAGTAGCTGGGACTATAGGTGCGCATCACCATGTGTGGCTAATTTTTGTATTTTTT +ATAAAGATGGGGATCTCACTATGTTGCCCAGGCTGGTCTTGAACTCCAGACCTCAAGTGA +TCCTCCTGCCTTGGCCTCCCAAAGCTAAGGGGGCATTAAAAGAAAAAAACATTTTTCCCC +CTGAAACATTTAAGTAGTCTTACTGAAAACAATAAAACACAGAAACACCAGATTCTCATT +TTAAAGTAAAACAGACAGGATCTCCCAGAACCTTCCTAGAATGGAACCATTCTTGTCGCT +TTTGAAAAACAAAGCCAAGTTCTAGATCCCAAATAAATGCACCTGCTGGTGAACATTCTC +CTTGTGGTTCTCGTCCCTATGTTAGTTATTTTCCTAAATTTTACATTTGTACCTTTTTAA +GAATGAGTTATCAGTTTTTTTATATTTGCTTTTCTTTTGAGATGGGGTCTTGCTCTGTCA +CCCAGGCTGGGGTGCAGTGGTGCAATCACGGCTCACTGCAGCCTCAACCTCCAGGGCTGA +AGCGATTCTCCCATCTCAGCCTCCCATGTTGAGATCACAGGTGTGCACCACCACACCTGG +CTCCTTTTCCTGATTTGTTTTTTGTAGAGATGGGATTTCGCTATGTTGCCCAGGCTGGTC +TCTAACTCCTGGACTCAAGTGATCCTCCCGCCTCAGCCTCCCAAATTGCTAGGATTACAG +GTTTGAGCCCCTGCACCTGGTCAACCTGAGTTTTAAGAGGATCCCTTTGGCGACTGGATT +GAGGACAGACAAGAGTGGACGGGGGACACAAGGAGGCCATTTTCGTTATCCAGGCCTGGT +AGTGGCTAGGGCCAGGAGGGTGGGGTTGGTGGGAAGCAGTCAGATCCCAAAGAGATTTGG +GGATTGGAAGCAAAAGGATTTGCTGGTGACTTGCACATGGGAGGGAGAGAGGTCAGTGCC +TCTGCTAATCAAGGAATCCAGATTGCCACCGAAATTTCTAGGCCCGAGATATTTAGGTAG +TGTCTCACTCTGTCACCCAGGATGGAGTGCAGTGGCACCATCTCGGCTCACTGTAACCTC +CGCCTCCCAGGTTTAAGCGATTCTCCCACCTCAGCCTCCTGAGTAGCTGGGATTACAGGC +ATGTGCCACCACTCCCGGCTAATTTTTGTATTTTTAGTAGAGACGGGGTTTCACCACGTT +GGCCAGGCTGGTCTTGAACTCCTGACCTCAAGTGATCCACCCACGACAGCCTCCCAAAGT +GCTGGGATTACAGGCGTGAGCCACCATGCTCGGCCTTTTAGGTGGTTTTGAGAGGTATTT +AGGTCACTTCCAATCTCGTGCTTTTCCAAGTGTTGTAAACTACAAATATTCCTTCACGTC +TTCTTGTCTTTTTAATGTTTAGAAAACCTTAAAAGAAATGTTCCCTATTGAGGCAGGGGC +GATCAATCCAGAGAACAGGATCATCCCCCACCTCCCAGGTGAGCACGGGGCTGAGCCGCC +TGTCAGGGGGTCATTGGCGGGGGCTCACCTGCCCTCCCAGCCCCTCTCGGGCTTGACCTC +ATGTTCTCTGGTGCCAGCTCCCAAGTGGTTTGACGGGCAGCGGGCCGCCGAGAACCGCCA +GGGCACACTTACCGAGTACTGCAGCACGCTCATGAGCCTGCCCACCAAGATCTCCCGCTG +TCCCCACCTCCTCGACTTCTTCAAGGTGCGCCCTGATGACCTCAAGCTCCCCACGGACAA +CCAGTGAGTGAACTTTTCACCCTGCCAGGTGGGAGAGGGAAGGAGGGGTGGGACTTTCTG +TGTTTTGCAGATGAGGAAACCAAGGCTCAGAGAGGGAAAGCCACCTTCCCAGAGCCACAC +AGCCAGAAAGAGGAGGCAAATTCCACCTCCGGCCCCTGTGACCCCGCCAAGCCTCCACCT +TAATCTTTCACACCTCAGGGCACTGGGGGAAGCACTCGGGGCTGGAGGTTCAAAGTCCTG +GGTCCTCATCCTGACATTATGGCCACCTGGCCATGGGACCTGGAGCCAGTCACCACTGCT +CTCTGAATGCAGGTTCTCCATTTCTATAATGGGCAGTGAGGATCAGATGAAGCATTGGGT +GTCTTGCGGAGCCCCCCAGAAGGATGTGGGGTTGATGCCTCTGCTAAGTGCTGAGCATGT +CTGGGGTCTCCTGTACCCAGGACCCTGTGTGGAAGGCACCTGAGAGGCTGAGGGAGCTCC +AGGCAGGCTGGGGAAGTCCCCTTCTCCACTCCTCTCTGGTCACTGAAGCTCGAAGTGGGG +AGCATGAGGACAGGACGTTACCCCTTGTCAAGGCACCCAGGCTGCCAAGACAGAGACAAG +CAGCATTGCTCCGGCCAGCACTTATTGACGCTTGAAGGTGTCCCCTGGCCCAAGGAAGGG +CAGTTATCATCAGCCCGGGAGGCGGGGGAAGGATGGACTCTGCAGTGGGGTCCGCTCCTC +ATTGCCTGCTCTCTCAGGGCTCCAGAAGGAGGAAGAGGCCGGGCACAGTGGCTCACACGT +ATAATCCCAGCACTTTGGAAGGTCGAGGTGGGCAGATCACCTGAGGTTGGGAGTTTGAGA +CCAGCCTGGCCAACATGGTGAAACCCCATCTCTACCAAAAATATAAAAATTTAGTCAGGC +ATGGTGGTGTGCGCTTGTAATCCCAGCTACTTGGGAGGCCGAGGCAGGAGAATCGCTTGA +ACCCGGGAGGCAGAGGTTGCAGTGAGCTGAGACTGCGCCACTGCACTCCAGCCTGGGTGA +CAGAGCGAGACTCTGTCTAAGAAAAAAAAAAGAAAAGAAGAAAGAAGATGGCCTGGGAGC +CCGCAAGAGCATTTTCCAGGCTTAGGGCATCCTTTGGGTCTGCAGAAGGCTATGCAGTGT +CCTCCTCATGTCCCTCCCTTGGGCTGCCCGAGCAGATCCGCCCGCCCCCATCACTTCCTG +AAGCCCTTCCTCAGCCAGTCCAGTTGCTGTCTTCTCTCCGCAGTGCCCCTTCCCTTTCCC +GGGTCCCTCTTCTCTTGGGAAGTTCTTCTGCAGGTCTACCCAGTGCCTCTTCTTCCTCCA +TGGGAAGCCAAGGGTCTCACCCAGACTGTTCTCTCCTCAGGACAAAAAAGCCAGAGACAT +ACTTGATGCCCAAAGATGGCAAGAGTACCGCGACAGGTGAGAGGACGGGGGGCAGCCGGC +GGGGGGGGACACCCTGAGGAGACCCAGAGTGTTCAGGGAATGGAGCAGGGGCTGGGAGCA +GGCTGGGAGGGCTCACAGCTACCCTGCTGAAGAATTGGGTCTTTGGGCCGGGTGCGGTTG +CTCATGCCTGTAATCCCAGCAGTTTGGGAGGCCGAGGCAGGTGGATCACTTGAGGTCAGG +AGTTTGAGACCAGCCTGGCCAACATGGAGAAACCCTGTCTCTACTAAAAATCCAAATTAG +CCAGGCGTGGTGACAGGTGCCTGTAGTCCCAGCCACTTGGGAGGCTGAGGCAGGAGAATT +GCTTGAACCCGGAAGACGGAGTTTGCAGTGAGCCGAGATCGTGCCACTGCACTCCAGCCT +GGGCAGCAGAGCCAGACTCCATCTCAAAAAAAAAAAAAAAAAAAAAGAAGAATTGGGTCT +TTGGAAGGTCCCTGGAGACTGAAAGGAGCCCTTTGCAGGTGGCAGTGCAGAGACCAGCGC +AGACCCTTGCTACTGGCAGCCGGGGGAGTGTTTGCGGCTGAATGAATGAACAGGTTTTGG +AGGGCAGTGTGGCCTTCAGAGGCGATGCAGGGCTGTGGCAGTTTCTAATACTTATTGCAC +AGTCACTGCTAATAACAATAATAATAATAATACCTAACATTAATGGAGTGCTTACTCTGT +GCCAGCCACTATTTTGTTTTTGTTGTTTTCAGTGACAGGGTCTCGCTCTGTTGCCCAGGC +TAGAGTGAAGTGGTGTGATCATAGCTCACTACAGCCTCGACCTCCTGGGCTGAAGCGATC +CTCCCACCTCAGCCTCCCAAGTAGCTGGGATTACAGGTGTGTGCCACCATGTCCAGCTAA +TTTTTAATTTTCTGGTAGAGATGGGGTCTCACTACATTGCCCAAGCTGGTCTTAAGCTCT +TGGCCTCAAGCAACCCTCCTGCCTCAGCCTCCCAAAGTGCTGAGATTATAGACATGAGCC +ACTGTGCCCGGCTTTTTCTTCTTCTTATAAGGACACGAGGCCTGTTGGGTTAGGGCCCAC +TCTACTGACCTCATTTTAATTTAATTACCTCTTGAAACGTACTTAAGAGTACCTTTCTCT +TAATACACCCACACTGTAAGGTACTGGGTGGTTAGGACTTCAACATATGAATTTTGAGAA +GGCGGATGTCAGCCAATACTAAACAGCATCAGCACCTCCACGGTTGGATGAAGGGCTGGT +CAGAAATGCACACTCAGGTCCCACAGTGGACCTACTGAACAGGATAGGCATTTTAGCAAA +ATCCCAGGTATTCGGGTGCACCTTAAAGTTAGGAAAAGGTCAGGCACTGTGGCTCATGCC +TGTAATCCCAGCACTTTGGGAGGCCGAGGCGGTTGAATCACCTGAGGTCAGGAGTTCGAG +ACCAGCCTGACCAATATCGTGAAACTCCATCTCTACTAAAAATACAAAAATTAGCCAGGT +GTGGTGGCGGGTGCTTGTAGTCCCAGCTACTTGGGAGGCTGAGGCAGGTGAATTACTTGA +ACCTGGGAGGTGGAGGTTGCAATGAGCCAAGATTGCACCACTGCACTCCAGTGACAGAGC +GAGACTCCATCTCAAAAAAAAAAAAAAAAAAGTTGGGAAAAGGCCAGGTGCAGTGGCTCC +ACACCTGTAATCCCAACACTTTAAGAGGCTGAGGTGGGAGAATCCTTTGAGCCCAGGAGT +TCGAGACCAGCCTGGGCATTGTCCCAAGACCTTGTCTTTACAAAAAATTAGCCGGGTGTG +GTGGCATACGTCTATGGTCCCAGCTATTCGGGAGGCTGAGGCAGGGAGATTGCTTGAGCC +TAGGAGTCCAGGGCTGTAGTGAGCTGTGATCACGTCACTGTACTCTAGCCTGGGCAACAG +AGCAAGACTCTGTCTCCAAAAAAGAAAATAAAGTTGGGAAAGGCTCACTAACTTCATCAG +ATGAGAACAAGGACATGTTTGAAGTGTGAGGCCGAAGCCTGGAGAACGCTATGCGCCCAG +GAAATGCAGGGCAGCAGAGACTCAAGATGCCAGCGCCTGTTCTGGAGGCCCAGATGGGCC +CTGCAATGCCCACTCACCCTGCCCTCCCTCTTGCCCCAGACATCACCGGCCCCATCATCC +TGCAGACGTACCGCGCCATTGCCAACTACGAGAAGACCTCGGGCTCCGAGATGGCTCTGT +CCACGGGGGACGTGGTGGAGGTCGTAGAGAAGAGCGAGAGCGGTCAGACCTCCCACCTTA +CGGGGCTCCTTCCCCTGGTGCTCAGGAACCCACAGCCACAAGCCCCCTGCCAAGGCTCAG +GCAGCCTTGCCCCTGGGAGGACTCCGGCTCTGTTAGGGGCCCTAAATGTCCTCCCCACAC +TGTGGGTCGCCTTCTGTCTTAGTGTGCACCCTGTGGTGGCTGTGGGCATCTGTGCATGGC +AGGCCGGGGCGGGGCATGTCTGCGTGTTCTGTCTGGATGGGTATGGGACCGTCTGTTCAT +TATGAAGTGGGCTCAGAGCTGTGATTCTGTGAGCATGTGTGCATGCATGCATGTGACCTC +ATTGTCCAGTGTGGTGAAGGTGACATTTCCAAATCTGAGCATTGGACATCAGTGTGTCTG +TGTCCCTGTGTCCTCACCATCCCTGATGGCTGCAGGGAGCCGCTGGGCCCTGCCCCTCAG +TCACATTCCCGCACCTCTGGCACAGGTTGGTGGTTCTGTCAGATGAAAGCAAAGCGAGGC +TGGATCCCAGCGTCCTTCCTCGAGCCCCTGGACAGTCCTGACGAGACGGAAGACCCTGAG +CCCAACTATGCAGGTGCCCCCTGCCCTCCGAGGCTGTAGGGGTGTGGGAGAAAGGGGCAG +GCAGGGCTCAGGGATATTGAGTGACTGCTTTGGAGTCTGGGCTGGTTGCTGGCTTGGCAG +AAAAGTCAGGGCTAAGATCTCATCGACTCTGGCTTGGGGGCCCTGGCAGGTTGTGATGCC +CTTGGTCTGGACAGGGAACCAGGAGGAGGAGCAGACGACTGGGGAGAGTGGGAGGCCAGT +GGTGTCTGTGGATATGTGGCCAGGTTCAGTGGGAAGCTGAAGGATGAGCAGACCTTAGGC +TCAGGAAGGAGGGCTGCCTGGAAGTGGGGGCATCATCACTGACCAGAAAGGGAAAACTGG +CAGTGCCAGGGCTGGATGGGGCCTGCATTGAGCTTGAAAAAAACTATAATAGAATTGGTT +ACCATTTTATTTTATTATTTATTTATTTATTTTACTTTTTTGAGATAGAGTCTCACTCCC +TTGCTAAGGCTGGAGTGCGGTGGTGCTATCTCAGCTCACTGCAACCTCTGCCTCCCAGGA +TCAAGTGATTCTCCAGCCTCAGCCTCCCCAAGTAGCTGGGATTACAAGCATGCACCACCA +TGCCTGGATAATTTTTGTATTTTTAGTTGAGACGGGGTTTCACCAGGTTGGCCAGACTGG +TCTCGAACTTCTGACCTCAGGTGATCTGCCTGCCTTGGCCTCCCAAAGTGCTGGAATTAC +AGATGTGAGCCACTGTCCCTGGCCTGGTTACCCACATTTTAAAATGGAGTGATTTCACCC +TTTTATGTGGATTTACAGCTTTTTTTTTTTTTTTTGAGACAAAGTCTGGCTCTGTCACCC +AGGCTGGAGTGCAGTAATGCAATCTCAGCTCACTGCAACCTTAGCCTCCTGGGTTCAAGC +AATTCTCCTGCCTCAGCCACCTGAGTAGCTGGGATTACAGGCATGCACCACCACGCCAGG +CTAATTTTTTGTATTTTTAGTAGAGATGGGGTTTCGCCATGTTGGCCAGGCTGGTCTCGA +ACTCCTGACCTCAGGTGATCCGCCCGCCTTGGCCTCCCAAAGTGCTAGGATTACAGGTGG +GAACCACCTTGCCCAGCCTGTGGCTATCGTTTAAACACTGGGAAGGCCTGCAGCCCCCAG +GCCGACAGTTAGCTGCAGCTGAGCAGTTCCCAGTGCCAGGTAGACGGATGCTCCACCCAC +CTACTCATGGCTGATCTCTTGTCATAGTGAAGTGTCTGGACAGACCTTCATCGTTATGGG +ATCTCTGGTCCCCAGAGTGGGTGGCAATGAATGGGAGTGGACAAGCTCACCTGGGTGTAG +GGGGCAGAGGGCCGAAGTCCAGAGTGTACCCCCAGAGTGGGTGCCAGCAGGAGCTTGCCG +AGGGATCTGGGATGGAGCAGGAGGGTGGAGGGAGGAGACCCAGAAGAGGGGGAACTGTGG +GCCCTGGGTGGGTCTGGAGTGCCTGGAGGAAGCCCAGGCGCAGAGAGGAGAAGATGGGAT +GGGTGGCGAGCCCCAGGCTGGGCCGACCTCACACTGTGCTCTGTGCCCCTGCCGTGGACC +AGGTGAGCCATACGTCGCCATCAAGGCCTACACTGCTGTGGAGGGGGACGAGGTGTCCCT +GCTCGAGGGTGAAGCTGTTGAGGTCATTCACAAGCTCCTGGACGGCTGGTGGGTCATCAG +GTAGGAGGGCCCCTCTCCATCCAGAGCACCCATCTGAGTCAGCCCCAGCCAGGACGGGGT +GTTTAGGGATCTGGGGTGACTTGTCCCTGGGACTCTGGGTAAGCCACTGCCCCTCTCTGG +GCTTAGTTTCCATCTCAGTAGCAGGGAGGAATGAGCCCACCCTTGCCTGTCTTGTGGGGA +TCCAATGTCCTTGTCCAAGTGGGTGCATTTCTCCTTTGTGATTTAGGGTCTCTTCCCAAC +CATCTATTATTATTCCTTCTCTGGCAACATGGTGAACTGTTGTATAAATAATTACATTCC +TAGCTAGGCGCAATGGCTCAGGCCTGTAATCCCAGCACTTTGGGAGCCCAGGACAGGACG +ATCATGTGAGGTCAGGAGTTCGAGACCACCCTGGCCAACATGGCAAAACCCTATCTCTAC +TAAAAACACAAACATGAGCCGGGTGTGGTGGTGGGAGCCTGTAATCCCAGCTACTCGGGA +GTCTGAGACAAGAGAATCACTTCAACCCGGGAGGCGGAGGTTGCAGTGAGCCAAGATCGC +GCCATTGCACTCCAGCCTGGGCAACGAGAGCGAAACTCCGTCTCAAAAAAAAAAAAAAAA +AAAAAGATTACTTTCTTTTTATCATTCCTTTATCTTTTAAAGCTTTCTTGCAGTCAGGTG +CAGTGTCTCATGCCTGTAATCCCAACACTTTGGGAAGCTGAGGTGGGAGGATCACTCAAG +GCTACAAGTTCAAGACCAACCTGGGCAATGTAGGGAGACCTCTGTCTCTACAAAAAAAAT +TAAAAAATAGCTGGATGTGGTAGCACACACCTGTAGCCCCAGCTACTCAGGAGGCTGAGG +TGAAAGGATCACTTGACCCCAGGAGTTGGAGGCTGCAGTGAGCTATGACTGCACCACTGC +ACCCCAGCCTGGGTGATGGAGCAAGACCCTGTCTCAAAAAAAAAAAAAAAAAAAAAGCTT +CCATTGCAATTCCCATCTGTTTATCCTCCAAATGAATGCAGAAATACTAATTATCTTTTT +TCTGGTTCTGGGGAACACAGAATTCTAGCGGCTTGTGGAGCCATTTCCCTGGAGCCATGG +GGCCTCCCAGGTCCTTTCCTGTGTCTTCATTTTTTACGAATTTTTTCATTTTTTGAGACA +GGATCTTGCTCTGACTCCCAAGCTGGAGCACAATCATCGCTCACTCAAGCGATCCTCCCA +CCTCAGGCTCCCACGTAGCTGGGACTACAGGTGAGCACCACCACATCTGGCTAATGTTTT +TTAATTTTTTTGTAGGGATGGGGTCTCACTATGGTGCCAAGACTAGTCTTAAACTCCTGG +CCTCAAGAGTTCCTCCTGCCTTGGCCTCCCAAAGCACTGGGATTACAGGAATGAGCCTCC +ATGCTGGGCCTTTGCTGGCGTCTTCAGAGCCCTAGGTCACAGGGCCAGCCTGGCGCCCTG +CCGCAAGCTTATCTTAAAGCTGGGACCACAACATGCATACCTGCAGCCGGGCCCGGGGCC +AGAGGGCTTTGAGGCAGCATTTCTCAGCCTTTTAGACACACACTCTGTTAACCCCCATCC +TGTGTCTCTGATAATCTTCTTGTGATCCTCCCACCAGCCAAGAATTGGGTTTTATGTGAA +CCTTGTATTATGCAAAGTTTTCTTTTGTTTTTTTTTTCACTCCCAAATATAATATTGAGA +ATAGAAAGAAAGTCTTTTCAACAAATGGTGCTGGAACAGATGGATTTCCATACTGGAAAA +AAAAAAAAAAGAGCAAAAAACAAACCTAGACCCCTTCCTCACACTGTACACATATGTTTA +CTTCAGATGGATCACAGGTTTATCCCAGAGTAAAACCTGAAACTAAAAACCATTTGGGGC +TGGACAGGGAGCTCACGCCTGTAATCTCAGCACTTTGGGAGGCTGAGGCAGGTGGATCAC +TTGATGTCAGGAGTTTGAGACCAGCCATGACCAACATGGTGAAATCCTGTCTCTACTAAA +AAAATACAAAATTAACCAAGTGTGGTGGTGCATGCCTGTAATCCCAGCTACTTGGGAAGC +TGAGACAGGAGAATTGCTTGAACTTGGGAAGCAGAGGTTGCAATGAGTCGACATCATGCC +ATTGCACTCCAGCCTAGGCAACAAGAGCAAAACTCTGTCTTGGGGTTGGGCGGGGGAAAA +GCATTTGGAAGAAAGCATAGAATTTGGTGGCTTGGAGGTAGGCAAAGGTTCGTAGGAGAC +AGAAGGCAGTTAACATAAAAGAAAAATTGGCAAATATAATCCGCCAATGTCTTCTTTTTT +CTTTACTTTTTTCGGGAGGTAGAGATAGGGGTCTTGCTATGTTACCCAGGCTGATCTCCA +ACTCCTGGCCTCAAGCGATCCTCCCACCTAGATCCCTCAAAGTACTGGGATTACAGGCGT +GAGCGACCGTGCCCTGCCCATTCTTGCCAATGTCTTATAGCAAATACCTGTCCCCTGCGG +TGACCTGGATCTGCTAACCTCCACCCCTGCCTAGACTGTGGAAGGATTGCTGGAAGGGTC +TCAGTTGCACAGACCAGGAAACTGAGGCCCACAGAGGCAGGTGTCCGGTTGTTTGCAACC +TCTCAGCCTGTGCTAACCCCAATTGTTCAGAGAGAGCCCTGAAACCCTCTCCTCTGGGCG +CCCCCAGGTGACTGCCCCAGCCTCAAGGGCTGCCTCTGTTGCAGGAAAGACGACGTCACA +GGCTACTTCCCGTCCATGTACCTGCAAAAGTCAGGGCAAGACGTGTCCCAGGCCCAACGC +CAGATCAAGCGGGGGGCGCCGCCCCGCAGGTAAGCGGGGGTCCCCGGGGCTGGGCGGGGT +CGAGCGGGGCGCACCACGGGTTCGCTCTGTCTAGGCCATAGCTTGGCAGTGCCGGGGCGG +GGGCTCTCAGCCTGGCAGGAGAGGCAGGACCCTCACGGGGGAAAGGGGCTGGACGCGCCT +GGCCGCGGTGTGGGGCTGGCACGGGGGCGGAAGGAAAGCGGCGATGCCCGGGGGCTTTGG +GGATGGGCAGTCCAGGGGGGGTCCCCGGAGAGGGGGACGACAGACCGAAGGCTGGTGAGG +GGCGTGGAAAACCGCCCAGGCTCTGCTGCAGGGCAAGGGTCCTTGTCGTGACGGGGGCAG +CCGCCTCTTGTCCCGCCGGGGTCGTGCAGACTACCGGCCCCCTACTGCCCCCCACTTCCT +CGGACCAGGGGTGCCCATCTGAGTCCCTGGGGGCAGGGGCGCCCTCGGGCTTTGACGACG +CCCCGTCCCGCTGGGCCAGGTCGTCCATCCGCAACGCGCACAGCATCCACCAGCGGTCGC +GGAAGCGCCTCAGCCAGGACGCCTATCGCCGCAACAGCGTCCGTTTTCTGCAGCAGCGAC +GCCGCCAGGCGCGGCCGGGACCGCAGAGCCCCGGGAGCCCGCTCGGTGAGTGCAGCGGGA +GAGGGCAGGAAGGGCAAGCCCTAGAGGCGGAGTCAGCGGGAGAGGCGGGGCCAGAGGTAG +GGCCAGAGTAGCGGGGCGGGACCAGAGGGCGGAATCAGAGGGAGAGGCGGGGACTGGAGG +CGGGGTCAGAGGAGGAGCCAGCGCTAGGGGGCGGAGCGATCCCTAAGAGGCGGAGTCAGA +GGGAGAGGCACAAGCGGGAGGCGAGGCCAGAGCGCGGAGCAGGAGTTGGAGACCGCGGCG +GGGCGAGGCCAGAGAGCGCTGTGGGCGGGGCCAGTGTGCGGGGCGGGGCGTCTGACTCGG +CCCCGCTCTCTGCCCGCAGAGGAGGAGCGGCAGACGCAGCGCTCTAAACCGCAGCCGGCG +GTGCCCCCGCGGCCGAGCGCCGACCTCATCCTGAACCGCTGCAGCGAGAGCACCAAGCGG +AAGCTGGCGTCTGCCGTCTGAGGCTGGAGCGCAGTCCCCAGCTAGCGTCTCGGCCCTTGC +CGCCCCGTGCCTGTATATACGTGTTCTATAGAGCCTGGCGTCTGGACGCCGAGGGCAGCC +CCGACCCCTGTCCAGCGCGGCTCCCGCCACCCTCAATAAATGTTGCTTGGAGTGGACCGA +GGCTCTGCAGGAATGCAGGGAGGGCCGGGCTCCGCCCCAGGGTTATTTCTAAGTTGAGGA +CAGGAGGTTGTGAGTTCTGCTGGGGGGAAGTTGCAAGAGCCGAGGTCTGGTTGCATGTTG +CCCTGGTCTTGGCCAAGAACAGGTTTGCACAAGGCCAAGTTCAAGAGGAACTCCCGGTTT +CCTGCTGACCGTTTGGTCAGAAACCACCTGCTTGGACTCTGGCGGAAGAGTGCTGAAGAT +GGGTGCACACAGTGCAGCAGGGCAGCCCTGTCTCATGACAGGAGACAGGCTGCCGTCCAG +GGTGTAGGAGTGACCTCATAGCTGGGATAAAAAATATATTATAACTTAGGTTCGGGCGCG +GTGGCTCACGCCTGTAACCCAGCACTTTGGGAGACCGAGGTGGGAGGATCCGTTGAGCTC +AGGAGTTCGAGACCAGCCTGGCCAACATGGTGAAACCCCATCTCTACCAAAAATATAAAA +ATTAGCTGGGCGTGGTGGCATGCATCCATAATCCCAGCTACTGGGGAGGCTGAGGCATGA +GAATCGCTTGAACCGGGGAGGCAGATGTTGCAGTGAGCCGAGACGGCGCCACTGGACTCC +AGCCTGGACAACATGGTGAAACCCCATCTGTACCAAAAATATAAAAATTAGCTGGGCGTG +GTGGCATGCATCCACAATCCCAGCTACTGGGGAGGCTGAGGCATGAGAATCGCTTGAACC +GGGGAGGCAGATGTTGCAGTGAGCCGAGACGGCGCCACTGCACTCCAGCCTGGACTACAG +AGCGAGACTCTAGCTCAAAAAAAAAAAAAAAAAAAAGTAACTTAGGTGCAGGGTGTCCTC +TGTTATTCACTGAGACCGTGCCCCGGTTATGAGGTTGTACCAGAAAGCAAGTATTCACTA +TGCACACTATTCACCGCTCACCCTAGCATTGAAGCCAGCCTGTAGCCTGAAAGCCTTTGC +TTTGAGGGCAGGTCTTTCCCCAAAATGCAGACACGAAGGTGCAAAGTGAAGCTGCCAGTC +TTGCAAAAGATGTAACTTGTCACGAAGGCCACGAGTGGCAGGGAGAGCTGTCCCACATTT +GCGGAAGTGGCTATGTGAGGACGGGGGAGGCGGGTCCCTTAGAGATAAGAGACAATCATA +AGGGGAGATATCAGAGAAAATCGTAAGGGGAGCAGATGGTTGTCAAGAGAATAGGCTGAC +CATCGAAGGACTGGCAGAAGCTTTCAGAAAACCACTGGACGGCTGGGCACAGTGGCTTAG +GCCTGTAATCCCAGCACTTTGGGAGGCTGACGCGGGTGAATCACTTGAGGTCAGGAGTTC +CAGACCAGCCTGGCCAACATGGTGAAACCCCATCTCTACAGAAAATATAAAAATTAGCCA +GGCGTGGTGGCACAAGCCTAGAATCCCAGCTACTTGGGAGGCTGAGGCAGGCGAATGGCT +TGAACCCAGGAGTCAGAGGCTGCAGTGAGTCGAGAGTGTTCCACTGCACTCCAGCCTGGG +TGACAGTGCAAGACTCCTTCCAAAAAAAAAAAAAGAAGAAAAAAGAAAACCACTGCAGCT +CTAAACTAGTTCTGCATTTTTGCCGAACCTGGTTTGCTGGAAAAGCCCAGCACCAAAGGC +TATCATACAAAGCTGTGGGAAATTGAATCACCAACCTCACCCCTTCTGCTTGTTCAGTTG +CAGTTATAACCCTTTTATTAAATACAGTATAAAATACCACGCCTCTAATCCGAGTGCTCT +GGGAGGCCAAGGTGGAAGGATTGCTTGGGGCAGGGAGTTCAAGACCAGCCTGGGCCACAC +GGTGAGACCCATCTAGACAAACATTTTTAAAAATTAGACAGGTGTGGTGGTGTGCACCTG +TAGTCCCAGGCTGGGTGGGAGGATCGCTTGAGCCCAGGAGTTTGAGGCTGCAGTAAGCTG +TGATCGTGCCACTGTACTTCAGCCTGGGTGACAGAGCAAGACCCTGTCTCTAAAAAAAAA +ACAAATTAAGGCCAGCCACAATGTTTCACGCCTGTAATCCCAGCACTTTGGGAGGCCGAG +GCAGGCGGATGACTTGAGGCCAGGAGTTCGAGACCAGCTTGGCCAACATGGTGAAAGCTT +GTCTCTGCTAAAAATACAACAACAACAACAAAATTAGCCGGGTGTGGTGGTACACGTCTG +TAATCCCAGGTACTCAGGAGGCTGAGGCGGAAGAATCACTTGAACCCGGGAGGCAGAGGT +TACAGTGAGCCGAGATCACGCCACCACACTCCAGCCTGGGCGACAGAGCATGACTCTGTC +TCAAAAATAAATAAATAAATAAATAAAAATAAAAAAATACATACATATAAAAAAGAAGAG +AAAATACTTATTTTCATGATTGTTTCATTTTTTTCCAAGCTGAGGTCCTGACCAAATGTT +TCCCCGTGGTTTTTGTATATTTCTGATCCCCTTAAAGTGGCCCTTGCAGCGTGCTGGGGA +GGAGGCCTCTCGGTGGGTGGGTGATTGGGAAGCCACGCCCACAGGGAAGGGAGAAAAAAA +CTGAAGTTGCCTGCTCTAAGGGCTGCACCTCTGTTTAGCCAACAATTTTCTTGAGGCCAG +TCATAGTGCTGGGCTCATTCCATCATCTCCAGTTCTGAGACCAACACTGTCTGGTAGGTG +TTTTATGGATGAGGAAACCAAAGCTCATAAAATTAATTAAGTGGTTTGCTTATGAAATAA +TGCAAGAAAGCAACAGTGCTGGGAGGTGGAACCGGGTTTTTCTTTTCTTCTTCTTTTTTT +TTTTTTTGATGGAGTTTCGCTCTTGTCACCCAGGTGGAGTACAATGGAGCGACCTCAGCT +CACTGCAACCTCTGCCTCCCGGGTTCAAGTGATTCTCCTGCCTCAGCCTCCCGAGTAGCT +GGGATTACAGGTGCCCGCCACCCTGCCCAGCTAATTTTTTGTATTTTTAGTAGAGATGAG +GTTTCACCATGTTGGCCAGGCTGGTCTTGAACTCCTGACCTCAGGTAATCCGCCTGCCTT +GGCCTTCCAAAGTGGTGGGATTACAGGCATGAGCCACTGCGGCTGGCCCAGGCTTTTCTT +TTTGATTCTAAGGCCTTCCAGATCCAGTACTTTATCCTGACCCTGCAGAGCTCAGGATCT +GTGGTTCAGGCCTGCGGCTCAGAGGAGGGAATACGGCACACAGGTACGTGCCAGGCCAAG +TGTGAGGGGGTCTGGCTTCCAGGGCCCTCTGCAGACCCCTAGTCCCAGGGCCTGTGTTGG +AGGAGAGATTGGCTCTTTCCTCTGCCCATCCTGGGATGAGAAGTCGGGGACTTGGGATAG +ATGCAGTGCAATCCCTGCCCCTGAAAATTGACAAAGACCCACCAAATCTAGCCCCTACTC +CTAGGCTGGGCCCCTGCCCATCTCCCTGGACCACACCACTCTCAAGGGCTCTCGTTTGCA +GACACCCCATTGCCCTACTAAAAACCTCTCCTGGCCAGGCTTGGTGGCTCACGCCTGTAA +T diff --git a/paraphase/data/ncf1/ref.fa.fai b/paraphase/data/ncf1/ref.fa.fai new file mode 100644 index 0000000..237e259 --- /dev/null +++ b/paraphase/data/ncf1/ref.fa.fai @@ -0,0 +1 @@ +chr7_74768800_74792800 24001 24 60 61 diff --git a/paraphase/data/neb/homopolymer_sites.txt b/paraphase/data/neb/homopolymer_sites.txt new file mode 100644 index 0000000..6130a8c --- /dev/null +++ b/paraphase/data/neb/homopolymer_sites.txt @@ -0,0 +1,656 @@ +chr2 151578919 A +chr2 151578920 A,C,G,T +chr2 151578921 0 +chr2 151578922 0 +chr2 151578923 0 +chr2 151578924 0 +chr2 151578925 A,1 +chr2 151578927 A +chr2 151578928 A,C,G,T +chr2 151578929 0 +chr2 151578930 0 +chr2 151578931 0 +chr2 151578932 0 +chr2 151578933 A,1 +chr2 151579080 A +chr2 151579081 A,C,G,T +chr2 151579082 0 +chr2 151579083 0 +chr2 151579084 0 +chr2 151579085 0 +chr2 151579086 0 +chr2 151579087 0 +chr2 151579088 0 +chr2 151579089 0 +chr2 151579090 0 +chr2 151579091 0 +chr2 151579092 0 +chr2 151579093 0 +chr2 151579094 0 +chr2 151579095 0 +chr2 151579096 0 +chr2 151579097 0 +chr2 151579098 0 +chr2 151579099 0 +chr2 151579100 0 +chr2 151579101 0 +chr2 151579102 0 +chr2 151579103 0 +chr2 151579104 0 +chr2 151579105 0 +chr2 151579106 0 +chr2 151579107 0 +chr2 151579108 A,1 +chr2 151579306 G +chr2 151579307 A,C,G,T +chr2 151579308 0 +chr2 151579309 0 +chr2 151579310 0 +chr2 151579311 0 +chr2 151579312 1,G +chr2 151579521 T +chr2 151579522 A,C,G,T +chr2 151579523 0 +chr2 151579524 0 +chr2 151579525 0 +chr2 151579526 0 +chr2 151579527 1,T +chr2 151579681 T +chr2 151579682 A,C,G,T +chr2 151579683 0 +chr2 151579684 0 +chr2 151579685 0 +chr2 151579686 0 +chr2 151579687 1,T +chr2 151579696 T +chr2 151579697 A,C,G,T +chr2 151579698 0 +chr2 151579699 0 +chr2 151579700 0 +chr2 151579701 0 +chr2 151579702 1,T +chr2 151579744 T +chr2 151579745 A,C,G,T +chr2 151579746 0 +chr2 151579747 0 +chr2 151579748 0 +chr2 151579749 0 +chr2 151579750 G +chr2 151579751 A,C,G,T +chr2 151579752 0 +chr2 151579753 0 +chr2 151579754 0 +chr2 151579755 0 +chr2 151579756 0 +chr2 151579757 0 +chr2 151579758 1,G +chr2 151579770 T +chr2 151579771 A,C,G,T +chr2 151579772 0 +chr2 151579773 0 +chr2 151579774 0 +chr2 151579775 0 +chr2 151579776 1,T +chr2 151579977 T +chr2 151579978 A,C,G,T +chr2 151579979 0 +chr2 151579980 0 +chr2 151579981 0 +chr2 151579982 0 +chr2 151579983 0 +chr2 151579984 0 +chr2 151579985 0 +chr2 151579986 1,T +chr2 151580002 T +chr2 151580003 A,C,G,T +chr2 151580004 0 +chr2 151580005 0 +chr2 151580006 0 +chr2 151580007 0 +chr2 151580008 0 +chr2 151580009 0 +chr2 151580010 0 +chr2 151580011 0 +chr2 151580012 0 +chr2 151580013 0 +chr2 151580014 0 +chr2 151580015 0 +chr2 151580016 0 +chr2 151580017 0 +chr2 151580018 0 +chr2 151580019 0 +chr2 151580020 0 +chr2 151580021 0 +chr2 151580022 0 +chr2 151580023 1,T +chr2 151580172 T +chr2 151580173 A,C,G,T +chr2 151580174 0 +chr2 151580175 0 +chr2 151580176 0 +chr2 151580177 0 +chr2 151580178 1,T +chr2 151580180 T +chr2 151580181 A,C,G,T +chr2 151580182 0 +chr2 151580183 0 +chr2 151580184 0 +chr2 151580185 0 +chr2 151580186 1,T +chr2 151580313 T +chr2 151580314 A,C,G,T +chr2 151580315 0 +chr2 151580316 0 +chr2 151580317 0 +chr2 151580318 0 +chr2 151580319 1,T +chr2 151580347 A +chr2 151580348 A,C,G,T +chr2 151580349 0 +chr2 151580350 0 +chr2 151580351 0 +chr2 151580352 0 +chr2 151580353 A,1 +chr2 151580373 A +chr2 151580374 A,C,G,T +chr2 151580375 0 +chr2 151580376 0 +chr2 151580377 0 +chr2 151580378 0 +chr2 151580379 0 +chr2 151580380 0 +chr2 151580381 A,1 +chr2 151580424 A +chr2 151580425 A,C,G,T +chr2 151580426 0 +chr2 151580427 0 +chr2 151580428 0 +chr2 151580429 0 +chr2 151580430 0 +chr2 151580431 0 +chr2 151580432 A,1 +chr2 151580976 A +chr2 151580977 A,C,G,T +chr2 151580978 0 +chr2 151580979 0 +chr2 151580980 0 +chr2 151580981 0 +chr2 151580982 A,1 +chr2 151581068 T +chr2 151581069 A,C,G,T +chr2 151581070 0 +chr2 151581071 0 +chr2 151581072 0 +chr2 151581073 0 +chr2 151581074 1,T +chr2 151581107 T +chr2 151581108 A,C,G,T +chr2 151581109 0 +chr2 151581110 0 +chr2 151581111 0 +chr2 151581112 0 +chr2 151581113 0 +chr2 151581114 1,T +chr2 151581146 A +chr2 151581147 A,C,G,T +chr2 151581148 0 +chr2 151581149 0 +chr2 151581150 0 +chr2 151581151 0 +chr2 151581152 A,1 +chr2 151581155 A +chr2 151581156 A,C,G,T +chr2 151581157 0 +chr2 151581158 0 +chr2 151581159 0 +chr2 151581160 0 +chr2 151581161 0 +chr2 151581162 0 +chr2 151581163 A,1 +chr2 151581185 A +chr2 151581186 A,C,G,T +chr2 151581187 0 +chr2 151581188 0 +chr2 151581189 0 +chr2 151581190 0 +chr2 151581191 A,1 +chr2 151581263 T +chr2 151581264 A,C,G,T +chr2 151581265 0 +chr2 151581266 0 +chr2 151581267 0 +chr2 151581268 0 +chr2 151581269 1,T +chr2 151581283 T +chr2 151581284 A,C,G,T +chr2 151581285 0 +chr2 151581286 0 +chr2 151581287 0 +chr2 151581288 0 +chr2 151581289 1,T +chr2 151581592 A +chr2 151581593 A,C,G,T +chr2 151581594 0 +chr2 151581595 0 +chr2 151581596 0 +chr2 151581597 0 +chr2 151581598 A,1 +chr2 151581599 A +chr2 151581600 A,C,G,T +chr2 151581601 0 +chr2 151581602 0 +chr2 151581603 0 +chr2 151581604 0 +chr2 151581605 A,1 +chr2 151581691 A +chr2 151581692 A,C,G,T +chr2 151581693 0 +chr2 151581694 0 +chr2 151581695 0 +chr2 151581696 0 +chr2 151581697 0 +chr2 151581698 0 +chr2 151581699 A,1 +chr2 151581747 T +chr2 151581748 A,C,G,T +chr2 151581749 0 +chr2 151581750 0 +chr2 151581751 0 +chr2 151581752 0 +chr2 151581753 1,T +chr2 151581811 T +chr2 151581812 A,C,G,T +chr2 151581813 0 +chr2 151581814 0 +chr2 151581815 0 +chr2 151581816 0 +chr2 151581817 1,T +chr2 151582072 T +chr2 151582073 A,C,G,T +chr2 151582074 0 +chr2 151582075 0 +chr2 151582076 0 +chr2 151582077 0 +chr2 151582078 1,T +chr2 151583053 A +chr2 151583054 A,C,G,T +chr2 151583055 0 +chr2 151583056 0 +chr2 151583057 0 +chr2 151583058 0 +chr2 151583059 0 +chr2 151583060 0 +chr2 151583061 A,1 +chr2 151583087 C +chr2 151583088 A,C,G,T +chr2 151583089 0 +chr2 151583090 0 +chr2 151583091 0 +chr2 151583092 0 +chr2 151583093 0 +chr2 151583094 1,C +chr2 151583789 A +chr2 151583790 A,C,G,T +chr2 151583791 0 +chr2 151583792 0 +chr2 151583793 0 +chr2 151583794 0 +chr2 151583795 A,1 +chr2 151584096 T +chr2 151584097 A,C,G,T +chr2 151584098 0 +chr2 151584099 0 +chr2 151584100 0 +chr2 151584101 0 +chr2 151584102 1,T +chr2 151584106 A +chr2 151584107 A,C,G,T +chr2 151584108 0 +chr2 151584109 0 +chr2 151584110 0 +chr2 151584111 0 +chr2 151584112 A,1 +chr2 151584561 A +chr2 151584562 A,C,G,T +chr2 151584563 0 +chr2 151584564 0 +chr2 151584565 0 +chr2 151584566 0 +chr2 151584567 A,1 +chr2 151584659 G +chr2 151584660 A,C,G,T +chr2 151584661 0 +chr2 151584662 0 +chr2 151584663 0 +chr2 151584664 0 +chr2 151584665 1,G +chr2 151584694 T +chr2 151584695 A,C,G,T +chr2 151584696 0 +chr2 151584697 0 +chr2 151584698 0 +chr2 151584699 0 +chr2 151584700 1,T +chr2 151584701 A,C,G,T +chr2 151584702 0 +chr2 151584703 0 +chr2 151584704 0 +chr2 151584705 0 +chr2 151584706 1,T +chr2 151584707 A,C,G,T +chr2 151584708 0 +chr2 151584709 0 +chr2 151584710 0 +chr2 151584711 0 +chr2 151584712 0 +chr2 151584713 0 +chr2 151584714 0 +chr2 151584715 0 +chr2 151584716 1,T +chr2 151584866 T +chr2 151584867 A,C,G,T +chr2 151584868 0 +chr2 151584869 0 +chr2 151584870 0 +chr2 151584871 0 +chr2 151584872 1,T +chr2 151584894 T +chr2 151584895 A,C,G,T +chr2 151584896 0 +chr2 151584897 0 +chr2 151584898 0 +chr2 151584899 0 +chr2 151584900 1,T +chr2 151585213 A +chr2 151585214 A,C,G,T +chr2 151585215 0 +chr2 151585216 0 +chr2 151585217 0 +chr2 151585218 0 +chr2 151585219 A,1 +chr2 151585418 T +chr2 151585419 A,C,G,T +chr2 151585420 0 +chr2 151585421 0 +chr2 151585422 0 +chr2 151585423 0 +chr2 151585424 A +chr2 151585425 A,C,G,T +chr2 151585426 0 +chr2 151585427 0 +chr2 151585428 0 +chr2 151585429 0 +chr2 151585430 A,1 +chr2 151585442 A +chr2 151585443 A,C,G,T +chr2 151585444 0 +chr2 151585445 0 +chr2 151585446 0 +chr2 151585447 0 +chr2 151585448 A,1 +chr2 151585475 A +chr2 151585476 A,C,G,T +chr2 151585477 0 +chr2 151585478 0 +chr2 151585479 0 +chr2 151585480 0 +chr2 151585481 A,1 +chr2 151585599 T +chr2 151585600 A,C,G,T +chr2 151585601 0 +chr2 151585602 0 +chr2 151585603 0 +chr2 151585604 0 +chr2 151585605 1,T +chr2 151585651 A +chr2 151585652 A,C,G,T +chr2 151585653 0 +chr2 151585654 0 +chr2 151585655 0 +chr2 151585656 0 +chr2 151585657 A,1 +chr2 151585723 A +chr2 151585724 A,C,G,T +chr2 151585725 0 +chr2 151585726 0 +chr2 151585727 0 +chr2 151585728 0 +chr2 151585729 0 +chr2 151585730 A,1 +chr2 151585958 T +chr2 151585959 A,C,G,T +chr2 151585960 0 +chr2 151585961 0 +chr2 151585962 0 +chr2 151585963 0 +chr2 151585964 1,T +chr2 151585965 C +chr2 151585966 A,C,G,T +chr2 151585967 0 +chr2 151585968 0 +chr2 151585969 0 +chr2 151585970 0 +chr2 151585971 1,C +chr2 151586055 A +chr2 151586056 A,C,G,T +chr2 151586057 0 +chr2 151586058 0 +chr2 151586059 0 +chr2 151586060 0 +chr2 151586061 A,1 +chr2 151586099 A +chr2 151586100 A,C,G,T +chr2 151586101 0 +chr2 151586102 0 +chr2 151586103 0 +chr2 151586104 0 +chr2 151586105 A,1 +chr2 151586120 A +chr2 151586121 A,C,G,T +chr2 151586122 0 +chr2 151586123 0 +chr2 151586124 0 +chr2 151586125 0 +chr2 151586126 A,1 +chr2 151586615 A +chr2 151586616 A,C,G,T +chr2 151586617 0 +chr2 151586618 0 +chr2 151586619 0 +chr2 151586620 0 +chr2 151586621 A,1 +chr2 151586622 T +chr2 151586623 A,C,G,T +chr2 151586624 0 +chr2 151586625 0 +chr2 151586626 0 +chr2 151586627 0 +chr2 151586628 1,T +chr2 151586848 A +chr2 151586849 A,C,G,T +chr2 151586850 0 +chr2 151586851 0 +chr2 151586852 0 +chr2 151586853 0 +chr2 151586854 A,1 +chr2 151587273 T +chr2 151587274 A,C,G,T +chr2 151587275 0 +chr2 151587276 0 +chr2 151587277 0 +chr2 151587278 0 +chr2 151587279 1,T +chr2 151587597 A +chr2 151587598 A,C,G,T +chr2 151587599 0 +chr2 151587600 0 +chr2 151587601 0 +chr2 151587602 0 +chr2 151587603 A,1 +chr2 151587641 A +chr2 151587642 A,C,G,T +chr2 151587643 0 +chr2 151587644 0 +chr2 151587645 0 +chr2 151587646 0 +chr2 151587647 A,1 +chr2 151587649 A +chr2 151587650 A,C,G,T +chr2 151587651 0 +chr2 151587652 0 +chr2 151587653 0 +chr2 151587654 0 +chr2 151587655 A,1 +chr2 151587806 A +chr2 151587807 A,C,G,T +chr2 151587808 0 +chr2 151587809 0 +chr2 151587810 0 +chr2 151587811 0 +chr2 151587812 0 +chr2 151587813 0 +chr2 151587814 0 +chr2 151587815 0 +chr2 151587816 0 +chr2 151587817 0 +chr2 151587818 0 +chr2 151587819 0 +chr2 151587820 0 +chr2 151587821 0 +chr2 151587822 0 +chr2 151587823 0 +chr2 151587824 0 +chr2 151587825 0 +chr2 151587826 0 +chr2 151587827 0 +chr2 151587828 0 +chr2 151587829 0 +chr2 151587830 0 +chr2 151587831 A,1 +chr2 151587851 A +chr2 151587852 A,C,G,T +chr2 151587853 0 +chr2 151587854 0 +chr2 151587855 0 +chr2 151587856 0 +chr2 151587857 0 +chr2 151587858 A,1 +chr2 151587930 A +chr2 151587931 A,C,G,T +chr2 151587932 0 +chr2 151587933 0 +chr2 151587934 0 +chr2 151587935 0 +chr2 151587936 A,1 +chr2 151588013 A +chr2 151588014 A,C,G,T +chr2 151588015 0 +chr2 151588016 0 +chr2 151588017 0 +chr2 151588018 0 +chr2 151588019 0 +chr2 151588020 T +chr2 151588021 A,C,G,T +chr2 151588022 0 +chr2 151588023 0 +chr2 151588024 0 +chr2 151588025 0 +chr2 151588026 0 +chr2 151588027 0 +chr2 151588028 0 +chr2 151588029 1,T +chr2 151588091 A +chr2 151588092 A,C,G,T +chr2 151588093 0 +chr2 151588094 0 +chr2 151588095 0 +chr2 151588096 0 +chr2 151588097 0 +chr2 151588098 0 +chr2 151588099 0 +chr2 151588100 0 +chr2 151588101 A,1 +chr2 151588102 A,C,G,T +chr2 151588103 0 +chr2 151588104 0 +chr2 151588105 0 +chr2 151588106 0 +chr2 151588107 0 +chr2 151588108 0 +chr2 151588109 0 +chr2 151588110 0 +chr2 151588111 0 +chr2 151588112 A,1 +chr2 151588113 A +chr2 151588114 A,C,G,T +chr2 151588115 0 +chr2 151588116 0 +chr2 151588117 0 +chr2 151588118 0 +chr2 151588119 0 +chr2 151588120 A,1 +chr2 151588153 A +chr2 151588154 A,C,G,T +chr2 151588155 0 +chr2 151588156 0 +chr2 151588157 0 +chr2 151588158 0 +chr2 151588159 A,1 +chr2 151588204 A +chr2 151588205 A,C,G,T +chr2 151588206 0 +chr2 151588207 0 +chr2 151588208 0 +chr2 151588209 0 +chr2 151588210 0 +chr2 151588211 A,1 +chr2 151588240 T +chr2 151588241 A,C,G,T +chr2 151588242 0 +chr2 151588243 0 +chr2 151588244 0 +chr2 151588245 0 +chr2 151588246 1,T +chr2 151588270 A +chr2 151588271 A,C,G,T +chr2 151588272 0 +chr2 151588273 0 +chr2 151588274 0 +chr2 151588275 0 +chr2 151588276 A,1 +chr2 151588351 A +chr2 151588352 A,C,G,T +chr2 151588353 0 +chr2 151588354 0 +chr2 151588355 0 +chr2 151588356 0 +chr2 151588357 0 +chr2 151588358 A,1 +chr2 151588372 T +chr2 151588373 A,C,G,T +chr2 151588374 0 +chr2 151588375 0 +chr2 151588376 0 +chr2 151588377 0 +chr2 151588378 1,T +chr2 151588482 A +chr2 151588483 A,C,G,T +chr2 151588484 0 +chr2 151588485 0 +chr2 151588486 0 +chr2 151588487 0 +chr2 151588488 A,1 +chr2 151588498 T +chr2 151588499 A,C,G,T +chr2 151588500 0 +chr2 151588501 0 +chr2 151588502 0 +chr2 151588503 0 +chr2 151588504 1,T +chr2 151588656 T +chr2 151588657 A,C,G,T +chr2 151588658 0 +chr2 151588659 0 +chr2 151588660 0 +chr2 151588661 0 +chr2 151588662 1,T diff --git a/paraphase/data/neb/neb_config.yaml b/paraphase/data/neb/neb_config.yaml new file mode 100644 index 0000000..ed67f38 --- /dev/null +++ b/paraphase/data/neb/neb_config.yaml @@ -0,0 +1,21 @@ +gene: neb +use_supplementary: True + +# data files +data: + reference: ref.fa + homopolymer: homopolymer_sites.txt + +# coordinates +coordinates: + hg38: + nchr: "chr2" + nchr_old: "chr2_151578750_151588780" + nchr_length: 242193529 + extract_region1: "chr2:151578700-151609700" + extract_region2: "" + + left_boundary: 151578800 + right_boundary: 151588500 + + noisy_region: [[151584195, 151584196], [151578903, 151578903], [151579992, 151580022]] diff --git a/paraphase/data/neb/ref.fa b/paraphase/data/neb/ref.fa new file mode 100644 index 0000000..885978e --- /dev/null +++ b/paraphase/data/neb/ref.fa @@ -0,0 +1,169 @@ +>chr2_151578750_151588780 +GGAAGGAACGAAGGAAGGAATGAAGGAAGGAAGGAAGGAAGGGCGGGCAAGGCTGGGCAT +GGTGGCTCACTTCTGTAATCCCAGAACTTTGGAAGGCTGAGACGGGTGGATCACCTGCGG +TCAGGAGTTCAAAACCAGCCTGGCCAAGATGGCAAAGACCTGTCTCTAATAAAAATACAA +AAATCAGCTGGGTGTGGTGCACATGCCTGTAATCCCAGCTACTTGGGAGGCTGATGCAGG +AACATCGCTTGAACCTGGGAGGCGGAGGTTGCAGTGGGCTGAGATTGCACCACTGCACTC +CAGCCTGGGTGACAGAGCAAGACTCCATCTCAAAAAAAAAAAAAAAAAAAAAAAAAAAGT +AAGAAACTGGAAACACTGTACACATTAACAAATCTATAGGAAAATTCTGGTAGCACCTTT +AACAATGAAAACTTTCTGAAGAAACCTGGCTGTAGGTAGGCAATTAATAATTCTGATAAA +ATAATAATAAAGTGGAGTGCCTACGTACAGAGAGTTGTGTTTTCAGAGCATTGGAAGGAG +CCATCAGGCAAAGCAATGGGGGACTTGTTTCCTGGGCGACACACTTACGTCGCTCTGTAG +GTCGTAGGCTTTCCTGGCTTGGATCACATCATTCTGGTCGGGAAAGCAAGACCAGCGGTG +CAGGTAATGGCGATAGTCCACATCACTTGCCAGTGCCTGACCTTCTTTGGCAGCGCTGAT +GGACACCATGTCCACAGGGATGGAGATCTTGGCTTTGTGGTCGTTGTAGGCCTTTTTGTA +CAGCCTGTCATTCTGCATCTTCAACACATTTGCTGCCCAAACCAGTTTAGGGTCTTCCTT +GGCGCTGCGACAGCCAATGTAATGGCCTTTCTGTTTCTCATAAGCAGTTTTGTACAGATA +CTGGTAGATGTGAACAGAAAATAGAAGAGTAATTTTTCCAAACCCACTTTTTCTAGAGAG +CCTACTCAAACTAACTCCATCTCCCTTAGTTTACCTTTTTTGGGGGGGTCGGGGAATACT +GTTTTTATTTAAATTCAAGCCATAAGTTTTGATACATTGGTTTGTCTTCTGTGTCTACAT +GCATGAAAACATTGAGAGCATCTCTTAGGCAAGAACTGTGTTGCTGGATTATCTCCCAAT +ATAGCCTGGCCACCTTCAGATGTTTTGTCACACACAAGTACCAGTTGTACTTCTTATTAC +TTTAAATCAAATTAATTCTTTGTTGAAATTTTTTTTATTTTACTTTTCTTTTCTTTTTTT +TTTTTTTTTTTTTGAGACAGAGTCTCGCTCTGCTCCCCAGGCTAGAGTGCAGTAGCACGA +TCTCGGCTCACTTGCAATCTCCACCTCCTGGGTTCAAGTGATTCTCCTGCCTCTGCCTCC +CCAGTAGCTGGGATTACAGGCATCCGCCACCATGCCCAGCTAATTTTTGTATTTTTAGTA +GAGACAGGGTTTCACCGTGTTGGCCAGGCTGGTCTCGAACTCCTGACCTCAAGTGATCCA +CCCGCCTTGGCCTCCCAAAATGCTGGGATTACAGGCATGAGCCACCATGCCTTGCCTGAA +ATAATTTTTAATTATAAACTTATTTTAAAATTTTATTTAAAAATGTATAACCCTATCTAC +GTGGAAAAAAACAGTGTATTTCTAGCACACATTAAAACAACCATATAACTATTTTAAAAA +AATACTTGTGCATGTACTACATAAAATTATTATAGGAAACACCGGTAGTATACACGCCAC +ACTTTGAGTAACAACAGAATTCTCTGTACTATGAAATTAAATCTCTTTAAGTACTTGTAA +ATTAAGTGACATTGTGAAGATATTGAAAAGAAAATCAAAATCTGTGCTTAAACGCAAGAG +TTGCAGAACCAAACATAGATGCAAAAGGATCCTCAGATTGTAAGAGTTGTTCTTGAATTT +AGGATTTACAAATATACTAATAACAATGACCCAATTCCATTTAAATATAATTCAAAGGGC +ATTAAGTTAAACTCCCTCTATTTCTACTAGTTAGTGTTTGTTCCGGTTGGGAAGGAGGAG +CTCTTACGTCACTGGCAATATCCCTGGAAGCCTTGGCATGCTGGATCCCAATGGCATCTG +CTCGCAGGTCATAACCAGTCATCTTGACATCTTCCCAAGCTTGCTGATAGCGTTTCTGCA +AACAGAGAGTGCAATGCCACAGTCAGTCTGAAGAGGGAGCTACTGAGTCAGCATTACTGA +TGTATTTAAAAATAGATATTTACAGTTACACTGAACATTAATGTCTCCTTAAGATTTATG +AAAGAGTCTGGTTTCATATTGAGCTTGAATTCCAGTAGATTTTTAATGACATTTAGTAAA +GCTCTCTCCCAATAATTATTTTTTGAAGAAGGAAACTTAGGAGATAATAAATCATGTAAA +AATCAGAAAAAAACATATTGCTTGGCTTTTACAAGGAAAAAGCAATGATGGATGGCCCAC +CAAAACCTGAGAACTAGGTATTACTCTACGTGTATCTATAAGTTTTATGTTGTGTTTTTG +GAGAAACCATTCCATTTTTCCTTCCAACAAATATTGAGATGTACATTATACAGACACACG +CAGACCCACACAGCTACTTATCATTAACTTAAAGACTAGAAATAAAAGGAATGGGAAGAA +AATGCCTTTTGACAAACTCTCTTTGGGGTGTATAATAAAAGGTTTGGTTAGGCAGAGAGC +ACTGTGAAAAGCAAATGATGTGTGCTGTCTTACATTGCTGATCTGCAGGGCATTGATTTT +GGATTGCAGCATCAGGGGAGTGTCAGCTGGCACGTTCACATTAGCCTTCTCTTTCTCCCA +GGCATCGCGATACAATGGCTGGGAAAAATGAAAAACGATGGAATGGTCAATTAGTAAATA +AGTCAATTACCACATAAATAAAATTATAAAGTCATAAAACATACTTGAATAAATGGATGA +TGAAAAAAATTTTAAGTGAATTTTATAATAAATTAAAACTAACATAAGTAAATTAAAATT +TTTAAGTAAGATTAATAGGTAAATCTAATTTTATATTAAATAATAAGCAAAATGAAAAGT +TCTTTTTGGATGTTCTCCGATATAAACGTAGATTATACATAGCATAAGCCACTGCACCTT +GCCTCCTTTGCACTTTTATTTAAAAGGTGCCAATAGTGACTTCAATAAGAACACATTTAT +GTGTGATACAACATAATTTAAAATGAGTCAGAAATATGGTAGTAAAATTATTCTTTTCCA +TTTACTCACATCTAATTATCAGATGGGTAAGTCCACATTGCAGATAGCTCTGATAATTCC +CATTGCTGAGCAGGACCTAGCTGTTTTTGGTCCAAATAACTACCATAATGTCTAACATTG +AAGTTGTCCCTAGACTTCTCAACCTATTTCAAGCAATAAAGAAGTGGCAATTTTGTGATT +ATAGTACTAAGTGGTGGGCCTATATTATGGTCTGGATCTCCAAATGTGATCAAAGTGTGT +ATCAGGGTGTGTAGAACATGAAGATGCCCACCACATGTGCATTGACCAAACACTTATCAA +AGCAAATGAAATCCAGAGGCTGCTAAGAGACTGACTACACAAAGATCCATCTGGATACAT +GCAAACGTGTTTCAGATCCTGACGCTGACATGAAGAGTCTAAAGTTCACGTTTAAAATTC +TTCTGGATTCAGATTTCTTCCATTTGTCTGAAAACATCATCCACCTGGACTTACCTCACT +GATCTGTACAGCATTGATCTTTGCCTGGATTACTTCCGGGGTGTCAACAATGCTGGTGAA +TTTGAGGGCCTCAGGCTTTATACGATACAGCCTTTCATTCAAGAGATTCTGGGCATTCTT +CACTCTCATCACTTCCACAGAACCCTCTGGCAACCATCCAATACCCTTCAGCCATTCCAG +GTCTGACTTGTACAAGTTCTACATGAAGGAGAGGAACACAGAGTGGGGAAGAGTCAAAGT +TGCCCATAAGAGTTTGATGCTTGCAAACCACACATCACCTCTGTTTAAATCAGTTAGGTG +GCTAATCTCATAAAACTCAAGGATCAACATTAAATGTTAATATTGGGTGTTATCTCTCTG +GCACACATGGGAGAAGTTTAGAGTATCCTGCAGGAATTGGAACAGGGCTCACAAGTGGCA +CTTGGTGGAACGGCAGCCAGCAATGGCACATAACTTTGTATGCTTGGATTAGTTTCTACA +CTCACCACTTGAAGTCTGCCTTTTATGTACCACTGGAAGTAACATCAGTAATAACATCAG +TTGGATCTTAGGAGGTAAAGAACCTTAGATGTATGCGTAATGAGAAAAAAATACATACCT +GTATCTATATCCATCCTGCCCCCCATCTCTCTCTCTTTCTGTGTCAAGTAAACATGAAAT +CCAACATTTCCTTATGCTTACATCTACTTATGAAATATGTATATGACTATGCAAAGGTAA +CGGCTCAGAGCAGCTGCAACTGAAATTTTCTACCTAATTACATAGAATAAGAGACTGATT +TTCATCTTGGTAAATAAGGTCTTGCTACCATAAGAAGGAGTCCTCTATAAGATAGAAAAT +GTCTGGGGAAAACTCTGCTACCATGAATTGTACAGAATATGTTAGGTTCTATTAAGATTA +CCATTTAAATTATGCCTCTGGGGAATGCACTGGATTTAAACACACAAATACACATCTCCC +CGGGGTCTGGCGATAATATACGCACATCGCTCTGCAGGTCGTAGGCTTTCTTGGCCTGGA +TCACATCGTTCTGATCTGGCAGACACGTCCACTGGTGCAGATAATTGCGATAATCAATGT +CGCTAACCAGGACCTGGCATTTCTTGGCTTGAACAATTGACATCATGTCCAACGGTGTGT +TGTGAATTCCTTTAGACTTCTCATAAGCTTTCTTATATTCTCTGTCACTCTGGATCTTGG +CAGCATGTATAGCCCATACTGACTTGGGGTCATCTTGTAGGGTGCGGAAACCCATGTGGT +GGCCCAGTTGCTTACGGTAACCTTCTTTGTATTTGTACTAAAGTAGTAAGAAATCATATT +AAAAAGACATATGAAAATAATGTATTAAGAGGGGTTTTCTAATTTAAATGGTTTCAGTAT +GTTTTCTTAGCCTTGGTATAAATTGGCTCAACATTGGCAGTCATTACATATTTCAGTTCA +CTTTAAAGAAGTTAGATAATACGATTTTGGGTCATTTCAGAGAGGACATGAGTAAATCAG +GAGGGAGATGGTATGGTGAACAAGAGTAATGAAAACAATAGCTAACATTTACTAAGTAAT +TACTCTATGCCGGACACTATTATGACTACTTCTAATGTGTTGGGTAGATCCACATGAAAC +TACCATGTTTTTCGGTCAAAAATCATTGACTGGTGGCAATTATGATCACAGTTCATCCTC +ACAATGTCTCTATGAGGTAGATACTATTATTAACATATATTCCAGTTGGGGAAAACAAGG +CACAGGTGATAATCTCCATCACCAAGTTTCCCTGACAAGTGCAACATTAGGTCAAATGAT +CTCCTAAGAGTACTGAAGGAACCTCAGGTAGGGAGAGGTCATACGTGTGTGGTTTTGGAA +ACAGACAAGAAAGATGCATGTTTCAAGGATGGAAAGCAAGGCTGTGGCTTGAATCAGAAG +ATAACAACAAAAGTTCTGTGGACAAGAGTTTCCATCTGATGAGTGTCTCAGGACTAGTGT +AAGGTGGTTTCTGTTTCTTTCCTATGTTTGGATAACATAGGCAAGTTTGAACAAAGAGAG +AGAAAACTGGGAATCAAGCAGATAACGTGGACTAGCTATGGTTATTAGAGGCAAAAATGA +AGAATAAGAAGGAGGAAACAACTGGCTACATGATGAAAATCAGAGAAGAACGCAGAGAAA +GGGAACAAGGTGGCTTAAAGAGTTGGAGCTGGGGGCAGGGTGCACCTGGTAGCATAATTA +TTCTATTTTTATTTTTATTTTTTTTTGAAACGCAGTTTCAGTCTTGTTGCCCAGGCTGGA +ACGCAATGGCATGATCTCGGCTTACTGCAAACCTCCGCCTCCCAGGTTCAAGTGATTCTC +CTGCCTCAGACTCCTGAGTAGCTGGGATTACACGTCCCGCCACCACGCCCAGCTAAGTTT +TTGTTTGTTTGTTTGTTTGTTTGTATTTTTAGTAGAGATGAGGTTTTACCATGTTGGCCA +GGCTGGTCTCGAACTCCTGACCTCAGGTGATCCGCCCACCTCAGCCTCCCAAAGTGCTGG +GATTACAGGGGTGAGCCACCATGCCCGACAGGTAGCATAATTATTTTGACCACTAGTAAG +TTTTAAAATTAGAATTAAATGAAGTTCCAAAGATTACTAAGAGTATTCTACATAAGCCTC +CAGTATAAAATCTTGCATATAAATATGATTTCAGTCTTAAGTCTTATATGAAACTATCAT +CTGTGACTTATATTTGTGGAGAATGTCAGAAATATTTTAGGGATAAAAATGACATTGTAT +TGGGTCCTGAATTTTCCCAGGAATTTAATACAGACTCTTCATGATGTTTTGTTTCTACTT +TGTAACAGCATCAAAGTAAATTTACACATATACCTCATCTATGCTAACTTTAATGTTTTC +CATTCCATTTTAATATTTTCAAATACAGAAAGCTTCTAGCAGAGGATGAGATGCAATTCT +ATGCACAGATTTTTTAAAAACACAAAATTCTTGAAAAATAGTCTCCCTGCTCGTTTTGTA +GAAAAGAAAAACCACAAGAAAAGAGAAGGAAGCGTACCTCACTGGCAATTTCTCTGGAGG +CCTTGGCGTGCTTGATTTCAATGGCATCTGCCCTTATGTCATAGCCTTTCTGCTTGGTGT +CATTCCAGTCTTTTTGGTAGAGTTTCTATAGAGGGAAAATAAAGGTTTGTTTACAAGAAT +GGAAAAATAAGCAAATTTACTCAAATTTGTCATAATATAAATCATAAAGAATATGGTAAT +GTAGTCAATATAAGAAAAAAGCATTCATTTCTGGGTTCTTTTCGTTGTTGGATATTTCCT +CTGTGGTGTACAGGATACTTCTGGTTAAAATTTTAAAGCTAAACATCAAACTCTATTAAT +GTTTGAATATTTGCATCCATCACAATATGGTATGCATTTGAAAATATCCCACTACAAGTA +CATGGAACCTAAGGATCAATGGGGAAGCCAGAGAATCTTTATCAAGATATCAGGTAAGAT +CTGATAAGATTTTTCTCCCCCTAATTTTGTCCTTTGACTTTTCAGAAGTTTCTGCTAAAG +GGTTGTGACATAGATATCATACATAGCCCGCCATTTGGATGTAATTAAAAATGGTCCTTT +GTCTTTCTTATGGCCTTGAAATGTCAAATGAAAAAGTCCTGGATCAATAAGAAAAACCTA +AGATGATAACAGAGACTGTTAAGAGTGAAGGAGGACACCAAAAGGATTCAGGTGTCCCAA +ATCCTGCAGTCTAGGCACCCACCGATGTATCCAAGCACACATTTAAAAGAGAGAAACTCC +ATTCATATCGCATTGGGACTGGTAGCTAAAGAGAAAATGTCTAGCACCAAAAGCCACAAA +CATTGCAAGGTAGGTAATTTGTCGCTATAACAATCTCCATGAGCAGACAGATATTGCCCA +TAAAGGCTTTTACACTTCATGTTCGTGCCACTTACATTGCTGATTTGCAAGGCATTGATG +TGGGCCTGCAGCATCTCCGGAGTATCAGAAGGAATGGTGATGTTGCTTTTATCTTTATTC +CAGGCTTCCTGATACAGTTTCTGTGGAGAGGAGGGAAATAGGGAATCAATATCTGAAACA +TTAAGCCTTAGCTAATGTTTGGCTAATAGTTAACTAATGTAAGCCTTAGCTAACTACTCA +ATCTGGAAAAACGTTTTTACATATGTTTATAATGGTTAACTTTATGTGTCAGCTTGGCTG +GGGCATGGTACCCAGATATTTAGTCAAACATTATTTTAGGTGTTTCTGTGAAAGTATTGT +TAAAATGAAGTTAACATTTAAATCAGTAGATTCCAAGTAAAGGAGACTACCCTTCATGAT +GTGGGTGAGCCTTATCTAATTAGTTGAAGACCTTGATAGAAAAGACAAACCTCCCCAGTA +AAAAGGAATTCTGCCAGCAGATGGCCTTTGGGTTTGAATTGCAACACATCCCTCAGTCTC +TAGCCGGCTGGCCTACCCTGCAGACTTTGGACTTGCCAGCCTGCATGATCTCAGGAGCCA +ATTTCTTAAAGTCTCTCTCTTCTGTGTGTGTGTACAAACACACACACACAAACACACACA +CACACACAGCCCCTATTGGTTCTGTTTCTCTAGAGAATCTGGACTAATACAATGTTATGT +TATTATGTAGCATGACACTACATGTCTTAGTATTTAATATGAGGTCAACAAGTATTTGTG +TTCCTGAGTTGCCAGGGAGTGACATTTTCACTGCAGTGCTTCAAAGTGTGTATGTGTGGT +AGAAGAAAGGCATGTTGGTGTTTGTCAGAGTTCTATTTTCAGTGTATCTGCACTAGTGAA +GACATTTTTAGAGTTGCTCGTCTCACCAAAATTCTTTGAAGAAGTCCAGTCTACGGGCTC +AGTTTGGGTGGTATCTTTGGAAAAGATAGTAGGATGGTTCTTACCTCACTTATTTGCAGA +GAATTGGCTTTTGCCAAGACAACTTCTGGAGTGTCGACAATGCTGGTGAATGACAAAGCT +TCTGGACGTGTCCTATAGAGTCTTTCATTCACGAGGTCTTGAGCAACCTTCACTCTGTTC +ATTTCCACTGAGCCTTCTGGCATCCAGCCGATGCCACGCAGCCACTCCAGGTCTGCCTTA +TACACACTCTATAAAGAAGATGTCAGACAAAAATACCATTTCTGACCAACATGGTGAAAC +CCCATCTCTACTAAAAATACAAAAATTAGCCGGGCATGGTGGCGCATGCCTGTAATCCCA +GCTACTTGGGAGGCTGAGGCAGGAGAATCGCTTGAACCCGGGAGGCAGAGCTTGCAGTGA +GCCGAGACTGTGCCATTGCACTCCAGCCTGGGCAACAAGAGCAAACCTCCGTCTCACAAA +AAAAAAAAAAAAAAAAAAAAAGGTAAATGTCAACATTTAATGAAAAAATAGGCAAAGATT +GTATGAGAGGCAAGGTGCAGGCTTTCAGCACTGAAACTCTGAGTACATGCAACGGCAAAG +CAAAAATAAAACATGAAAGTACATCATTGTGACACTGGAAATCCCTTTTGTGATCATAAG +TTGAAGTCGTGAGTGCTAACATTTAAAAAAATTTTTTTTAAGGGATGGAGTGCAATTGGA +GACTGCACTCCAGCCTGGGCAATGAGAGTGAAACTCCATTTCAAAAAAAAAGAAAAAAAA +AAGGAAAAAACATTTCAAGTGTTAGACATACAAATCTTTAGCTGAAAAATAAAGCACATG +GGGTTGAAGGGATAGGTGCAGGAGATTGTACTTCCAAAAAAGTAAGACCTGAGATCTCAC +ATGATTCAGGGTTTTTAATATTGATCACAGAAGTCATTTGTAAAAACATAATTATTTATA +CTAGCCATGAATAATAATATGTATAGTAATAAAGCTGAATTCCATGATATAATTGATATT +TCAAAAAATTTTATAGTTGAAAATTTTTAAAGGAAAATTTTGCTGAATTGGATTCCACTT +TAATCCTGTGATAATTGACCCAAGGAAATGCTGTTTAGTGTTTTGGATCTAGCATAATAG +CAGCAATTATTTTAAAAACATTCAGATGATTTTTATTTGTCTGAAAATGTATTTGTCTGA +AAAGTGCTGGGATTACAGGTGTGAGCCACCGCGCCCAGCCCTATGTCATGTTCTTTCTTC +TTCTTTATTCACCATTGTATCTTTAGTGCTTAGCACAATGCCTGGCACATGGTGTGTTTA +ATAAGTATTTTTAAAAGGATGATTAAGAATTCGATCACTAGTTCCACAGTTTTGGTCTCC +AGAAACTGTCTTAAATCTACCACTTGGCAATGCATTTTAAGTGATGAGCATGCTTTCCTG +TAATGAACTCA diff --git a/paraphase/data/neb/ref.fa.fai b/paraphase/data/neb/ref.fa.fai new file mode 100644 index 0000000..167f02b --- /dev/null +++ b/paraphase/data/neb/ref.fa.fai @@ -0,0 +1 @@ +chr2_151578750_151588780 10031 26 60 61 diff --git a/paraphase/data/pms2/pms2_config.yaml b/paraphase/data/pms2/pms2_config.yaml new file mode 100755 index 0000000..e52556c --- /dev/null +++ b/paraphase/data/pms2/pms2_config.yaml @@ -0,0 +1,24 @@ +gene: pms2 +check_nm: 0.1 + +# data files +data: + reference: pms2_ref.fa + homopolymer: pms2_homopolymer_sites.txt + +# coordinates +coordinates: + hg38: + nchr: "chr7" + nchr_old: "chr7_5967000_5992500" + nchr_length: 159345973 + extract_region1: "chr7:5970000-5989062" + extract_region2: "chr7:6735630-6754792" + + left_boundary: 5970925 + right_boundary: 5986000 + + clip_3p_positions: [5989126, 5989052] + + noisy_region: [[5980880, 5980980], [5979663, 5979669], [5975950, 5976080]] + diff --git a/paraphase/data/pms2/pms2_homopolymer_sites.txt b/paraphase/data/pms2/pms2_homopolymer_sites.txt new file mode 100644 index 0000000..8c6c825 --- /dev/null +++ b/paraphase/data/pms2/pms2_homopolymer_sites.txt @@ -0,0 +1,1378 @@ +chr5 5970049 T +chr5 5970050 A,G,T,C +chr5 5970051 0 +chr5 5970052 0 +chr5 5970053 0 +chr5 5970054 0 +chr5 5970055 T,1 +chr5 5970057 T +chr5 5970058 A,G,T,C +chr5 5970059 0 +chr5 5970060 0 +chr5 5970061 0 +chr5 5970062 0 +chr5 5970063 T,1 +chr5 5970223 C +chr5 5970224 A,G,T,C +chr5 5970225 0 +chr5 5970226 0 +chr5 5970227 0 +chr5 5970228 0 +chr5 5970229 1,C +chr5 5970798 G +chr5 5970799 A,G,T,C +chr5 5970800 0 +chr5 5970801 0 +chr5 5970802 0 +chr5 5970803 0 +chr5 5970804 1,G +chr5 5971061 T +chr5 5971062 A,G,T,C +chr5 5971063 0 +chr5 5971064 0 +chr5 5971065 0 +chr5 5971066 0 +chr5 5971067 T,1 +chr5 5971096 T +chr5 5971097 A,G,T,C +chr5 5971098 0 +chr5 5971099 0 +chr5 5971100 0 +chr5 5971101 0 +chr5 5971102 T,1 +chr5 5971281 G +chr5 5971282 A,G,T,C +chr5 5971283 0 +chr5 5971284 0 +chr5 5971285 0 +chr5 5971286 0 +chr5 5971287 1,G +chr5 5971868 A +chr5 5971869 A,G,T,C +chr5 5971870 0 +chr5 5971871 0 +chr5 5971872 0 +chr5 5971873 0 +chr5 5971874 A,1 +chr5 5971876 A +chr5 5971877 A,G,T,C +chr5 5971878 0 +chr5 5971879 0 +chr5 5971880 0 +chr5 5971881 0 +chr5 5971882 A,1 +chr5 5972034 A +chr5 5972035 A,G,T,C +chr5 5972036 0 +chr5 5972037 0 +chr5 5972038 0 +chr5 5972039 0 +chr5 5972040 0 +chr5 5972041 0 +chr5 5972042 0 +chr5 5972043 0 +chr5 5972044 0 +chr5 5972045 0 +chr5 5972046 0 +chr5 5972047 0 +chr5 5972048 0 +chr5 5972049 A,1 +chr5 5972137 G +chr5 5972138 A,G,T,C +chr5 5972139 0 +chr5 5972140 0 +chr5 5972141 0 +chr5 5972142 0 +chr5 5972143 1,G +chr5 5972307 G +chr5 5972308 A,G,T,C +chr5 5972309 0 +chr5 5972310 0 +chr5 5972311 0 +chr5 5972312 0 +chr5 5972313 1,G +chr5 5972317 G +chr5 5972318 A,G,T,C +chr5 5972319 0 +chr5 5972320 0 +chr5 5972321 0 +chr5 5972322 0 +chr5 5972323 1,G +chr5 5972344 C +chr5 5972345 A,G,T,C +chr5 5972346 0 +chr5 5972347 0 +chr5 5972348 0 +chr5 5972349 0 +chr5 5972350 1,C +chr5 5972535 T +chr5 5972536 A,G,T,C +chr5 5972537 0 +chr5 5972538 0 +chr5 5972539 0 +chr5 5972540 0 +chr5 5972541 T,1 +chr5 5972645 C +chr5 5972646 A,G,T,C +chr5 5972647 0 +chr5 5972648 0 +chr5 5972649 0 +chr5 5972650 0 +chr5 5972651 1,C +chr5 5972656 A +chr5 5972657 A,G,T,C +chr5 5972658 0 +chr5 5972659 0 +chr5 5972660 0 +chr5 5972661 0 +chr5 5972662 A,1 +chr5 5972824 T +chr5 5972825 A,G,T,C +chr5 5972826 0 +chr5 5972827 0 +chr5 5972828 0 +chr5 5972829 0 +chr5 5972830 0 +chr5 5972831 0 +chr5 5972832 0 +chr5 5972833 0 +chr5 5972834 0 +chr5 5972835 0 +chr5 5972836 0 +chr5 5972837 0 +chr5 5972838 T,1 +chr5 5972853 T +chr5 5972854 A,G,T,C +chr5 5972855 0 +chr5 5972856 0 +chr5 5972857 0 +chr5 5972858 0 +chr5 5972859 0 +chr5 5972860 T,1 +chr5 5973017 T +chr5 5973018 A,G,T,C +chr5 5973019 0 +chr5 5973020 0 +chr5 5973021 0 +chr5 5973022 0 +chr5 5973023 T,1 +chr5 5973154 T +chr5 5973155 A,G,T,C +chr5 5973156 0 +chr5 5973157 0 +chr5 5973158 0 +chr5 5973159 0 +chr5 5973160 T,1 +chr5 5973212 A +chr5 5973213 A,G,T,C +chr5 5973214 0 +chr5 5973215 0 +chr5 5973216 0 +chr5 5973217 0 +chr5 5973218 0 +chr5 5973219 0 +chr5 5973220 0 +chr5 5973221 A,1 +chr5 5973288 T +chr5 5973289 A,G,T,C +chr5 5973290 0 +chr5 5973291 0 +chr5 5973292 0 +chr5 5973293 0 +chr5 5973294 0 +chr5 5973295 0 +chr5 5973296 0 +chr5 5973297 T,1 +chr5 5973319 A +chr5 5973320 A,G,T,C +chr5 5973321 0 +chr5 5973322 0 +chr5 5973323 0 +chr5 5973324 0 +chr5 5973325 0 +chr5 5973326 A,1 +chr5 5973355 A +chr5 5973356 A,G,T,C +chr5 5973357 0 +chr5 5973358 0 +chr5 5973359 0 +chr5 5973360 0 +chr5 5973361 A,1 +chr5 5973481 C +chr5 5973482 A,G,T,C +chr5 5973483 0 +chr5 5973484 0 +chr5 5973485 0 +chr5 5973486 0 +chr5 5973487 1,C +chr5 5973578 T +chr5 5973579 A,G,T,C +chr5 5973580 0 +chr5 5973581 0 +chr5 5973582 0 +chr5 5973583 0 +chr5 5973584 0 +chr5 5973585 T,1 +chr5 5973650 A +chr5 5973651 A,G,T,C +chr5 5973652 0 +chr5 5973653 0 +chr5 5973654 0 +chr5 5973655 0 +chr5 5973656 A,1 +chr5 5973667 A +chr5 5973668 A,G,T,C +chr5 5973669 0 +chr5 5973670 0 +chr5 5973671 0 +chr5 5973672 0 +chr5 5973673 A,1 +chr5 5973676 C +chr5 5973677 A,G,T,C +chr5 5973678 0 +chr5 5973679 0 +chr5 5973680 0 +chr5 5973681 0 +chr5 5973682 1,C +chr5 5973944 C +chr5 5973945 A,G,T,C +chr5 5973946 0 +chr5 5973947 0 +chr5 5973948 0 +chr5 5973949 0 +chr5 5973950 1,C +chr5 5973964 C +chr5 5973965 A,G,T,C +chr5 5973966 0 +chr5 5973967 0 +chr5 5973968 0 +chr5 5973969 0 +chr5 5973970 1,C +chr5 5974009 C +chr5 5974010 A,G,T,C +chr5 5974011 0 +chr5 5974012 0 +chr5 5974013 0 +chr5 5974014 0 +chr5 5974015 1,C +chr5 5974131 C +chr5 5974132 A,G,T,C +chr5 5974133 0 +chr5 5974134 0 +chr5 5974135 0 +chr5 5974136 0 +chr5 5974137 1,C +chr5 5974155 C +chr5 5974156 A,G,T,C +chr5 5974157 0 +chr5 5974158 0 +chr5 5974159 0 +chr5 5974160 0 +chr5 5974161 1,C +chr5 5975242 C +chr5 5975243 A,G,T,C +chr5 5975244 0 +chr5 5975245 0 +chr5 5975246 0 +chr5 5975247 0 +chr5 5975248 1,C +chr5 5975374 C +chr5 5975375 A,G,T,C +chr5 5975376 0 +chr5 5975377 0 +chr5 5975378 0 +chr5 5975379 0 +chr5 5975380 1,C +chr5 5975536 T +chr5 5975537 A,G,T,C +chr5 5975538 0 +chr5 5975539 0 +chr5 5975540 0 +chr5 5975541 0 +chr5 5975542 0 +chr5 5975543 0 +chr5 5975544 0 +chr5 5975545 0 +chr5 5975546 0 +chr5 5975547 0 +chr5 5975548 0 +chr5 5975549 0 +chr5 5975550 0 +chr5 5975551 0 +chr5 5975552 0 +chr5 5975553 0 +chr5 5975554 0 +chr5 5975555 T,1 +chr5 5975711 T +chr5 5975712 A,G,T,C +chr5 5975713 0 +chr5 5975714 0 +chr5 5975715 0 +chr5 5975716 0 +chr5 5975717 0 +chr5 5975718 T,1 +chr5 5975919 T +chr5 5975920 A,G,T,C +chr5 5975921 0 +chr5 5975922 0 +chr5 5975923 0 +chr5 5975924 0 +chr5 5975925 T,1 +chr5 5976076 A +chr5 5976077 A,G,T,C +chr5 5976078 0 +chr5 5976079 0 +chr5 5976080 0 +chr5 5976081 0 +chr5 5976082 A,1 +chr5 5976084 A +chr5 5976085 A,G,T,C +chr5 5976086 0 +chr5 5976087 0 +chr5 5976088 0 +chr5 5976089 0 +chr5 5976090 A,1 +chr5 5976234 A +chr5 5976235 A,G,T,C +chr5 5976236 0 +chr5 5976237 0 +chr5 5976238 0 +chr5 5976239 0 +chr5 5976240 A,1 +chr5 5976248 A +chr5 5976249 A,G,T,C +chr5 5976250 0 +chr5 5976251 0 +chr5 5976252 0 +chr5 5976253 0 +chr5 5976254 A,1 +chr5 5976867 A +chr5 5976868 A,G,T,C +chr5 5976869 0 +chr5 5976870 0 +chr5 5976871 0 +chr5 5976872 0 +chr5 5976873 0 +chr5 5976874 A,1 +chr5 5976876 A +chr5 5976877 A,G,T,C +chr5 5976878 0 +chr5 5976879 0 +chr5 5976880 0 +chr5 5976881 0 +chr5 5976882 A,1 +chr5 5977005 G +chr5 5977006 A,G,T,C +chr5 5977007 0 +chr5 5977008 0 +chr5 5977009 0 +chr5 5977010 0 +chr5 5977011 1,G +chr5 5977029 A +chr5 5977030 A,G,T,C +chr5 5977031 0 +chr5 5977032 0 +chr5 5977033 0 +chr5 5977034 0 +chr5 5977035 0 +chr5 5977036 0 +chr5 5977037 0 +chr5 5977038 0 +chr5 5977039 0 +chr5 5977040 0 +chr5 5977041 0 +chr5 5977042 0 +chr5 5977043 0 +chr5 5977044 A,1 +chr5 5977056 T +chr5 5977057 A,G,T,C +chr5 5977058 0 +chr5 5977059 0 +chr5 5977060 0 +chr5 5977061 0 +chr5 5977062 0 +chr5 5977063 0 +chr5 5977064 0 +chr5 5977065 T,1 +chr5 5977066 A,G,T,C +chr5 5977067 0 +chr5 5977068 0 +chr5 5977069 0 +chr5 5977070 0 +chr5 5977071 0 +chr5 5977072 0 +chr5 5977073 T,1 +chr5 5977221 T +chr5 5977222 A,G,T,C +chr5 5977223 0 +chr5 5977224 0 +chr5 5977225 0 +chr5 5977226 0 +chr5 5977227 0 +chr5 5977228 0 +chr5 5977229 T,1 +chr5 5977231 T +chr5 5977232 A,G,T,C +chr5 5977233 0 +chr5 5977234 0 +chr5 5977235 0 +chr5 5977236 0 +chr5 5977237 T,1 +chr5 5977708 T +chr5 5977709 A,G,T,C +chr5 5977710 0 +chr5 5977711 0 +chr5 5977712 0 +chr5 5977713 0 +chr5 5977714 T,1 +chr5 5977778 A +chr5 5977779 A,G,T,C +chr5 5977780 0 +chr5 5977781 0 +chr5 5977782 0 +chr5 5977783 0 +chr5 5977784 A,1 +chr5 5977955 A +chr5 5977956 A,G,T,C +chr5 5977957 0 +chr5 5977958 0 +chr5 5977959 0 +chr5 5977960 0 +chr5 5977961 0 +chr5 5977962 A,1 +chr5 5977964 A +chr5 5977965 A,G,T,C +chr5 5977966 0 +chr5 5977967 0 +chr5 5977968 0 +chr5 5977969 0 +chr5 5977970 0 +chr5 5977971 A,1 +chr5 5978122 A +chr5 5978123 A,G,T,C +chr5 5978124 0 +chr5 5978125 0 +chr5 5978126 0 +chr5 5978127 0 +chr5 5978128 0 +chr5 5978129 0 +chr5 5978130 0 +chr5 5978131 0 +chr5 5978132 0 +chr5 5978133 0 +chr5 5978134 A,1 +chr5 5978168 T +chr5 5978169 A,G,T,C +chr5 5978170 0 +chr5 5978171 0 +chr5 5978172 0 +chr5 5978173 0 +chr5 5978174 T,1 +chr5 5978181 T +chr5 5978182 A,G,T,C +chr5 5978183 0 +chr5 5978184 0 +chr5 5978185 0 +chr5 5978186 0 +chr5 5978187 T,1 +chr5 5978202 A +chr5 5978203 A,G,T,C +chr5 5978204 0 +chr5 5978205 0 +chr5 5978206 0 +chr5 5978207 0 +chr5 5978208 0 +chr5 5978209 0 +chr5 5978210 0 +chr5 5978211 A,1 +chr5 5978246 A +chr5 5978247 A,G,T,C +chr5 5978248 0 +chr5 5978249 0 +chr5 5978250 0 +chr5 5978251 0 +chr5 5978252 A,1 +chr5 5978272 T +chr5 5978273 A,G,T,C +chr5 5978274 0 +chr5 5978275 0 +chr5 5978276 0 +chr5 5978277 0 +chr5 5978278 0 +chr5 5978279 0 +chr5 5978280 0 +chr5 5978281 0 +chr5 5978282 0 +chr5 5978283 0 +chr5 5978284 0 +chr5 5978285 0 +chr5 5978286 0 +chr5 5978287 0 +chr5 5978288 T,1 +chr5 5978437 T +chr5 5978438 A,G,T,C +chr5 5978439 0 +chr5 5978440 0 +chr5 5978441 0 +chr5 5978442 0 +chr5 5978443 T,1 +chr5 5978699 A +chr5 5978700 A,G,T,C +chr5 5978701 0 +chr5 5978702 0 +chr5 5978703 0 +chr5 5978704 0 +chr5 5978705 A,1 +chr5 5978707 A +chr5 5978708 A,G,T,C +chr5 5978709 0 +chr5 5978710 0 +chr5 5978711 0 +chr5 5978712 0 +chr5 5978713 A,1 +chr5 5978888 T +chr5 5978889 A,G,T,C +chr5 5978890 0 +chr5 5978891 0 +chr5 5978892 0 +chr5 5978893 0 +chr5 5978894 T,1 +chr5 5979032 A +chr5 5979033 A,G,T,C +chr5 5979034 0 +chr5 5979035 0 +chr5 5979036 0 +chr5 5979037 0 +chr5 5979038 A,1 +chr5 5979040 A +chr5 5979041 A,G,T,C +chr5 5979042 0 +chr5 5979043 0 +chr5 5979044 0 +chr5 5979045 0 +chr5 5979046 A,1 +chr5 5979201 A +chr5 5979202 A,G,T,C +chr5 5979203 0 +chr5 5979204 0 +chr5 5979205 0 +chr5 5979206 0 +chr5 5979207 0 +chr5 5979208 0 +chr5 5979209 0 +chr5 5979210 0 +chr5 5979211 0 +chr5 5979212 0 +chr5 5979213 0 +chr5 5979214 0 +chr5 5979215 0 +chr5 5979216 0 +chr5 5979217 0 +chr5 5979218 0 +chr5 5979219 0 +chr5 5979220 0 +chr5 5979221 0 +chr5 5979222 0 +chr5 5979223 0 +chr5 5979224 A,1 +chr5 5979230 A +chr5 5979231 A,G,T,C +chr5 5979232 0 +chr5 5979233 0 +chr5 5979234 0 +chr5 5979235 0 +chr5 5979236 A,1 +chr5 5979237 A,G,T,C +chr5 5979238 0 +chr5 5979239 0 +chr5 5979240 0 +chr5 5979241 0 +chr5 5979242 A,1 +chr5 5979390 A +chr5 5979391 A,G,T,C +chr5 5979392 0 +chr5 5979393 0 +chr5 5979394 0 +chr5 5979395 0 +chr5 5979396 0 +chr5 5979397 0 +chr5 5979398 0 +chr5 5979399 0 +chr5 5979400 0 +chr5 5979401 0 +chr5 5979402 0 +chr5 5979403 0 +chr5 5979404 0 +chr5 5979405 0 +chr5 5979406 A,1 +chr5 5979432 T +chr5 5979433 A,G,T,C +chr5 5979434 0 +chr5 5979435 0 +chr5 5979436 0 +chr5 5979437 0 +chr5 5979438 T,1 +chr5 5979466 T +chr5 5979467 A,G,T,C +chr5 5979468 0 +chr5 5979469 0 +chr5 5979470 0 +chr5 5979471 0 +chr5 5979472 T,1 +chr5 5979503 A +chr5 5979504 A,G,T,C +chr5 5979505 0 +chr5 5979506 0 +chr5 5979507 0 +chr5 5979508 0 +chr5 5979509 A,1 +chr5 5979513 T +chr5 5979514 A,G,T,C +chr5 5979515 0 +chr5 5979516 0 +chr5 5979517 0 +chr5 5979518 0 +chr5 5979519 T,1 +chr5 5979668 T +chr5 5979669 A,G,T,C +chr5 5979670 0 +chr5 5979671 0 +chr5 5979672 0 +chr5 5979673 0 +chr5 5979674 0 +chr5 5979675 0 +chr5 5979676 0 +chr5 5979677 0 +chr5 5979678 T,1 +chr5 5979689 T +chr5 5979690 A,G,T,C +chr5 5979691 0 +chr5 5979692 0 +chr5 5979693 0 +chr5 5979694 0 +chr5 5979695 0 +chr5 5979696 0 +chr5 5979697 T,1 +chr5 5979978 A +chr5 5979979 A,G,T,C +chr5 5979980 0 +chr5 5979981 0 +chr5 5979982 0 +chr5 5979983 0 +chr5 5979984 A,1 +chr5 5980070 G +chr5 5980071 A,G,T,C +chr5 5980072 0 +chr5 5980073 0 +chr5 5980074 0 +chr5 5980075 0 +chr5 5980076 1,G +chr5 5980230 A +chr5 5980231 A,G,T,C +chr5 5980232 0 +chr5 5980233 0 +chr5 5980234 0 +chr5 5980235 0 +chr5 5980236 A,1 +chr5 5980460 A +chr5 5980461 A,G,T,C +chr5 5980462 0 +chr5 5980463 0 +chr5 5980464 0 +chr5 5980465 0 +chr5 5980466 0 +chr5 5980467 A,1 +chr5 5980597 A +chr5 5980598 A,G,T,C +chr5 5980599 0 +chr5 5980600 0 +chr5 5980601 0 +chr5 5980602 0 +chr5 5980603 A,1 +chr5 5980605 A +chr5 5980606 A,G,T,C +chr5 5980607 0 +chr5 5980608 0 +chr5 5980609 0 +chr5 5980610 0 +chr5 5980611 A,1 +chr5 5980759 A +chr5 5980760 A,G,T,C +chr5 5980761 0 +chr5 5980762 0 +chr5 5980763 0 +chr5 5980764 0 +chr5 5980765 A,1 +chr5 5980767 A +chr5 5980768 A,G,T,C +chr5 5980769 0 +chr5 5980770 0 +chr5 5980771 0 +chr5 5980772 0 +chr5 5980773 A,1 +chr5 5980774 A,G,T,C +chr5 5980775 0 +chr5 5980776 0 +chr5 5980777 0 +chr5 5980778 0 +chr5 5980779 0 +chr5 5980780 A,1 +chr5 5980858 A +chr5 5980859 A,G,T,C +chr5 5980860 0 +chr5 5980861 0 +chr5 5980862 0 +chr5 5980863 0 +chr5 5980864 A,1 +chr5 5981216 T +chr5 5981217 A,G,T,C +chr5 5981218 0 +chr5 5981219 0 +chr5 5981220 0 +chr5 5981221 0 +chr5 5981222 T,1 +chr5 5981411 T +chr5 5981412 A,G,T,C +chr5 5981413 0 +chr5 5981414 0 +chr5 5981415 0 +chr5 5981416 0 +chr5 5981417 0 +chr5 5981418 T,1 +chr5 5981707 T +chr5 5981708 A,G,T,C +chr5 5981709 0 +chr5 5981710 0 +chr5 5981711 0 +chr5 5981712 0 +chr5 5981713 T,1 +chr5 5981719 A +chr5 5981720 A,G,T,C +chr5 5981721 0 +chr5 5981722 0 +chr5 5981723 0 +chr5 5981724 0 +chr5 5981725 A,1 +chr5 5981844 T +chr5 5981845 A,G,T,C +chr5 5981846 0 +chr5 5981847 0 +chr5 5981848 0 +chr5 5981849 0 +chr5 5981850 T,1 +chr5 5981939 A +chr5 5981940 A,G,T,C +chr5 5981941 0 +chr5 5981942 0 +chr5 5981943 0 +chr5 5981944 0 +chr5 5981945 A,1 +chr5 5981948 A +chr5 5981949 A,G,T,C +chr5 5981950 0 +chr5 5981951 0 +chr5 5981952 0 +chr5 5981953 0 +chr5 5981954 A,1 +chr5 5981975 A +chr5 5981976 A,G,T,C +chr5 5981977 0 +chr5 5981978 0 +chr5 5981979 0 +chr5 5981980 0 +chr5 5981981 A,1 +chr5 5982022 T +chr5 5982023 A,G,T,C +chr5 5982024 0 +chr5 5982025 0 +chr5 5982026 0 +chr5 5982027 0 +chr5 5982028 0 +chr5 5982029 0 +chr5 5982030 0 +chr5 5982031 0 +chr5 5982032 0 +chr5 5982033 0 +chr5 5982034 T,1 +chr5 5982182 T +chr5 5982183 A,G,T,C +chr5 5982184 0 +chr5 5982185 0 +chr5 5982186 0 +chr5 5982187 0 +chr5 5982188 0 +chr5 5982189 T,1 +chr5 5982191 T +chr5 5982192 A,G,T,C +chr5 5982193 0 +chr5 5982194 0 +chr5 5982195 0 +chr5 5982196 0 +chr5 5982197 T,1 +chr5 5982198 A,G,T,C +chr5 5982199 0 +chr5 5982200 0 +chr5 5982201 0 +chr5 5982202 0 +chr5 5982203 0 +chr5 5982204 T,1 +chr5 5982216 T +chr5 5982217 A,G,T,C +chr5 5982218 0 +chr5 5982219 0 +chr5 5982220 0 +chr5 5982221 0 +chr5 5982222 0 +chr5 5982223 T,1 +chr5 5982379 T +chr5 5982380 A,G,T,C +chr5 5982381 0 +chr5 5982382 0 +chr5 5982383 0 +chr5 5982384 0 +chr5 5982385 T,1 +chr5 5982503 T +chr5 5982504 A,G,T,C +chr5 5982505 0 +chr5 5982506 0 +chr5 5982507 0 +chr5 5982508 0 +chr5 5982509 T,1 +chr5 5982511 T +chr5 5982512 A,G,T,C +chr5 5982513 0 +chr5 5982514 0 +chr5 5982515 0 +chr5 5982516 0 +chr5 5982517 T,1 +chr5 5982645 T +chr5 5982646 A,G,T,C +chr5 5982647 0 +chr5 5982648 0 +chr5 5982649 0 +chr5 5982650 0 +chr5 5982651 0 +chr5 5982652 T,1 +chr5 5982653 A,G,T,C +chr5 5982654 0 +chr5 5982655 0 +chr5 5982656 0 +chr5 5982657 0 +chr5 5982658 0 +chr5 5982659 T,1 +chr5 5982683 T +chr5 5982684 A,G,T,C +chr5 5982685 0 +chr5 5982686 0 +chr5 5982687 0 +chr5 5982688 0 +chr5 5982689 0 +chr5 5982690 T,1 +chr5 5982788 G +chr5 5982789 A,G,T,C +chr5 5982790 0 +chr5 5982791 0 +chr5 5982792 0 +chr5 5982793 0 +chr5 5982794 1,G +chr5 5983034 T +chr5 5983035 A,G,T,C +chr5 5983036 0 +chr5 5983037 0 +chr5 5983038 0 +chr5 5983039 0 +chr5 5983040 T,1 +chr5 5983065 A +chr5 5983066 A,G,T,C +chr5 5983067 0 +chr5 5983068 0 +chr5 5983069 0 +chr5 5983070 0 +chr5 5983071 0 +chr5 5983072 0 +chr5 5983073 0 +chr5 5983074 0 +chr5 5983075 A,1 +chr5 5983086 A +chr5 5983087 A,G,T,C +chr5 5983088 0 +chr5 5983089 0 +chr5 5983090 0 +chr5 5983091 0 +chr5 5983092 A,1 +chr5 5983121 T +chr5 5983122 A,G,T,C +chr5 5983123 0 +chr5 5983124 0 +chr5 5983125 0 +chr5 5983126 0 +chr5 5983127 0 +chr5 5983128 0 +chr5 5983129 0 +chr5 5983130 0 +chr5 5983131 T,1 +chr5 5983277 T +chr5 5983278 A,G,T,C +chr5 5983279 0 +chr5 5983280 0 +chr5 5983281 0 +chr5 5983282 0 +chr5 5983283 T,1 +chr5 5983285 T +chr5 5983286 A,G,T,C +chr5 5983287 0 +chr5 5983288 0 +chr5 5983289 0 +chr5 5983290 0 +chr5 5983291 T,1 +chr5 5983535 T +chr5 5983536 A,G,T,C +chr5 5983537 0 +chr5 5983538 0 +chr5 5983539 0 +chr5 5983540 0 +chr5 5983541 0 +chr5 5983542 0 +chr5 5983543 0 +chr5 5983544 T,1 +chr5 5983549 A +chr5 5983550 A,G,T,C +chr5 5983551 0 +chr5 5983552 0 +chr5 5983553 0 +chr5 5983554 0 +chr5 5983555 0 +chr5 5983556 A,1 +chr5 5983557 A,G,T,C +chr5 5983558 0 +chr5 5983559 0 +chr5 5983560 0 +chr5 5983561 0 +chr5 5983562 A,1 +chr5 5983644 T +chr5 5983645 A,G,T,C +chr5 5983646 0 +chr5 5983647 0 +chr5 5983648 0 +chr5 5983649 0 +chr5 5983650 0 +chr5 5983651 T,1 +chr5 5983652 A,G,T,C +chr5 5983653 0 +chr5 5983654 0 +chr5 5983655 0 +chr5 5983656 0 +chr5 5983657 0 +chr5 5983658 0 +chr5 5983659 0 +chr5 5983660 0 +chr5 5983661 0 +chr5 5983662 0 +chr5 5983663 T,1 +chr5 5983818 T +chr5 5983819 A,G,T,C +chr5 5983820 0 +chr5 5983821 0 +chr5 5983822 0 +chr5 5983823 0 +chr5 5983824 0 +chr5 5983825 0 +chr5 5983826 T,1 +chr5 5983827 A,G,T,C +chr5 5983828 0 +chr5 5983829 0 +chr5 5983830 0 +chr5 5983831 0 +chr5 5983832 T,1 +chr5 5983961 T +chr5 5983962 A,G,T,C +chr5 5983963 0 +chr5 5983964 0 +chr5 5983965 0 +chr5 5983966 0 +chr5 5983967 T,1 +chr5 5984622 A +chr5 5984623 A,G,T,C +chr5 5984624 0 +chr5 5984625 0 +chr5 5984626 0 +chr5 5984627 0 +chr5 5984628 A,1 +chr5 5984629 A,G,T,C +chr5 5984630 0 +chr5 5984631 0 +chr5 5984632 0 +chr5 5984633 0 +chr5 5984634 0 +chr5 5984635 0 +chr5 5984636 A,1 +chr5 5984785 A +chr5 5984786 A,G,T,C +chr5 5984787 0 +chr5 5984788 0 +chr5 5984789 0 +chr5 5984790 0 +chr5 5984791 0 +chr5 5984792 0 +chr5 5984793 0 +chr5 5984794 0 +chr5 5984795 0 +chr5 5984796 A,1 +chr5 5985075 A +chr5 5985076 A,G,T,C +chr5 5985077 0 +chr5 5985078 0 +chr5 5985079 0 +chr5 5985080 0 +chr5 5985081 A,1 +chr5 5985105 T +chr5 5985106 A,G,T,C +chr5 5985107 0 +chr5 5985108 0 +chr5 5985109 0 +chr5 5985110 0 +chr5 5985111 0 +chr5 5985112 0 +chr5 5985113 0 +chr5 5985114 0 +chr5 5985115 0 +chr5 5985116 T,1 +chr5 5985216 C +chr5 5985217 A,G,T,C +chr5 5985218 0 +chr5 5985219 0 +chr5 5985220 0 +chr5 5985221 0 +chr5 5985222 1,C +chr5 5985264 T +chr5 5985265 A,G,T,C +chr5 5985266 0 +chr5 5985267 0 +chr5 5985268 0 +chr5 5985269 0 +chr5 5985270 0 +chr5 5985271 0 +chr5 5985272 0 +chr5 5985273 0 +chr5 5985274 0 +chr5 5985275 0 +chr5 5985276 0 +chr5 5985277 0 +chr5 5985278 0 +chr5 5985279 0 +chr5 5985280 0 +chr5 5985281 0 +chr5 5985282 0 +chr5 5985283 0 +chr5 5985284 0 +chr5 5985285 0 +chr5 5985286 0 +chr5 5985287 0 +chr5 5985288 0 +chr5 5985289 T,1 +chr5 5985414 A +chr5 5985415 A,G,T,C +chr5 5985416 0 +chr5 5985417 0 +chr5 5985418 0 +chr5 5985419 0 +chr5 5985420 A,1 +chr5 5985536 T +chr5 5985537 A,G,T,C +chr5 5985538 0 +chr5 5985539 0 +chr5 5985540 0 +chr5 5985541 0 +chr5 5985542 0 +chr5 5985543 0 +chr5 5985544 T,1 +chr5 5985545 A,G,T,C +chr5 5985546 0 +chr5 5985547 0 +chr5 5985548 0 +chr5 5985549 0 +chr5 5985550 0 +chr5 5985551 T,1 +chr5 5985552 A,G,T,C +chr5 5985553 0 +chr5 5985554 0 +chr5 5985555 0 +chr5 5985556 0 +chr5 5985557 T,1 +chr5 5985558 A,G,T,C +chr5 5985559 0 +chr5 5985560 0 +chr5 5985561 0 +chr5 5985562 0 +chr5 5985563 0 +chr5 5985564 0 +chr5 5985565 0 +chr5 5985566 0 +chr5 5985567 0 +chr5 5985568 0 +chr5 5985569 0 +chr5 5985570 0 +chr5 5985571 0 +chr5 5985572 T,1 +chr5 5985727 T +chr5 5985728 A,G,T,C +chr5 5985729 0 +chr5 5985730 0 +chr5 5985731 0 +chr5 5985732 0 +chr5 5985733 T,1 +chr5 5986395 A +chr5 5986396 A,G,T,C +chr5 5986397 0 +chr5 5986398 0 +chr5 5986399 0 +chr5 5986400 0 +chr5 5986401 0 +chr5 5986402 A,1 +chr5 5986529 A +chr5 5986530 A,G,T,C +chr5 5986531 0 +chr5 5986532 0 +chr5 5986533 0 +chr5 5986534 0 +chr5 5986535 A,1 +chr5 5986537 A +chr5 5986538 A,G,T,C +chr5 5986539 0 +chr5 5986540 0 +chr5 5986541 0 +chr5 5986542 0 +chr5 5986543 A,1 +chr5 5986691 A +chr5 5986692 A,G,T,C +chr5 5986693 0 +chr5 5986694 0 +chr5 5986695 0 +chr5 5986696 0 +chr5 5986697 0 +chr5 5986698 0 +chr5 5986699 A,1 +chr5 5986700 A,G,T,C +chr5 5986701 0 +chr5 5986702 0 +chr5 5986703 0 +chr5 5986704 0 +chr5 5986705 0 +chr5 5986706 A,1 +chr5 5986707 A,G,T,C +chr5 5986708 0 +chr5 5986709 0 +chr5 5986710 0 +chr5 5986711 0 +chr5 5986712 A,1 +chr5 5986713 A,G,T,C +chr5 5986714 0 +chr5 5986715 0 +chr5 5986716 0 +chr5 5986717 0 +chr5 5986718 A,1 +chr5 5986725 A +chr5 5986726 A,G,T,C +chr5 5986727 0 +chr5 5986728 0 +chr5 5986729 0 +chr5 5986730 0 +chr5 5986731 A,1 +chr5 5986733 A +chr5 5986734 A,G,T,C +chr5 5986735 0 +chr5 5986736 0 +chr5 5986737 0 +chr5 5986738 0 +chr5 5986739 A,1 +chr5 5986740 A +chr5 5986741 A,G,T,C +chr5 5986742 0 +chr5 5986743 0 +chr5 5986744 0 +chr5 5986745 0 +chr5 5986746 0 +chr5 5986747 A,1 +chr5 5987021 T +chr5 5987022 A,G,T,C +chr5 5987023 0 +chr5 5987024 0 +chr5 5987025 0 +chr5 5987026 0 +chr5 5987027 0 +chr5 5987028 T,1 +chr5 5987125 A +chr5 5987126 A,G,T,C +chr5 5987127 0 +chr5 5987128 0 +chr5 5987129 0 +chr5 5987130 0 +chr5 5987131 A,1 +chr5 5987525 T +chr5 5987526 A,G,T,C +chr5 5987527 0 +chr5 5987528 0 +chr5 5987529 0 +chr5 5987530 0 +chr5 5987531 0 +chr5 5987532 0 +chr5 5987533 0 +chr5 5987534 T,1 +chr5 5987605 T +chr5 5987606 A,G,T,C +chr5 5987607 0 +chr5 5987608 0 +chr5 5987609 0 +chr5 5987610 0 +chr5 5987611 T,1 +chr5 5987826 G +chr5 5987827 A,G,T,C +chr5 5987828 0 +chr5 5987829 0 +chr5 5987830 0 +chr5 5987831 0 +chr5 5987832 1,G +chr5 5987892 A +chr5 5987893 A,G,T,C +chr5 5987894 0 +chr5 5987895 0 +chr5 5987896 0 +chr5 5987897 0 +chr5 5987898 A,1 +chr5 5987900 A +chr5 5987901 A,G,T,C +chr5 5987902 0 +chr5 5987903 0 +chr5 5987904 0 +chr5 5987905 0 +chr5 5987906 A,1 +chr5 5988052 A +chr5 5988053 A,G,T,C +chr5 5988054 0 +chr5 5988055 0 +chr5 5988056 0 +chr5 5988057 0 +chr5 5988058 0 +chr5 5988059 0 +chr5 5988060 0 +chr5 5988061 0 +chr5 5988062 0 +chr5 5988063 0 +chr5 5988064 0 +chr5 5988065 0 +chr5 5988066 0 +chr5 5988067 0 +chr5 5988068 0 +chr5 5988069 0 +chr5 5988070 0 +chr5 5988071 0 +chr5 5988072 0 +chr5 5988073 0 +chr5 5988074 0 +chr5 5988075 0 +chr5 5988076 0 +chr5 5988077 0 +chr5 5988078 0 +chr5 5988079 A,1 +chr5 5988126 A +chr5 5988127 A,G,T,C +chr5 5988128 0 +chr5 5988129 0 +chr5 5988130 0 +chr5 5988131 0 +chr5 5988132 0 +chr5 5988133 0 +chr5 5988134 A,1 +chr5 5988390 A +chr5 5988391 A,G,T,C +chr5 5988392 0 +chr5 5988393 0 +chr5 5988394 0 +chr5 5988395 0 +chr5 5988396 A,1 +chr5 5988398 A +chr5 5988399 A,G,T,C +chr5 5988400 0 +chr5 5988401 0 +chr5 5988402 0 +chr5 5988403 0 +chr5 5988404 0 +chr5 5988405 0 +chr5 5988406 0 +chr5 5988407 0 +chr5 5988408 0 +chr5 5988409 A,1 +chr5 5988562 A +chr5 5988563 A,G,T,C +chr5 5988564 0 +chr5 5988565 0 +chr5 5988566 0 +chr5 5988567 0 +chr5 5988568 0 +chr5 5988569 0 +chr5 5988570 0 +chr5 5988571 A,1 +chr5 5988596 A +chr5 5988597 A,G,T,C +chr5 5988598 0 +chr5 5988599 0 +chr5 5988600 0 +chr5 5988601 0 +chr5 5988602 A,1 +chr5 5988816 A +chr5 5988817 A,G,T,C +chr5 5988818 0 +chr5 5988819 0 +chr5 5988820 0 +chr5 5988821 0 +chr5 5988822 A,1 +chr5 5988845 T +chr5 5988846 A,G,T,C +chr5 5988847 0 +chr5 5988848 0 +chr5 5988849 0 +chr5 5988850 0 +chr5 5988851 0 +chr5 5988852 0 +chr5 5988853 0 +chr5 5988854 T,1 +chr5 5988866 T +chr5 5988867 A,G,T,C +chr5 5988868 0 +chr5 5988869 0 +chr5 5988870 0 +chr5 5988871 0 +chr5 5988872 0 +chr5 5988873 0 +chr5 5988874 T,1 +chr5 5989022 T +chr5 5989023 A,G,T,C +chr5 5989024 0 +chr5 5989025 0 +chr5 5989026 0 +chr5 5989027 0 +chr5 5989028 0 +chr5 5989029 T,1 +chr5 5989031 T +chr5 5989032 A,G,T,C +chr5 5989033 0 +chr5 5989034 0 +chr5 5989035 0 +chr5 5989036 0 +chr5 5989037 T,1 +chr5 5989433 A +chr5 5989434 A,G,T,C +chr5 5989435 0 +chr5 5989436 0 +chr5 5989437 0 +chr5 5989438 0 +chr5 5989439 0 +chr5 5989440 0 +chr5 5989441 A,1 +chr5 5989459 A +chr5 5989460 A,G,T,C +chr5 5989461 0 +chr5 5989462 0 +chr5 5989463 0 +chr5 5989464 0 +chr5 5989465 A,1 diff --git a/paraphase/data/pms2/pms2_ref.fa b/paraphase/data/pms2/pms2_ref.fa new file mode 100644 index 0000000..7dd6516 --- /dev/null +++ b/paraphase/data/pms2/pms2_ref.fa @@ -0,0 +1,427 @@ +>chr7_5967000_5992500 +CTCTTGCCTAGTGGTGTTGCCATCTTGTTTGGAAAAGTCTAGATTATCAGAGAGAGATGA +GGGAGAGCGGGCAGACTTCTCCCCTTTTTTGTCTGCTTTTTTCTTTTCTTTCACCATTGC +CTTGGGAAGATCCAATGGTTACTTGAATCAAATGATTTCTTTGGTTCAGAGCTGCTTGTT +TCAAAGCACTGATGAGTTTTATCTGAAAAATAAAAATTACGTCTCCAAACACTTGGGGTT +TTCATTTGTAGTTAAAATTTCAGTTTTACAACACAATGTCATTATCATTCTCCTGACAAA +GTCTGAAAAATTAGTTACCAGGGCCGGGTGTGGTGGCTCACACCTGCGATCTCAGCACGT +TGAAGGGCCAAGGTGGGAGGATCATGTGAGTCCAGGAGAGGGAGATCAGCCTAGGCCACG +TAGTGAGATGCCACCTCTCTACAAAAAATAAAAATAAATTAGCCAGGCGTGGTGGCACAT +GCCTGTAGTCCCAGCTACTAGGGAGGCTCAGGTGGGAGGACTGCTTGAGCCTGGGAGGTC +GAGGCTGCAGTGAGTAGTGTCACACCACTGCACTCCAGTCTGGGTATCAGAGCAAGACTC +TGTCTCAAAAATGACTATAATTACAAGTTTCAAGTACCATCAGTTTATAAAAATACAACC +TCAACATCGCATTGCTTGTTCCTAAAATTTTTTTATTTTTAATTTTTGAGACAGAGTCTC +ACTCTGTCACCCAGGCTGGAGGGCAGTGGCACAATCACAGCTCACTCTAGCCTCAACCTC +CTAGGCTCTGACGATCCTTCCATCTCAGCCTCCCAAGTAGCTGGGACCACAGGCATGCAT +TACCACACCTGGCTAATTTTTTGTAGAGACAGGGTCTTGCTATCTTGCCCCGGCTGGTCT +GGAACTCCTGAGCTCAAGCAATCCGCCTGTCTCAGCCTCCCAAAGTGCTGGGATTATAGG +TGTGAGCCACTGTACCTTGCCCCAAAAATTATTTAAGTTGGAACCATTGTCTAGCATTGT +TTCTTGAAAGGTAACCCTACACATGAAATAGGCTACTTCACCTCTCAGGTCTTGCATGCA +GCCAATTCACACTTTAAAAGCCCCTCTCTGGCCGGATGCAGTGGCTCACACCTGTAGTCC +CAGCACTTTGGGAGGCCAAGGCAGGTGGATCACGAGGTCAGGAGATTGAGACCACCCTGG +CTAACACTGTGAAACCCCGTCTCTACTAAAAATACAAAAAATTAGCCAGGCATGGTGGCA +CATGCCTGTAATCCCAGCTACTCAGGAGGCTGAGGCAGGAGAATCACTTGAACCCAGGAG +GTGGAGGTTGCAGTGAGCCGAGATCACACCACTGCACTCTAGCCTCGGCAACAGAGCAAG +ATTCTGTCTCAAAAAAAACAAAACAAAACAAAAGCCCCTCTCCTTATAGGTCAGCATTGT +AAAGTGTGCAAGAGCTGGATTCGGAGTCCTGCATTGCCCATTACCAGTTCTATGGGTTTG +TTTATTTATTTATTTATTTATTTTTGAGACGGAGTCTCACTCTCTTGCCCAGGCTGGAGT +GCGGTGGTGCGATCTTGGCTCACTGCAAGCTCCACCTCCCGGGTTCATGCCATTCTCCTG +CCTCAGCCTCCTGAGTAGCTAGGACTACAGGCGCCCACCACCACACCTGGCTAATTCTTT +TTGTATTTTTAGTAGAGACGGGGTTTCACCGGGTTAGCCAGGATGGTCTCAATCTCCTGA +CCTCGTGATCTGCCCGCCTTGGCCTCCAAAGTGCTGGGATTACAGGCGTGAGTCACCGGG +TGTGGGTGCCCGGCCCAGTTCTGTGTTTTTTGGGTTTGTTTTTTTTTTTTTTTTTAGACA +GACTCTCTAGCCCTGTCCTGCAGGTTGGAATGCAGTGGCAGGATCTTGGCTCACTGGCTC +ACTGCAACCTCACTGCAGGTTCAAGTGATTCTCCTGCCTCAGCCTCTCGAGTAGCTGGGA +TTACAGGCACCTGCCTCCATACTCAGCTAATTTTTGTATTTTTAGTAGAGATGGGGTTTC +ACTGTGTTGGCCAGGCTGGTGTCGAACTCCTGACCTCGTGATCTGCCCGCCTTGGCCTCC +CAAAGTGCTGGGATAACAAGTGTGAGCCACCGTGCCTGGCTGGTTTCCTTTTTTTTTTTT +TTTTTTTTTTTTTTTTTTGAGACGGGGTGTTGTACATTTTGCCCAGGCTGGTTTCAAACT +CCTGGCCTCAAGCAATCTTCCCACCTTCTCCTCCCAAAGTGCTAGGATTGCGGGCATGAG +CCACTGAGCTCGGCCAAGTGCTGTGTTCTGAAGCAAGCTGCTTAATCTCCTCTGCCTCAC +AAATAGAAATAACAGAACCTTGTCTCATAAGAATTAAACCGATGACACACAGAAAAATCC +CTAAGTACAGGATACAGAGCAAAGTCAATAAATTGAGTGATTATCTCGTCCCTTTCCTGT +TTTCAAACTTGAAATCATTGGTTTCCCACTCCCTCTAGCACTGCCATCGATTGAGTGCCT +CTCATATGCCGGACAAGCAAGGACCGGTGTGCTGGAGTTGCGTTTCCAGCGCATGGTTGC +TTGAGCTGTATTTCCAGCGCATCGTTGCTCGAGCTCCGTTTCCAGCGCATATCTTTGCTT +GTCTGGCATATGAGCTGCGGTGCAGGGTTTCAAAGCCATAGTTTTCATGGCTTTCCTAAT +CTGGCCCCGCTGAAAGCACCCTGGTCCATCAGGCAGGATGCATAGGTGAGCCCTGTGGTG +AAGGCCAGGTCTGGTCCCTTTTCGTAGGCCCCGACGGGTGCAGTGGGCACTTCTTGCCCT +CTTCTTCCTGGATGCCATCCTCTGGACACCCCACCCTCCAGCTGAACCCATCTCCAGCCT +TCTCTCTTCCTCCATTATTTATTTATTGAGACAGGGTCTCGCTCTGTCACCCAGGCTGGA +GTGCAGTGGCACAGTCTTGGCTCACTGCAACCTTCCGGGTTCAAGTGATTCTTCTGCCCT +AGCCTCCCAAGTAGTTGGGATTACAGGTGCCTGCAACCACTCCCGGCTAATTTTTGTATT +TTTAGTAGAGCCAGGGTTTCACCATGTTGCCCAGGCTGGTCTCAAACTCCTAACCTCAGG +TGATCCGCCCGCCTCGGCCTCCCAAAAGTGCTGGGGTTACAGGCGTGAGCCACCGCGCCC +GGCCAAGCCTTCTCGCTTCCTTCCAACACCCTTGCTCCTTCCCGCCCCCGACTACTGTCT +CAGCGAAGCCGCCCCTGGGTCCCCTAGCCCAAGGTGCTCTCACCTCTAAATTTAGAGGGG +CCTTTACTACGAGCTTTTCGGTCTTCTGTCAGTGATCTACAAATAAGTGAACTACAAATC +AAGCCAATTTCGTTTCTCAGTGCGTTTCCTAATGTTTCCTCTGTTCTCCTGGGCTTAAGG +AGAGGCCATCTCTCTCCTGTGTCTACGGTGAGCCGGGAAGAATCGGCTCCCTGCACTGGT +CTGCCCGGCCTAGGCCCTCCTCGCCCATCACGCACGTGCTCGGGTCCTGGGAGGCCGCGT +CAGTCCGGCCGGGAAGGAGCAGGACCCAGTCGCCATGGCTGTCCCGGCGTACGCAGGACC +GCGGCCTGGGGCGCTCACCTCGCTCCAGGAGCCCAGAGACCTCGCCGGGCTCGGGCTGAG +GTGTTGCCGGGCTCTCGGCGTCCCAGACCCGGCTCCGGTCTCCAGGCAACCGCGGACGCC +GCCAGGCCCACCCTGTGCTCTTAAAGGGGCCGCGCGCCAGCGCCAAGCAGGTGTCCCGCC +CTTGCTGGTCCTGAAGGCCGGGGGAAAGGCTGGACGCTGGAGGCAGCAGGCCAGGGTTTT +CCCAGCTCTGCTAACTGCTTCATCATAAAATAGGAATAACGCAGGCATTAGTTTTCCATT +GCTGCCGCAAACAAATTACCATAAATTTAGTGGTTCGAAGCAACACAAATTTATTACCCT +ACAGTATGTAAATTAGTCCCACAGTGCTGGTTCCTTCCAGAGGCTGAGGGAGAGAATGCG +TTTTCTTGCCTTTTCCAGCTTTTCCAGAAGCGGTCCCCAACCTTTTTGGCACCAGAGACC +GGTTTCGCGGAACACAATTTTTCCATGGATTGGGGTCGGGGAGGGATGGTTTCAGGACGA +TTCAAGTGCATTGCATTTATTGTGCACTTTATTTCTACTATTATTACATTGTAACATATA +ATTAAATAATTCTACAACTCACCATCACAGAGACTCAGTGGGAGCCCTGAGCTTGTTTTC +CTGCAACTAGACAGCCCCATCTGGGGGTGACGGGAGACAGTGACAGATCATCAGGCATCA +GATTCTCATAAGGAGCGTGCAACCTACATCCCTCGTGTGTGCAGTTCACAATAGGGCTCG +TGCTGCTATGAGAATCTAATGCCCACCGCTGATGTGACAGGAGGCGGAGCTCGGGTGGTA +ACGCCAGCGACGGGGAGTGGCTATAAATACAGATGAAGCTTCGCTGGCTTGCCGGCCCCT +AACCTGCTGTGCACCCCACTTCCTAATAGGCCATGGACCACTACTGGTCTGTGTCCGGAG +TGTTGGAATCCCTGTTCTAGAGACTGCTTGCAATCCTTGACTAGTGGTACCTCCTTCCAT +CTCCAAAACCAGCAAGACAGCCTCTCTCTGGCCAGCAGGGAAAGGTCTCCACCTTTGAAG +GACTCACCCAATGGACTGGGCCCACCCAGACAATCCAGGATAATCTCTCTGTTGCAAGAT +CCTTAACTCAGCCAGGCATGATGGCTCACACCTGTAATCCCAGCACTTTGGGAGGCCGAG +GCGGTCGGATCACCTGAGGTCAGGAGTTCGAGACCAGCCTGGTCAACATGGTGAAACCCT +GTCTCTACTAAAAATACAAAAATTAGCCAGTATGGTTCTGGACACCTGTAATCCCAGCTA +CTCGGGAGGCTGAGGTGGGAGAATCACTTGAACCTGGGAGGCAGAGGCTGCAGTGAGCTG +AGATCATGCTATTGAACTCCAGCCTGGGCAACAAGAGCGAAACTTCATCTCAAAGAAAAA +AAAAAAAAAGATCCTTCACTCAATCACACCTGCCGAGTCCCTTCTGCCACGTGAGGCAGC +GTGGTCACAGGTTCTGGGGATTAGGACACAGCTGTCTTGGGGGCTGTTATCCTGCCACAG +CTCCCAATCTGGAGAGTTCATAAGTGGGATCCTGCAGACCACGCCAGCACAGTGCCAGAC +ACGATGGCACAGTGACTACTGTACTGCCTCCTCCATCTGAGGGATTCTAAAGCAGGAAGG +GGAGCCGCCCACAGTCTGGAGAAGGGGTGGGGGCAGCAGGGGGAGCCACATCTGTCATCT +CTGGGCCCCCAAGAGGGCATCTTTACTTCCATTTTCAGCCAAGTTCAAACAGGACAAGGT +TCCATGAAAACTATTTGAAAAGACAGACAGGGATTCTTATATTCCCAGAACCATTCAAGG +CCAGTAACTGGAATGTTCTACAGTTCACATCCTGAGGAAACCAAATCACAGCATCAAATT +ATGGGAAATCAAACTCTTTTTGTTCCCCTGCGAGGACAGCATTTTGCGACCTTGGCCGCA +CAGAGGAATGTTTCAAATAGTGACCCCTGTCCCATCCAGTCATTTTCTTCCAGCCGGGCA +GAGAATCCCCCGTGTTTAAAAATTTAATGTGAATCAGGGCTGAGAATCACTAACTGAAAA +GGACCCTACTTTTAAATTTATGAAATTAAACAAAGATGAATTTAATTATCATTAAGGGTT +GAAAGTTATAGACTAAACTATGTCCAGCCAGAGCAGAGGCCTGAGTAACTTCCAAAGTGG +TTTGGTTTTTTTTTTTTTGTTTTGTTTTGTTTTGTTTTTTGAGACACAGTCTTGTTCTAT +CTCACCCAGGCTGGAGCGCAGTGGCGCAATCTCGGTTCACTGCAACCTCCGTCTCCCGGG +TTCAAGCGATTCTCCTGCCTCAGCCTCCTGGGTAGCGGGGATTACAGGCATGCGCCAGCA +CACCTGGCTAATTTTGTATTTTTAGTAGAGGCAGGGTTTCTCCATGTTGGTCAGGCTGGT +CTCGAACTTCTGATCTCAGGTGATCCGCCGGCCTCGGCCTCCCAAAGTGCTGCGATCACA +GGCATGAGCCACCATGCCCAGCCCAAGTGTTCTTATTTTTATAAAATGTGTTCTTGCCTG +GACACACACACACGAGCGCATGCAAACATAGAGAAAAAAAATTTGCAAGCAATGCTCCAT +CTGGTTTGAAAAGGTTCTCAAGATCACTTTTAAATGGGTGTGATGTGTATTTTTTTTAAG +TAGCAGGTTCATTTTAAAACAAAAAAGGTTAGTGAAGACTCTGTCTTTCAAAACATAAAA +ATCTGCGATAAAACCAATTATTCCATACAGTGACTACGGTCAGTTCTGAGAAATGACACC +CAGGTTGGCGATGTGTCTCATGGTTGGCCTTCCATGGGGACAGTTCCAGGGGTGGTCCAT +CTCCCCCATGTGGGTGATCAGTTTCTTCATCTCGCTTGTGTTAAGAGCAGTCCCAATCAT +CACCTGAGTGTGAGACACAATGGTTCAACGTTTTAGTAGTTTTTTGACGTCAGAATGGCA +GCTCTTCAGAAGCATTCTTCTCTAAAATAAGGCTGGACAAGATTACAGCTCAAAAACTAC +CTTCCCTGAAAAACCTTCCCCCAGAGAAGCCTAGGTTCTAGATCTCAGCCCTCCACCCTT +CTGTGAAATCAGGCTCCTTGTGGCTCCTTCAAGGTGGCACCGCCTCCACTCCAGACGCCG +ACCACACCTGTCTCAGCAGCCACCCTGCCCTCTCACCCTGGCAGGTGCAGCAGCCTCCCA +GCAGGCCTCCCTGCCCCACTGCGACCCCTCCGAGCCGCTCTCCACTCAGCAGCCAGTGAT +TACTTTTAAAGGGCTGTCAGGTTATTCATTCCACTTCACAGCTCTCCCCCTCACCTGAAT +AAAAGCCCCCGTCTGTCCCCTGACTTGGCCCTCGCTGGGCTGTGCCTGCACCCCCACCTC +CAAGCACGAATGCCTCCCTTCCTCACCCCAGCTGCACTGCTACTCCCTTCCTCTTGCACA +GGCCCATCACGCAAACACCTGCCTTGGGACTGTGGCACTCCCGGGACCCTCTCCCCCAAT +GGGTGCAGGCGTCACTCCCCCTCTGTCGAGCTCCGACCTGCTGCCCATAGCACTCCAGCC +CTGGCCCTGCTGCCTCCCTGCCATGGGTCCTCTGACAGGAAGGAGAGGACACAAGCCTGA +AGCCCAATGTCACCTTCTTTCTTCCTGCAGCACCCTGAGGGCTCGCCATGTGCCAAGCAC +AGTCAGAAGGCTGGGGTGACAGCAGGTTGGAGAAGGACAGACAATCAACAAGTCAACAGA +GAACCAAGACAGGTGGCACCAGGCGAGGCGGCCTGCTCAGGTGTGGGGATGGGGTGAAGG +GTGACGGTGGCAAACCCAGGTAGAGAGGAGAGTAGGGAGAAAGGGTGTAAGGCAGGGAGG +AGACTGAGGCGAGCGTGGAACTGGAAGGCAGCTACATGGCTGGAAGCTACATGGTGGGGA +GATGGGGCTGGAAGGGTGGGCAGGGCTCAAAGCAGGAGCCTCCTGGGCAGGCAGTGACAA +CACCGGAGATGGACGGGTAGGCCAGGGCGAGAGGGAAGGAGCAGCCTGTGGTTCCCCGGG +CCACTGAGTCACACTAAACTCAGGACATCAAAACTGCCCGGCTATGAGCTCAGCTCCACG +CTCTCACTCACAGACTCCAAGACTGGAAGATCCATATTATGTCTTTTATTTTGGTGAGGT +CAGGGGTGGTGGAGAGACTCTGTCTCCCAGGCTGGAATGCAATGGTGCGATCTCAGCTCA +CTGCAACTCCGCCTCCCAGATTCAAGCAATTCTCCCGCTTCAGCCTCCCGAGTAGCTGGG +ATTACAGGCGCCCACCACCATGCCCAGCTAATTTTCGTACTTTTAGTAGGGATGGGGTTT +CACCATGTTGGCCAAGCTGGTCTCAAATTCCTGACCTCAGGTGATCAACCCACCTCCGCC +TTCCAAAGTGCTGGGATTACAGGTGTGAGCCACCACGCCCAGCCCCTATTAGGTCTTTAT +CCAAGAAACACTGTGGCTAGAAGTCAGACTCTGGGCCCTCTTCTAATTAAACTCTGCCCT +TGAGTCATTTCATCTAATCTCATGGCTGTAAATTACACCTGAAGCTCACACAGCAGGCTC +CATCCCACCCACTCCCCACGTGGCCCCCAGCTGCTGCTCTCCTCAGCGGCCGCAGCCACC +GCACCCCTTCCAGTCTGTTCTCTCTCCAGCAGCTGCAATCACGGGACTCCTTCCCGTCTG +TTCTCTCCAGTGGCTCGTGCCACACACAGCACAGACCCCCAGGGTCTAGGTATGACCGGC +AACACTCTACGTGGCTGTCCTCTGGACGCCGCTCTGCTCACTCCCTTCCCCTCTCCAGGG +ACACAATCAGCCTCTGGCTTCAGTCTTGCTACTTCCTTCGCTTGGAAAGTTCTTACCCAA +GAGGGCTCCATTCTACCTTTTTTTTTTTTTTTTTTGAGACAAGGTCTTACTCTGTCACCC +AGGCTGGAGTGCAGTTGCGTGATGTTGGCTCATTGTAACCTCGACCTCCCTGGCTCAAGT +GATCCTCCCACCTCAGCCTCCTGAGTAGCTGTGACTACAGGCACATGCCACCACACCTGG +CTAATCTTTTAATTTTTTGTACACATGGGGTCTGCCTGTGTTGCCCAGGCTGGTCTCTTA +ACTCCTGGCCTCAAGCAATCCTCCTGCCTTGGCCTCCCAAAATGCTGGGATTACACGTGT +GAGCCACCATGCCTGGCTTCCATCCCACCTTTTAGATGGCAGCTGAGATGCCACCTGCCC +AGATGCCATTCCCTGACCACCATCTCACCTGGTCACCATGTTTTTCTCTTGTCATTTCCT +GCCCCAAAACGCTGTTTTAGGCCAGGTGCGGTGGCTCACGCCTGTAATCCCAGCACTTTG +GGAGACCAAGGCGGGCAGATCATGAGGTCAGGAGATCAACACCAGCCTGACCAACATGGT +GAAACCCCATCTCTATTAAAAATATAAAAATTAGCCAGGTGTGGTGGGTGCCTGTAGTCC +CAGCTACTTGGGAGGCTGAGGCAGGAGAATCGCTTGAACCTGGGAGGCGGAGGTTGCAGT +GAGCTGAGATCACATCACTGCACTCCAGCCTAGTAACAGAGCGAGACTCCGTCTCAAAAA +CAGACAAAGAAAAATGCTGTTTGAATCTCTTGACTGTGCTTACTGGCATCTAATGCGTGT +CATTTATTTGTGGTGATGCCTATTTCTCCCCACTGTCTGCTCCACAGGGGCAGGGGCTGC +AGCCGCCTTGTTACCTCTGTGTCCCGAGCACCTGGAGCAGGGCGGGCCCCACATCAGGGG +CTCAAGGAGCACCTGCTGAATAAATAAAGGAATGGCGTCCTGGCCCTTCCCAGTGGCCAG +CTGATACACAGTCACTTTTCTTGGACATCAGGCTAATCCCCACTGCAGGCAGAACCACTG +CTGCCACCTTCCCACACCAACCGAAGCAGCGGCAGTGACGCCACGTGCAATGACAACCAC +GGCACCCCGTGAAGCACCTGCTGCCTCGATGACTCTGCAGAATCGTGTCCAATGTCGCCG +AGTCCTGGCAGCAGCAAATCTTTATCTCCCAATGTTGTTATGACCCATAAGGTCCATAGA +CGAACAAGGTACCTCAAACGCTAACTGCGTTGGAGTCAACCAAAGCTCGGAGATAGAATA +CTGGCCGGGCCAGGCACAGTGGCTCATGCCTGTAATTCCAGCACTTTGGGAGGCTGAGAC +AAGGGGCAAAAGGAGACCATGTTTCTACAAAAAATTTAAAAATTAGCTGGGCATGGTGGT +GCATGCCTGTGGTCACAACTACTTGGGAGACAGAGAGAGGAGGATCGCTTCAGCCTGGTA +CGTCAAGGCTGCAGTGAGCTGTGATTGTGCCACTGCACTCCAGCCTGGGGGACAGAGGGA +GAATCTGTCTAAAAAAAAAAAAAAGAAGAATTCTGGGTTTTTTTTGTTTTTTTGAGACGG +AGTCTCGCTCTGTCGCCCAGGTTGGAGTGCAGTGGTATGAACTTGGCTCACTGCAAGTTC +CGCCTTCTGGGTTCACGCCATTCTCCTGCCTCAGCCTCCCGAGTAGCTGGGACTATAGGC +GCCCGACACCACGCCCGGCTAATTTTTTTGTATTTTTAGTAGAGACGGGCTTTCACCATG +TTAGCCAGGATGGTCTCGATCTCCTGAACTCGTGATCCGCCCGCCTCAGCTTCCCAAAGT +GCTGGGATTAGAGGCTTGAGCCACCATGCCCGGCCAAGAATACTGCTTAACAGAGGTAAC +AAAAGAGCAATAATTATGAGTTCAAGGTCACAGAGAACGCAGACGACACAGATGCTCAGC +TACGACGCTGCACGTAGCTCTCTGTGTAAAATGACCCCTGGCAATCACAAAGGCGTTTAC +AACCTTGACCAAATCAGGAGCTGGGCTGAGACCTTCCTCGACTGCAAGCTTGAGCAGCTG +AGCTGACAGCCAGGCTTTCTTTACTTACCGACTTCCGGCAGGCTCTGGAGGCAAACATCT +GCTTGACTCGGGAAGGCCGGCACATGACCCCAGGGCTGTCGCTCAGCATGAAGATCAGTT +CATCGACGTCCTGGGGTCCGAAGGTCCAGTTTTTACTAGTTGGCAAGGAAATCAGTTTAG +CCCTTTCAGTGACTGGAGCTAAAAGAATACAATTTTGAGAAAAATCCATGACTTGACAAA +CACGTTTCACTTGAAAGCTACTTAGGATGAACATCTGAGGCCGGGCGTGGTGGCTCACGC +CTGTAATCCCTGCACTTTGGGAGGCTGAGGCCAGCGGATCATGAGGTCAGGAGATACAGA +CCATCCTGGCTAACATGGTGAAACCCCGTCTCTACTAAAAAATAGAAAAAATTAGCTGGG +TGTGGTGGCAGGCACCTGTAGTCCCAGCTACTCGGGAGGCTGAGGCAGGAGAATGGTGTG +AACCTGGGAGGCGGAGCTTGCAGTGAGCCGAGATCGGCACCACTGCGCTCCAGCCTGGGC +GACAGAGCAAGACTCCATCTCAGAAAAAAAAAAAGTGAACACCTGAAAGAGAGGAAACTC +ACAAAATGCTTTTTGGAGGAACTTTTTAATCTTTTATAAAATTAAAAAAAACTGGTCTAT +ATGACCTGAAAGATTATTCCCAGCTCTAAAAAGACAAGAATCTATAGTTCTGATTTTTTT +TTTTTTTTGAGACAGAGTTTCACTCTTGTTGCCCAGGCTGGAGTGCAGTGACGTGATCTC +GGCTCACTGCAACCTCCACCTCCCGGGTTCAAGCGATTCTCCCACCTCAGCCTCCTGAGT +AGCTGGGATTACAGGCACCCGCCACCACGCCCGGCTACTTTTTGTATTTTCAGTAGAGAT +GGGGTTTCACCATGTTAGCCAGGCTGGTCTCAAACTCCTGACCTCAGGCGATCTGCCCGC +CTTGGCCTCCCAGAGTGCTGGGATTACAGACGTGAGCCACCACACCCAGCCGCTATAGTT +CTAATTAATAACTTACCATTTTCATCGATAACAAAATCAAAGCCATTCTTTCTAAATATT +TCCAGATTTTCTATCAGAACAGCTTCATTAACAGCAGTTAAGTTGAGAGTCTGAGGTCTG +AAAAACACAAAAATGATTCAAACCATATCCTGAAGTCAAACATTTAGCTTTACAGCAGAA +ATGAAATGAAAACAACAATACTGTATTTTGAATTCATGTCAAAATAACAACACAAATAAC +AACACTACTCAGCTAAGTGTCACAAAACTTCCTGAGAAGTTCCTTTTAATTTTCTCTTTC +TTAAAGTTCTTTTTAGAAGTTAAAGTAGCTACAGGCCAGGTGCGGTGGCTCACGCCTGTA +ATCTCAGCACTTTAGGAGCCCGAGGCAGGCAGATCTCTTGAGGCCAGGAGTTTGAGACCG +GCCTGGTCAACACAGCGAAACACTCTCTCTACTAAAAATATAAAAATTAGGCCAGGCATG +GTGGCGCACGCCTGTAGTCCCAGCTACTTGGGAGGCTTAGGCATGAGATTCGTTTGAACC +CAGGAGGGAGGCAGAGGTTGTAGTGAGAGCCAAGATCACGCCACTGTACTCCAGCCTGGG +CGACACAACAAGACTCTGTCTCAAAAAAAAAAAAAAAAAAAAAACACCCATAAAAACAAA +AATTAGCTGGAAGTGGTGGCTCATGCCTGTAATCCCAGCTACTCGGGAGGCTGAGGCACT +AGAATTGCTTGAACCCAGGAGGTGGAGGTTGCAGTGAGCCAAGATCACACCACTGCACTC +CAGCCTGGGCAACAGGGCAAGACTCTGTCTCAAAAAAAAAAAAAAAGTTCAAGTAGCTAC +AAAAGTAGTTTGCTTTTTCCTCAGCCTGCCACGCCAATGACTCCCACTTTTTCTGAATCC +TTTCCTTAAGGATAACAGTATCACAAAAATGCTATTTTTCCTCCTTCTAATACAGAATTT +GAAACACTGGGTTAGGTCATTGCCAGCATTTGTAAACAGAATGAACAGACAGCTTTTATT +TTGCTATCCTGTTCCTTCCTCTGCCTGTATTATATCTCCATCCCTCTCTCCTCCTGGATT +TACTGTTTGTTTTTTTTTAACCTTTCGTTATTTTTTTCAAAGATAGAGACAGGGTCTCAC +TATGTTGCCCAGGCTGCTCTCAAACTCCTGGGCTCAAGCGATCCTTCCACTTCAGCCTCC +CAAAGTGCTGGGATTACGAGTGTGAGCCACTGCATCTGGTCCTGAGTGCTGGATAAGACA +AACACTGCTCAAGGCAGGAGACAGCTGGTGAGCAAACACAGGCTTGGTCCTGGAGCCAAC +AGATTACCGGGGAAGAAAGACGTTGAGCAAATACTCAGGCAAGTCGATTATGATGAGAAA +CGACAGGAAGGTCAGGAAGAAAAAGCAGCCAGTGTCATAGAGAGACTTCACCCTGGGGTC +AGGAAGTGACCTTTGAGCTGAGTCCGGGGATAAGAGAGTTAATCAGGTAACGGGGGAGAA +GTGGGTGCAGGGTGCAGGACAAGTATTCTAGACAGGGACAACAATCTGTGCCAAGCCCCA +GGATGGACCGCTGATGTTTGCAGAGCCACACATAAAGATATCCTTCATTCTGCTTAGTGG +CCACATAGGGATTCATCAGGCCTGCCAAGGGAAAAAGAAAGACAGCCAGAGAGGCTTGAG +TACAGGGAACAAGGGGAAGACGAACGGGAAAAGCTACAGAGGTCAACAGGGCCACCCTAC +ACAGGGGCTGGCAGGGCAGGGTGGAACTGGGTCTTTATCCTTTAATCTTGAAGTGTGGTC +CATCTGGGACCCTAAGTGGTCTGTGATTACCTGGGCCAACTTGAAATGTCACAGATGAGA +GGTTATCTTTGCCGAATGGCTAAAAAATACAAGACCTCAGCCGGGCATGGTGGCTCACAC +CTGTAATCCCAGCACTTTGGGAGGCTGAGGTTGGTGGATCACCTGAGGTCAGGAGTTCAA +GACCAGCCTGACAAACATGGCAAAACTCCCTATCTATTAAAAATCCAAAAATTAGCCAGG +TGTGGTGGCGGGTGCCTGTAATCCCAGCTACTCGGGAGGCTGAGGCAGGAGAATCACTTG +AACCCGAGAGGCAGAGGTTGCAGTAAGCCGAGATCACATCACTGCACTCCAGTATGGGCG +AAAGAGTGAAACTCTGTCTCAAAAACACAAAAACAAAAAACCTCTATTAAGAGGAACAGG +GAAGGGATATAAAGTAGCTTACTAAATGTCTATTATTACCATTGCCTCCTACTGAGAATA +AAAACAATTCACGCATTCCACAGGAGAGTACTCAGCAAACTACACAGGAGAGTACTCAGC +AAACTACACAGGAGAGTACTCAGTAAACTACACAGGAGAGTACTCAGCAAACTACACAGG +TTCAGTGGTACATTTCTCCATGTGGGATCTACTTGTTGGGATCTGAGTTTACTTCACTAC +GTGGTTTAATTTCCCACACGAAAATCCATGACCTCTTCTTCTAACTTTGCTGAAGACAAG +ACTTTGGTTTTACATGATACTATCACACCTGACCTTTGTGAAGTAGTCAGGGTAAAACAT +TCCAGTTTGGCCGAGGAGAGAGAAATACCAAATTCTGCAGTGACTATCTTAAAATAATTT +TTAAATTTTATTTTATTTTATTTATTAATTTATTTGTGAGACAGAGTCTCACTGTCTCAC +TCTGTCACCCAGGCTGGAGAGCTGTGCAGTGGCACGATCATGGCAGCCTCAATCTCCTGG +GCTCAAACGATCCTCCCACCTCAGCCTCCCGAATAGCTGGGACCACAGGCACACACCATC +AAGTCTGACTAATTTTTTACATTTGTTGTAGAGACAAGGTTTCACCATGATGCCCAGGCT +GGTCTCAAACTCCTAAGCTCGAGGGATCTGCCTGCCTCAGCCTCCCAAAGCTCTGGGATT +ACAGGCGTGTGCCCCTGCATCCAACCTGCAGTGACTATCTGACTTCTGATTACTCTACTG +TCAATCAACACTGGCGCACAGGCTGTCTGTCTTTCTGAACACACACATTCCATACACTAT +GCATACTAATACTCCATACTATCAATTGCCCTCATCAGAAGGATCTTCTGGCTAACCAGT +GATCAACATTTTTAATAGCGAAAAATACCTGATACTTAGAGAACATGTTAACCACGTGAA +CTGGGGCAGGTTACTCAACCTCTCTGCATGTGCCTCAGTTTTATCGCTTGTGGAATGGTG +ATGGTAACAGTAACAACCCCATAGGTTTTTGAGGATTAAAGGAACTAATACACATACATT +ATTTCAACAGTGCCTGGCAGATTCTAGGCACTGAATAAATGGTAACTATCACTATTATGT +AAAAAGTATAAAAATCTGCTATATGAATACTTATGGAAAAATACATATATACATATAGAC +ACACATATAAACTATTAGGTCTCTTTTTTTTTTTGAGATAGAATCTCGCTCTGTCACCCA +GGCTGGAGTGCAGTGGTGCGATCTCGGCTCACTGCAACCTCTGTCTCCCAGGTTCAAGCG +ATTCTCCTACCTCAGTCTCCTGAGTAGCTGGAATTACAGGCGTGCACTGCCATGCCCAGC +TAATTTTTTGTATTTTTATTTTTTATCATTATTATTATTTTTTGAGACGGAGTTTCACTC +TTGTTGCCCAGGCTGGAGTGCGATGGCACGAACTCGGCTCGCTGCAAACTCCGCCTCCCG +GGTTCAAGCGATTCTCCTGCCTCAGCCTCCCAAGTAGCTGGGACTACAGGCGCCCGCCAC +CACGCCTGGCTAATTTTGTATTTTTAGTAGAGACAGGTCTCACCATGTTGGTCAGGCTGG +TCTCGAACTCCCGACCTGAAGTGATCTGCCCACCTCGGCCTCCCAAAGTGCCGGGATTAT +AGGCGTGAGCCACCGTGCCTGGCCTTTTTGTATTTTTAGTAGAGGCGGGGTTTCACCACG +TTGGCCAGGCTGGTCTCCAACTCCTGACCTCAGGTGATCTGCCTGCCTCGGCCTCCCAAA +GTGCTGGGATTACAGGTGTGAGGCACCGCGCCTGGCCAACTAGATATTTTTTATTTTTTA +CACCCCTCCTTCCTAGATCTCTTCTTTTTTAAAGTAGATACAAGGTCTTGCTGTGTTGTC +CGGGCTGGTCTCAAACTCCTGGCCTCTTGTGATCCTCCTGCCTTGGCCTCTATTAGATCT +TCAATTTGAGGGGGAGTCTGGGAATGAACACTAAACACACTCACGCTATGAGCCTCTGCC +CCTGGAGCACGGTGTGCTGCTGCAGCATCTCGAAGTTATACTTCTCGTCCGTGGCATGCT +GGTCCACTATGAAGATATCCTCATTCAGTTTGGTTATTATAAATCCCAGGTTAAACTGAC +CAATGATTTCCATTTCTGCAAACATCGTTTTACTGCAGGTAGAAAATGTTAATTATCAGA +CATTTTACAAGATTATTTTTCTGATTATGTTATAGAACACTGTAATAAAAAAAAAGTCAA +ACAATACAAAAACAAAATAAAGTCCCTAGCCATCCCGCTTTCTTTTTTTTTGAAACAGAG +TCTCGTTCTGTCACCCAGGCTACAGTGCAATGGCACAATCTTGGCTCACTGCAACCTCCA +CCTCCCGGGTTCGAGTGATTCTTCTGCCTCAGCCTCCTGAGTAGCTGGGATTACAGGTGC +GCCACCATGCCCAGCTAATTTTTGTATTTTTAGTAGAGACAGAGTTCCACCATGTTGGTC +AGGCTGGTCTCGAACTCCTGACCTCATGATCTGCCCGCCTTGGCCTCCCAAAGTGCTGGG +ATTACAGGCGTGAGCCACTGCGTTCGGCTTAACCATCCCACTTTCTAAAGATAACATTAA +TTATTCATTCATCCAACTCTCCGGAGAAGACATCAGTTGCTACTATTAACGATTTAAATG +GAATATATCCTTCTAGACCCTTGTCTCCATATATAATTTTTTTTAATTTTAAAAAACAAA +AATGGAATCTTAATTCTCCATTCTGTCATCACTTAATGCATCTGAAACAAGTTTTCAGAC +CTGTACACATAGATCTACTTCATTATTTTTTCTTTTTTTTTTTCTTTTAAGACGGAGTCT +CACTCTCTGTTGCCCAGGCTGGACTGCAGTGGCGTGACCGTGGCTCACTGCAACCTGCGC +CTCCCAGGTTCGAGCGATTCTCCTGCCTCAGCCTCCCGAGTAGCTGGGACTACAGGCATG +CACCGCCAAGCCCGGCTAATTTTTTTATTTTTAGTAGAGACAAGGTTTCACCATGTTGGC +TAGGCTGGTCTTGAACTCCCGACCTCAAGTGATCCACCTGCCTCGGCCTCCCAAAGTGCT +GGGATTACAGGCGTGAGCCACCGCACCTGGCCTACTTCATTCTTTTTAATGGCCACATAG +GAATTCATTGCATGGATGTACCATAATTTGTCTGACAAATCCCCTACTAAAGGACATTTC +AGTTGTTTCCAATTTCAATAGCGCACTCAAAGCTGCAACAAATACTTCTGTGCATAAACC +TACTCATCTGTGGGTGCATTTCCGTGAGACAGACGCTAGAGGGAGAACTACATATGTACT +TCTTGACGGGAAATCTGTGAAAAGTCACACTCCCACCAATGGTGTGTAAGAGCACCTTTC +TGCCAATGCTGGATATCAATCCTTCTCATCTTTGCCAGGCCCACCACTGGGTCCTTTGCG +GGTGGCTTCAACATCTAATGTCATTAAATACTAACTTAGTCAATCTGGACAAAAGACAGA +CACCACCGCTAACCTTCACACGAGAAATTGACACTGTCATTCTCAGTCCCACACAATTAA +ATCCGGTGAAAATGGATTTTCCGCAGTATCAGCGCGGTGATGACAAGAAATGGCTCTGTT +AAAGCAGCCATGGACGTTTTCTGGTTCTCACCTGGTGGCCTGAGCTGAGGATGAAAGCAG +CTGTAATGTAATCCCAGCACTTTGGGAAGACAAGGTGGGCAGATCATTTGAGGTTGGGAG +TTCAAGACCAGCCTGGCCAACATGGTGAAACCCCATCTCTACTAAAAATAAAAAAATTAG +CCAGGCATGGTGGTGGACACCTGTACTCCCAGCTACTTGGGAAACGAAGGCAGAAGAATC +GTTTGAACCCGGGAGGCAGAGCTTGCAGTGAGCTGAGATCGTGCCAGTGCACTCCAGCCT +GGGCGAAAGAGTGAGACTCTTGTCTCAAAAAAAAAACAAAGGAGCTGATATTGTTGTTTC +TTTCTATAAGTGCTCCAGGAAGACCCGGTCCCATGCCACCATGCTCGTCACCATCACAAT +CAACCACAGGGGACAGTTTGGTGAACTGTGAGACCTCCACATGGCATGGATTACTGAGCC +CACATTTCCTATGGTGAGGGGCTCCACACAGAGCTCAAATCCAAGTCATAACCAAACCAG +TCCCCAAATCCTATCTTTGAGGGTCTGTTTCCTGGTACCAATTCCAGATCAGGCAGAGTG +CAATCAATCAAGAGACAAAAACCACACCAGTGATTTTAACAGGGACTTTTTTTTTTAAGA +CAGGGTCTTGCTCTGTCACCCAGGCTGGAGTGCAATGGCATGATCATAGCTCACTGCAGC +CTCAAACTCCTGGGCTCAAGTGAGCCTCCTGCCTCAGCCCCCTGAGTACCTGGGACTACA +GGCGTACAGCAATGTACTTAGCTAATTTTTTTTTTTTTTTTTTTTTTTTAGAGATGGGGC +CTCATTATATTGCCCAGGCTGGTCTCAAACTCCTAGCCTCAAGTGATTCTCCTGCCTCAG +CCTCCCAAAGTGCTGGAATTACAAGGTGTGCACCACCATGTTAGGCCTGAGGAGGAAAAA +TGTATAATAAGGCATTACACAAACTAGTAAAAGGTGGTTAACTACTATGCTAAGAAATAC +AGGAATGGAAAATGCTACTATCCTAGGGAAGAGGGAGAGTCCTCAGAAAAGGAACTCTTT +TTTTCTTTTTTCTTTTTCTTTTTTTTTTTTTTGAGATGGAGTTCGCTCTTGTTGCCCAGG +CTGGAGTGCAATGGTGCAATCTCGGCTCACCACAACCTCCACCTCCCGGGTTCAAGCAAT +TCTCCTGCCTCAGCCTCCCAAGTAGCTGGGATTACAGGCATGCACCACCATATCCCACTA +ATTTTGCATTTTTAGTAGAGAAAAGGTTTCTCCATGTTGGTCAGCCTGGTCTCGAACTCC +CAACCTCAGGTGATCCACCCACCTCAGCCTCCCAAAGTGCTGGGATTACAGGCATGAGCC +ACCATGCCCAGCAGAAAAGGAACTCTTGTAAGAGGCTCCTACCCACTCAGGCTGAGTTTC +AGACCTCCTTGGAGCAGGAGTGGCCGCAGCCTGCTGGATGGAGAGAAGCTGCCAGAGTGA +GTGATGACACAGGAACTCCTGCCGCACAGGAGGGAAGGAAAAGAACATCCCAGAAGCATC +CCAGATGCCAGCACAAATACCACCTCCCCTGGCGCCGATCCCAGGCTCTCCCAGGAATTG +TCTGAATATGCCCTGGTTCCCAGTACATAGATAATCTGCTCAAAAGCTGGTGCTGGCCTA +AAAGACCCAAGTCTTCCATGTGTTTGGAGTCTGTGTCCTGCCACAGAGAACAGGATCTGG +CCAGGCGCAGATGCCGGAATTACAACTGCGCACTACCGCGCCCAGCCAATTTTATTGTAG +AGACGAGGTCTCCCTATGTTGTCCAGGCTGGTCTTGAACTCCTGGGCTCAAGTGATCCTC +CCTCCTTGGCTTGGCCTCCCAAAGTACTGGGATTACAGGTGTGAGCCACCACACCTAGCC +TCAAAATACTCTTAAGAAAAAACTTTACCTGGCCGGGTGTGGTGGCTCACACCTGTAATC +CCAGCACTTTGGGAGGCCAAGGTGGCTGGATCACCTGAAGTCAGGAGTTCGAGACCAGCC +CAGCCAACATGGTGAAACCCTGTTTCCACCAAAAATATAAAAATTAGCCAGGCATGGTGG +CGTGCACCTGTAATCCCAGCTACTCAGGAGGCTGAGGCAGGAGAAACGCTTGAACCCGGG +AGGTGGAGGCTGCAGTGAGCCAAGATCATGCCACTGCATTCCAGCCTGCGCAACAGAGCA +AGACTCTGTCTCAAAAAAATAAAAAATAAAAATAAAAATTTTAGATAAAAAGAGAAAAAG +TAAAAAATTAAAACTTTACCTTATCTCTTTTCTTAGTTCATCTTCGGCTGCTTGATTTTC +TCCAGGACAAATCTTTGCCCTAAACTTCCTGTAATTCTGTTCCCCTTCACTTTGCTGTGC +TTCATGATGTAACTGCTTTATTCGTTTAGCTAAAGAACTCATAGAAAAGTCCAGGGGCAC +AACTTTCTTATTAATTTTCACAGCTACATCAACCTGAGAGGCTGACATGTCCTGAGTATT +TACTAACTTTTGACAAATGTCAGAACTGGAAAGAATTTCTTCTTTTTTAAAACGCTTTGT +GTTTGGGGTTGCGAGATTAGTTGGCTGAGGCAAAACTCGAAATTTACATCCGGTATCTTC +CTGGTTTGAATGGCAGTCCACATCTGAAAAAGAGTCGTCAGTTTTAGGCGCTTTCTCCTG +AGAGTCCACATGTTCCTGCGAGCCCCTGTCCCCTGGGGAGCTGGCCGCATACTCGCTGCT +GCAGTGACTGCCCGTGTCTGGGATGCTGAACCCCTCAGAATCCACGGAAGTGCTGCCGTG +CCCCGAGTCCTTCTCCACCTCCGCTCTGTCCGTAGGGTCACTGGGTCCGTGACTGGAACT +CACTGCCTCTTTCTGAGGTCTCAGGACGCCTTTGTCAGAGATGGCACCTGAAGTGCTAGA +AGACAGCATACCCCTTTTCTGTCCTAGAGGGCTCCTTCTTGGTTCTGGAGTCTTTGGGCT +GTGAGGCTTGTTCTCTGTTGTGTGACGAAGAGAAAAGGCCTCTCGCAGTCTGGAAATGGA +CACGTCTTTTTTTTCTTCTCCAGTCCTTAATGAAGGGGATTGATCCTGCTTTTCTACCAT +GGGCTTTTCCAAATCCGCTGCATGCATTTTTATTAAGTTACCTAAGCAAACGTGGACGGA +GAAGAGGGTCAGGGACTATCCTGAAATGGTGAGAGGACGTGCTTATGTGAACAGATACTT +CACAAAAGAGGAGATCCACATGCTAATTACACAGATGAACACAGTTCAATGTTCAAAATA +AAACTATAATATGGGCCAGGTGTGGTGGCTTACGCCTGTTATCCCAGCACTTTAGGAGGC +CAAGGCAGGGGGATCACATGAGGCTAGGAGTTCAGGACTGGTCTGGACAACATGGTAAAA +CCCTGTTTCTACTAAAAATACAAAAATTAGCCGGGTGTGGTGGCATATCTGTCATCCCAG +CTACTTGGGAGGCTGAGGCACGAGAATCCCTTTAGCCCGGGAGGCAGAGGTTGCAGTGAG +CCAAGATGCCACCACTGCTCTCCAGCCTGGGTGACAGAGCAACACTCTGTCTCAAAAAAA +AAAAAAAAAAAAAAAAAAACCACAACACAATGCAATATGGCCATATACTCACCAGAATGG +TAAAATTAAAAAAACAACAAATGCTCACAAAGATCAGGATCAAGAGGAATGCCTGAATAC +CTCTGGTAGGAATGAACCTGGTACAGCTGCTTTGAAAAGTTCTCTGGGAATACCTCCTAA +ATCTGAATGTATGCACACCTGCAACCCAGCATAGCTACTCCTATCAGAAGTGCCTATTGG +CCGGCACAGTGGCTCACGCCTGTAATCCTAGCCCTTTGAGGTCAGGAGTTCAAGACCAGC +CTGACCAACATGGTGAAACCTCATCTCTACTAAAAATACAAAAAAAAAATTAGCAGGGCA +TAGTGGAATGCACTTATAATCCCAGCTACTAGGGAGAATGAGAATGAGGCAGGAGAATCA +CTTGAACCTGGAAGGCAGGGTTGCAGCGAGCCAAGATCACTCCACTGCACTCCAGCCTGG +GCGACAGAGTGAGACTCCCTCTCAAAAAAAAGAAAGAAGTGCCTATCTATGCTCGTCAAA +AAGACGTGGATGAGGATGTTCATGACAGCATTCTTCATTATAGCCCCAAACTGGAAACAA +TTCAAATATTCACAAATGATGATATCTGACTATAATGGAACACTGTATAGCGAACGAATA +AATGAATTTTGCCACATGACTTGGGTGAATCTCACAAACAAAATAATGAGAGAAAGAAAC +AAATCACAGAAAAGGACAGACTGAATAACTTCAAATTAAAAACAGATTAAACTATACCGT +TTTGGGTTTTTTTTGTTTGTTTGTCTGTTTTTTTGAGACGGAGTCTCGCTTTGTCACCCA +GGCTGGAGTGCAGTGGCACAATCTTGGCTCACTGCAAGCCCCTCCTCCCGGGTTCACGCC +ATTCTCCTGAGTCAGCCTCCTGAGTAGCTGGGACTACAGGCGCCCGCCACCACGCCCGGC +TAATTTTTTGTATTTTTAGTAGAGACAGAGTTTCACCGTGTTAGATAGTCTCGATCTCCT +GACCTCGTGAGCCGCCCGCCTTGGCCTCCCAAAGTACTGGGATTACAAGCATGAGCCGCT +GCGCCTGGCCTAAATTCTACTGTTAGAAGTCAGGAAATCCCAGCATTTTCAGAGGCCAAG +GCTAGAGGACTGCTTGAGCTCAAGAGTTTGAGACCAGCCTGGGCATCATGGAGAAACCCC +ATCTCTAATGACAATACAAACATTAGCCAGGTGTGGTGGTGGGCGCCTGTAATCCCAGCT +ACTCAGGAGGCTGAGGCAGGAGAATCTCTTGAACGTGGGAGGCTTCAAGGTTGCAGTGAG +CTGAGATCGCATCATTGCACTCCAGCCTGGGTGACAAGAGCGAAACTCCATCTCAAAAAA +ACAGACATGACAAGGGAGTTAAAAATGCAGTCACTGCAGACTTCTTCTAATCATATATCT +TATATGACTTCATCCGTTTACAGTTTACAAAAAACTAGAGGTACTTGGAGGCAGCTACTT +GGGAGGCTGAGGCAGGAAGATGGCCTGAGCCCAGGAGTGTGATGCTGCAGTGAGCTACAA +TGGCACCACTGCAGTCCAGCCTGGGTGACAGAGCAAGTCCCTGTCTCAAAAAAGAATTAA +AAATGATAAAATAATATAAGAGACTTTGTTTTCATGTCAAAAAAAAGTTTACTTGGAAAA +AATAAGGAAACACATTAGCTAAAAGCTTTAGAAGCTGTTTGTACACTGTATTTTTCTTAC +CTTCAACATCCAGCAGTGGCTGCTGACTGACATTTAGCTTGTTGACATCACTATCAAACA +TTCCTATCAAAGAGGTCTTTAAAACTGCCAACAAAAGCTTTTCCTCTTGTAGCAAAATTT +GCCTTTTATCTGGAGTAACATTGATATCAACGCATTCTAAGGCAAAAAAGAAAACATATT +TATTATGTTTAAATTCACTTTTATTTTATTTATTAATTATTATTTTCAGACAGCGTCTCA +CTCTGTCGCCTAGGCTGGAGTGCAGTGGCGCGATCTCAGCTCACTGCAACCTCCGCCTCC +TGGGTTCAAGTGATTCTCCCTGCCTCAGCCTCCGAAGTAGCTAGGATTACAGGCAAGTGC +CACCACACTGGCTAATTTTTGTATTTTTAGTAGAGATGGGGTTTCACCGTGTTGGCCAGG +CTGGTCTCGAACTCACAACCTCAAGTGATCCACCCGCCTTGGCCTCCCAAAGTTCTGGGA +TTACAGGCGTGAGCCACCGCGCCCAGCCAAATTCACTTTTAACAATAGAAATTTCCCCAT +CTATTATTTCATTCACTTGTATTTATCACAAGTGCTATTAAAAACATTACAGTGTCCAGG +TTAAGATTCATAAGTTATGAAATCAGCTTTTTCAAATAAATGAGCAAAAGACAATTTTTG +AATAGACAAAATACGGAAGGGCTTAATTAGGTAAATTGTTAAAGGAAAAGCAAATAAACA +CATAAAAATAATTTTAAATATGCAAACTAAAATAAGATATTTTAATCCCCTACTGAATTT +AGCTTAACAATTATAATACCTAGTACATTATAGGTAGGTGTGTAAATTGGTACATCTAGA +ACACAATACAGAAAAAGCCTTAAAATGATCATATTCACTGACCCAGTAATTCTACTCCCG +GCAATTTATATTCAGAATAATTAAAGATGTAGGCTGAGTGCAGTGGCTCACACCCTGTAA +TCCCAGCACTTTGGGAGGCTGAGACGAACAGATAAGTTAAGGTCAGGAGTTCAAGACCAG +CCTGGCCAACATGGCGAAACCCCGTCTCTACTAAAAACACAAAAAATTAGTCAGGCATGG +TGGCAGGTGCCTGTAATCCGTGCTACTCAGGAGGCTGAGGCGGGAGAATCGCTTGAACTC +AGGAATCGGAGGTTACAGTGAGCTGATATTACACCACTGCACTCCAGCTTGGGAGACAGA +GCAAGACTCTGTCTCAAAAAATATCTAATAATAAAGATGCAGATAATGATTTAATTATAA +GGAAAGTATTTATAATTTCCAAAAACTAAAAACAATTTAATTTTGAAAAATTTAAAAATT +AAAATACCAAACTATAACCATGCATTGGAATATAATTCACCTATTAAAACCACATTTCTG +ATCAATTTCTAATAACATGGAAAAGAAAACATTCACATCTAAGGATAAAGAGCAGTATAC +AAAATTATTTTCTCATCCCAAAGAATATGGGAGTAGGGGAGAGAGAGAGAAAGAGAGAGA +GGACAGAAGATATTTTTTAAGGTATGTACATATGTGTTTCTAAGTATCTAGAAAAAATAC +TCAATTACAATAAACCAAAATTTTAACAATCAGAAAAAAAAATCTATATGAAATGAATTA +TTTATGAAATTAGGAAGAACATTTCATCTACTTTCTCCCTTGGTTGACATTAAAAAAAAT +TACATTTTCCTAACAATATAATTAACATAGTCTCAAGTAGAAAGCGGGAACTCTGTTTAA +AAAAAAAAAAAATTATAGGGCCAGGCACGGTGGCTCATGCCTGTAATCCCAGCACTTTGG +GAGGCCAAGACGGGCAGATCACGAGGTCAGGAGATCGAGACCATCCTGGCTAACACGGTG +AAACCCTGTCTCTACTAAAAATACAAAAAATTAGCCAGGTGTGGTGGCACGCACCTGTAG +TCCCAGCTACTTGGGAGGCTGAGTCAGGAGAATCGCTTGAACCCGGGAGGCAGAGGTTGC +AGTGAGCCGAGATCACGCCACCACACTCCAGCCTGGGTGACAGAGCAAGACCCCGTCTCA +AAAAAAAACAAAAACAAAAAACTTACATGACCATAAATTGTTATCTCATTCCAGTCATAG +CAGAGCTGTAGAATTTCATTTTATTCTTTGAGGCATTAGTCACTAGTTGTACTGAAATGC +CAATGGAACTTACCTGAATCAACAGAAATGTTAAGAACAACAAATGGATACTGGTGTCGA +TTATACATGTGGTAGACCTCATTCACGAGTCTGCAGACCTGCACAAAATACAAGGAGTAG +AAAAGAATAAATGACAAATGTTCCCAGCCCCCCGCATTCTAACAACATTCTATTCTAACC +AACCAGCATGTTCTTAGAAGGGGATACTTTTTTGTTTTTTTTTTTTTTGAGTCAAGGTCT +CGCCTTGTCACAGCCTGGAGTGCAGTGGAGCAATCATGGCTCACTGCAGCCTCAACCTCC +CAGGCTCAAGTGATCCTCCTGTGTCAGCCTGACATGTAACTTGGATTACAGGCAGGATTT +TTTTTTCTTTTTTTTTTTCAACGGAGTCTCGCTCTTGTTACCCAGACTGGAGTGTAATGG +CACGATCTTGGCTCACTGCAACCTCTGCCTCCGGGGCTCAAGTGATTATCCTGACTCAGC +CTCCAGAGTAGCTGGGATTACAGGCACACGCCACCATGACCAGATAATTTTTGTGTTTTT +A diff --git a/paraphase/data/pms2/pms2_ref.fa.fai b/paraphase/data/pms2/pms2_ref.fa.fai new file mode 100644 index 0000000..c9a4c62 --- /dev/null +++ b/paraphase/data/pms2/pms2_ref.fa.fai @@ -0,0 +1 @@ +chr7_5967000_5992500 25501 22 60 61 diff --git a/paraphase/data/rccx/cyp21_diff_sites.txt b/paraphase/data/rccx/cyp21_diff_sites.txt new file mode 100644 index 0000000..6ee7852 --- /dev/null +++ b/paraphase/data/rccx/cyp21_diff_sites.txt @@ -0,0 +1,10 @@ +chr6 32038514 C T P31L +chr6 32039081 C G IVS2-13A/C>G +chr6 32039132 GGAGACTAC G G111Vfs +chr6 32039426 T A I173N +chr6 32039807 T A I237N +chr6 32039810 T A V238E +chr6 32039816 T A M240K +chr6 32040110 G T V282L +chr6 32040421 C T Q319X +chr6 32040535 C T R357W diff --git a/paraphase/data/rccx/rccx_config.yaml b/paraphase/data/rccx/rccx_config.yaml new file mode 100755 index 0000000..3123981 --- /dev/null +++ b/paraphase/data/rccx/rccx_config.yaml @@ -0,0 +1,40 @@ +gene: rccx +use_supplementary: True + +# data files +data: + reference: rccx_ref.fa + snp_file: cyp21_diff_sites.txt + homopolymer: rccx_homopolymer_sites.txt + +# coordinates +coordinates: + hg38: + nchr: "chr6" + nchr_old: "chr6_32013240_32049000" + nchr_length: 170805979 + extract_region1: "chr6:31980000-32046800" + extract_region2: "" + + clip_3p_positions: [32046127] + clip_5p_positions: [32013270] + + deletion1_size: 6367 + deletion2_size: 120 + + noisy_region: [[32029159, 32029159], [32022483, 32022483]] + + left_boundary: 32013300 + right_boundary: 32046000 + + # 120bp sv + del2_3p_pos1: 32043700 + del2_3p_pos2: 32043723 + del2_5p_pos1: 32043828 + del2_5p_pos2: 32043865 + + # 6.3kb sv in C4B + del1_3p_pos1: 32017400 + del1_3p_pos2: 32017433 + del1_5p_pos1: 32023788 + del1_5p_pos2: 32023798 diff --git a/paraphase/data/rccx/rccx_homopolymer_sites.txt b/paraphase/data/rccx/rccx_homopolymer_sites.txt new file mode 100644 index 0000000..e65f800 --- /dev/null +++ b/paraphase/data/rccx/rccx_homopolymer_sites.txt @@ -0,0 +1,1764 @@ +chr5 32014377 A +chr5 32014378 T,G,C,A +chr5 32014379 0 +chr5 32014380 0 +chr5 32014381 0 +chr5 32014382 0 +chr5 32014383 0 +chr5 32014384 1,A +chr5 32014566 G +chr5 32014567 T,G,C,A +chr5 32014568 0 +chr5 32014569 0 +chr5 32014570 0 +chr5 32014571 0 +chr5 32014572 1,G +chr5 32014672 C +chr5 32014673 T,G,C,A +chr5 32014674 0 +chr5 32014675 0 +chr5 32014676 0 +chr5 32014677 0 +chr5 32014678 1,C +chr5 32014743 G +chr5 32014744 T,G,C,A +chr5 32014745 0 +chr5 32014746 0 +chr5 32014747 0 +chr5 32014748 0 +chr5 32014749 1,G +chr5 32014858 G +chr5 32014859 T,G,C,A +chr5 32014860 0 +chr5 32014861 0 +chr5 32014862 0 +chr5 32014863 0 +chr5 32014864 1,G +chr5 32015075 G +chr5 32015076 T,G,C,A +chr5 32015077 0 +chr5 32015078 0 +chr5 32015079 0 +chr5 32015080 0 +chr5 32015081 1,G,C +chr5 32015082 T,G,C,A +chr5 32015083 0 +chr5 32015084 0 +chr5 32015085 0 +chr5 32015086 0 +chr5 32015087 1,C +chr5 32015093 G +chr5 32015094 T,G,C,A +chr5 32015095 0 +chr5 32015096 0 +chr5 32015097 0 +chr5 32015098 0 +chr5 32015099 1,G +chr5 32015273 G +chr5 32015274 T,G,C,A +chr5 32015275 0 +chr5 32015276 0 +chr5 32015277 0 +chr5 32015278 0 +chr5 32015279 1,G +chr5 32015394 T +chr5 32015395 T,G,C,A +chr5 32015396 0 +chr5 32015397 0 +chr5 32015398 0 +chr5 32015399 0 +chr5 32015400 T,1 +chr5 32015603 G +chr5 32015604 T,G,C,A +chr5 32015605 0 +chr5 32015606 0 +chr5 32015607 0 +chr5 32015608 0 +chr5 32015609 1,G +chr5 32015614 T +chr5 32015615 T,G,C,A +chr5 32015616 0 +chr5 32015617 0 +chr5 32015618 0 +chr5 32015619 0 +chr5 32015620 T,1 +chr5 32015689 C +chr5 32015690 T,G,C,A +chr5 32015691 0 +chr5 32015692 0 +chr5 32015693 0 +chr5 32015694 0 +chr5 32015695 1,C +chr5 32015722 C +chr5 32015723 T,G,C,A +chr5 32015724 0 +chr5 32015725 0 +chr5 32015726 0 +chr5 32015727 0 +chr5 32015728 1,C +chr5 32016121 G +chr5 32016122 T,G,C,A +chr5 32016123 0 +chr5 32016124 0 +chr5 32016125 0 +chr5 32016126 0 +chr5 32016127 1,G +chr5 32016129 C +chr5 32016130 T,G,C,A +chr5 32016131 0 +chr5 32016132 0 +chr5 32016133 0 +chr5 32016134 0 +chr5 32016135 1,C +chr5 32016231 C +chr5 32016232 T,G,C,A +chr5 32016233 0 +chr5 32016234 0 +chr5 32016235 0 +chr5 32016236 0 +chr5 32016237 1,C +chr5 32016768 C +chr5 32016769 T,G,C,A +chr5 32016770 0 +chr5 32016771 0 +chr5 32016772 0 +chr5 32016773 0 +chr5 32016774 0 +chr5 32016775 1,C +chr5 32016834 G +chr5 32016835 T,G,C,A +chr5 32016836 0 +chr5 32016837 0 +chr5 32016838 0 +chr5 32016839 0 +chr5 32016840 1,G +chr5 32016893 G +chr5 32016894 T,G,C,A +chr5 32016895 0 +chr5 32016896 0 +chr5 32016897 0 +chr5 32016898 0 +chr5 32016899 1,G +chr5 32017523 G +chr5 32017524 T,G,C,A +chr5 32017525 0 +chr5 32017526 0 +chr5 32017527 0 +chr5 32017528 0 +chr5 32017529 1,G +chr5 32017531 G +chr5 32017532 T,G,C,A +chr5 32017533 0 +chr5 32017534 0 +chr5 32017535 0 +chr5 32017536 0 +chr5 32017537 1,G +chr5 32017588 T +chr5 32017589 T,G,C,A +chr5 32017590 0 +chr5 32017591 0 +chr5 32017592 0 +chr5 32017593 0 +chr5 32017594 0 +chr5 32017595 T,1 +chr5 32017837 G +chr5 32017838 T,G,C,A +chr5 32017839 0 +chr5 32017840 0 +chr5 32017841 0 +chr5 32017842 0 +chr5 32017843 1,G +chr5 32017931 T +chr5 32017932 T,G,C,A +chr5 32017933 0 +chr5 32017934 0 +chr5 32017935 0 +chr5 32017936 0 +chr5 32017937 T,1 +chr5 32017981 C +chr5 32017982 T,G,C,A +chr5 32017983 0 +chr5 32017984 0 +chr5 32017985 0 +chr5 32017986 0 +chr5 32017987 0 +chr5 32017988 1,C +chr5 32017991 T +chr5 32017992 T,G,C,A +chr5 32017993 0 +chr5 32017994 0 +chr5 32017995 0 +chr5 32017996 0 +chr5 32017997 0 +chr5 32017998 0 +chr5 32017999 0 +chr5 32018000 0 +chr5 32018001 0 +chr5 32018002 0 +chr5 32018003 0 +chr5 32018004 0 +chr5 32018005 T,1 +chr5 32018241 A +chr5 32018242 T,G,C,A +chr5 32018243 0 +chr5 32018244 0 +chr5 32018245 0 +chr5 32018246 0 +chr5 32018247 1,A +chr5 32018250 T +chr5 32018251 T,G,C,A +chr5 32018252 0 +chr5 32018253 0 +chr5 32018254 0 +chr5 32018255 0 +chr5 32018256 0 +chr5 32018257 0 +chr5 32018258 T,1 +chr5 32018283 T +chr5 32018284 T,G,C,A +chr5 32018285 0 +chr5 32018286 0 +chr5 32018287 0 +chr5 32018288 0 +chr5 32018289 T,1 +chr5 32018290 T,G,C,A +chr5 32018291 0 +chr5 32018292 0 +chr5 32018293 0 +chr5 32018294 0 +chr5 32018295 0 +chr5 32018296 0 +chr5 32018297 T,1 +chr5 32018315 T +chr5 32018316 T,G,C,A +chr5 32018317 0 +chr5 32018318 0 +chr5 32018319 0 +chr5 32018320 A +chr5 32018321 T,G,C,A +chr5 32018322 0 +chr5 32018323 0 +chr5 32018324 0 +chr5 32018325 0 +chr5 32018326 1,A +chr5 32018517 T +chr5 32018518 T,G,C,A +chr5 32018519 0 +chr5 32018520 0 +chr5 32018521 0 +chr5 32018522 0 +chr5 32018523 0 +chr5 32018524 T,1 +chr5 32018631 A +chr5 32018632 T,G,C,A +chr5 32018633 0 +chr5 32018634 0 +chr5 32018635 0 +chr5 32018636 0 +chr5 32018637 1,A +chr5 32018661 T +chr5 32018662 T,G,C,A +chr5 32018663 0 +chr5 32018664 0 +chr5 32018665 0 +chr5 32018666 0 +chr5 32018667 0 +chr5 32018668 0 +chr5 32018669 0 +chr5 32018670 T,1 +chr5 32018675 T +chr5 32018676 T,G,C,A +chr5 32018677 0 +chr5 32018678 0 +chr5 32018679 0 +chr5 32018680 0 +chr5 32018681 T,1 +chr5 32018753 T +chr5 32018754 T,G,C,A +chr5 32018755 0 +chr5 32018756 0 +chr5 32018757 0 +chr5 32018758 0 +chr5 32018759 T,1 +chr5 32018774 T +chr5 32018775 T,G,C,A +chr5 32018776 0 +chr5 32018777 0 +chr5 32018778 0 +chr5 32018779 0 +chr5 32018780 T,1 +chr5 32018860 T +chr5 32018861 T,G,C,A +chr5 32018862 0 +chr5 32018863 0 +chr5 32018864 0 +chr5 32018865 0 +chr5 32018866 T,1 +chr5 32018946 T +chr5 32018947 T,G,C,A +chr5 32018948 0 +chr5 32018949 0 +chr5 32018950 0 +chr5 32018951 0 +chr5 32018952 T,1 +chr5 32018955 T +chr5 32018956 T,G,C,A +chr5 32018957 0 +chr5 32018958 0 +chr5 32018959 0 +chr5 32018960 0 +chr5 32018961 0 +chr5 32018962 0 +chr5 32018963 0 +chr5 32018964 0 +chr5 32018965 0 +chr5 32018966 T,1 +chr5 32018977 T +chr5 32018978 T,G,C,A +chr5 32018979 0 +chr5 32018980 0 +chr5 32018981 0 +chr5 32018982 0 +chr5 32018983 0 +chr5 32018984 0 +chr5 32018985 T,1 +chr5 32019119 T +chr5 32019120 T,G,C,A +chr5 32019121 0 +chr5 32019122 0 +chr5 32019123 0 +chr5 32019124 0 +chr5 32019125 T,1 +chr5 32019143 T +chr5 32019144 T,G,C,A +chr5 32019145 0 +chr5 32019146 0 +chr5 32019147 0 +chr5 32019148 0 +chr5 32019149 T,1 +chr5 32019291 T +chr5 32019292 T,G,C,A +chr5 32019293 0 +chr5 32019294 0 +chr5 32019295 0 +chr5 32019296 0 +chr5 32019297 0 +chr5 32019298 0 +chr5 32019299 0 +chr5 32019300 T,1 +chr5 32019307 T +chr5 32019308 T,G,C,A +chr5 32019309 0 +chr5 32019310 0 +chr5 32019311 0 +chr5 32019312 0 +chr5 32019313 T,1 +chr5 32019348 T +chr5 32019349 T,G,C,A +chr5 32019350 0 +chr5 32019351 0 +chr5 32019352 0 +chr5 32019353 0 +chr5 32019354 G +chr5 32019355 T,G,C,A +chr5 32019356 0 +chr5 32019357 0 +chr5 32019358 0 +chr5 32019359 0 +chr5 32019360 1,G +chr5 32019514 G +chr5 32019515 T,G,C,A +chr5 32019516 0 +chr5 32019517 0 +chr5 32019518 0 +chr5 32019519 0 +chr5 32019520 0 +chr5 32019521 1,G +chr5 32019550 T +chr5 32019551 T,G,C,A +chr5 32019552 0 +chr5 32019553 0 +chr5 32019554 0 +chr5 32019555 0 +chr5 32019556 0 +chr5 32019557 0 +chr5 32019558 0 +chr5 32019559 0 +chr5 32019560 0 +chr5 32019561 0 +chr5 32019562 0 +chr5 32019563 T,1 +chr5 32019636 G +chr5 32019637 T,G,C,A +chr5 32019638 0 +chr5 32019639 0 +chr5 32019640 0 +chr5 32019641 0 +chr5 32019642 1,G +chr5 32019748 T +chr5 32019749 T,G,C,A +chr5 32019750 0 +chr5 32019751 0 +chr5 32019752 0 +chr5 32019753 0 +chr5 32019754 T,1 +chr5 32019816 T +chr5 32019817 T,G,C,A +chr5 32019818 0 +chr5 32019819 0 +chr5 32019820 0 +chr5 32019821 0 +chr5 32019822 0 +chr5 32019823 0 +chr5 32019824 0 +chr5 32019825 0 +chr5 32019826 0 +chr5 32019827 T,1 +chr5 32019896 G +chr5 32019897 T,G,C,A +chr5 32019898 0 +chr5 32019899 0 +chr5 32019900 0 +chr5 32019901 0 +chr5 32019902 1,G +chr5 32019949 G +chr5 32019950 T,G,C,A +chr5 32019951 0 +chr5 32019952 0 +chr5 32019953 0 +chr5 32019954 0 +chr5 32019955 1,G +chr5 32020049 A +chr5 32020050 T,G,C,A +chr5 32020051 0 +chr5 32020052 0 +chr5 32020053 0 +chr5 32020054 0 +chr5 32020055 1,A +chr5 32020082 T +chr5 32020083 T,G,C,A +chr5 32020084 0 +chr5 32020085 0 +chr5 32020086 0 +chr5 32020087 0 +chr5 32020088 0 +chr5 32020089 0 +chr5 32020090 T,1 +chr5 32020157 T +chr5 32020158 T,G,C,A +chr5 32020159 0 +chr5 32020160 0 +chr5 32020161 0 +chr5 32020162 0 +chr5 32020163 0 +chr5 32020164 T,1 +chr5 32020178 T +chr5 32020179 T,G,C,A +chr5 32020180 0 +chr5 32020181 0 +chr5 32020182 0 +chr5 32020183 0 +chr5 32020184 0 +chr5 32020185 0 +chr5 32020186 T,1 +chr5 32020187 T,G,C,A +chr5 32020188 0 +chr5 32020189 0 +chr5 32020190 0 +chr5 32020191 0 +chr5 32020192 0 +chr5 32020193 T,1 +chr5 32020194 T +chr5 32020195 T,G,C,A +chr5 32020196 0 +chr5 32020197 0 +chr5 32020198 0 +chr5 32020199 0 +chr5 32020200 0 +chr5 32020201 T,1 +chr5 32020270 T +chr5 32020271 T,G,C,A +chr5 32020272 0 +chr5 32020273 0 +chr5 32020274 0 +chr5 32020275 0 +chr5 32020276 T,1 +chr5 32020279 T +chr5 32020280 T,G,C,A +chr5 32020281 0 +chr5 32020282 0 +chr5 32020283 0 +chr5 32020284 0 +chr5 32020285 T,1 +chr5 32020307 A +chr5 32020308 T,G,C,A +chr5 32020309 0 +chr5 32020310 0 +chr5 32020311 0 +chr5 32020312 0 +chr5 32020313 1,A +chr5 32020359 T +chr5 32020360 T,G,C,A +chr5 32020361 0 +chr5 32020362 0 +chr5 32020363 0 +chr5 32020364 0 +chr5 32020365 T,1 +chr5 32020393 T +chr5 32020394 T,G,C,A +chr5 32020395 0 +chr5 32020396 0 +chr5 32020397 0 +chr5 32020398 0 +chr5 32020399 0 +chr5 32020400 T,1 +chr5 32020451 T +chr5 32020452 T,G,C,A +chr5 32020453 0 +chr5 32020454 0 +chr5 32020455 0 +chr5 32020456 0 +chr5 32020457 0 +chr5 32020458 T,1 +chr5 32020464 T +chr5 32020465 T,G,C,A +chr5 32020466 0 +chr5 32020467 0 +chr5 32020468 0 +chr5 32020469 0 +chr5 32020470 0 +chr5 32020471 T,1 +chr5 32020475 T +chr5 32020476 T,G,C,A +chr5 32020477 0 +chr5 32020478 0 +chr5 32020479 0 +chr5 32020480 0 +chr5 32020481 0 +chr5 32020482 0 +chr5 32020483 T,1 +chr5 32020499 T +chr5 32020500 T,G,C,A +chr5 32020501 0 +chr5 32020502 0 +chr5 32020503 0 +chr5 32020504 0 +chr5 32020505 T,1 +chr5 32020517 T +chr5 32020518 T,G,C,A +chr5 32020519 0 +chr5 32020520 0 +chr5 32020521 0 +chr5 32020522 0 +chr5 32020523 T,1 +chr5 32020533 T +chr5 32020534 T,G,C,A +chr5 32020535 0 +chr5 32020536 0 +chr5 32020537 0 +chr5 32020538 0 +chr5 32020539 0 +chr5 32020540 0 +chr5 32020541 T,1 +chr5 32020552 T +chr5 32020553 T,G,C,A +chr5 32020554 0 +chr5 32020555 0 +chr5 32020556 0 +chr5 32020557 0 +chr5 32020558 T,1 +chr5 32020562 T +chr5 32020563 T,G,C,A +chr5 32020564 0 +chr5 32020565 0 +chr5 32020566 0 +chr5 32020567 0 +chr5 32020568 T,1 +chr5 32020570 T +chr5 32020571 T,G,C,A +chr5 32020572 0 +chr5 32020573 0 +chr5 32020574 0 +chr5 32020575 0 +chr5 32020576 0 +chr5 32020577 0 +chr5 32020578 T,1 +chr5 32020579 T +chr5 32020580 T,G,C,A +chr5 32020581 0 +chr5 32020582 0 +chr5 32020583 0 +chr5 32020584 0 +chr5 32020585 0 +chr5 32020586 T,1 +chr5 32020588 T +chr5 32020589 T,G,C,A +chr5 32020590 0 +chr5 32020591 0 +chr5 32020592 0 +chr5 32020593 0 +chr5 32020594 0 +chr5 32020595 0 +chr5 32020596 T,1 +chr5 32020597 T +chr5 32020598 T,G,C,A +chr5 32020599 0 +chr5 32020600 0 +chr5 32020601 0 +chr5 32020602 0 +chr5 32020603 T,1 +chr5 32020611 T +chr5 32020612 T,G,C,A +chr5 32020613 0 +chr5 32020614 0 +chr5 32020615 0 +chr5 32020616 0 +chr5 32020617 0 +chr5 32020618 0 +chr5 32020619 0 +chr5 32020620 T,1 +chr5 32020626 T +chr5 32020627 T,G,C,A +chr5 32020628 0 +chr5 32020629 0 +chr5 32020630 0 +chr5 32020631 0 +chr5 32020632 0 +chr5 32020633 0 +chr5 32020634 T,1 +chr5 32020689 T +chr5 32020690 T,G,C,A +chr5 32020691 0 +chr5 32020692 0 +chr5 32020693 0 +chr5 32020694 0 +chr5 32020695 T,1 +chr5 32021043 A +chr5 32021044 T,G,C,A +chr5 32021045 0 +chr5 32021046 0 +chr5 32021047 0 +chr5 32021048 0 +chr5 32021049 1,A +chr5 32021092 T +chr5 32021093 T,G,C,A +chr5 32021094 0 +chr5 32021095 0 +chr5 32021096 0 +chr5 32021097 0 +chr5 32021098 T,1 +chr5 32021186 T +chr5 32021187 T,G,C,A +chr5 32021188 0 +chr5 32021189 0 +chr5 32021190 0 +chr5 32021191 0 +chr5 32021192 T,1 +chr5 32021213 A +chr5 32021214 T,G,C,A +chr5 32021215 0 +chr5 32021216 0 +chr5 32021217 0 +chr5 32021218 0 +chr5 32021219 1,A +chr5 32021254 T +chr5 32021255 T,G,C,A +chr5 32021256 0 +chr5 32021257 0 +chr5 32021258 0 +chr5 32021259 0 +chr5 32021260 T,1 +chr5 32021296 T +chr5 32021297 T,G,C,A +chr5 32021298 0 +chr5 32021299 0 +chr5 32021300 0 +chr5 32021301 0 +chr5 32021302 T,1 +chr5 32021386 T +chr5 32021387 T,G,C,A +chr5 32021388 0 +chr5 32021389 0 +chr5 32021390 0 +chr5 32021391 0 +chr5 32021392 T,1 +chr5 32021408 T +chr5 32021409 T,G,C,A +chr5 32021410 0 +chr5 32021411 0 +chr5 32021412 0 +chr5 32021413 0 +chr5 32021414 T,1 +chr5 32021519 A +chr5 32021520 T,G,C,A +chr5 32021521 0 +chr5 32021522 0 +chr5 32021523 0 +chr5 32021524 0 +chr5 32021525 1,A +chr5 32021559 C +chr5 32021560 T,G,C,A +chr5 32021561 0 +chr5 32021562 0 +chr5 32021563 0 +chr5 32021564 0 +chr5 32021565 1,C +chr5 32021807 T +chr5 32021808 T,G,C,A +chr5 32021809 0 +chr5 32021810 0 +chr5 32021811 0 +chr5 32021812 0 +chr5 32021813 T,1 +chr5 32022004 T +chr5 32022005 T,G,C,A +chr5 32022006 0 +chr5 32022007 0 +chr5 32022008 0 +chr5 32022009 0 +chr5 32022010 T,1 +chr5 32022132 T +chr5 32022133 T,G,C,A +chr5 32022134 0 +chr5 32022135 0 +chr5 32022136 0 +chr5 32022137 0 +chr5 32022138 0 +chr5 32022139 T,1 +chr5 32022168 T +chr5 32022169 T,G,C,A +chr5 32022170 0 +chr5 32022171 0 +chr5 32022172 0 +chr5 32022173 0 +chr5 32022174 0 +chr5 32022175 T,1 +chr5 32022201 A +chr5 32022202 T,G,C,A +chr5 32022203 0 +chr5 32022204 0 +chr5 32022205 0 +chr5 32022206 0 +chr5 32022207 0 +chr5 32022208 1,A +chr5 32022258 G +chr5 32022259 T,G,C,A +chr5 32022260 0 +chr5 32022261 0 +chr5 32022262 0 +chr5 32022263 0 +chr5 32022264 1,G +chr5 32022326 T +chr5 32022327 T,G,C,A +chr5 32022328 0 +chr5 32022329 0 +chr5 32022330 0 +chr5 32022331 0 +chr5 32022332 T,1 +chr5 32022385 T +chr5 32022386 T,G,C,A +chr5 32022387 0 +chr5 32022388 0 +chr5 32022389 0 +chr5 32022390 0 +chr5 32022391 0 +chr5 32022392 0 +chr5 32022393 T,1 +chr5 32022413 C +chr5 32022414 T,G,C,A +chr5 32022415 0 +chr5 32022416 0 +chr5 32022417 0 +chr5 32022418 0 +chr5 32022419 1,C +chr5 32022594 C +chr5 32022595 T,G,C,A +chr5 32022596 0 +chr5 32022597 0 +chr5 32022598 0 +chr5 32022599 0 +chr5 32022600 1,C +chr5 32022668 T +chr5 32022669 T,G,C,A +chr5 32022670 0 +chr5 32022671 0 +chr5 32022672 0 +chr5 32022673 0 +chr5 32022674 0 +chr5 32022675 T,1 +chr5 32022727 T +chr5 32022728 T,G,C,A +chr5 32022729 0 +chr5 32022730 0 +chr5 32022731 0 +chr5 32022732 0 +chr5 32022733 T,1 +chr5 32023083 T +chr5 32023084 T,G,C,A +chr5 32023085 0 +chr5 32023086 0 +chr5 32023087 0 +chr5 32023088 0 +chr5 32023089 T,1 +chr5 32023344 G +chr5 32023345 T,G,C,A +chr5 32023346 0 +chr5 32023347 0 +chr5 32023348 0 +chr5 32023349 0 +chr5 32023350 1,G +chr5 32023742 T +chr5 32023743 T,G,C,A +chr5 32023744 0 +chr5 32023745 0 +chr5 32023746 0 +chr5 32023747 0 +chr5 32023748 T,1 +chr5 32023898 C +chr5 32023899 T,G,C,A +chr5 32023900 0 +chr5 32023901 0 +chr5 32023902 0 +chr5 32023903 0 +chr5 32023904 1,C +chr5 32024028 C +chr5 32024029 T,G,C,A +chr5 32024030 0 +chr5 32024031 0 +chr5 32024032 0 +chr5 32024033 0 +chr5 32024034 1,C +chr5 32024354 G +chr5 32024355 T,G,C,A +chr5 32024356 0 +chr5 32024357 0 +chr5 32024358 0 +chr5 32024359 0 +chr5 32024360 1,G +chr5 32024406 G +chr5 32024407 T,G,C,A +chr5 32024408 0 +chr5 32024409 0 +chr5 32024410 0 +chr5 32024411 0 +chr5 32024412 0 +chr5 32024413 1,G +chr5 32024457 C +chr5 32024458 T,G,C,A +chr5 32024459 0 +chr5 32024460 0 +chr5 32024461 0 +chr5 32024462 0 +chr5 32024463 1,C +chr5 32024715 G +chr5 32024716 T,G,C,A +chr5 32024717 0 +chr5 32024718 0 +chr5 32024719 0 +chr5 32024720 0 +chr5 32024721 0 +chr5 32024722 0 +chr5 32024723 1,G +chr5 32025005 G +chr5 32025006 T,G,C,A +chr5 32025007 0 +chr5 32025008 0 +chr5 32025009 0 +chr5 32025010 0 +chr5 32025011 1,G +chr5 32025051 G +chr5 32025052 T,G,C,A +chr5 32025053 0 +chr5 32025054 0 +chr5 32025055 0 +chr5 32025056 0 +chr5 32025057 1,G +chr5 32025100 C +chr5 32025101 T,G,C,A +chr5 32025102 0 +chr5 32025103 0 +chr5 32025104 0 +chr5 32025105 0 +chr5 32025106 1,C +chr5 32025329 C +chr5 32025330 T,G,C,A +chr5 32025331 0 +chr5 32025332 0 +chr5 32025333 0 +chr5 32025334 0 +chr5 32025335 1,C +chr5 32025443 C +chr5 32025444 T,G,C,A +chr5 32025445 0 +chr5 32025446 0 +chr5 32025447 0 +chr5 32025448 0 +chr5 32025449 0 +chr5 32025450 1,C +chr5 32025575 G +chr5 32025576 T,G,C,A +chr5 32025577 0 +chr5 32025578 0 +chr5 32025579 0 +chr5 32025580 0 +chr5 32025581 1,G +chr5 32025859 A +chr5 32025860 T,G,C,A +chr5 32025861 0 +chr5 32025862 0 +chr5 32025863 0 +chr5 32025864 0 +chr5 32025865 1,A +chr5 32025956 G +chr5 32025957 T,G,C,A +chr5 32025958 0 +chr5 32025959 0 +chr5 32025960 0 +chr5 32025961 0 +chr5 32025962 1,G +chr5 32026103 G +chr5 32026104 T,G,C,A +chr5 32026105 0 +chr5 32026106 0 +chr5 32026107 0 +chr5 32026108 0 +chr5 32026109 1,G +chr5 32026123 T +chr5 32026124 T,G,C,A +chr5 32026125 0 +chr5 32026126 0 +chr5 32026127 0 +chr5 32026128 0 +chr5 32026129 T,1 +chr5 32026143 C +chr5 32026144 T,G,C,A +chr5 32026145 0 +chr5 32026146 0 +chr5 32026147 0 +chr5 32026148 0 +chr5 32026149 1,C +chr5 32026444 C +chr5 32026445 T,G,C,A +chr5 32026446 0 +chr5 32026447 0 +chr5 32026448 0 +chr5 32026449 0 +chr5 32026450 1,C +chr5 32026899 C +chr5 32026900 T,G,C,A +chr5 32026901 0 +chr5 32026902 0 +chr5 32026903 0 +chr5 32026904 0 +chr5 32026905 1,C +chr5 32027110 A +chr5 32027111 T,G,C,A +chr5 32027112 0 +chr5 32027113 0 +chr5 32027114 0 +chr5 32027115 0 +chr5 32027116 1,A +chr5 32027157 G +chr5 32027158 T,G,C,A +chr5 32027159 0 +chr5 32027160 0 +chr5 32027161 0 +chr5 32027162 0 +chr5 32027163 1,G +chr5 32027277 G +chr5 32027278 T,G,C,A +chr5 32027279 0 +chr5 32027280 0 +chr5 32027281 0 +chr5 32027282 0 +chr5 32027283 1,G +chr5 32027479 C +chr5 32027480 T,G,C,A +chr5 32027481 0 +chr5 32027482 0 +chr5 32027483 0 +chr5 32027484 0 +chr5 32027485 1,C +chr5 32027619 G +chr5 32027620 T,G,C,A +chr5 32027621 0 +chr5 32027622 0 +chr5 32027623 0 +chr5 32027624 0 +chr5 32027625 1,G +chr5 32027793 G +chr5 32027794 T,G,C,A +chr5 32027795 0 +chr5 32027796 0 +chr5 32027797 0 +chr5 32027798 0 +chr5 32027799 1,G +chr5 32027827 C +chr5 32027828 T,G,C,A +chr5 32027829 0 +chr5 32027830 0 +chr5 32027831 0 +chr5 32027832 0 +chr5 32027833 1,C +chr5 32028098 G +chr5 32028099 T,G,C,A +chr5 32028100 0 +chr5 32028101 0 +chr5 32028102 0 +chr5 32028103 0 +chr5 32028104 1,G +chr5 32028302 G +chr5 32028303 T,G,C,A +chr5 32028304 0 +chr5 32028305 0 +chr5 32028306 0 +chr5 32028307 0 +chr5 32028308 0 +chr5 32028309 1,G +chr5 32028366 G +chr5 32028367 T,G,C,A +chr5 32028368 0 +chr5 32028369 0 +chr5 32028370 0 +chr5 32028371 0 +chr5 32028372 1,G +chr5 32028564 G +chr5 32028565 T,G,C,A +chr5 32028566 0 +chr5 32028567 0 +chr5 32028568 0 +chr5 32028569 0 +chr5 32028570 1,G +chr5 32028631 G +chr5 32028632 T,G,C,A +chr5 32028633 0 +chr5 32028634 0 +chr5 32028635 0 +chr5 32028636 0 +chr5 32028637 1,G +chr5 32028908 C +chr5 32028909 T,G,C,A +chr5 32028910 0 +chr5 32028911 0 +chr5 32028912 0 +chr5 32028913 0 +chr5 32028914 1,C +chr5 32028943 G +chr5 32028944 T,G,C,A +chr5 32028945 0 +chr5 32028946 0 +chr5 32028947 0 +chr5 32028948 0 +chr5 32028949 0 +chr5 32028950 1,G +chr5 32029093 C +chr5 32029094 T,G,C,A +chr5 32029095 0 +chr5 32029096 0 +chr5 32029097 0 +chr5 32029098 0 +chr5 32029099 1,C +chr5 32029109 T +chr5 32029110 T,G,C,A +chr5 32029111 0 +chr5 32029112 0 +chr5 32029113 0 +chr5 32029114 0 +chr5 32029115 T,1 +chr5 32029193 T +chr5 32029194 T,G,C,A +chr5 32029195 0 +chr5 32029196 0 +chr5 32029197 0 +chr5 32029198 G +chr5 32029199 T,G,C,A +chr5 32029200 0 +chr5 32029201 0 +chr5 32029202 0 +chr5 32029203 0 +chr5 32029204 1,G +chr5 32029464 G +chr5 32029465 T,G,C,A +chr5 32029466 0 +chr5 32029467 0 +chr5 32029468 0 +chr5 32029469 0 +chr5 32029470 1,G +chr5 32029489 G +chr5 32029490 T,G,C,A +chr5 32029491 0 +chr5 32029492 0 +chr5 32029493 0 +chr5 32029494 0 +chr5 32029495 1,G +chr5 32029777 G +chr5 32029778 T,G,C,A +chr5 32029779 0 +chr5 32029780 0 +chr5 32029781 0 +chr5 32029782 0 +chr5 32029783 1,G +chr5 32029823 G +chr5 32029824 T,G,C,A +chr5 32029825 0 +chr5 32029826 0 +chr5 32029827 0 +chr5 32029828 0 +chr5 32029829 1,G +chr5 32030177 G +chr5 32030178 T,G,C,A +chr5 32030179 0 +chr5 32030180 0 +chr5 32030181 0 +chr5 32030182 0 +chr5 32030183 1,G +chr5 32030340 G +chr5 32030341 T,G,C,A +chr5 32030342 0 +chr5 32030343 0 +chr5 32030344 0 +chr5 32030345 0 +chr5 32030346 1,G +chr5 32030385 A +chr5 32030386 T,G,C,A +chr5 32030387 0 +chr5 32030388 0 +chr5 32030389 0 +chr5 32030390 0 +chr5 32030391 1,A +chr5 32030639 T +chr5 32030640 T,G,C,A +chr5 32030641 0 +chr5 32030642 0 +chr5 32030643 0 +chr5 32030644 C +chr5 32030645 T,G,C,A +chr5 32030646 0 +chr5 32030647 0 +chr5 32030648 0 +chr5 32030649 0 +chr5 32030650 1,C +chr5 32031337 G +chr5 32031338 T,G,C,A +chr5 32031339 0 +chr5 32031340 0 +chr5 32031341 0 +chr5 32031342 0 +chr5 32031343 1,G +chr5 32031652 G +chr5 32031653 T,G,C,A +chr5 32031654 0 +chr5 32031655 0 +chr5 32031656 0 +chr5 32031657 0 +chr5 32031658 1,G +chr5 32032156 C +chr5 32032157 T,G,C,A +chr5 32032158 0 +chr5 32032159 0 +chr5 32032160 0 +chr5 32032161 0 +chr5 32032162 1,C +chr5 32032578 T +chr5 32032579 T,G,C,A +chr5 32032580 0 +chr5 32032581 0 +chr5 32032582 0 +chr5 32032583 0 +chr5 32032584 0 +chr5 32032585 0 +chr5 32032586 T,1 +chr5 32032600 T +chr5 32032601 T,G,C,A +chr5 32032602 0 +chr5 32032603 0 +chr5 32032604 0 +chr5 32032605 0 +chr5 32032606 0 +chr5 32032607 0 +chr5 32032608 0 +chr5 32032609 T,1 +chr5 32032727 T +chr5 32032728 T,G,C,A +chr5 32032729 0 +chr5 32032730 0 +chr5 32032731 0 +chr5 32032732 0 +chr5 32032733 T,1 +chr5 32032867 A +chr5 32032868 T,G,C,A +chr5 32032869 0 +chr5 32032870 0 +chr5 32032871 0 +chr5 32032872 0 +chr5 32032873 0 +chr5 32032874 0 +chr5 32032875 1,A +chr5 32032991 C +chr5 32032992 T,G,C,A +chr5 32032993 0 +chr5 32032994 0 +chr5 32032995 0 +chr5 32032996 0 +chr5 32032997 1,C +chr5 32033177 G +chr5 32033178 T,G,C,A +chr5 32033179 0 +chr5 32033180 0 +chr5 32033181 0 +chr5 32033182 0 +chr5 32033183 0 +chr5 32033184 1,G +chr5 32033284 G +chr5 32033285 T,G,C,A +chr5 32033286 0 +chr5 32033287 0 +chr5 32033288 0 +chr5 32033289 0 +chr5 32033290 1,G +chr5 32033413 C +chr5 32033414 T,G,C,A +chr5 32033415 0 +chr5 32033416 0 +chr5 32033417 0 +chr5 32033418 0 +chr5 32033419 1,C +chr5 32033705 A +chr5 32033706 T,G,C,A +chr5 32033707 0 +chr5 32033708 0 +chr5 32033709 0 +chr5 32033710 0 +chr5 32033711 1,A +chr5 32033732 G +chr5 32033733 T,G,C,A +chr5 32033734 0 +chr5 32033735 0 +chr5 32033736 0 +chr5 32033737 0 +chr5 32033738 1,G +chr5 32033975 G +chr5 32033976 T,G,C,A +chr5 32033977 0 +chr5 32033978 0 +chr5 32033979 0 +chr5 32033980 0 +chr5 32033981 1,G +chr5 32034195 C +chr5 32034196 T,G,C,A +chr5 32034197 0 +chr5 32034198 0 +chr5 32034199 0 +chr5 32034200 0 +chr5 32034201 0 +chr5 32034202 0 +chr5 32034203 0 +chr5 32034204 1,C +chr5 32034314 C +chr5 32034315 T,G,C,A +chr5 32034316 0 +chr5 32034317 0 +chr5 32034318 0 +chr5 32034319 0 +chr5 32034320 1,C +chr5 32034488 C +chr5 32034489 T,G,C,A +chr5 32034490 0 +chr5 32034491 0 +chr5 32034492 0 +chr5 32034493 0 +chr5 32034494 1,C +chr5 32034502 C +chr5 32034503 T,G,C,A +chr5 32034504 0 +chr5 32034505 0 +chr5 32034506 0 +chr5 32034507 0 +chr5 32034508 1,C +chr5 32034629 G +chr5 32034630 T,G,C,A +chr5 32034631 0 +chr5 32034632 0 +chr5 32034633 0 +chr5 32034634 0 +chr5 32034635 0 +chr5 32034636 1,G +chr5 32034752 G +chr5 32034753 T,G,C,A +chr5 32034754 0 +chr5 32034755 0 +chr5 32034756 0 +chr5 32034757 0 +chr5 32034758 1,G +chr5 32035125 C +chr5 32035126 T,G,C,A +chr5 32035127 0 +chr5 32035128 0 +chr5 32035129 0 +chr5 32035130 0 +chr5 32035131 1,C +chr5 32035135 C +chr5 32035136 T,G,C,A +chr5 32035137 0 +chr5 32035138 0 +chr5 32035139 0 +chr5 32035140 0 +chr5 32035141 1,C +chr5 32036053 C +chr5 32036054 T,G,C,A +chr5 32036055 0 +chr5 32036056 0 +chr5 32036057 0 +chr5 32036058 0 +chr5 32036059 1,C +chr5 32036499 G +chr5 32036500 T,G,C,A +chr5 32036501 0 +chr5 32036502 0 +chr5 32036503 0 +chr5 32036504 0 +chr5 32036505 1,G +chr5 32037024 C +chr5 32037025 T,G,C,A +chr5 32037026 0 +chr5 32037027 0 +chr5 32037028 0 +chr5 32037029 0 +chr5 32037030 1,C +chr5 32037062 G +chr5 32037063 T,G,C,A +chr5 32037064 0 +chr5 32037065 0 +chr5 32037066 0 +chr5 32037067 0 +chr5 32037068 0 +chr5 32037069 0 +chr5 32037070 1,G +chr5 32037228 C +chr5 32037229 T,G,C,A +chr5 32037230 0 +chr5 32037231 0 +chr5 32037232 0 +chr5 32037233 0 +chr5 32037234 1,C +chr5 32037328 G +chr5 32037329 T,G,C,A +chr5 32037330 0 +chr5 32037331 0 +chr5 32037332 0 +chr5 32037333 0 +chr5 32037334 0 +chr5 32037335 1,G +chr5 32037505 T +chr5 32037506 T,G,C,A +chr5 32037507 0 +chr5 32037508 0 +chr5 32037509 0 +chr5 32037510 0 +chr5 32037511 T,1 +chr5 32037562 T +chr5 32037563 T,G,C,A +chr5 32037564 0 +chr5 32037565 0 +chr5 32037566 0 +chr5 32037567 0 +chr5 32037568 T,1 +chr5 32037597 T +chr5 32037598 T,G,C,A +chr5 32037599 0 +chr5 32037600 0 +chr5 32037601 0 +chr5 32037602 0 +chr5 32037603 0 +chr5 32037604 T,1 +chr5 32038114 G +chr5 32038115 T,G,C,A +chr5 32038116 0 +chr5 32038117 0 +chr5 32038118 0 +chr5 32038119 0 +chr5 32038120 1,G +chr5 32038176 A +chr5 32038177 T,G,C,A +chr5 32038178 0 +chr5 32038179 0 +chr5 32038180 0 +chr5 32038181 0 +chr5 32038182 1,A +chr5 32038226 T +chr5 32038227 T,G,C,A +chr5 32038228 0 +chr5 32038229 0 +chr5 32038230 0 +chr5 32038231 0 +chr5 32038232 0 +chr5 32038233 0 +chr5 32038234 T,1 +chr5 32038661 G +chr5 32038662 T,G,C,A +chr5 32038663 0 +chr5 32038664 0 +chr5 32038665 0 +chr5 32038666 0 +chr5 32038667 1,G +chr5 32038768 A +chr5 32038769 T,G,C,A +chr5 32038770 0 +chr5 32038771 0 +chr5 32038772 0 +chr5 32038773 0 +chr5 32038774 1,A +chr5 32038820 G +chr5 32038821 T,G,C,A +chr5 32038822 0 +chr5 32038823 0 +chr5 32038824 0 +chr5 32038825 0 +chr5 32038826 1,G +chr5 32038827 T +chr5 32038828 T,G,C,A +chr5 32038829 0 +chr5 32038830 0 +chr5 32038831 0 +chr5 32038832 0 +chr5 32038833 0 +chr5 32038834 T,1 +chr5 32038840 A +chr5 32038841 T,G,C,A +chr5 32038842 0 +chr5 32038843 0 +chr5 32038844 0 +chr5 32038845 0 +chr5 32038846 0 +chr5 32038847 T +chr5 32038848 T,G,C,A +chr5 32038849 0 +chr5 32038850 0 +chr5 32038851 0 +chr5 32038852 0 +chr5 32038853 0 +chr5 32038854 0 +chr5 32038855 0 +chr5 32038856 0 +chr5 32038857 T,1 +chr5 32038983 G +chr5 32038984 T,G,C,A +chr5 32038985 0 +chr5 32038986 0 +chr5 32038987 0 +chr5 32038988 0 +chr5 32038989 0 +chr5 32038990 0 +chr5 32038991 1,G +chr5 32039074 C +chr5 32039075 T,G,C,A +chr5 32039076 0 +chr5 32039077 0 +chr5 32039078 0 +chr5 32039079 0 +chr5 32039080 1,C +chr5 32039300 C +chr5 32039301 T,G,C,A +chr5 32039302 0 +chr5 32039303 0 +chr5 32039304 0 +chr5 32039305 0 +chr5 32039306 1,C +chr5 32039481 C +chr5 32039482 T,G,C,A +chr5 32039483 0 +chr5 32039484 0 +chr5 32039485 0 +chr5 32039486 0 +chr5 32039487 1,C +chr5 32039592 A +chr5 32039593 T,G,C,A +chr5 32039594 0 +chr5 32039595 0 +chr5 32039596 0 +chr5 32039597 0 +chr5 32039598 1,A +chr5 32040182 T +chr5 32040183 T,G,C,A +chr5 32040184 0 +chr5 32040185 0 +chr5 32040186 0 +chr5 32040187 0 +chr5 32040188 0 +chr5 32040189 0 +chr5 32040190 T,1 +chr5 32040657 C +chr5 32040658 T,G,C,A +chr5 32040659 0 +chr5 32040660 0 +chr5 32040661 0 +chr5 32040662 0 +chr5 32040663 1,C +chr5 32040777 G +chr5 32040778 T,G,C,A +chr5 32040779 0 +chr5 32040780 0 +chr5 32040781 0 +chr5 32040782 0 +chr5 32040783 0 +chr5 32040784 1,G +chr5 32040785 G +chr5 32040786 T,G,C,A +chr5 32040787 0 +chr5 32040788 0 +chr5 32040789 0 +chr5 32040790 0 +chr5 32040791 1,G +chr5 32041096 G +chr5 32041097 T,G,C,A +chr5 32041098 0 +chr5 32041099 0 +chr5 32041100 0 +chr5 32041101 0 +chr5 32041102 1,G +chr5 32041103 G +chr5 32041104 T,G,C,A +chr5 32041105 0 +chr5 32041106 0 +chr5 32041107 0 +chr5 32041108 0 +chr5 32041109 1,G +chr5 32041203 C +chr5 32041204 T,G,C,A +chr5 32041205 0 +chr5 32041206 0 +chr5 32041207 0 +chr5 32041208 0 +chr5 32041209 0 +chr5 32041210 1,C +chr5 32041355 C +chr5 32041356 T,G,C,A +chr5 32041357 0 +chr5 32041358 0 +chr5 32041359 0 +chr5 32041360 0 +chr5 32041361 0 +chr5 32041362 1,C +chr5 32042175 G +chr5 32042176 T,G,C,A +chr5 32042177 0 +chr5 32042178 0 +chr5 32042179 0 +chr5 32042180 0 +chr5 32042181 1,G +chr5 32042220 C +chr5 32042221 T,G,C,A +chr5 32042222 0 +chr5 32042223 0 +chr5 32042224 0 +chr5 32042225 0 +chr5 32042226 1,C +chr5 32042375 G +chr5 32042376 T,G,C,A +chr5 32042377 0 +chr5 32042378 0 +chr5 32042379 0 +chr5 32042380 0 +chr5 32042381 1,G +chr5 32042464 C +chr5 32042465 T,G,C,A +chr5 32042466 0 +chr5 32042467 0 +chr5 32042468 0 +chr5 32042469 0 +chr5 32042470 1,C +chr5 32042667 C +chr5 32042668 T,G,C,A +chr5 32042669 0 +chr5 32042670 0 +chr5 32042671 0 +chr5 32042672 0 +chr5 32042673 1,C +chr5 32042846 G +chr5 32042847 T,G,C,A +chr5 32042848 0 +chr5 32042849 0 +chr5 32042850 0 +chr5 32042851 0 +chr5 32042852 0 +chr5 32042853 0 +chr5 32042854 1,G +chr5 32043005 G +chr5 32043006 T,G,C,A +chr5 32043007 0 +chr5 32043008 0 +chr5 32043009 0 +chr5 32043010 0 +chr5 32043011 1,G +chr5 32043036 G +chr5 32043037 T,G,C,A +chr5 32043038 0 +chr5 32043039 0 +chr5 32043040 0 +chr5 32043041 0 +chr5 32043042 0 +chr5 32043043 1,G +chr5 32043270 G +chr5 32043271 T,G,C,A +chr5 32043272 0 +chr5 32043273 0 +chr5 32043274 0 +chr5 32043275 0 +chr5 32043276 1,G +chr5 32043349 C +chr5 32043350 T,G,C,A +chr5 32043351 0 +chr5 32043352 0 +chr5 32043353 0 +chr5 32043354 0 +chr5 32043355 1,C +chr5 32043379 C +chr5 32043380 T,G,C,A +chr5 32043381 0 +chr5 32043382 0 +chr5 32043383 0 +chr5 32043384 0 +chr5 32043385 0 +chr5 32043386 1,C +chr5 32043435 C +chr5 32043436 T,G,C,A +chr5 32043437 0 +chr5 32043438 0 +chr5 32043439 0 +chr5 32043440 0 +chr5 32043441 1,C +chr5 32043481 G +chr5 32043482 T,G,C,A +chr5 32043483 0 +chr5 32043484 0 +chr5 32043485 0 +chr5 32043486 0 +chr5 32043487 0 +chr5 32043488 0 +chr5 32043489 1,G +chr5 32043558 G +chr5 32043559 T,G,C,A +chr5 32043560 0 +chr5 32043561 0 +chr5 32043562 0 +chr5 32043563 0 +chr5 32043564 1,G +chr5 32043959 G +chr5 32043960 T,G,C,A +chr5 32043961 0 +chr5 32043962 0 +chr5 32043963 0 +chr5 32043964 0 +chr5 32043965 0 +chr5 32043966 1,G +chr5 32044056 G +chr5 32044057 T,G,C,A +chr5 32044058 0 +chr5 32044059 0 +chr5 32044060 0 +chr5 32044061 0 +chr5 32044062 1,G +chr5 32044114 G +chr5 32044115 T,G,C,A +chr5 32044116 0 +chr5 32044117 0 +chr5 32044118 0 +chr5 32044119 0 +chr5 32044120 1,G +chr5 32044189 C +chr5 32044190 T,G,C,A +chr5 32044191 0 +chr5 32044192 0 +chr5 32044193 0 +chr5 32044194 0 +chr5 32044195 0 +chr5 32044196 0 +chr5 32044197 1,C +chr5 32044296 C +chr5 32044297 T,G,C,A +chr5 32044298 0 +chr5 32044299 0 +chr5 32044300 0 +chr5 32044301 0 +chr5 32044302 1,C +chr5 32044323 C +chr5 32044324 T,G,C,A +chr5 32044325 0 +chr5 32044326 0 +chr5 32044327 0 +chr5 32044328 0 +chr5 32044329 0 +chr5 32044330 0 +chr5 32044331 1,C +chr5 32044344 C +chr5 32044345 T,G,C,A +chr5 32044346 0 +chr5 32044347 0 +chr5 32044348 0 +chr5 32044349 0 +chr5 32044350 1,C +chr5 32044602 C +chr5 32044603 T,G,C,A +chr5 32044604 0 +chr5 32044605 0 +chr5 32044606 0 +chr5 32044607 0 +chr5 32044608 1,C +chr5 32044731 G +chr5 32044732 T,G,C,A +chr5 32044733 0 +chr5 32044734 0 +chr5 32044735 0 +chr5 32044736 0 +chr5 32044737 1,G +chr5 32044793 C +chr5 32044794 T,G,C,A +chr5 32044795 0 +chr5 32044796 0 +chr5 32044797 0 +chr5 32044798 0 +chr5 32044799 0 +chr5 32044800 1,C +chr5 32044805 C +chr5 32044806 T,G,C,A +chr5 32044807 0 +chr5 32044808 0 +chr5 32044809 0 +chr5 32044810 0 +chr5 32044811 1,C +chr5 32044845 G +chr5 32044846 T,G,C,A +chr5 32044847 0 +chr5 32044848 0 +chr5 32044849 0 +chr5 32044850 0 +chr5 32044851 0 +chr5 32044852 1,G +chr5 32044950 C +chr5 32044951 T,G,C,A +chr5 32044952 0 +chr5 32044953 0 +chr5 32044954 0 +chr5 32044955 0 +chr5 32044956 1,C +chr5 32045075 G +chr5 32045076 T,G,C,A +chr5 32045077 0 +chr5 32045078 0 +chr5 32045079 0 +chr5 32045080 0 +chr5 32045081 1,G +chr5 32045382 T +chr5 32045383 T,G,C,A +chr5 32045384 0 +chr5 32045385 0 +chr5 32045386 0 +chr5 32045387 0 +chr5 32045388 T,1 +chr5 32045512 C +chr5 32045513 T,G,C,A +chr5 32045514 0 +chr5 32045515 0 +chr5 32045516 0 +chr5 32045517 0 +chr5 32045518 1,C diff --git a/paraphase/data/rccx/rccx_ref.fa b/paraphase/data/rccx/rccx_ref.fa new file mode 100644 index 0000000..86afb50 --- /dev/null +++ b/paraphase/data/rccx/rccx_ref.fa @@ -0,0 +1,598 @@ +>chr6_32013240_32049000 +GCAGTCATCTTTGTCTTCAGACCAAATGCACAAGTACTTTGTTAAAGGTATCCCATCTGC +AGCTCAAGCCTGCAGCCCCTCACCTTTTGGTGGCTCCTCAGGCCTCTAGGCCTTATTCAC +CTTTCCCCTTTCCTGTGCCACTTCTCCTCTAGGGCGCCAGGCTGTCCTTGGCATGGTCCG +GAAGGCAAAGTACCGGGAGCTGCTCCTATCAGAGCTCCTGGGCCGGCGGGTGCCTGTCGT +GGTGCGGCTTGGCCTCACCTACCATGTGCACGACCTCATTGGGGCCCAGCTAGTGGACTG +GTGAGTCTTTCCCTGGCCTCTGGCAGATTATGGAGCAATGACCCAAAGTGGGATTTCCTC +CCAGCTCATGCTTAGTTTCCTAGTGAAGGCCAGTGGCTCTCATTCTTCTCTGGAACCCGG +GAGCACCCCTTCCCAAGTTCTAAGTTCTCCTCACAGCTTGAGCCTAGGCGTCTGGCTCCA +GCCTTGTCTTTCTCCTGCACAGCATCTCTACCACTTCAGGAACCCTCCTCCGCCTGCCAG +AGACATGAAGATTCTGCTCATCATTGCTCAGCTCCTCAGAGTGGGCCGGGAGGGGACTAG +AAGAGCTGCATGATGGTGGCTGAGACAGGGTCACCTTGGGAAGGCTTGGGAGCCAGGATG +AGTGTCGGGCTCTCGTGTGTGCAAAAGGTCAGATGTGACTGCTGCTGTTTGCCTGGTTTC +TGACCCAGTGGTGGGGTTTGAGCAATGCTTCTCTGCCCTTCCATGGAAAGTGGAACCAGA +AATGGTGCCAAGGCTGTGGCTGTTCCCTTTCGTGTAAAATGGTGCTGTTATTACTCTGTC +TTGAAATAGGAAGGTGGGATTTCTGGGGAGGCTGGTGAAGGAGGGCAGGGTTCTTTTCTC +TACGTGTCATGTTAAAATTGCCAAATAAAGTACCTCTGCCTGTGATATTTTCTGGATGTC +CTTTATTTACTGTGACGTGTGTTTGGGTGCCTTGTTTAGGGGTAGAGGTGAAGTCTGAGC +TTTGCCTCATTCAGAGAGGAAAGGGGTCAGGGGTTCACTCTGACGTTCAGGCCATTCTCC +CTGTGGAGTGGTGAGGGTGTACCTAATCTCCTAAACCACGGAATTTCTGTTAGGGCCTAA +AAAAGCAAAAGCCTAGTATAGTTCAATTTGTGTTGGAATGAAAGTAAGAGACAAGTGTCT +TAGAAGCCTGTCATTGTTTTGTGAGGGCCTTTAAATATCCTGTACTCGTGGGCCATGTTG +GGCCCTTGTACGCCCAGGTATACATGAGCTTGTGTGCACCTATACCCTGATACAGATATA +CCTGGTAGGGGGAGGTGCTCAGGCACTGGAATGAGAGGAGTTAACGGGGAAGGACAGGGT +TATTTCTGGGCCAAGATTCAGAGTTTCCCATGGACACCCAGGTGTCCGGGGTGCCCCCAC +AACTCTGGGCCTGAGGCCAGTTGCACTTCTTGGCTGTCACGTGGTTTCCCAGCTTAGCTG +GGCTGGGGGAGGAGCAAGGTCCAGAGTCAACTCTGCCCCGAGGCCTAGCTTGGCCAGAAG +GTAGCAGACAGACAGACGGATCTAACCTCTCTTGGATCCTCCAGCCATGAGGCTGCTCTG +GGGGCTGATCTGGGCATCCAGCTTCTTCACCTTATCTCTGCAGAAGCCCAGGTCCTGGAG +GCGGGATGCTGGGTGCTTGGATTGGGGCAGGGCTGGCATCGGGACCCGATTCAGGAGTGA +GGGAGAGCAGGGGTGGAGGTGTCAGAGCGAAGTCTGACTGCTGATCCTGTCTGTTCTCCC +CAGGTTGCTCTTGTTCTCTCCTTCTGTGGTTCATCTGGGGGTCCCCCTATCGGTGGGGGT +GCAGCTCCAGGATGTGCCCCGAGGACAGGTAGTGAAAGGATCAGTGTTCCTGAGAAACCC +ATCTCGTAATAATGTCCCCTGCTCCCCAAAGGTGGACTTCACCCTTAGCTCAGAAAGAGA +CTTCGCACTCCTCAGTCTCCAGGTAACCAGACCCCATGCCCTCCTGCTGCTTGTGGGGGC +CTCCTGCCCTGTTCCCATCTGTCTTGTAAGTGTCATCATCTTCCCACTGGCCTCCTCCCC +TCCTGTCTTCCCACCCTGGCATTCTCCTTCCACGTTTCTCCCTTGGTCTCTGTCCTTTTT +GGTCAGCTGTCTCTTGCTCTGTGACCCGCTCCCTCTCCCTCTCCCTCTCCTGACAGGTGC +CCTTGAAAGATGCGAAGAGCTGTGGCCTCCATCAACTCCTCAGAGGCCCTGAGGTCCAGC +TGGTGGCCCATTCGCCATGGCTAAAGGACTCTCTGTCCAGAACGACAAACATCCAGGGTA +TCAACCTGCTCTTCTCCTCTCGCCGGGGGCACCTCTTTTTGCAGACGGACCAGCCCATTT +ACAACCCTGGCCAGCGGGGTGAGTCTCAGCCCCAGGGCCTCAACCTTTAACCCCCTCCGA +GCCCTCTCAGGATGAGTTTGGTGCCCCCTAAGTGAGATAACCTGAAAGAAAGTGCCACAC +AGAAGGGGTGCTTAGGAAACATTTGTCCCCTGCTCCCTCTGTGGAGTTTGACCCACCCTC +CCCTTGCACATGGACCCCTGCTCACCTCTCTCCTCCTCCACTCCCAGTTCGGTACCGGGT +CTTTGCTCTGGATCAGAAGATGCGCCCGAGCACTGACACCATCACAGTCATGGTGGAGGT +GAGTCCCCGACCTCTGGCCTTCCTGATCCTGGCCACTGATGTGACCTCCTGCCTGTGAGC +ACTTCTCCCCTTGCAGAACTCTCACGGCCTCCGCGTGCGGAAGAAGGAGGTGTACATGCC +CTCGTCCATCTTCCAGGATGACTTTGTGATCCCAGACATCTCAGAGTGAGCGCTCCCAAT +GTGGGGGCTGCCCCCAAGCTACACCACCCCAATTCCTGTTAGGCTCTCCACCTCCCACAC +AGAGGCACGTCCCCAGATGCCCTGACCCTCAGCCTCCTGAGCCTCTGGTTAACCCCCACA +GTCCTCTTCCCAGGGAAGCAGGCTGCTGGCTCTCCGTGCCCCACTGTACAGATGGGCTGA +GCCCCTTCCTTGTCCATTCTCAGGCCAGGGACCTGGAAGATCTCAGCCCGATTCTCAGAT +GGCCTGGAATCCAACAGCAGCACCCAGTTTGAGGTGAAGAAATATGGTGAGAGCTGGAAA +CTGGAGGGACAGGCAGCTGCTTTCCTGAAGGAAATAAGGGTGGAAGGAGAGGTACTGGGA +GCAGCTCAGGGCAGGGAGATATGGGTGCCACAGCCCTGAGCAGAGGGGAGTCTTTGAGCT +GGAGTCTGACCTGCCTATCCCTTCACCCTGGGTCAGTCCTTCCCAACTTTGAGGTGAAGA +TCACCCCTGGAAAGCCCTACATCCTGACGGTGCCAGGCCATCTTGATGAAATGCAGTTAG +ACATCCAGGCCAGGTAATACCTCCCTCCCCACCTCTGCCCACCAGCACCGGGTCCTGCTC +CCTACTCAGTATGAATGGGCTCCTGCTTCCCTGCCCTCGGGCCATTATTCCCCCCAGCCC +TTGGCCCACCCTCTTCTCTCTGCCACGACAGGTACATCTATGGGAAGCCAGTGCAGGGGG +TGGCATATGTGCGCTTTGGGCTCCTAGATGAGGATGGTAAGAAGACTTTCTTTCGGGGGC +TGGAGAGTCAGACCAAGGTAGGAAGGAGAATAGGGGCTGGGGAGGGGAAGGGGCAAGGGA +GGTGAGGTGGGAGACTCAGTCTCACCCTATGTCCTGTTTCTTTCTATGCCCCAGCTGGTG +AATGGACAGAGCCACATTTCCCTCTCAAAGGCAGAGTTCCAGGACGCCCTGGAGAAGCTG +AATATGGGCATTACTGACCTCCAGGGGCTGCGCCTCTACGTTGCTGCAGCCATCATTGAG +TCTCCAGGTGGGTGACTTTCCCTTATTGTAACCCCAGACCCTTGCCTCTGACCTCTGAGC +TAACCCTCTGTCCTCCGGCACCAACACCACCCCACTTCTCACATCTCATCTCAGACTCAA +AACCAGGAAACACCCAGGAGACCTGGTTTCTCTCCAACTCTGTCTCTGTGACTCGGCCCT +TTTCCCTGGCTGAGTTTATTTATTTCTTTGCTCGTTCTGCTCATTCCTTCACTCCTCCAG +TGGACATGTGTTGTTCAATGCCCCGTGCTAGGCCTCAGCATGCACAGACATGTTGGGGAC +CAGCCTCAACGCCACCCGTAGGGTTCCTGAAGTCCATTGGTGACACAGGAATGAGAAGAG +ACAGGTTAAGAGTTCATAAAGAGTGGGGGCCAGGGGGCCAATTGCAAAATGGAGGCTGCA +AAAGGCTCAGAGCTCTGGTCTCCACACTATTTTTTGAGTACAGTCACTCAGATCTAAGAA +GCAGATGTTCAGGGAGAAACAGTGAAAGGGAGGCAGTGGGTCATAGGCGTAATCTATAGC +AATAGAGTTTTAAATGAATCTCCTTTGTGCTCAAACAGCATGTCTTTAAATTATCGGAGA +GTAGCTGGTGGAAGTGGGCTTAGCTAGAAGACTGCATGTCTGTCCAATGCTTCAAAGGAG +GGTCTTTCTCCTTGAACAGAGTGTTTACAGATAAGACAGGGGGTCTCACTCTGAGCATGG +GAACATGATGGCAATTAGGAGGCTTTTCTTCTCAGAGGCCTCTTGTGGCTTTCCACAACT +TATTGTCTCATATTTTTATGGACAGTTTATACAGGCACCCCACAAGTCCTTTTCCCAACA +TGCCCCCCTCCCTTTTTTTTTTTTTAACCGCTATTGCTATTATGGCTTATTTGTGGTGTT +TGGTCTGTTTTCAGAAGTGTCTTTTGCATCTGTAGACTAAAAGTAAACAGCATAAACAGA +TACACATTAAAGTAAAATTTGTAATAGTTGATCCTTTAATGGTCTTAATCTGTTTAAGAG +GATTTATGTTTGAAAGTCCGTCAGTAGCTCCAATGAGAATGTCAGTCTCAGGCAGGAGGG +TTAAATGAGCCTGAGATGCTTTAAAAACCTGTTTTTTTAAAATTTGGTTATATTTAATGT +TAAATTTTTATTTTTTTCTTTTAGATGATGTCTAACTTTTTAAAAATGATGTTTAGTAGT +ATTATACGAATGGGGAGTTATGTAGAAATTGGAAGTATTTCAATTACATTGTACTTCTAA +TTGATGTTTTAAGTTTATTGTACGATCTTCCATTTAAATAACAGTCTGTCTAAGATCATT +TGTTTGATTTGTCAATTGTTGGTCTATTTGGGTCTGAGAATTCCACAATTTTGAGGAATT +TTTTGTTAACTATTTATATATTTTGTAGTTTGAACAGAGGAGTGTAAAGCAATTCCAGCA +GCCGCAGCAGTAGCTGTGACTGCAATAAGGCCCATAAGACTGTTATAAGGGTAAAAATAA +ATCTCTTTGTTTTGGTAAACACTTTTTTTTAAAACATTTTTGTGACAATATGAATGGAAG +GAGAGGCTTTCTAAGGTCTATTGAGGGAAACCAGTATCCAAACTCCTTTCTTAGTTTTTA +TCAGTAACACAGATGTTTTTACACCGAACGTGGAATTAATACAGGTGAAAAGGTGACAGT +TTTGACAAGTAATAGTTTGAGAATTAGGTCGAATGTCAATATTTTTGACCATTAACATAA +AAGGAGGGTTGACACAACTCTGAATGGGCACTGTTTTGTTGGAAGAAAACTGATACGCAA +ATTGAAGTTTTTAACCTTTTTTTTTTAAAGATAATATATTTTTTTCTAAACTTAAATATG +AGATTGGGCCATTATTAACTTTCATAATTTGGAGTGTTTAGGGCCTATTATTGGATTAAT +TATTTTGGGATGTGGGCCAGCTGTACTAAAATTGGTCCAAATTATGGGAAAATGAGCACG +TTTTTCAGTGTAAGTAGTGTTACCTTTTTGATAGTATAGTTTCTGTTTTAGTTTTGTCTT +GTATTTATTATTTTGATGGGTACAATTAACTGTAAAGGTCCCCTCAGGGGACCAATTAAT +GACAATTTCATAGGAATTATTTTGTAGTACCATAGTGTGATCAGAGATGTAATTTTTTTT +AATTAATATTTTTAAATTATTTGACCATTGTTAAGGTTGTTGGCACCTCTTTTTTGGGGG +CTTAAACTGTTAATTGAATTGAACTCTGTGAATGATCCGGGCTCCATCCAGAAAATAAAT +GATAGGATACTGGTCTTTGATTATGACCTGGAATTTTAACTAGTCAATGTTGTCGGTAGC +CTTTTAGGCAACCGATAGTTGGCCTTATGTAAAGAGGGGGGAACTGATAACCTATGGACA +CATTTATTAACTTTTTTTTTTTTCCTTTGGGTGAGAGGGCCCATGAGTATTTGTAGGCTT +AGGGATCCAAACGCTATTATTAACATAAACTTCAACTGGGGGTTTTAACCATGTGACAGG +CCTAATTAAAGGCAGGAATGGGACACATGCCCAATAGGTATAATTTTGGGCTGTTGTAGC +CACAGGTTTGTTAGGCGAGGAGGTCACTGTTTTTATTTTGGCTTTGTATTCTAGGATTAG +TAAATAACAGAAGACAAACATGAGTATAATTAGTAACTTTTTTTTTTAGTAAAAGAGTGA +CCTGTAGTGTTACTTGGCATCTTAGTTTACTATATGTTATTAATGAGGAACCCCACTGGG +GGTATGTTAATTTATTCTAGCTAAGCAGTTATGTTATTAGAAGCTGAGAAGGGGGTGTTT +GTTAAAGTAACAGGGCAGAAGAAAGGCGGATTTAAGATACGAGCTTAATACAGTGTAGCA +GGTATAGGTAGTAGGCAAAGTGAGAGAATTAAAAATGAATAAATTATTTGGCTTAGACTT +TTGTTTTTTTAGTATAATGTCTGAGGCCTGTGTTGTTTGTGGAAGTCGCATTGTTGAGGC +TGTAGTTCCTGTAGGGTCTTTTTTAGGCTGGTTCAAATGTTTTTTTATTTTTTAATTTTT +TATCCTTTGATGAGGATGTAGTCTTTAGGCTGGTACTGGAAATTTTAGGAGTGGCGTCTG +TGTTAAGAGACTTTTTACAATTTTTAAAGAGCAGGTTAGTGTTTTAAGAAAAACTTGTGT +TTTATTTTAATGTTTAGTTTATAGAAAACTGGATGATATCTTTTTAACTTTAGTAAATAC +GTTTACACACGGAATTTTTTACAATTATCATTTTAAAACTTGTTTAGATCTTTAAAACAA +AATTAAACAACCTTTTTTGTATAAATTTTTTATAACTTTTTTTATGACTTTTACAGACAA +TTTTTAACATGTCTTAACTTTTTATGTTTTATAATTTTTTTACTAAAGGTACATTTTTAT +AACTTTTTAAATTTTTTTACTTTTTTGTATTTTTTTGATTTTTGTCTTAGTCTTTTTTTT +ACTTTTATTTTTTTAAATGTGTAATAATTAGATGAGTGTTGGTAACAATGGATGTATGTA +CATATTTTAGTTTTTAAAATTTAGGGATGTGTTTAACATCTGTTTGCCAGAACTGACTAG +GTTCCAATTCTTTACGGTTAACACCTATTGAAGGAGGGTATGTGCCTGTGAGCTGGTAAT +CTGGGCATTGTGGGATAATTTGTTTAGCCAGCCTCTGTGTAAGTTGAAATTATTTAGATA +AGTTTCTCCAATTTTGGTGGAATAATCGATGTGATTGGGTGGCTTGGTCAAGCAGTGATG +TCATAACCTGAAGGTCTGCTTGATTATTGCCGTAAGCCAATGGGCCAGGCAGAGAGCTGT +GGGCTCGAATGTGTGTAATAAAAGTAGGATGTGTACCTTGGTCTAGTAATTGTTGAAGTT +GAAGAAAAAGACCACACAGAGTGGGCTCCAGAGCAAACTTAAGGCTGTAATAGTTTTTAA +ATAAATACACAGAATAACCTTAGCTCTCTGAATGTTAGTAAATTCAGATCAAGTGATTGG +ATTATGTGGTCTCCACCAGACTGTTGCTTTTTCATGTTTACCAGACCCACCAGTAAAAAC +AGCTATGGCTCCTTCCAAAGGGGCATCACAAGTAATTTTTGGAAGAACCTATGTAGTTAA +TTTTAAGAATTGAAAAGTTTTTAGGATAATGATTATTAATACATCCAACAAATTTTGTTA +AATTAATCTGTCATGTAACTGAGTTAATAAATGCCTGTTTAACCTGATTTTTATTTATTG +GAACTATAATTTTTATTGGGCTCAGTGCCACAAAATTTAATAATTCATATATGAGCCTGT +CCAATTAGAATTGCCATCTGATTTAAGTATACTGTAAGTGCTTTTATGGTATTATGTGGC +AAAAAGGACCATTTAACTAAATCATCATTTTGAACAATAACCCCCATTATTGTGTGGTTA +GTGTGAAGTAGGGAACACAATGAATTATAAAGGCAAGTCTGAGTCAATCCTACTGACCTG +GGCTTGCTGAATTTTGTTTTCAATTACTGATAACTCTTTCATGGCCTCGGGTGTTAGTTC +TCTGTTACTGCGTAAGTTGGTATTTCCCCTCAATATTGAGAAGAGATTAGACATAGCATA +AGTAGGAATTGCTAAATTGGGCCAAATCCAATTAATATCTTCTAACAATTTTTGAAAATT +ATTTAAGGTTTTGAAAGAATCTCTTCTAATTTGAACCTTTTGAGGCTTAATGGCTCTATC +CTGTACTTGTATTTTCAAATACTGAAAAGGAGTGGTTGTTTGAATTTTGTCAGGTGCTAT +AAGTAATTCAGCATTTGTAATTGTCTTTTGCAAAGATTAATAATATTGAATAAGTTGGTC +TCTACTTTTTGCTGCACAAATCTGGAAACTGATCTCTAACAGGCTGGATAGTTCTGCCTA +CAAAAGTTTGACAAACTGTGGGACTATTTAACATACCCTGGGGCAAAACTTTCCAATGAT +ATTTGGCTGCAGGTTTTTTGTTATTAACGGCAGGAATGGTAAAGGCAAATTTTTTGAAAT +CTGCCTCTGCTAAAGGAATTGTAAAAAAGCAGTCTTTTAAATCTATAATAACAAGCGGTC +AGTCTTTAGGGAGCACAGTGGGGGATGGGAGCCCAGGTTGTAAGGCTCCCATCGGTTGAA +TTACAGCGTTGACGCCATCTACCGGACTTTTTCTTAATTACAAATACTGGGGAATTCCAA +GGAGAGAAAGTGGGTGAAATATATCCTTTTTTTAGTAGTTTATTTTATAAAGCACCCCCA +ACTTTTCCTTAGGGAGCGGCCACTGTTCAACCCAGACGGGGCGCCGGGTCATCCATTTTA +AGGGAAATTGCTCCTTCACTGTAATAACTGTAGGGTGAACCTGAATTGCCCCATCTCCAT +AATGAACTGTGGGTCGGGCAATAATGGGCACGGTGAGCCAAGTCTCGGGCTCCCTCCCCC +TGCACCCACTCGGCTGAGGAGGAGGTGGCCATTCTGGACATTTCTCTACAGGAACCGTGG +GCTGAACAATTTTTTGAGTAGGTTTAGGGAGACTGGGGAGATTGGCATAAATCATCTTCA +GACTCTCCTTTTTGTTAGTACTCGGTAGAGGTGGTTCAGAGTTCTGATTATCAAACTCCT +CTCTCTCCTCCTCTGACTCAGCCTCATTATCTGTCTGAAAAGGCTCCAGTGCTGCATGCA +CCAATGACCAAAGCGACCAAACAGGCAAAGGAATTTCCTTTCCTTCTCTATATGCTCTTT +TAAGGTCCTTTCCAACTCCTTCTTAATGTTTTAATTTCAAAGTTTCCTGTTTTGGGAACC +AAGGGCAAAATTGTTCCATAGCATGAAACAAATCCATAAGATTTTCCGTATCAACTTTTA +CCCCACCATGCATGCTTGAAGAGCTGCCGTAGGAAGCTCAAATACGTGGTGTACTTACTT +TCAGTTTTTCCCATTGTGTCCCTAGCTTTCTCTGGGCGCCCCGCTTACCTGTAGAGGTTA +AAACTTTTATGTCCTTGGGAGTCCTTTGTTCGTTGGTCCTCTGTTTCACATGCTTGAGCG +TTTCCTCACCAGATTCTTTTGGGCCCCACGTTGGGCGCCAGAATGTTGGGGACCAGCCTC +AACACCACCTGTAGGGTACCTGAAGTCTGGTGGTGACAAAGGAATGAGAAGAGACAGGTT +AAGAGTTCATAAAGAGTGGAGGCCAGGGGGCCAATTGCAAAATGGAGGCTGCAAAAGGCT +CAGAGCTCTGGTCTCCACACTATTTATTGAGTACAATAACTTAGATCTAAGAAGCAGATG +TTCAGGGCAAAACAGTGAAAGGGTAGCAGTGCGTCACAGGCATAATCTACAGCAGAAGCG +CTTTAAATGAATCTCCTTTGTGCTCAAACAGCATATCTTTAACTTATCGGAGAGTAGCTA +GTGGGAGTGGGCTTAACTAGGAGCCTGCACGTCTGTCCACATTCCAATGCTTCAAAGGAG +GGTCTTTCTCCTTGAATACAGTGTTTACAGATAAGAGAGAGCAGGTCTCGCTCTGAGCAT +GGCAATTAGGAGGCTTTTCTCCTCAGAGGCCTCTTGTGGCTTTCCACAACTTATTGTCCC +ATATTTTTATGGCCAGTTTATACAGGCACCCCACAAGTCCTTTTCCCAACACAGACAGGA +ATACGGCAGCCTGTGCCCTGGGAGCTCACTGTCTTGTGGGAGGGAACCACTCAAGCCACT +CCCCACTTGTCCTCCTGTCCCTCTCTTCTTGGGCTCTGTCCCCCACCTCTCTCTGTCCTT +TGTCTTGCAGGTGGGGAGATGGAGGAGGCAGAGCTCACATCCTGGTATTTTGTGTCATCT +CCCTTCTCCTTGGATCTTAGCAAGACCAAGCGACACCTTGTGCCTGGGGCCCCCTTCCTG +CTGCAGGTTTCTTCCAGAGGGGAAGGATGAGTAGGGAGGATGTGGTAGTTAGGAGGGCTC +AGGGTCTGACCACTCTCTTTTGCCTGCCCTCCTTTACCTGCCTAGGCCTTGGTCCGTGAG +ATGTCAGGCTCCCCAGCTTCTGGCATTCCTGTCAAAGTTTCTGCCACGGTGTCTTCTCCT +GGGTCTGTTCCTGAAGTCCAGGACATTCAGCAAAACACAGACGGGAGCGGCCAAGTCAGC +ATTCCAATAATTATCCCTCAGACCATCTCAGAGCTGCAGCTCTCAGTAGGACTCCTCGGA +CCCCTGGGAGATGGTGGGGGAAGGGGAGGAGGGTGAGCTGGGGTCCCAAGGATCCATGGC +CTGACTTGGGGGGAAGGTGGGGTACTTGGCTCTGAGCTACTACCCTATTCGCACCTGACC +CCCTCTCCAGGTATCTGCAGGCTCCCCACATCCAGCGATAGCCAGGCTCACTGTGGCAGC +CCCACCTTCAGGAGGCCCCGGGTTTCTGTCTATTGAGCGGCCGGATTCTCGACCTCCTCG +TGTTGGGGACACTCTGAACCTGAACTTGCGAGCCGTGGGCAGTGGGGCCACCTTTTCTCA +TTACTACTACATGGTGTGCATGAGCTGGGGAGTCACGGAGGGCTGGGGTGCAGGGAAGAG +CCCTCTGGGTGGGGCTGGGGGGGTTCAAGGCTGAGGCTGTCCCATGAAGAGGCAACCACT +CTTGTCCCTCCCATTCTTGGCCCAGATCCTATCCCGAGGGCAGATCGTGTTCATGAATCG +AGAGCCCAAGAGGACCCTGACCTCGGTCTCGGTGTTTGTGGACCATCACCTGGCACCCTC +CTTCTACTTTGTGGCCTTCTACTACCATGGAGACCACCCAGTGGCCAACTCCCTGCGAGT +GGATGTCCAGGCTGGGGCCTGCGAGGGCAAGGTGACCGGGGTCAGGAGAGATGGCACTTG +TGCCGAGGGGGTTGAGGACAGGGTGATTGCCAACAGGGCATGGATTTAGCTTGGGGGCAG +TGAGGATACCGGGACTGAAGGAAGCTCTCCCACTCTGACCGCCCCCACCTGCCGCCCCTG +CCAGCTGGAGCTCAGCGTGGACGGTGCCAAGCAGTACCGGAACGGGGAGTCCGTGAAGCT +CCACTTAGAAACCGACTCCCTAGCCCTGGTGGCGCTGGGAGCCTTGGACACAGCTCTGTA +TGCTGCAGGCAGCAAGTCCCACAAGCCCCTCAACATGGGCAAGGTTTGTCCAGACCCTCT +CCACAGCTCTCTCACCCCTCCATGGCTCATCCCCCTGCTTCCCTGAGCCTTGGGCGCAGC +CCCTGGATCCCACTGAGGCTCCCCACAGTCTCTTCCCCACTTGGCCCTGTGGTCTCCATC +TCCTGGCTCTGTATCCTTTCCTATCCCCCCATGTGCTGCCCTCTCACCTGTGCCGAGTGC +TCAGTCCTGCCCCTCAGCCACACTTGGCTCCTAGCATTCCTGCCTTTCTTGCAGGTCTTT +GAAGCTATGAACAGCTATGACCTCGGCTGTGGTCCTGGGGGTGGGGACAGTGCCCTTCAG +GTGTTCCAGGCAGCGGGCCTGGCCTTTTCTGATGGAGACCAGTGGACCTTATCCAGAAAG +AGTGAGAACAGAGAAGGAAGGGGAGTGGGTGGCGGGAAGATAAGGAAGGAGGAAGGGCCT +GAGGGGACCAGCTGGAAGAGTCCGGGCAGGAAGGGCTGGGCAGGGGAAGGGGAGGAGGGG +AGGAGGCCGAGTGCCTGACGGCTGGACTGCAGCCTTTCTCTCTACCAGGACTAAGCTGTC +CCAAGGAGAAGACAACCCGGAAAAAGAGAAACGTGAACTTCCAAAAGGCGATTAATGAGA +AATGTGAGTTGCGGGTGCCTAGGCAGTAGCTTGGGCTCTCCACCTGGGATCCGGGTTGGG +GGTCTGCCTCTCTGCCCCTCGGCTCCTTGCTGAACCCACGTGTGGTATTTGGGGCCAGAG +ATCCGAATTCCGGGATTACGAGTGGAAGGTGGGCAGCTCTCTCCAGCAGCCTCTCTTATG +TTGCTGGTCTCAAGGGGTCGGGGCGGGGGCTGAGGTGTATGTCCTTTTTGTCCTCTCATG +CTCACCCCCACCTGGCCCTGCAGTGGGTCAGTATGCTTCCCCGACAGCCAAGCGCTGCTG +CCAGGATGGGGTGACACGTCTGCCCATGATGCGTTCCTGCGAGCAGCGGGCAGCCCGCGT +GCAGCAGCCGGACTGCCGGGAGCCCTTCCTGTCCTGCTGCCAATTTGCTGAGAGTCTGCG +CAAGAAGAGCAGGGACAAGGGCCAGGCGGGCCTCCAACGAGGTGAGGGGCTGGGTGGGGC +TAGGGCACAGGTGGCGGCGCTTGGAAAGGCAGAACGGTCCCCTCCTCACTCCCGTCCACC +GTGGTCCCCCAGCCCTGGAGATCCTGCAGGAGGAGGACCTGATTGATGAGGATGACATTC +CCGTGCGCAGCTTCTTCCCAGAGAACTGGCTCTGGAGAGTGGAAACAGTGGACCGCTTTC +AAATGTGAGAGTGTGTGCCGGCCCGGCCTTTTCTCTGTGCTGTGTCTCGGGGCCAGCCGG +GGTAGACGGGCCTTCTCTGCCTTTCCCTACACAGATTGACACTGTGGCTCCCCGACTCTC +TGACCACGTGGGAGATCCATGGCCTGAGCCTGTCCAAAACCAAAGGTGATGTCACCCTGT +CTGGGCCTCAGGTGACCCTGCTTCCATTTCCCTGTACCCCAGCTCCCTGTTCCCTTTGCT +CTTAGTGTAGGAAGAGGGTCCAGTGATCTGGGGAGGTCTGTGCCAGCGTGCAGCTGGCGT +GGGCCAGAGGGCAGAGGCGGACTGAGACAGAGCTGGGTCACCCCCACCCCTCCCTCCTGT +GGCCCTGAAGCTTTGATGGCCCCTCTGATCTCTGCCCCTGTGCCCACGCTTCCTTTCCCT +CAGGCCTATGTGTGGCCACCCCAGTCCAGCTCCGGGTGTTCCGCGAGTTCCACCTGCACC +TCCGCCTGCCCATGTCTGTCCGCCGCTTTGAGCAGCTGGAGCTGCGGCCTGTCCTCTATA +ACTACCTGGATAAAAACCTGACTGTGAGGCCCCATGGGAGCCTGAGCATACAGGAGTTGG +GGGAGCCAGGGCCCAGTGAGGGGTGGGGAGGCTAACCGGGCCAGGACTCTGGCCATCCTC +GTTTTCCTGCCCTCAGGTGAGCGTCCACGTGTCCCCAGTGGAGGGGCTGTGCCTGGCTGG +GGGCGGAGGGCTGGCCCAGCAGGTGCTGGTGCCTGCGGGCTCTGCCCGGCCTGTTGCCTT +CTCTGTGGTGCCCACGGCAGCCACCGCTGTGTCTCTGAAGGTGGTGGCTCGAGGGTCCTT +CGAATTCCCTGTGGGAGATGCGGTGTCCAAGGTTCTGCAGATTGAGGTGAATGGAGCACC +CCTGAATATAAGTCCCCGGGCCCCCAGCTTTGTCCTCCACCCTCAGCACTCTCTCTGCTG +GCCAGGCCAGGGGCCCAACACCCAAACCAATGCCTTGGTCTGTTCCCATCTTCTACAATT +CTGATCCAACTCTGTCCCTGGAGTTGAAACTCAAAGTTCTGGGGGAGTCTGCGCTAGCAG +GGCAGGCTGTAGTCCTGTGTGACCTCACAACCATGTTTTCCCTGAGACAGAAGGAAGGGG +CCATCCATAGAGAGGAGCTGGTCTATGAACTCAACCCCTTGGGTGAGTGACCCTCTACCT +CCAGCCATTGGTTTCCTAAGTGGGTACAGGTGGTGGGGGATGTGGACAGCAGGACAGGCT +GCCAACTTCCCCCATTTCCCCAGACCACCGAGGCCGGACCTTGGAAATACCTGGCAACTC +TGATCCCAATATGATCCCTGATGGGGACTTTAACAGCTACGTCAGGGTTACAGGTGGGAG +TGCCCTTTAGTCCCTTCCCAGTGGCCACCTTCGGATTCATGTGGGACTTGTGGATCCCTG +CTTGGTCCCACTCCCCGTGAGCCTCTGACACAGAGTCCTCAGACCTCCACCCTCTCCCTC +CCATGTAGCCTCAGATCCATTGGACACTTTAGGCTCTGAGGGGGCCTTGTCACCAGGAGG +CGTGGCCTCCCTCTTGAGGCTTCCTCGAGGCTGTGGGGAGCAAACCATGATCTACTTGGC +TCCGACACTGGCTGCTTCCCGCTACCTGGACAAGACAGAGCAGTGGAGCACACTGCCTCC +CGAGACCAAGGACCACGCCGTGGATCTGATCCAGAAAGGTTCTGGGTGCAAGGGCAAGCA +GGAGGGGGGCCAGGAAAGGACAGTTACTGGAAGATGGACAGCCCAGGAGGCTACAGAGGG +AAAGAAAGGGGGCCCCTGATGAGGATGGGGAGCATGGCCTTGGGCTCAAACAGCAGAAGG +GTGAGTGTCACCTGAGCGGCCACCTCTCCTCTCCAAGGCTACATGCGGATCCAGCAGTTT +CGGAAGGCGGATGGTTCCTATGCGGCTTGGTTGTCACGGGGCAGCAGCACCTGGTGAGCT +TGGGAGAGTGGTTCCAGGGTTCTGAGGGGGTCAGGGCTGGGGCAGGGGTGGGACAGAGCT +GGTATGATGGGAGGGTGGATAACCAGGCACCTGGGGGCGTGGGCATAATGAGAAGCAAGT +CCTTATCCCCAACCCTCCTTTCCTGCCCTCCAGGCTCACAGCCTTTGTGTTGAAGGTCCT +GAGTTTGGCCCAGGAGCAGGTAGGAGGCTCGCCTGAGAAACTGCAGGAGACATCTAACTG +GCTTCTGTCCCAGCAGCAGGCTGACGGCTCGTTCCAGGACCTCTCTCCAGTGATACATAG +GAGCATGCAGGTGCGGGCATGCTGGGGCTGGCCCGAGAAGCGCCTGTCGGAGGACTCTCT +TTGCCCCTTCCCCCTCCTGTTTGACATCTTTTCTCCCCTTACTAGGGGGGTTTGGTGGGC +AATGATGAGACTGTGGCACTCACAGCCTTTGTGACCATCGCCCTTCATCATGGGCTGGCC +GTCTTCCAGGATGAGGGTGCAGAGCCATTGAAGCAGAGAGTGGTAAGTTCAGTGGCGTTT +CTGCCCTCTGCTGGCCCCCAGCTCTCTCCCTTTTTCCTCAGGAACCCAGGGGTCCAGGCC +CAAGACCCTCCTCCCGTTTTCTTCCAGGAAGCCTCCATCTCAAAGGCAAGCTCATTTTTG +GGGGAGAAAGCAAGTGCTGGGCTCCTGGGTGCCCACGCAGCTGCCATCACGGCCTATGCC +CTGACACTGACCAAGGCCCCTGCGGACCTGCGGGGTGTTGCCCACAACAACCTCATGGCA +ATGGCCCAGGAGACTGGAGGTGAGGGGTGAGGGGCTCTGGCAGTGAGCCTGAGGCCCAGG +GGACCTTAGGATCCCTGAGTGTGCCCAGAGGGAGAGGCTGGATGAAGACTCAGAGGAGGA +ATGAAGTTATAAGCAGGGGTGGGTTGGGGGAGACTCAGGAGAGCCCAGCAGGGGGTGGCT +AAGGGCCAGGGGACCAGGCTCTTCTCCCTGCCTTCCTGTTTACTCGTGGTCTCCCTTCAC +TTTCAGATAACCTGTACTGGGGCTCAGTCACTGGTTCTCAGAGCAATGCCGTGTCGCCCA +CCCCGGCTCCTCGCAACCCATCCGACCCCATGCCCCAGGCCCCAGCCCTGTGGATTGAAA +CCACAGCCTACGCCCTGCTGCACCTCCTGCTTCACGAGGGCAAAGCAGAGATGGCAGACC +AGGCTGCGGCCTGGCTCACCCGTCAGGGCAGCTTCCAAGGGGGATTCCGCAGTACCCAAG +TAGGGGCCGTCCCCGGGCTCTGGCGGGGGTGGGTAGTCCTCAGACCAAGGGCTTGCTTGA +GTCCTGGCTCAACCTCCCTAGGACACGGTGATTGCCCTGGATGCCCTGTCTGCCTACTGG +ATTGCCTCCCACACCACTGAGGAGAGGGGTCTCAATGTGACTCTCAGCTCCACAGGCCGG +AATGGGTTCAAGTCCCACGCGCTGCAGCTGAACAACCGCCAGATTCGCGGCCTGGAGGAG +GAGCTGCAGGTGAACCACTCCCTGGTGAACCACTCCCTCGCCTGGGTAGCCAGGACACCT +GGGCCTCGTGGCCAGGCCAGAAGCCGTCCCCACCCTCCCACCCGTGGAATCCCCGCAGCA +CTTCTTCCTGGGGTCTTCGGGGGAAGACTGACTTCCTGGCTGCGTGACCTGGAGCTCTGA +GCTTCAGTTTTCTCACTTGTAGAGTAACATACACAGAGTTCACCCTACAGGGTCGTTAGA +AGGCTGAAGTGAGATAATTCATGTGCTGGTATAAACTTTGTGGAAATGTGAGGTGGGGAG +AGGGGGTGGGGCTGTTTTGAGGAAGGAGATAAGTTATTGGAGCCGCAAAAACAGGTTTGC +TTGTGCCCTTCTAACATCGCCTTCCCTTTTCTGTTGCTGAAGTTTTCCTTGGGCAGCAAG +ATCAATGTGAAGGTGGGAGGAAACAGCAAAGGAACCCTGAAGGTGAGGGCCAGGGAAGGG +GTGGGGCCAGGCACTGGTGGAGGAGAGGGTGTGGAGTGAGAGGCCTGTGGGCAGAGGCAC +ATGGTCCGGGGAAGGAGGCAGACACCTCAGGGTTGGTGTCCCGTGCTTCCGTCCTGGGTG +TTTTTCCCCCTGCTTGCTTTCGCTTGCTCTCCCCATCTCTGGGTACCTGTTGTTTCCTTT +ACCCGCCTCAGTGCTGGTGGCTCCGAATCCCACTCCTCAGCCCAGGCCTCTTCCCTGAAC +CATGGGCCCCACTCGTCCCACTCCCACAGCACCTCAGACGAGGCATGTCCCAAAGCCCTT +CTTCATTCTGTGTCTCTTGTCTGGCTGGTGGGAGCCCCTCCCAGCCAGGAGCCCAGCCAC +TACTCTAGAGGCCGTGTTAGTGGCCCCTCTCCCAAGCCTGTCCTTATGTCCCTAGTGACT +CCTCCTCTGCTCCCCTGCTGCCTGTGGCCCTTGGTGCTGCATCCTAGATTCTGTGCTGAG +ACGGCCTTCTCCCTACCTGGAACTTCTCTCTACCTCCTGTCTCCCCTGTCTGATCCACTG +TCCACACGGCAGTGACACTGACCTTCCAAAAGCCCCAGCCAGATCAGCCTTGGGGAAAAG +TCACTCCCCGCTGCCCACGGCTCAGATGGCTGGGCCTCTGCCCACCCCTCCGGCCAGACA +GCTCTCCTTGTCTACACAGATCCCCTTGCCTTTCCTGTCCTTCCCTGCTTCTTGGCCCAC +AGGACAAGCTCTTTCTTCTCCTTCAAGCCTTGGCCAGAAGCCTTTCCTGAGCTTTTCAGT +CCAGCCTCTTCCCAGCACAGTCTGGAGTGTTGGCCTCTGGGGGCAGGCCCCTGCTTCTTT +ACCTCTCTGTCTCGCCTGACGCCTGTGGCGAATGTGGTGCCACTCGTGTGTGTGGACTGT +GCAGTGACGGGGAGGAAAAGGGGCTGAAGGCCTCAAATCCTGTAGCCCAGGGAGATGCCC +TTAGGTATGGCACCAGAGAGGTCTGTGGCCTCACATGTCCCACGTCCTCTCCCTGCCCCT +TGCTGAGCCAGGTCCTTCGTACCTACAATGTCCTGGACATGAAGAACACGACCTGCCAGG +ACCTACAGATAGAAGTGACAGTCAAAGGCCACGTCGAGTACACGAGTGAGTGTGGGGGTT +GGGAGGCCTTGGGGCCAGGCAGGGGCTGGCGCAGGGAGCCGGGTGGCCATCCCAGCCCTC +CTCACAATGCTTCCCTGTGCAGTGGAAGCAAACGAGGACTATGAGGACTATGAGTACGAT +GAGCTTCCAGCCAAGGATGACCCAGATGCCCCTCTGCAGCCCGTGACACCCCTGCAGCTG +TTTGAGGGTCGGAGGAACCGCCGCAGGAGGGAGGCGCCCAAGGTGGTGGAGGAGCAGGAG +TCCAGGGTGCACTACACCGTGTGCATCTGGTGGGCGCCGGGAGCTGCCCTGGGCCAGGGG +AGGGAGGGCAGGACCCAGGCTGGGGCTGGGCTTCTGGAGCCCGCGCAGGCAGAACCTGGA +CGACAGCTCACACGTCTCCACAGGCGGAACGGCAAGGTGGGGCTGTCTGGCATGGCCATC +GCGGACGTCACCCTCCTGAGTGGATTCCACGCCCTGCGTGCTGACCTGGAGAAGGTGTGG +TCAGCCACCCAGGGCAACCCCCTCTGTCCCAGGTACTGAGCCCTGTCATGTGCAGGGCCT +GTGACCAACTCCCCTTTTCCACAGCTGACCTCCCTCTCTGACCGTTACGTGAGTCACTTT +GAGACCGAGGGGCCCCACGTCCTGCTGTATTTTGACTCGGTGAGTGGGGAGAGATGAGGC +AGGAAGGGACTCGATGGCACCGGGTTTACTGAGTATGCGTTAGGAGGTTTCTCAGGAGAC +AGCTGTGTCAGCGGCTGGTGCTCTTGAGAACTTGTGATGTCATCAGAGAGAAGGACAAGA +ATGTGAGCCCGTGAGACACAGCAGAGTAAGGGGCAGACCTGCAGGCGGCAGGGACCGATG +CCAGTCAGCAGGGACCCTCAGGGTTTGAGAGGGAGTCTTTCCTAATGCTGGTTTTATTCA +GCTTGAGGGGCTGCCTTTGTTTTTTTGTTGAACTTCCTATCTTTTTTTTAATATTAAAGC +GTATTTTCCTTTACAAAGTGATGGTGGCCATAGATGATAGTTGTATTTGTCTTTTCACGA +CCTTATTTGGCTAAAATAGTTATCAACCCTCTTACGGCTCTCAAAACATTTTTATTTATT +TATTTAGTAAAGACAGGGTCTCGCTCTGTTGCCCAGGCTGGTCTTGAACTCCCGGCCTCA +AGCGATCCTCTGGCCTAGGCCTTTCAAAGTACCGGATTTACAGGCCAGAGCCACCATGCC +CGGCCTTCAAAAAAAGTTTTGGAACATTTACTGTAACCTCTGGGAGAAAATGTGAGAAAG +GTGTGGTGGCTGTCATTAGCCAGCTGTTTGTAGGTCAGGGAGACCCCTACCCAGTGTGTG +CAGAGGGGCCAGCCCCCATCAGCTGGGGAAGCCTGGCTGACACATCTGGGTTGAACACAA +TAGAAAACACAGAGCCAACAAGATTCCCGGATAGGGAGCTGACGGTGCAGCAGCCTAGCT +CAGGAGGGACACTGGCACGGCACCGTGTGGACTGGGCCCGCGTGGGCACGAGGAGGGGTC +AGGCCTGGGACCTGAGTCGGGGGGTCAGGCAGGATGACAGAACCTGCAGTTAGGTTGTGG +CAAATAAAGGAGGACCCAGTTGTATCCATGACAAAGATGAGGCCGCGAGGAGGGCGAGTG +GGTTTGGGGGCAGGCAGAGTGCCTTGGAGAACTTACAGGTCCTGCCACAATCCTAATGCA +AGGATGGAGCTGCAAGTTCAGTTTGGGAATCATCAGCCTGGATTGGTTTGGTGGAAGCCA +GGGAGTGGTTGAGACCCCCACAGGGGAGCTCTGAGGAAGGAAGTTCCGAAGGAGGGAACG +TAAGAAATGACCAGGTCAGAACCAAGGGTGGTCCAGAAGCTAACCCTTAGCTTAGGGACA +GTTTCACAGAGAACACGTCCATGATGCAAGACTCTGCTGAGGGCCTGGAGCAGTGAAGAC +TGGGGCAAGGTCACCCTCTGGGAAGTGAAGTCACCAGAGACCTTGCGGAGCAGCTTTGAG +AGTTCTCTGAGTAGGAAGGTAACAGAATGTGAAGGACACTGGAGAGAAGGCCAATAGGAA +GCAAACAAAAACAGGCCAAGGAAACCCAGTACAGGGGGCTGCAGGGCCCAGGGAGTGGGT +CCCTCATCTCTCCTCCCCACGCTTGGCCAGGTCCCCACCTCCCGGGAGTGCGTGGGCTTT +GAGGCTGTGCAGGAAGTGCCGGTGGGGCTGGTGCAGCCGGCCAGCGCAACCCTGTACGAC +TACTACAACCCCGGTGAGCACTGCAGGACACCCTGAAATTCAGGAGAACTTTGGCATAGG +TGCCCTCCTATGGGACAATGGACACCGGGGTAGTGAGGGGGCAGAGAGCCCTGGGGCTCC +CTGGGACTGAGGAGGCAGAATGGAGGGGCCTGTGCCCTAACTCCTCTCTGTTCTCCAGAG +CGCAGATGTTCTGTGTTTTACGGGGCACCAAGTAAGAGCAGACTCTTGGCCACCTTGTGT +TCTGCTGAAGTCTGCCAGTGTGCTGAGGGTGAGACTGAGGGCCTGGGGCGGGGCAGTGGA +GGCGGGATGGCCGGGGCCCCCCCCACACTGTCTGATGGGTTCCCCAACTTCAGGGAAGTG +CCCTCGCCAGCGTCGCGCCCTGGAGCGGGGTCTGCAGGACGAGGATGGCTACAGGATGAA +GTTTGCCTGCTACTACCCCCGTGTGGAGTACGGTCAGTCTTCCCACCGAGGCCCTGGCCT +GACCCTCCCTCGGGGACCGGCCGTTTTGGTCTCTCTGGGTGTAGCCTGCTCCTCTTACAG +GTCATGCACGCAGCCTGTTTGCTCTGACACCAACTTCCTACCCTCTCAGCCTCAAAGTAA +CTCACCTTTCCCCCTTCTCCTCACCCCCTCTTAGGCTTCCAGGTTAAGGTTCTCCGAGAA +GACAGCAGAGCTGCTTTCCGCCTCTTTGAGACCAAGATCACCCAAGTCCTGCACTTCAGT +ATGAAGCAAACCGGAGAGGCGGGCAGGGCTGGGGGGAGACAGGGAGGCTGAGGTGTGGCC +GAGGACCTGACCATCTGGAAGTGTGAAAATCCCCTTGGGCTGTCAGAAGCCTTGGGCTTG +GCCATAAATAGGGAGGCAGTGGCACCTCTCCATGGGGGTGGCGAAGGTGGAATGAGAGGA +TCTACACAGAGTCCCCAGCCTGGGCTCACCCTGCACCTTCTCTTCCCCTCTGACCACTTT +TGCGCACGTCATCCCCGCAGCCAAGGATGTCAAGGCCGCTGCTAATCAGATGCGCAACTT +CCTGGTTCGAGCCTCCTGCCGCCTTCGCTTGGAACCTGGGAAAGAATATTTGATCATGGG +TCTGGATGGGGCCACCTATGACCTCGAGGGACAGTGAGTCATCTGGTCCCCTCAGTCTCT +TGTCCTCCCCATGCCTCGCCACCTAGGCCTTGCCCCTCAGAAGCCAGATGCCTGTGCTCT +CCGTTTCCACCTGCCATCCTCCCGAGCCCTGCTGACTGCCCCTTTGCCCCCTGCAGCCCC +CAGTACCTGCTGGACTCGAATAGCTGGATCGAGGAGATGCCCTCTGAACGCCTGTGCCGG +AGCACCCGCCAGCGGGCAGCCTGTGCCCAGCTCAACGACTTCCTCCAGGAGTATGGCACT +CAGGGGTGCCAGGTGTGAGGGCTGCCCTCCCACCTCCGCTGGGAGGAACCTGAACCTGGG +AACCATGAAGCTGGAAGCACTGCTGTGTCCGCTTTCATGAACACAGCCTGGGACCAGGGC +ATATTAAAGGCTTTTGGCAGCAAAGTGTCAGTGTTGGCAGTGAAGTGTCAGTGTGTGTTG +CTAGGGCTGAGAGCAGTGCCCCTGCCCGATGCAGTTCTGGGCAGGCCAGGTTGACATAAC +CTTAGACTCTCTGAGCCCTGATGACCCTTGGGCTGTTCAGCTCTGCTAGAACCTCCCAGA +TGACCCGCTAGGAGTCTAGTGCTTCACAGGACCACCCCGAGCAGAACTGGGACCCAAGAG +CCTGCACCCCAAGGACCAGAGTCCATGCCAAGACCACCCTTCAGCTTCCAAGGCCCTCCA +CTGCCCGGCTGTCGCCAGTCACCACGGCCTCAGACAGGGCTTGTGCTCAGCTGACACCTG +TGACACAGCTCTTCTGCCTCATGAGCTGTTGTCCAGCTACACCTCCCCGACTCTGTCCTC +GTGCTGCTGGCGGTTCTGAGGTCTGCAGATTTTAGCTGAGTTCCGGGCTGTTGAAAGCCT +GCTGACGCTTGGTTCTGTTATCAGTGGAATGAGGTGACTTTCCCGGAGTTGTGCAATCCT +CAGGTCCGGCAGTGTCTTCTTCCAGTTACTGGTTTCAAACAAGCCAAAAGTCTGACTTTG +GTGTGTTTGTGAATCCTCTGAGGAAGCCGCTGTTCTCCTGGGGTCTCCCCTTCCCACCGG +ACCTGCCTAACTTTCCCCCATTTAGTGGCACACCTGGGGTCTTCAGAGATGACTCCGCGT +CTGTCCAAAGAAGTTTGGTGAGATCAGTTTCCGTAGAGGTCATGACAGTTCAGCAGCCTG +CCATCCAGTCATTCGACAGAAATTCGGGAATCTTTCACTTCATGCCATGCCCTGTGCCAG +GTGCCAGAGATACAGCTGCTCACTCCAGGGCTCATCGCTGGGGAGACAGATAAGAGGACG +GGCAGTCCCCACCCTCTGTGAAAGATGTGATGTCAGGGAGCAGTGTGGTCCTGTGGGGCA +TCTAACCAAGTCAGGGGCATTGCCAGGCAGGGACAGGGAAGGCTTCCTGGAGCAGGTGGC +CTCCAAGTGGGGCTCTGAAGACTGAGAAGGAGCCAGGAAAAGAGCAGGGGTAGATGAGGG +CATCTGGGGCAGAAGGAGAATATACAAAGGCCCAGAGGCCGGGGGCAGGACAGGGTACCT +TTGGGGACATTGCATGTAATTGACCACATTCGGAGTTTGGATTTGGAAGTGGTGGAAGAG +ATGGAGATGGTGAGACAAGTAGTAAGCACGTCAGCCTTCCAGGTGCGCTCCTTTCCGATG +AGCACTGTCTTATCCCACGTAACTTTGAGAAGTTTGGGCCTTTCCCACTGTGGCAGAGGT +TTCCTGAGGCTCTTGCATACATGGCCCTATGGTTGCTCATCAGATCTTTCTCCCAGTAGC +TGCTCAGCATGGTGGTGGCATAAGCCCATTTTCCGGAGCCAGGGATTCAGTTGCAGCAAG +ACATGGCCCGGTCTGGGAGGTCAACCATGAAGAAGGCAGTAGCTGTCATTGCCCAACCCC +AGAAATCCCAATCCTGTTTTCTCCCTCTCAGTCCTGATCATGGATTCAGCAGCAGCGAAC +TCGCCAATGTAGTGGGTGGCACAGCCAGGGTCTTGACTCTGGCTCTGCAGTAGCACAGTC +TGGAAAAGCTCTGAGGGGAGAGAGACCCCCACTGGTCCGAGGGTCTGGCACAGAGCCAGA +AATGGGGGGGAAGGTATGGGGCTGGGTCGCCTCTGACCTCTCAGGTACCATCCAGGAGGC +CCTGGCCTCTCACTGAACCCGGCCACTCCTCTTTGGCATGGCCTCTTCCCAAATCCCCAA +ACTGCCTCCTTACCCACAAAAGTGGTCTCTGAGTGTCAGTCCAGTGGGACCCCCACCCCT +TATGGCTTCAGTTCCCCAAATAGGGCTGGACCCTTGATCCTGATCCAGCTGTGGCTATCC +AGCCCCTTCCTGGGGACTTTGGACTTTGAGGGGGGCATGCCCAGTTGTGCTGGGAATCCA +TACTTTCCCTGGCTGGAGTAGAACCTGTGGACTGTAGTCCTGAGGGCAGTCATGTTCTGC +CTGTGCCTGGAAACACAAGAAACTTGACTGCAGAGAGAAGAAAGAGGAGAGAGGAACAGA +GCGAGGAAACCGCCCGTCTCCGGGGCTTTTTCTGTTCCCTATCCTTGACTTTCTAAGACC +AGTGGGGTCCCCTCCTCTGCTTCTTTTTCCTGAGTTCTGTGAAATTCCCCAATTCTTATT +TTTTATCTCAAACCAGCTCAAGGTGGGCTGTTTTCCTTTCAACCAAAGAAAGGTGCTCCT +GGTGGCTAAAGGTACATATTCGACAGCTAGATTTCCAGGCTGGAATCCTGCCCTCCACAA +CATGCGAACAATACCCGTGTTGCATATAGAGCATGGCTGTGAAGAGTTGAGTGAGTGCCC +ACAAAGCACTTAGAGCAGTGTCTGGTACATGCTATTACTCCGCAGCGGGAAACCACTTCC +TCCTTTGTCTTCTGGGCACTTTTGTGAGTGAAAGGAGGCACTAATAACAATCACACTGGG +ATACCTGTATATACTGGAATGCCCCAGGCAAACCAGGCTTAAACTGTATTACTCTATCTG +TAGCTTAAACTAACAAACAACCCACACAAATCACATTTTGTTCTTCAGGCGATTCAGGAA +GGCCTATTAGGCAGGGACTGCCATTTTCTCTCTGAGACAAACATCATGCCAGTAAACTGG +CCCACGGTGGGGTGGCAGAGGGAGAGGGCCCAGGTGGGGGCGGACACTATTGCCTGCACA +GTTGATGTGGAACCAGAAAGCTGACTCTGGATGCAGGAAAAAGGTCAGGGTTGCATTTCC +CTTCCTTGCTTCTTGATGGGTGATCAATTTTTTTGAAATACGGACGTCCCAAGGCCAATG +AGACTGGTGTCATTCCAGAAAAGGGCCACTCTGTGGGCGGGTCGGTGGGAGGGTACCTGA +AGGTGGGGTCAAGGGAGGCCCCAAAACAGTCTACACAGCAGGAGGGATGGCTGGGGCTCT +TGAGCTATAAGTGGCACCTCAGGGCCCTGACGGGCGTCTCGCCATGCTGCTCCTGGGCCT +GCTGCTGCTGCTGCCCCTGCTGGCTGGCGCCCGCCTGCTGTGGAACTGGTGGAAGCTCCG +GAGCCTCCACCTCCCGCCTCTTGCCCCGGGCTTCTTGCACCTGCTGCAGCCCGACCTCCC +CATCTATCTGCTTGGCCTGACTCAGAAATTCGGGCCCATCTACAGGCTCCACCTTGGGCT +GCAAGGTGAGAGGCTGATCTCGCTCTGGCCCTCACCATAGGAGGGGGCGGAGGTGACGGA +GAGGGTCCTCTCTCCGCTGACGCTGCTTTGGCTGTCTCCCAGATGTGGTGGTGCTGAACT +CCAAGAGGACCATTGAGGAAGCCATGGTCAAAAAGTGGGCAGACTTTGCTGGCAGACCTG +AGCCACTTACCTGTAAGGGCCGGGGGCATTTTTTCTTTCTTAAAAAAATTTTTTTTTAAG +AGATGGGTTCTTGCTATGCTGCCCAGGCTGGTCTTAAATTCCTAGTCTCAAATGATCCTC +CCACCTCAGCCTCAAGTGTGAGCCACCTTTGGGGCATCCCCAATCCAGGTCCCTGGAAGC +TCTTGGGGGGGCATATCTGGTGGGGAGAAAGCAGGGGTTGGGGAGGCCGAAGAAGGTCAG +GCCCTCAGCTGCCTTCATCAGTTCCCACCCTCCAGCCCCCACCTCCTCCTGCAGACAAGC +TGGTGTCTAGGAACTACCCGGACCTGTCCTTGGGAGACTACTCCCTGCTCTGGAAAGCCC +ACAAGAAGCTCACCCGCTCAGCCCTGCTGCTGGGCATCCGTGACTCCATGGAGCCAGTGG +TGGAGCAGCTGACCCAGGAGTTCTGTGAGGTAAGGCTGGGCTCCTGAGGCCACCTCGGGT +CAGCCTCGCCTCTCACAGTAGCCCCCGCCCTGCCCGCTGCACAGCGGCCTGCTGAACTCA +CACTGTTTCTCCACAGCGCATGAGAGCCCAGCCCGGCACCCCTGTGGCCATTGAGGAGGA +ATTCTCTCTCCTCACCTGCAGCATCATCTGTTACCTCACCTTCGGAGACAAGATCAAGGT +GCCTCACAGCCCCTCAGGCCCACCCCCAGCCCCTCCCTGAGCCTCTCCTTGTCCTGAACT +GAAAGTACTCCCTCCTTTTCTGGCAGGACGACAACTTAATGCCTGCCTATTACAAATGTA +TCCAGGAGGTGTTAAAAACCTGGAGCCACTGGTCCATCCAAATTGTGGACGTGATTCCCT +TTCTCAGGGTGAGGACCTGGAGCCTAGACACCCCTGGGTTGTAGGGGAGAGGCTGGGGTG +GAGGGAGAGGCTCCTTCCCACAGCTGCATTCTCATGCTTCCTGCCGCAGTTCTTCCCCAA +TCCAGGTCTCCGGAGGCTGAAGCAGGCCATAGAGAAGAGGGATCACATCGTGGAGATGCA +GCTGAGGCAGCACAAGGTGGGGACTGTACGTGGACGGCCTCCCCTCGGCCCACAGCCAGT +GATGCTACCGGCCTCAGCATTGCTATGAGGCGGGTTCTTTTGCATACCCCAGTTATGGGC +CTGTTGCCACTCTGTACTCCTCTCCCCAGGCCAGCCGCTCAGCCCGCTCCTTTCACCCTC +TGCAGGAGAGCCTCGTGGCAGGCCAGTGGAGGGACATGATGGACTACATGCTCCAAGGGG +TGGCGCAGCCGAGCATGGAAGAGGGCTCTGGACAGCTCCTGGAAGGGCACGTGCACATGG +CTGCAGTGGACCTCCTGATCGGTGGCACTGAGACCACAGCAAACACCCTCTCCTGGGCCG +TGGTTTTTTTGCTTCACCACCCTGAGGTGCGTCCTGGGGACAAGCAAAAGGCTCCTTCCC +AGCAACCTGGCCAGGGCGGTGGGCACCCTCACTCAGCTCTGAGCACTGTGCGGCTGGGGC +TGTGCTTGCCTCACCGGCACTCAGGCTCACTGGGTTGCTGAGGGAGCGGCTGGAGGCTGG +GCAGCTGTGGGCTGCTGGGGCAGGACTCCACCCGATCATTCCCCAGATTCAGCAGCGACT +GCAGGAGGAGCTAGACCACGAACTGGGCCCTGGTGCCTCCAGCTCCCGGGTCCCCTACAA +GGACCGTGCACGGCTGCCCTTGCTCAATGCCACCATCGCCGAGGTGCTGCGCCTGCGGCC +CGTTGTGCCCTTAGCCTTGCCCCACCGCACCACACGGCCCAGCAGGTGACTCCCGAGGGT +TGGGGATGAGTGAGGAAAGCCCGAGCCCAGGGAGGTCCTGGCCAGCCTCTAACTCCAGCC +CCCTTCAGCATCTCCGGCTACGACATCCCTGAGGGCACAGTCATCATTCCGAACCTCCAA +GGCGCCCACCTGGATGAGACGGTCTGGGAGAGGCCACATGAGTTCTGGCCTGGTATGTGG +GGGGCCGGGGGCCTGCCGTGAAAATGTGGTGGAGGCTGGTCCCCGCTGCCGCTGAACGCC +TCCCCACCCACCTGTCCACCCGCCCGCAGATCGCTTCCTGGAGCCAGGCAAGAACTCCAG +AGCTCTGGCCTTCGGCTGCGGTGCCCGCGTGTGCCTGGGCGAGCCGCTGGCGCGCCTGGA +GCTCTTCGTGGTGCTGACCCGACTGCTGCAGGCCTTCACGCTGCTGCCCTCCGGGGACGC +CCTGCCCTCCCTGCAGCCCCTGCCCCACTGCAGTGTCATCCTCAAGATGCAGCCTTTCCA +AGTGCGGCTGCAGCCCCGGGGGATGGGGGCCCACAGCCCGGGCCAGAGCCAGTGATGGGG +CAGGACCGATGCCAGCCGGGTACCTCAGTTTCTCCTTTATTGCTCCCGTACGAACCCCTC +CCCTCCCCCCTGTAAACACAGTGCTGCGAGATCGCTGGCAGAGAAGGCTTCCTCCAGCGG +CTGGGTGGTGAAGGACCCTGGCTCTTCTCTCGGGGCGACCCCTCAGTGCTCGGCAGTCAT +ACTGGGGTGCGAGAGAGGTGGGCAGCAGCTCAGCCTCCCCCCGCTGGGGAGCGAAAGTTT +CTTGGTCTCAGCTTCATTTCCGTGAAGGGCACCGAGAACTCGAAGCCCTTCCAGTGGTAC +CAGCTCACTCCCTGGGAAAGGGGTTGTCAAGAGAGAGTCAAAGCCGGATGTCCCATCTGC +TCTTCCCGTTCCCCTTAAGGAGGTAGCTCCCAGCACTCAACCAACCTCCCCGCAGAGCTC +CCTTCCTGACCCTCCGCTGCAGAGGATTGAGGCTTAATTCTGAGCTGGCCCTTTCCAGCC +AATAAATCAACTCCAGCTCCCTCTGCGAGGCTGGCATGATTGTTCCATTTCACCCAGCCA +CTCAGTCCCTTGCCTGTTACACTGTGGGGCTGAAACCTAGGCAGGCCGAGCCCCAGCCAC +CCCAGCTCTGAGCCGCCTCCCCACCCCTCACCTGATGGTCCACTGTGCTCCCGTAGAGCC +CGTTGAGGTTGGCGTAGTGGCAGTTCCTGTACCACCAGGCCCCTCGGTAGGAGACAGCGC +AGGAGATGAGCAAGCTGTTGGGGTCCCGATCACGGGCAGAGAAGACACTGCCGCTGTGGT +AGCTCATGGAGTCCCCTGGGCAGGGTGGAGGAAGGAGCCATGAGGGCCTCCCCTCCCAGC +CTCACCCTCCCAGCCTCACAGCCTCTGCTTACCTGCGGTGCCGTGGTAGCCCTCCAAGTG +GAGGCGGTAGTACTCCGCAGCCGAGTCTACGTGGAAGGAGTCGTACTGGGCGAACACAGC +CTCGTCCCCAGCCCGCAGGTCCACGCGCATGGAGTAGTCACCTGCCTGTGTCAGGCTGTG +CAGGGCCTCATTGCCTGGGGGTGGGATACGTGCCCTCATCAGGGTCCTGGTGTCCACAGG +GCCCCCATCCCCATCCGTAGTTCCCCAGTCCCTGTGAGGCACTGACCCAGCCAGAACTCT +CCAGAGATGTTCCCAAAACCATGGGCATAGTCCTCCCAGTCCCTCCAGAAGTCTGTCTGT +CCATCCATGCGGCGCTGGAACACCTGGGAAGCAAGTGGGGGCACCATCAGCCTCTGGCTC +CCGGGGCAACAGACCCTGCCCTGCACAGACCCCTGGGCTTCCCAATGCCACCCACCAGCC +AGCCGCCCCCATCAGTCTCCATGTCGCAAAACACGTTCAGGGGCCGCTCGCGGTTGCCGT +TGAGGAAGATGGTGCTGGTCCTGGAGGCACCGGCTCCGTTCTGCATCTCCTCCCCGCAGT +CCCTGGGGAAGGGGATCCGCAGCCCACCTGGGAGAGGAGAGCAGGGGCCAGTCCTTTTCC +AAGCCTTAGGCCCTGGCTGCCCACCCAGCCCCCGGCCCCGGGCCCGTGCGTCCAGGTACC +CGTGGTGAAAGAGGTGGACACGGGCGGCAGGAGGCTCTGGCCCCACATGGCCTGGAGCCG +TGCATTGTAGGAGGTGGAGGGAAAGAGGCCAAGGAGCTGGTGAGATGTGATCCCTCCTGG +GAGCAGGATCTCCTGTGGGACAGACAAGGGGGGGTCAGGGGAGAGGGAGGTGGAGACCCT +CCGGGAGGGCCAGAGGCAGCACCTCCTGGAATCACCCAGGGAGGGGAGTTGGGTCAGTGG +GGCCGGGGCACCTGGTTCTGTCCACCAGGGGTGTGGAAGCTGAGCAGGTAGCCTGCGGGC +CGGACTGGGGGCTCAGTCCAAGTGAGCAGGGCGGTGCGGGGGGTCACTTCCTTGGCCTCC +AAGTCCCGAGGGGCCTCTAGCCCTAGGAGGGAAAGCAGGAAGAGGAGATGGGGATGAGGC +CCAACCTGGCTCCCTCTACCTCCTCTCCCTGTCCCACACACCCCACAGACCCTACCTGTG +GTGAAGGTGATGCTGGCTGGGGAAGTGAGGTTGGGGCCCCGCAGGCCACGCACTGTGGCG +GTGTAGTTGGTGTGGAGGACAAGGTCATGCAGGGGGTAGTCCACCGCGCTGCCTGGGGTC +TCCGCCTGCAGAGGCGGGGCTGGGAGTGTAGAGAGGGGCATCAAGGCCTGCCCCCTCCAT +CCTCGGCCAGAGTCCAGCCTCCCCCCTGCAATCCCCACCCTGAACAAGTCCCCTCCAGAG +GCCTCAGGCCTGCTCACCCCCAGGGGCTGTGACCTGGACGTCATAGGTGTCCACAGGATT +CTGGGGGGGCTTCCAGTGCAGCACGGCGAATCCCTCGGTCAAGTTCAGTGCACGCAACTG +TGTGGGACCGTCAGGAACTGGGGGAAGGGGAGGGGCTCAGAAGGGTCCCCGCGGCTCTCT +CTACTCCGTGCCTCCCCAGACTCCACTGGCCTCCCGTCCGCAATCGGAGCCTCCACCACC +TCCCTTTCACCCTCCTCGTTCTCTCTCAACTCCCACCCATGCCGTTTTCTTGGCTCCCAC +CTCTTGCCCCGGGTCCCAGTCCATCTCACCCGTGGTGAGGAAGCCTGTGAGAGGCTCACT +CTCCTCAAAGCCTCGGACCGAGACCACGGTCACCTCATAGCGAGCGCCTGGGATCAGCCC +CTGGAGTTTCTGGGTCCGGGCCTGGCCATCCACCTGCACACTCTGAGGCTCCCCTGAAAA +CATTGGGGATCGAGGGTTACCCAGGGAACCCCAGGGCAGCTGGAGGGTGGGCAGAGTGCA +GGGGGGAGAGGAAATGCGAGGCGATGAGCACATGGCAAAGGCACCACCTCCGTCCGCCAG +CTGGTAGGAGACTTTGAAGCTGTCCGCCCGGGATGGTGGGGGCATCCAGTTGACCTTGGC +TGAGGTCTCCCTGATTTCACTGAATTGGAGGTCACGGGGGCTCTCCAGAACTGCAGAGGG +GTCAAGGAACAATGACGCAGGCAGGGGCAGGGAGGCTTCTCCCTACGAGTCCCCCCCTCG +CCTCTGCTCCAGCACAGGCTCACCACCCCTTTTCCTCTAGTCCCCAGGAATGGAAGTCGC +TCTGCAGATTCCTCCAGGCCCACCACCAACTCGCCCACCCCCACCGCTGGCTGAGGCACT +AGGTCCCCCCCGTGAAGTACAAAGACCCCCACTTTGGGGCAGAGTGTGTGTGGGTCCTTA +CCTGGGCTGAGGGTGCGGGCGGTTCCCTGGATGCTGTCGGCCTTGTGGGGTCCTCGCAGC +CCATACAGTGTCAGGCTGTACAGAGTCCCGGAACGCAGGTCCCGGAGCACGGCCGAGTGC +CGCGTCCCCGGCACCATCAGCTCGCGCTGCAGCAGTGGACGCGGATGCGGCTCCAGAGTG +CTTGGTGATGGAACCCCAAAGCGGAGCAGGAAGGAGTCGAAGGCCCCCGGTGGGGCCTCC +CAGTTGAGCCTCAGTGAACTGGTGGTCACGTCAGTCACAGACAGCTGGGACAGGCGGGGC +CTTGACTCCTCTGAGGTCTGACCAGCAGGAGCCAGCCCTGCACGGAGTGGGTGGGGGAGA +AGGGATTGGAGACAGAAGCACACCAGCTTGGTGACCCAGAGCACGTCCCTTCCACCCCCC +TCCCTGCCCCCGTTTCTCTATCTGTAACCAGGGACTTGCAGCCACAGGGGGGTCCTGTGG +GGCAGAGCTAAAGGCCACTCGCATCCAGCCCATCCATCCTCTCTCCCTGGTACCCGCCTC +ACGCTCTTTCCCTGCGACCACCCCTTCTGAGCCCCCGTTTCTCCCTTCTGAGTCCTAGGC +TAGAGGCCGGAGACGCCTGGTGGTACCTGTGGTGCCCTCAGCTGAGAGGGGCCCCAGGCG +CTTCCCTTCATGGAGGCCATAGAGGAGGAACCTGTAGGGGGTGCTGGGCTCCAGGCCTGA +GATGAGGATCTTGCTCTGGTCGCCGTCCACGAGCAAGGCCTGGGGCTGCCCGTTCGTGTC +CTCATACTGGACCACGAAGGAATCAAAGGGGCCCTGGGCCACGCTCCACGAGAGGCGCAT +GGAGTCTGGGGTTGTGTCGGTCACGGTCAGCACTCCTAGGCGGGGCTCTTCAGGAGGCTC +AGGGGCCTCTGGGGCTAACTCTGGGGCTGGTGTGTCCTCTTCTGGGGCTGCGTGGGAGAA +GCCCAGGGGAGAATCTGAGTGAGGGGCGCCATGGGGTGCTCCATTTTTATCTTCCAGGCT +TGGCCCAAGGCTGAGGTGGGAAGTTTATAGGTCCAGGCCCAGTCAGACAATGAAGTCGCT +GTGGCCTCGTGACTCCTGCGAGCTCCCGCGCTGTCTGAGTCAGGTGCTCGCTTCCCCCTT +CCACACCCCGGTGTCCTGCCGAGCCCACCTCGAGATATCACAGGCTCTGGCCCCACCCAT +GCCGGGATACATTCACTGAGCTTGAGGAGTGTGGTGCTCCCTTCTGAGAGAAGCTGAGGG +TGGAACTGGCTGGTTGAGGTGACTGGCAAATCCCACCAGCCGTGCCGTGGTCAGGCCTGT +CTGAGGTGGGCATCAGCGAGCTCTGGAAGAGGAGCCTGTACCACAAATGCAGCCACTGCT +GTTGGTTTCTGTGTCCCCGCTCATTTTGTTTTCCAGTGATGTTCCTCTTAAGAAAATGCT +CCTGACTCATCCACGGCAGGGAGGTTTGCCGCTATCTGGACAAGGCCACCCTTCGGGGAG +GCGACAGCAGCCCCAGCGAGTAATGAGGAGCAGTGGCAGTGACGGGGCAGAGTCGGGGCT +GGGAGATTAGAGAGCCCCTCCCAGGGCCTTTCCCTCCCGCCTGGCCTGGCTCCTGCTCTG +GACTCCTTGATGGATGTTGAAGCCCACAGGGCTGCAGACTCCTCCTCCTTCCTGGGGACA +GGCCAGGGCGCCCCACTCCGGCCTGCCCACTCCTGCAGTCATCTTTGTCTTCAGCCCAAA +TGCACAAGGAAACCCACACAAGCTGGCTTGCTATAGCCAGGCACAGCAGCCTCACCTGTC +ATTCCCAGGGCAGAGACCGGGCCCAGGCGCTTTCCCCCAAGGAGCCCGTAGAGCAGAAAC +TTGTATTTCTTGCCAGGCTCCAGGTCCTCTACGGTGACTGTGCGCTGGTCTGCGGCCACA +GGCACTGCCCTGGGCTGCCCGTCCGTGTCCCTGTACTGGACCACGAAGGAGTCAAAGGGG +CCCTGGGCTACCGTCCAGGACAGGCGCAGAGAGCTGGAGGTCTCCTCAGCCACGGTCAGT +TCCCCCAGGTGGGGAGGTAGCTCCTTCTCCAGGGGAGCTGTGCAGAGGGAGGAGGGAAAG +CTCTTAGTCACATGCTGCCTTTGCCTAAGCCCTGGCAGCCTCCCGGAGGTGTGAGGTTCT +GGGAAATGGTCCCTCCAGTGTAGCCCCAGGGACAGCTCCTTGAGGAGACACACAGGCCTG +CTCCCGCCATGCCCCACAGGAATGAGGGAGAACAGCCCCCTCCTCCTCTGGAGGCTGCTG +CCCAAACTCCTTCCTGCCCCGCCCCTTCCCTGCTGTGATCGAGGATGCGCCAAATTCATT +ACAGATCATCTCCCGAGGGATGGGTGGCTGGGGGTGCAGAGAGGGCCTTTGTTTACCCTG +ACCCCCAGCCCCTGAGCAAGAATGAGGCCAGAGCTGAGAGAGACTCCCCGGAGGTCTCTG +GGTTGTCACGGAGACACCCCAAACATCGAGAGCTGGTCTGGGCAGCCGGCCAATGCACGG +CTCCCATCACTGCCAGGCTGTGATCTCCCCCTTGTCCCCTTGTGGCCATCAGCCTGAACA +TCCGTGCCTCCTGCTTCCCCAGCCCCACACTGACCCCACTGGGCCGGGGCAGCCAGGGTG +GGGCAGGGAGAAGACAGGGGATTAGCTGGGAGAACAGAGGGCAGAGCAGAGGCTTGCCCG +GGTGGGGCTGGGGCCGATGGGTGGGGATCTGTACCCCGTCCCCACAGTGAGGGTTTGGGA +AGAGAATTACGGAGTCCCAGGGACCCAGGCCCAGACTGGCCGGCTGCTCTGTCCTCCTCT +GGGCATAGTGACTCATGGTCCTGGGAGTGGGGTGAGGGTCGGTGACCCACCACACCCCTT +CCTCAGGGAGCTGAGTCATAGGCATAGTGACACCAGGTTTTTCCATCGTCTTTCCATAGC +CAAGCCCTCCCTTTTCTTCCACCCCTCGGCTCCGAGTCAGGGAGGAGGGAGGAGGATGGG +AACCACTACTGAGTCCAGCGCCATTCCCAGCATTATGCAGGTGAGGACACTGAGGTCCCG +GGGATGAAGCGGCTTGTCCATGGTCACCCTGGCAAAGGCTAGGACTGGAACTGGAACACA +GATCTGCTGGCCCCAAAGCCCGTGTCCCTTTTATTTCCTCAGCAGTCAGCGAATGAAAGG +AAGTAATGCATATGCTTCAGAACTGTGCCTGACACACAGAGGGACTCACTTTCGGAGTTA +AGATGGTTGTGTCAGGGCTGATAGAGGGAATCTCACGGGAAGGCTGCAGGGCCAGCTCTG +AGGGCTCGGATGAGAGGCAGCTCTGGAAAAGGTGGAGGCTGGACTGGGACTCACCTGTGG +TGCTGTCAGCAGAGATGGGGCCCAGTCGTTTCCTGCCTGACAGACCATAGAGCAGGAACC +TGTATTTCCTACTGGGCTCCAGGCCCTGGACTGTGACCTCCCGCTGGTTGGCTGCCACCG +GCACCACCTGGAGCCGACCATCCTTATCCTTGTACTGGACCACGAAGGAGTCGAATTCGC +CCTCAGGGACCGTCCACGAGAGGCCCACGGAGTCAGGGGTCGCATCTGTCACAGTCAGCT +CCCCCAGGCGGGGAGACGGTTTGGTGTCTGGGGCTGGAAAAGACAGTGAGGTGCATGGAG +AGTGGGATGGAGGCAAAGGGGCCACGGAGCTTCCTGGGCTGCTATGGCTCTGTGAGCCGG +TCCCAGGAACGGGAGGGTGACTGGGCCAGGAGTAGGAATAAAAGAGGAGCCAGACAAGAA +AGCAAGTGTCCCCTGGGGTGCAGGGAAAGTAGGGAGAGGGATGAGTGTGAGTGGGAGAGG +AGAGCTCAGGGCCTGGGTTTTCCTGGACCCAATAAATCAGTGGGTGCTGAGGACTGGAGT +GTGGGGCACAGAACGTGAAATTCCAACAGGTGCCACAAGGGGGCGAAGGCTCTGGCCGCG +GGAGGCCTCCAGCCCTCACTCACCGGTCCTGGCCTCCACAGGGACTGGGCCGTGGCGTTT +CCCATTCTGGAGTCCAAAGAGCAGGAACTTGTACTTGCGGGCCGGGTCCAGCCCCGAGAC +GGCGACCGCTCGGAGGTCTCCGCTCACAGGCACTGCCTGGGGCTGCCCCTGCGCGTCCCT +GTACTGTACCAGGAAGGAGTCAAAGGGGCCCTGGGCCACCGTCCATGAGAGGCCCACTGA +GTCCGAGGTCACGGCCGCCACCGCCAGCTCCCCCAGGCGGGGCTCCACCGGCAGTGGTGT +GGGCAGGGGCGCTGAAAAGAGCAGAGCAGGCCCATGGGTCAGGAGGCAGGACCCTGCGCA +AGGGAGGCAGTGCTCTCCCAGGACTGGAGTGAGCATTTCTTAGCGGCCTCCTCTAAAACG +CTTGTTTTAGAATCTGTGCCCTGCATTGCTGTAAGCAGCTCACAAACAGTGGTGCATTTA +ACCCTCGCACAACATATGAAGTGGGTGCCATTATTATCATCACCCCAACTTTGCAGGAAT +CTGAAGCACAAGGTTAGGAAACGCCTGCAAAGTCGCACAATCACTACATTCGAAGGCACA +TGCAGATCTGGGCAGCTGGATCTGAAGCACTTTCTGAGCCACTAAAATACTCCTTAAGGG +A diff --git a/paraphase/data/rccx/rccx_ref.fa.fai b/paraphase/data/rccx/rccx_ref.fa.fai new file mode 100644 index 0000000..83cd953 --- /dev/null +++ b/paraphase/data/rccx/rccx_ref.fa.fai @@ -0,0 +1 @@ +chr6_32013240_32049000 35761 24 60 61 diff --git a/paraphase/data/smn1/ref_smn2.fa b/paraphase/data/smn1/ref_smn2.fa index 90e5411..ffa51c8 100644 --- a/paraphase/data/smn1/ref_smn2.fa +++ b/paraphase/data/smn1/ref_smn2.fa @@ -1,2901 +1,802 @@ ->chr5_70026528_70200526 -AATTTAATTCTAAAGTCACTGACGTTGTGATTGAAGCAACATTTTGGGCTGGGTGTGTTG -CCTCATGCCTGTAATCCCGGCAGTTTGGGAGAGTCGGGAGAACTGCTTGAAGCCAAGAGT -TTGAGACCAACTTGGACAACATAGCCAGCCCCTGTATCTACAAAATATTTTTTTAAATTG -GCCAGGCATGGTAGCACATGGCTGTGGTCTCGGCTACTCTGGAGGCTGAGGCGGGAGAAT -CGCTTGAGCCCAGGAGGTTAAGGCCGCAGTAAGCTGCGATTGCACCACTGCACTCCAGCC -TGGATGGCAGAGTTAGACCCTGTCTCAAAAAAAAAAAAAAAAAAAAAAAAAGAAGGCCTC -ATTTTGGGGAACAGAAAGCATTTTGTTAAGCCCTTGGTAGAACAGGGCCTAATGATTTGT -GCCAGGCGGACTAAAACCACGTGGGGTAGACATCCCAACATATAGATAAAAACGTAAAGC -TCTGAAGCTATTATTTGTTTCACAGAGACTCATGCAGCTCCTCCACAACCATAAGAACTT -TTTATAGGCTGGGCGCGGTGGCTCACGCCTGTAATCTCAGCACTTTGGAAGGCCAAGGTG -GGTGGATCACCTGGGGTCAGGAGATCGAGATCAGCCTGACCAACATAGTGAAACCCTATC -TCTACTAAAAATACAAAATTAGCTGGGTGCAGTGGCACATGCCTGTAATCCCAGTTACTT -GGGAGGCTGAGGCAGGAGAATCGCTTGAAACCGGGAGGGGGAGGTTGCAGCGAGTGAAGA -TTGTGCCATTGCAATCCAGCCTGGGTACTGAGCGGGAAACTCTGTATCAAAAAAACAAAA -CAAAACAAAAAAAAACAACTTTATTCAGCAAAATAACATCTTCTATATGCAAAACACTGT -GAGGTGCTAGAGTTACAACATTTTCAAAGTAGACAGCCTACCCAAACTACTCTGAATGAC -AAGGGACTCAATTATTAATATATAATGATAATAGTTCTCAAGAAGATACAAAAAAGTATA -TGCATAATAGCTAGCTGTGCTGATTTCTGAAGATCCATTGCATTGGAGAGAATTCATGTA -CATAGCCTTAATATATGACTATATGTGCCAATGTAAAACTGCTACAGAAATACTTTAGAC -TGCAGCTTAAGTAAAAAAAAGTACACTCATGTTTCTAAAAGAGCTAATCAAAGCTTAATT -TTATTCTCAAATGATTTTGTCCATATGGAACTTGGAGGTTAAGCGAATAACTGACTGCAT -GTGCTTCAGTGTGGCTTGTTAGGGGTTCTCAATCCTGGCTGCACATTAGAATCACCTGGG -AAACCTTGACAGCTACTCAAGCCTTGCGTTATGCTCAGTTTTGATTTTTTGTTTTTTTAA -AAAATTGAATTACAATAGTTGTACATATTTTGGGGGTACATGTGATCTTTTAATACCTGT -ATGTGGGCTGGGTAGTCCCAGCCACTTGGGAGGCTAAGGCAGGAGAATCACTTGAACCTG -GGAGGCGGAGGTTGCAGTGAGCCGAGATCCTGCCATTGCATTCCAGCCTGGGTGACAGAG -TGAGACCCTGTCTCAAAAAAACAACAACAAAAAGAAACTGGCTTGGCGTGGTGGCTCATA -CCTGTTAGCCCAGCACTTTGGGAGGCCGAAGCGGGTGGATTACCTGAGGTTGGGAGCTCA -AGACCATTCTGACCAACATGGAGAAACCCCATCTCTACTAAAAATACAAAATTAGCCAAG -TGTGTGGCCGGGCGCGGTGGTTCACGCCTGTAACCCCAGCACTTTGGGAGGCCCAGGCGG -GCGGATCACGAGGTCAGGAGATCGAGACCATCCTGGTTAACACGGTGAAACCCCGTCTCT -ACTAATAATACAAAACTTAGCCGGGCGAGGTGGCAGGCGCCTGTAGTCCCAGCTACTTGG -GAGGCTGAGGCAGGAGAATGGCGTGAACCCGGGAGGCGGAGCTTGCAGCGAGCCGAGATC -GTGCCACTGCCCTCCAGCCTGGGTGACAGAGCGAGACTCCGTCTCAAAAAAAAAAAAAGC -CAACTGTGGTGGCGAACACCTGTAATCCTAGCTACTCGGCAGGCTGAGACAGGAGAATCA -CTTGAACCTGGGAGGCGGAGGTTGCGGTGAGCTGAGATCTCGCCATTGCACTCCAGCCTG -GACAACAAGAGTGAAACTCCGGCCGGGCGCGATGGCTCATGCCTGTAATCCCAGCACTTT -GGGAGGCCAAGGCAGGAAGATCACGAGGTCAGGAGATCGAGACCACGGTGAATCCCTGTC -CGTACTAAAAATACAAAAAATTAGTCGGGCGCAGTGGCGGGCGCCTGTAGTCCCAGCTAC -TCGGGAGGCTGAGGCAGGAGACTGGCGTGAACCCGGGAGGCGGAGCTTGCAGTGAGCTGA -GATCGCGCCACCGCACTCCAGCCTGGGCGACAGAGCGAGACTCCGTCAAAAAAAAAAAAC -CTTTATGTGTACAATGTGTAATGATCAAATCGGGGTAATTGGGATATCTCTATGCTCAAA -CATTTATCTTTCATCCAGTTCTGATTTAATTGGTCAGAGGTCGAGCATTAAAAAGCACCC -TAGGTAAATTTTACTGTACTTAGGTTATGCCTTTTTTTTTTTTTTAAAGGCAGAGTCTTA -TTCTGTTGCCCATGCTGGAGGGCAGTGGCGTGATCTCGGCTCACTGCAACCTCCACCTCC -TGGGCTTAAGCGATTCTCCTGCGTCAGCAATCCAAGTAGCTGGAATTGCAGGCGTCCGCC -ACCATGCCCAGCTAATTTTTGTATTTTTAGTAGAGACTGGGTTTCACCGTGTTGGCCAGG -CTGGTCTCAAACTACTGACCTCAAGTGATCCACCCGCCTCGGCCTCCCAAAGTGCTGGGA -TTACAGGCGTGAGCCACCACGCTGGCCCAGTTATACCTTTTTTTTTTTTTTTGAATTTTT -TTTTTATTATTACGCTTTAAGTTCTAGGGTATATATGCACAACGTGCAGGTTTGTTACAT -AGATATACATGTGCCATGTTGGTTTGCTGCACCCATCAACTCATCATTTACATTAGGTAT -TTCTCCTGATGCTATCCCTCCCCCAGCCTCCCAGCACACCCAGTTATACCTTAAACTGAA -CTTAAAACAGCTCCCAGGTGATTCTAATGTGCAGCCACTATTAAGAGTCATTGATAAATG -AGATTAAAGACCTTAATTTAAGGCAAAGGTCCTGACACCTTTTTTTTTTTTTTCCCAGAT -ATGGCGTCTTACTCTGTGACCCAGACTGGAGTGCAGTGCCACAGTCTCGGTTCACTGCAA -GCTCTGCCTCCCAGGCTCAAGTGACCCTCCCACCTCAGCCTTCTGAGTAGCTGGGACTAC -AAGGGCACACCACCAAGCCCAGATAGTTTTTATATTTTTTGTAGAAACGGGGTTTCATCA -TGTTGTCCAGGCAGGTCTTGAACTTCTGGGGTCAAGTGATTTGCCCACCTCAGTCCCCCA -AAGTGCTGGAATTACAGGTGTGAGCCACTATGCCCGGCCCTAACATTTATTATTAAAGTG -ATAAGCTTTGTCTTCAATTTCTGTTGACTCACATTAGAGTAAAAATGAACATGGTATGAA -TCAGTGACCCTGCAATAGTATTTTTATTGGAGAACCTAGTCTAGCTTGGTTCAGAAATTG -TCATTGTTTACCAGATATGCACTCCTTATAAAATTCTATGCTAGACATTCTATATACATT -ATTCTTTATTCATCATAACTCTGAAAAATGGTATTAGCACTAATCTGTAGAATAGGAAAC -TGAGGCTCTGAACTTCAGTAACATTACTAAAGTTACACAGCAAGCACAACAGAGCTTGGT -TTCAAATAGAGAAGTAACTGTCATGGTTCTTTTTCCACTGTACTTCATTTCTTTATAGCT -ATGTTTTTGTTTTTGTTTTAGTGAAAGCAAGTTTATTAGGAAAGTAAAGAAATAAATATT -GGCTACTTTATAGGCAGAACAGCCTGTAGCTGTGTTATTTTGCCTTTCTTCTTTATTTTT -ATTTTATTTTATTTTATGTTTTTGAGACGGAGTTTCGCTCTTGTTGCCCAGGCGGAGTAC -AATGGCGCAATCTCAGCTCACCGCAACCTCCACCTCTCGGGTGCAAGTGATTCTCCTGCC -TCAGCCTCCCAAGTAGCTGGGATTACAGGCATGCACCACCATGCCTGGCTAATTTTGTAT -TTTTAGTAGAGATAGGGTTTCTCCATGTTGCTCAGTCTGGTCTCAAACTCTCGACCTCAG -GTGGTCCGCCTATCTCAGCCTCCCAAAGTGCTGGGGTTACAGATGTGAGCCACTGGCCTA -TTTTGCCTTTCTTCTATTCCCTTGTTTTTGCTATTGGCTTTACAGAAATATCTTACCATC -ATGGCTGTGGAAATCAGTTTAGCATTTCCTCAAAAAGTTAAAACAGGCCAGGCTCAGTGG -CTTACACTTGTAAATCCAGCACTTTGGGAGGCCGAGGCGAGCGGATCAACTTGAGGCCAG -GAGTTCGAGACAAGCCTGGCCAACATGGTGAAACCCTGTCTCTACTAAAAATACAAAAAT -TAGGTGGGCTTGGTTGCACATGCCTGTAATCCCAGCTACTTGGGAAACTGAGGCAGGAGA -CTCGCTTGAACCAGGGAGGTGGAGGTTGCAGTGAGCAAAGATTGTGCCACTGCACTGCAG -CCTGGGCAACAGAGCAAGACTCTGTCTCAAAAAAATAAATAAAAAATTAAAAAGTTTGAA -ACATGAGGTTAATAAGTCAGAGTTGTGGGACTTTAACCAGAGCTGGTAGAGTGCTTGACA -CACAGTAGATGTTGAATGCATGGCCGTTTAGTCTGTTTTTAAAATATGGGTCCATGGACT -CATGAATCACCTAACCCAGGTAGTCTGGTCATAGTTCACAGTTTATATGGCATGTGTCAG -TTTGACTGGAGATGAGAAGGGTTAAGGGCTGGGCGCAGTGGCTCACACCTGTAATCCCAG -CACTTTGGGAGACTGAGGCAGGCGGATCACAAGGTCAGGAGTTCGAGACCAGCCTGGCCA -ATATGGTGAAACCTCGTCTCTACTTAAAATGCAAAAATTAGTCGGGTGTGGTGGCGCATG -CCTGTAGTCCCAGCTACTCAGGAGGCTGAGGCAGAAGAACTGCAAGCAGGAAGTGGAGGT -TGCAGTGAGCCAAGATCGTTCCACCGCACTCCAGCCTGGGGGACAAAGCGAGACTCCGTC -TCAAAAAAAAAAAAAAGTACCCTGAACATCCAGCTTTTCTTTATTGTAATCCAGTTTTAG -TGACTAGCTTTTGGGCTTTTTTGCTTGTAAGAAACTGAAATCCTTCATAATATCTATGTT -CTAGTCGTAGATACCAGTTAGGATACCTAGGAGAGTTCTTAAATGCCTCTTCTTTCAAGG -AAAAAAATTAGAATGAATTAACTAGTTAACAACCAAGAGTAATAGCTATTATTTCTACAT -GTAGCTGCCTAATGTCACACACTTTTATATATTCATAGTGATTCCTTGTAACAACCTTGA -TACATATTGCCCGGATTTTGAAAGGGCTTTGAGGTTATTTGCCCAAGGTCATGATAACAG -AGATGGTATTCAGAATAAGATCTGACTTCAAAGCCTTTCCAGTCTGTCTTTCCATTTTGT -CTCCAGCCATGAAAATGGAAAGACAAAAAAGTTTATATCCTCTTTTAAATTGTCTTCTAA -TACACTGAATGGGTTATGTGTAGAAACCAAGTGAGAATATATAATTGGTTTTTCTGTAAC -AACTTATAGACTTTTCCCTCATTGTAGGGACTCTGAGATCATGCAAGAAAAGCAGAAGGC -AGCTAATGAGAAGAAGTCTATGCAGACAAGAGAAAAGTGATGACTGGCTATTTGGAAAAC -CTGGGTGCTACTGCCAACTGGGTGTATCATAAGCTCTAAGATCAAGATTTTGTAGAGTGG -ACAGTCATTACATATGTTATAACTTATCCTTTAAAAACTATTTTAAACTTTATCCTTTCA -GCTTTACTTAGTGCGATGTTTTAGAAGCAGTCTTCAAAGAATAAAACACTAACCATGCAT -GTGACATATTGGTGAACATTATTTTTATTATTGAACATTCATATATAATTTATTAGGTAA -TATGATCAGATAATAGGATCTCTTATATAATAAAGAATCTTTGTCATCAGCTTTGTTAAC -ATAGTTTTTTTTTCCTCACAGTTTCTAAGGATAAGGATAAAATAGATCTTTGAAGTAAAC -TTAAATATATAATAGAAGTTAGGGTCCATTTGTATAATTTTGCTTTGAAATCAAGTTAAA -GGGCCAGGTGCGGTGGCTTATGCCCACAATCCCAGCACTTTGGGAGGCCGAGGCGGGCGG -ATCACTTGAGGTCAGGAGTTTGAGACCAGCCTGACCAACATGATGAAACCTCATCTTTAC -TAAAAATACAAAAAAAAAAAAAAAAAAAAAATAGCCAGGTGTGTGGTGGCACATGCCTGT -AATCTCAGCTACTCGGGAGGCTGAGGCAGGAAAATCGCTTGAACCCAAGGCAGAGGTTGC -AGTCAGCAGATATTGCACCACTGCACTCCAGCCTGGGCAACAAAGCAAGACTCTGCCTCA -AAAAAAAAAAAAAATCAACTTACAAAGCTTGCTTGAACAATTTACAACAGATACTTCAAA -CCACTGGAATAGAAACTAAGTGGATGTAAACTGAGGTCTCAGTTCTACTTATAGCTTTAA -CATTTTTTGGAATGAGTACCATATTTTCTGTTCTCAGCCTCTTCTAAAACTTGAGTCTTG -ATGGTAGTTATAAATTTGGAAATATGTAACCTAGAGAAATTAAGGTTTGAGACCTTGCTG -CACTCTGAAGTAAACACAAAACTATGTCAGAGAGAATAAAAATGCCATTGTAGTAGTAAA -TAGAATAACTTAAAGTATTCTACAAATACTTGATTTTTCACATAATGCAATTTAACAAAT -TTTTCTGATCACCCAATATGTCAACCACTATCTGAATGGACAGATCTTGAAGTTAGCCTA -ACACAATATCTTGTGATTTGTCTCTTACCAGTGGTACCACCCATAAATAGGCTAGAATTT -TTTGTGTCTAATACTGAATTCGACAACCAGGAAGTTTTTTGGGTTTTTGTGGGGTGTTTT -TTTTTTTTTTGAGACAGAGTCTTGCTCTCTCGCCCAGGCTAGAGCGCAGCAGTGCCATCT -CGGTTCACTGCAACCTCCGCCACCTGAGTTCAAGCGATTCTCCTGCCTCAGCCTCCTGAG -GAGCTGGGACTACAGGCGCCCGCCACCACGCCCGGCTAATTTTTGTGTTTTTAGAAGAGA -TGGGGTTTCACCATTTTGGCCAGGCTGGTCTCAAACTACTAACCTCAGGTGATCCACCCT -TCTTGGCCTCCCATAGAGCTGGGATTACAGGCGTGAGCCATCCCACCCGGCTGCAGTTTT -TTAGCCTGAGTTTCTATCTTCATATTAGCCTAGATTTTTCATTAAATTAAAACATTGTTC -TGGATCTTTGGTTAACTTTAGTCTTCAGAATATTCTATGATGGTAGTCACAAAGGCAAAA -ATTAAGTAGCTTAAGTTACATTCTAATAAAAGAAATAATAAAGAAATCTGATTGTACCAC -AAAGATTCTTTGTGGGCCTGGTTTCTGTAATTCTGTCTCCAGAATTTCTACACAGACTAA -TAAGCCATAAGTACAAAAAAACTTTTCATGCTTTAAGCTCTTTTCTTTGCCTTTTTTTTT -AAATGAATAATTTCTTTAGTTTATCCTGTGGAATGGAAGAACTTTAGACCTTTTAATTCT -TATAAATCGAGGGAAAGCTACGTTTCCAAAATAAAATGGATATTAGAATAAGGAAGATCT -CTAGTTTGTAATCAATCATTAGTACTTTTTTTTTTTTTTTTTTGAGGCAGGATCTCGCCT -TGTCACCCCGGCTGAGTGCAGTGGCACGAACGTGGCTCACTGCAGCCTCAACCTCCTGGG -CTCAAGTGATCCTCCTGCCTCAGTCTCGTCTCCCAAGCAGTTGGGACTACAGGCGCATGC -CACCACGTCAGGCTGATTTTTATATTTTTGGTAGAGATAGGGGTTTTGCCATGTTGCCCA -GGCTGGTCTCCAACTCCTGAGCTCAAGTGAGCCACCTTCCACCTCTGCCCAAAGTGCTGG -GATTACAGGCGTGAGCCACCATGCCTGGCCATTCTTGATTAATTTTTATGGCATTTAATT -AAATAAATTTATTGTTAAGAGGTTTGATTTTTAACTGCAATATGACCAGATGTTTCCTCA -AAGCAGGCGGAAAAATTATCGGAGAGGAAGAAAATTAAGTCTAATTGTTGGAGTATATTG -ACACCTATCATGTGGTATATTGTAATATATATATATATGCAATACATTGACACCAATCAT -GACACCACCATGTGGTATAGTTAAGGTAACTAAAAGTAGCTGAACTTATAAAGAGGGAAA -CAGGTCAATTATAGGATACTAAGGGAAAATACAGGTGAATAGCTTTTTTTTTTTTTTTTT -TTTTTTTTTTTTTGAGACTGTGTCTTGCTTGCTCTGTTGCTGAGGCTGGAGTACAGTGGC -ACAATCTCGGCTGACTGCAACCTCTGCATCCCGGGTTCAAGCAATTCTCCTGCCTCAGCC -TCCCAAGTAGCTGGGATTACAGGCGTGCACTACCACACCCAGCTAGTTTTTTTGTATTTT -TAGTAGAGACAGGGTTTCACCATGTTAGCCAACCTGGTCTCAAACTCCTGATCTCAAATG -ATCTGCCTGCTTCGGCCTCCCAAAGTGCTGGGATTACAGATATGAGCCACCATGCCCATC -CCTGGAGAATAATTTTAATTATTATTATTATTATTATTTTTTTTTTTTTTTGAGACGGAG -TTTCGCTCTTGTTGCCCAGACTGGAGTGCAGTGGCGTGATACTGGCTCACCGCAACCTCT -GCCTCCCGGGTTCTCCCACCTCAGCCTCCCGAGTAGCTAGGATTACAGGCATGAGCCACC -ACGCCCGACTAATTTTGTGTTTTTAGTAGAGACGGGGTTTCTCCATGTTGGTCAGGCTGG -TCTCGAACTTCTGACCTCAGGTGATCCGCCCACCTCGGCCTCCCAAAGTGCTGGGATTAC -AGGCATGAGCCACCACGCCCAGCCTTAGGAGAATGATTGTAAAAAGTAAATTCATGTAAT -GATTTTATTTAGTTTGGATATTGTTAGGGCTTGTTGCTAAAGAAAGATAAAATTATTAGG -TGAGATAGTACCAGATTTAGAATATAATTTGGAAAATACCAAACTCCATGGAACCCTCCC -TTTAAACATCAAAAATCGTATTTTGCATCATTCTTAGGAGGTAGTGCGTTATCATTAGCA -ATTTTCATTAAGTCCTGCTGAAAATGAGAAGCAGCAGCCATTACTGCCCAAGATACACTG -TGGTCAGTTTTATCAGTTACTTTTTTTTTTTTTTTAAACAGAGTCTCGCTTTGTCATCCA -GGCTGGAGTGCAATGGTGCGATCTCCGCTCACTGCAACCTCCACTTCCGTGCCTGGCTAA -TTTTTTGTAGTTTTAGTAGGGATGGGGTTTCACCATCTTGGCTAGGCTGGTCTCGAACTC -CTGATCTCAAGTTATCCACTCGCTGGCCATCAGTCATTTATTTTTGAATGCCTCTTCTAT -TAGTAGCATGTGTAAGAAATTGTGATCCATTTATCAAACTAGCCAGTTTTTGAAAATAGG -GCTAAAAGGAAACGTTGATTTCTGACATTTTCCAAAAACTTAAAAAATTTTTATATAGGC -TGGGCACAATAGCTCACGCCTGTAATCCCAGCATTTTGGGAGGCCGAGGCAGGTGGGTCA -TTTGAGCTCAGGAGTTTGAGACCAGCCTGGGCAACACAGAAAAACCTCATCTCTACCAAA -AAAAAAAAAATTAGGTGGGTGTGGTGGTGCACGCCTGTAGTCCCAGCTACTTGGGAGGCT -AAAGTGGGAGGATCACCTGAACCCAGAAGGTCAAGGCTGCAGTGAGCCGAGATTGCACCA -CTGCCCTCCACCCTGGGTGATAAGAGTGGGACCCTGTCTCAAAACATACACACACACACA -CACACACACACACACACACACACTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCAAA -AACACTTGGTCTGTTATTTTTACGAAATTGTCAGTCATAGTTATCTGTTAGACCAAAGCT -GAGTAAGAACATTTATTACATTGCCTCCTACAACTTCATCAGCTAATGTATTTGCTATAT -AGCAATTACATATTGGAATATATTATCTTTAGAGATGGCCAAGTCATAAAACTGTCACTG -AGAAAAGGAGAATGACAATGTGTATGCTCAAATGTACTTCCCTATAAATTTCCAAAAGAC -ATGAAACTTACTACAGGTTTGTTTTTTTCACACCTTCACTTCTTAAAAACAAAAAAACTT -TTACATAGCAGTAACTAATGCACATTAAAAGTTTATAAATAGCCTGCTATTGGATCATTT -GCTTGGAAAAGTTGAGATTTTCAAATTTGATTATAACATAACTTTTGTAGAAATACACGG -CCAGGTGCAGTAGCTCACATTTGTAATCTCAGCACTTTGGGAGGCTGAGGTGGGAGGATC -GCTTGAGGCCAGGAGTTTGAGACCAGCCTGGGCAACATGACAAAACCCCATCTCCTCAAA -AAGCACAAAAATTAGCCAGATGTGGTGGTGCACACCTGTAGTCCCAGCTACTTGGGGGAC -TGAGGTGGAAGGATGGTTTGAGTCTGGGAAGTTGAGGATGCAGTGAGCCAAGGTCATGCC -ACTGCACTCCAGCCAGGGTGACAAAGTGACACCCTGTCTCAATATAATAATTTTAAAAAG -GTGCCTGTAATCCTAGCACTTTGGGAGGCCAAGGCGGGCGGATCACGAGGTCGGGAGTTC -AAGACCAGCCTGGCCAATATGGTGAAACCCGTCTCTACTAAAAATACAAAAATTAGCCAG -GTATGGTGGTGTGTGCCTGTAATACCAGCTACTTGGGAGGCTGAGGCAGGAGAATCGCTT -GAACCCGGGAGGTGGAGATTTCAGTGAGCCGAGATTGCACCACTGCACTCCAGTCTGGGT -GACAGAGCAAGACTTCATCTCAAAGAAATAAATAAATAAAAAACAAGGCCGGGCATGGTG -GCTCATGCCTATAATCCAGCACTTTGGGAGGCTGAGGCTGAGGTGGGCAGATCACCTGAG -GTCAGGAGTTCAAGACCAGCCTGGCCAACATGGTGAAACCCCGTCTCTACTAAAAATACA -AAAATTAGCCAAGCGTGGTGGTGGGCGCCTGTAATCCCAACTACTTGGGAGGCTGAGGCA -AGAGAATTTCTTGACTCTGGGAGGCAAAGGTTGCAGTGAGCCGAGACTGCACCACTGCAC -TCTAGCCTGAGCAACAGAACAAGACTCTGTCTCAAAAAAAAAACAAAAAAAAAAACATAC -AAACCGAATTTCCATTCCACATACTACTCTTGCTGTTTTACCACTTGGACAAGACTGCTT -GCTGGTACATAAGTTCTGGAACACTTCCTTGCAGCAGTCTGGCTGAGCCTTGGTATTTAA -AAGAAATTTACCTACCAGCCTGGCTATAATTGACATAATCCTATTAAATACTTGCCTTTT -ATGAACATATATCACATGACATAAGTTTTTGTCAAATACTTTTTTTTTTGGTCAAAGACT -GTAGCCTTATACCACTCAAGGGGGCTGTTAGGGTAGCTTATGAATGGATATTTCATACAG -AGTTACGTATTTAACCCATTTCCTGTTTAGAAAATAAAAGTGGCCAGGCGCCGTGGCTCA -CGCCTGTAATCCTAGCACTTTGGGAGGCCGAGGTGGGTGGATCACGAGGTCAGGAGATCG -AGACCGTCCTGGCTAACATGGTGAAACCCCGTCTCTACTAAAAATACAAAACATTAGCCG -GGCATGGTGGCAGGCGCCTGTAGTCCCAGCTACTCGGGAGGCTGAGGCAGGAGAAAGGCA -TAGACCCGAGAGGCGGAGCTTGCAGTGAGCCGAGATCATGCCACTGCACTCCATGCACTC -CAGCCTGGGCGACAGAGTGAGACTCCGTCTCAAAAAAAAAAAAAGAAAAAAAGAAAAAAA -AAGTGCAGCTGGCTGCCAGCGCTCATTTAATTTTACATAAACACACTCTGAGGTTGACGC -AAATTAATTTTCAATGTGAAAATACACAAACTGTTCTTAGAGTTATTTCTAAACAGAACT -TGTCTCTAATCCTAATATAATGGAAATGTATATAATGTTACATTAGGATTAGAGGCAAGA -GTATTCTTGGGGCAAACGGGAAATGGGTTAGTATATACTTGAAGTAATATAACCACATCT -AACCTGATTTCATGATATATTGGAATTTTTGGTTGCAAGCAACAGGATCAGATTAATGAA -CTTACTGATAAAAATCTTATTTGAAAGAATGAGAGACTCCAGAGTCATGGAAAAGTTGAG -GACTCAAGGCTTTGGAAAGGTCAAGAATTGTAACAGTTTCAGGTATCTTAAGAGTAGGTG -TTATGAGGTGTTTGTTTTTGTTTTTTTTTCTTACAGTGCTGATATCAGGTTGAACTTCTG -ACATGTATATTTAGTTTTTATGTCACTTGGAGTGCCTCCAACTGAAGTTACATGGGTTAG -GTACCCATCCCTCAGCAAGAGGAGGGCAAGTTGATGGTAAAACCAACTCTAGTAAGGAGT -AGGTGGTCCCCTGTATTAGTCCGTTCTCATGCTGATAATAAAGACATACTCGAGACTGGA -TAATTTATAAAGGAAAGAGGTTTAATTGACTCACAGTTGTGTATGGCTGGGGAGGCCTCA -GGAAGCTTACAATCATGGCAGAAGGGGAAGCAAATACATCCTTCTTCACATGATGGCAGG -AAGGAGAAGTGCAAAGCAAAAGGGGAAAAGCCCCTTATGAAACCATCAGATCTCCTGAGA -ACTCACTCACTATCATGAGAACAGCATGAAGGTAACCACCCCCATGATTCAATTACCTCC -CACCAGGTCCCTCCCATGACACATGGGAATTATAGGAGCTACAATTCAGGATGAGATTTG -GGTTGGGGACACAGCCAAACTATATCATCCCCCAAAAAATTTAGGGTTCTTTGCCAAAAG -GAAGGGAGAATGGATGCTGAGCAGACTAAAATAATATATTTTAATCCCTCTTATTGAAAA -GCTAAGACTAAATTTTTAAAATTGTATGTTTTTGTTTTTTTGAGACGGAGTGTCGCTCTG -TCGCCCGGCTGGAGTGCAGTGGCGCGATCTCAGCTCACTGCAACATCCACCTCCCGAATT -CAAACAATTCTCCCACCTCAGCCTCCTGAGTAGCTGGGACTACAGGAGCATACCACCATG -CCCAGCTAATTTTTGTATTTTTAGTAGAGATGAGGTTTCACCATATTGGTCAGGCTGGTC -TCGAACTGACCTCAGGCAATCTACCCACCTTGGCCTCCCAAAGTGCTGGGATTACAGACA -TGAGCCACCACGCCCAGCTAATTCTGTATTTTAAGTAGAGACGGGGTTTCACCATGTTGG -TCAGGCTGGTCTCGAACTCCTGACCTCAGGTGATCCGCCAGCCTTGGCCTCCCAAAGTGC -TGGGATTACAGGCGTGAGCCACCGAGCCTGGCTGTATTTTTAAATACAAAAATTTGCCAG -GTATTGTGATGGGTGCCTGTAATCTCAGCTACTCGGGAGGCAGAGGCCGGAGAGTCACTT -GAACCCAGGAGGTTGCAGTGAGCTGAGATCACACCACTGCACTGCAGCCTGGGTGACAGA -ATGAGACTCCATCTAAAAAAAGAAAAAAATGTATTTTTTAAAATACATGATATGGCAAAT -CATAAAGGTGGTAAGCATCTGAAGTGTGAGAAATAATCATGCAGCTTATGCAGTTTTTGT -CTCTACCAATCTTTCAAACCCTCAAACTGCAATGCTTGCTCATTATAGAAAATCTGGGAA -AATACAGTAAGTTCTAAATTTCCATAAATTCTAAATAGCATCTCTTTGAAAACTAACTCC -TTGGGCATTTATCCCAGTGAAATGATAATAAACAGCCAGCATTTGAAATGTATATTTCAA -TGACTCGTATATGAATACTATGTTCAGATCACAATAAAGGACATTTCCAGCACCCTCTTA -GATAATGCCTCCCATCTGATTATCACCATAGATTAGTTTTGCCTGTTTTTGAATTTTTTG -TAAATGGGATCACACGGTATGTGCTGTAGTGAGCCTAGCTTGATTTGCTCAACGTTACGT -CTAAAAGAATTCTCTATTTTGCTGCTATGCAGCAGTTCAGCCTTTTTCACTGTTGTGTAG -CATTCTGTTATATGAATATACCATAATTTAGTTACTCTACATTTCATGGACCTTTGAATT -GCTCCCACTTTGGGGCTGTTATGAATAATGCTACCGTAACATTTTTATATATCTTATGAT -GGTCTTAAACACGCTATCTACTGAGTATATGCCAGGAATGGAATTGCTGGGTCATAGGGT -GTGTGTCTATACACACACATACACACATTTTTTAGTAGGTACCATCAAACAATTGCATTT -TTGTTGTTTTTGTTTTAGTTTGTTTTTAATGACCATACCATTTTACCACCAGCAATAAAG -GAAGGTTCCATTGCTCCATACCAGCATTAGGTATTATCAGTCCTTTTATCTTTAACCATA -TTGGTGGGAGGTAGTGGTATCTCATTGTAGTCTTAATTTACTTCCATATACTTAATTTAA -ATACACTCAATTTATATACACACACGTATACTCTCTCAGCCTTCAGTAAGCCAGTCCCAT -TCCCCTGAACTAACCAATGTTAATAATGGTATACTTCTCTCAGTGCTCATGCAAACATGC -ATACACACACATACACATAAATAAGGGGTGTGGGAATGTATTTGATTCTGAGAGCTGCTG -TAACAAATTACCACAAACTTAGTGGCTTAAAACAACAGAAATATTTCTGTTCACAGTTCT -GGAGGCCAGAAATCTGAGAGCAAGATGTCAGCTGGGCCACACCTCCCTCTGAAGGCTCCA -AGGGAGAATCCTTGCTTTCCTCTTCCAGCTGCTGGTGGCTCCAGGTATTACTTGGCTTAT -GGCAGCATAACTCCTATCTCTGCCTTTGTCTTTGTGGTCTTCTTTTCTGTCTTCTTCCCT -TCTTTTTATGAGGACTTTTGCTGTTGGATTTAGGTTCCATTCTAACCTAGGATGATCTCA -TTTGGAAATCCTTAATTTCATCTACAAAAACTGTTTTCCCAAATAGGTCACATTCACGCA -TATCAGATGGACAGATGTATCATTTTGGGGTCCACCATTCAACCCACTACAAGGAGTTTT -TTAAACAAAAATAGGAAACTTAGATGTAACTTAGCACTTTTTTTTTTTTTTTTTGAGATG -GAGTCTCACTCTGTCACCAGACTGGAGTGCAGTGGCGCCATCTCAGCTCCATGCAACCTC -TGCCTCCTGGGTTCAAGCAGTTCTCTTGCCTCAGCCTCCTGGGTAGCTGGGATTACAGGC -ACGCGCTGCCACACCCAGGTAATTTATTTATTTTTTTTTTGAGACAGAGTCTCGCACTGT -TGCCCAGGCTGGACTGCAGTGGCGTGATCTCTGCTCACTGCAACCTCCGCCTCCCGGGTT -CAAGCGATTCTCCAGCCTCAGCTTCCTGAGTAGATGGGATTACAGGCGCCTGCCACCACG -CCCAGCTAATTTTTTGTATTCTTAGTAGAGATGGGGTTTCACCATGTTGGCCAGGCTGGT -CTCCATCTCCTGACCTCGTGATTCACCCGCCTCGGCCTCCCAAAGTGCTGGGATTACAGG -CGTGAGTCACAGCCCCCGGCCATAATTTAGCACTTTAAAAAATAATAGCCATGTTGGGCC -AGGCGTGGTGGCTCATGCCTGTAATCTGAGCACTTTGGGAGACCAAGGCGGGTAGATCCC -TTGTGCCCAGGAGTTCAAGACCAGCCTGGGCAACATGGCGAAACCCCATTTCTACTAAAA -ATACAAAAATTAGCTGGGGCGAGGGGATAGGCCGAGTTCCGGGTGTAAGGGGGCCATTAG -GGAGAGCAGAGCGAGGCAGCTGATCTTCCGGATTGGGGGCCTTGCCCGGAAGCTGGACCT -CACGGAGATGAAACGGAAGATGCACGAGGATATGATCTCCATACAGAACTTTCTCATCTA -CGTGGCCCTGCTGCGAGTCACTCCATTTATCTTAAAGAAATTGGACAGCATATGAAGATT -GGACATCACATGTGAATGCATGATATGAAGAGCCTGGTTACAGTTTCTACTGTTCTCTGC -AAGTAAATAGGCCCAGAAAGGTATAAGAGACTCTTTGAATGGACATAAAAATTCTGCTTG -TTAAGAACAAGTTGAGCTCTGGTAACTGATCTTAATAGCTAAAATATAAAAATATTTGGG -AAGTCTGAAATGAGGTCTCCTGGCCCTGGTGTGCCCTTAATGCCTGTGACAGTTGGCCTC -TGTGAATATTGGTATAATTGTAAATAATGTCAAACTCCATTTTCTAGCAAGTATTAATAA -TTAAGGGAAGTATGTCTGAAATGGCACTGTCTTGTCAGTCATTTCTGTTTACCCTTCTGT -CTGGAGTGTATTTGTGAAGAGTCCCTTATAACTTATGTTTTATGGACATCAGCACATAAC -CACAATGACATTGAAGCACAGGATCATTAGTCTATATTTTATTTTATTATTTTATTTATT -TATTTATTTATTTTTGAGATGGAGTCTTGCTCTGTCGCCCAGGCTGGAGTGCAGTGGCAC -AATCTCGGCTCACTGCAAGCTCTGCTTCCCAGGTTCACGCCATTCTCCTGCCTCAGCCTC -CCGAGTAGCTGGGACTACAGGTGCCCACCACCACACCCGGCTAATTTTTTGTACTTTTAG -TAGAGATGGGGTTTCACTGTTTTAGCCAGGATGGTCTCGATCTCCTGACCTCATGATCCA -CCCGCCTTGGCCTCCCAAAGTGCTGGGATTATAGGTGTAAGTCACCATGCCCAGCCCATT -AGTCTATATTTTTAAGTAAACATACCAATTAAGAAAGAAGCCAAAAACCAAAATTAGCCA -GGTGTGGTGGCACGTGCCTGTAGTCCCAGCTACTTGGGAGGCTGAGGTGTGAGGATCACT -TGAACTCAGGAGGCAGAGGTTGCAGTGACCCAAGATGGTGCCGCTGCACTCCAGCCTGGG -TGACAGAGTGAGACCCTGTTTCCACAAAAAGAAAAAAAAAATAGCCGTGCCTGTACTTCA -GTACTTACAAATTTAACTTTAGTATAGATGTACAGTAATTTATTCAATCATTTCCTTACT -CATAGACAATTAGGATGTTGCAACTTTTGCCACTACAAACAATTCTGCGATGTGGATTAT -CGTACTTATTCCCATTTATTGGTGCTTTCATTTCTATAAGAATGGATTTTTAAAGATAGA -ATTCCTTGGGAATAGTTATGTCAAAGCCAAATATAATATAGAGACAAATCTCTAAAAACA -TTTTATTTGGTAAGCAAGAGCTGCAATTCATGGCATACACACAGACCGGGCTGATCATTG -GTATGATCAGGAGAATAAAGGGAAGGTTGCGGCCAGGTGTGATAGCTCATGCCTGTAATC -CCAGCACTTTGGGAGGCCGAGGCGGGCAGATCACCTGAGGTCAGGAGTTTGAGACCAGCC -TGACCAACATGGAGAAACTCCGTCTCTACTAAAAATACAAAATTAGTTGGGCATAGTGGC -GCATGCCTGAATCCCAGCTACTCAGGAGGCTGACACAGGAGAAATGCTTGAACCCGGGAG -GTGAAGGTTGCAGTGAGCCGAGATCGCGCCATTGCACTCCAGCCTGGGCAACAAGAGCAA -AATTCCATCTCAAAAAAAAAAAAGAGAAGGTTCCGGGTTTTATGAGAAAGAACAGTATTA -CATACTGTTTTGGAAGAAAGCTCATTCACACTAGAGCTTGTGGGAGCTAGCAAGCTCTGA -TTGGTGAGCGATGGTGGTAGGTAAAACCAGTCTTAGAGTCATGGCAGTTCATTTTAGCAG -CTATTAGGTAAAACTGGTCTTAGGGATACAGAAGGCTGGTTCAGCAGTTGGACTTGTGGA -AAATTTAATTCTTGAAGCAGATGCTGTGTGCCCCGAATGCTTCTTCCCCCTGGCCCTTCA -ACTCTGATTTAGTTGAGTATTTCAAGAATGACCCAATTTATGTAATCAACTTTCACAGGT -ATACATGTCTTAAACTTTAAACAGATGTTTTGGGTTTTGTTGTTGTTGTTTTTGAGACGG -AGTCTCACTCTGTTGTCCAAGCTGGAGTGTAGTGGTGTGATCTCGGCTCACTGCAACCTC -CGCCTCCAGGGTCAAGTGATTCTCCAGCCTCAGCCTCCTGAATAGCTGGGATTACAGGCG -CCCGCCACCACGCCCAGCTAATTTTTGTATTTTTAGTAGAGATGGGGTGGGGTTTCACCA -TGTTGGCCAGGCTGGTCTTGAACTCCTGACATCAAGTGTTCTGCTCACCTCAGCCTCTGA -AAGTGCTGGGATTACAGGCGTGAGCCACTGCGCCCGGCAGTCTTTCCTTCTTTTTTTTTT -TTTTTTTTTTTTTTTTAATGACATGGGGTCTTACTTTATTACTCAGGCTGGTCTCAAACT -TCTGGCCTCAAGGAATCTTCCCACCTTGGCCTCCCAAATTGCTGGGATTACAGGCATAAG -TCATCATGCCTGGCTACAAACAGATATTTTCAATAAGAGGATAAAAGTTCATTTCCCCAT -ACTTTGCTAACATCAAATGTTATTAATTCCTAATAGTTTTGCCAAACTGAGAGGAAAATG -GTATGTTAGTTTTTCTGGGTTTTCTTTCTTTTTAATTTTTTTTCTTTTTTATTCATCGCA -ACACTATTCACGATTTTTTTATTTTTTATTTTATTTATTTATTTATTTTTTTTTGAGACA -AGGTCTCCCTATGTTGCCCAGGCTGGTCTTGTACCCCTGGGCTCAAAGGATCCTCCTGCC -TCAGCCTCCCAAAGTGCTAGGATTACAGGCATGAGTCACCACGCCTGGTTCACAATTTCT -TTTTGTTTTTACCAAAGGCAGGTATATTCCTGAAATTTTTTGTTTTTTTGTTTTTTTTTT -GAGATGAAGTCTCACCCTGTCACTCAGACTGGAGTGCACTGGCACGATCTCAGCTCACTG -CAACCTCCGCTTCCTAGGTTCAAGCGATTCTCCTGCCTCAGTCTTCAAAGTAGCTAGGAT -TATAGGCGCCGCAACCATGCTCAGCTAATTTTTGTATTTTTAGTAGAGACAGGATTTCAC -CATGTTGGCCAGGGTGATCTCAAATCCTGACCTCAAGTGATCCGCCTGCCTCAGCCTCCC -AAAGTGCTGGGATTACTGGCATGAGCCACCGTGCCAGGCCCTGAAATGTTATCTTAGTTA -TTAATTTGCAATTCCTTGGCTCTAGAGGTTGGGCATCTTCTCAGATCTCTAGTGGACATT -TGGATTTTCTTTTTGGTGAACTGTCCAGTTTTTCTCTCTGCTTTACAATCTTTATTATAT -GCAATCTTCACATGTAGGTACTACCATTTTTTTAGTTTGTTTTTGAAACAGCATATTGCT -CTGTTGCCCAGGCTGGAGCACGGTGGCAAAAACATGGCTTACTGCAGCCTTTGACCTCCT -TGGCTCAAGTACTCCTCCTGTCTCAGCCTCCTGAGTAGCTGGTACCACAAGCCCATACCA -CCATGCCCAGCTAATTTATTTTTGTAGAGATGGGGCCTGACCATGTTACTTGGGCTCAAA -TGATCCTCTCCCACTCAGCCTCCCAAAGTGCTAGGATTACAGGCATGAGCCACCATACTT -GGCCCCTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTGAGACACAGTCTTGCT -CTGTCTCACAGGCTGGTGTGCAGTGGCACGATCTCAGCTCATTGCAACCTCCACCTCCCA -GTTTCAAGTGATTTTTGTGCCTCAGCCTCCCTAGTAGCTGAGATTACAGGCATGCACCAC -CATGCCTGGCTGACTTTCATATCTTTAGTGTTGCCATGTTGGCTAGGCTGGTCTCAAACT -CCTGACCTCATGTGATCCACCTGCCTCGGACTCCCACAGTGCTGGGATTACAGGTGTTAG -CCACCACCCCGACATTATTTGAAACTTTTATTTTATCATGAGAGAGTTCCAGGAGTCAAC -TGAAGAGAGATTTTTGGTATGAAAATTACATATGCAAAAAGACTGATTCCAGTACATGAA -ATTAAATTCAACATTTACATTAAATGCCTTCAAATATGGTAAAATGGTTTCTTTTGGCAG -TTTACCTCATTATGTTTTGAATGATTTGTCTATCATATGAAATAACTTTTATAAATATAG -TAACTCAGGCCTGGGCACAGCGGCTCAAGTGGGAGGACTGCTTAAGCAACCGAGTTTGAG -ACCAGCGTGGACAACATGGGGAGACCCCTTCTCTCCCAAAAAATAGCTGAGCATGGCAGC -GCACTGTTAAAGGAAACAGAGTTTCTTTGGTGGGTGATTAAAATGTTCTGGAGTTAGATA -GTAGTGATGGTTGCACAACCTTGTGAATATATTAAGGTTTCCGCTCTATCTACCATTCAA -TTGTACTCTCTAAAACGATTAATTCTATAGTATATCAATTATATCTCTAAATAATAAAAA -CAAAAAGAAATGGCTGGGTGCGGTGGCTCATGCCTGTAATCGCAGCACTTTGGGAGGCTG -AGGCGGGCGGATCACAAGGTCAGGAGTTTGAGACCAACCTGACCAACATGGCAAAACCCA -GTCTCTACTAAAAATACAAAAATTAGCTGGGTGTGGTGGCACACGCCTGTAATCCCAGGT -ACTCGGGAAGCTGAGACAGGAGAATCACTTGAACCCGGGAGGCAGAGGTTGCAGTGAGCC -ACTGCACTCCAGTCTGGGTGGCAGAGCGAGACTCCGTCTCAAAAAGAAAAAAATTAAAAA -ACAAAAAGAAACCTGGTTCTATATTTTGTTTAAATTTATTTTTTTAACCATCATGTAATA -TGTCCAGGTAATTTGTTTAAATTTTGACATCAAATGCAATTGTGAGAATTTTTATGATTC -AGAAAAATCTAAGCAAGCTTTATAAAAACATACTTTTTTTTTTACTTTTTTTTTTTTTTC -TGAGACACAGCCTCACTCTGTCGCCCAGGCTGGAGGGCAGGTTTTCATGTTTATCTGTGA -GATGTACCTTTGGCACATTACTTTCCTGACATGAGATTTAAATTTTTTTTTTTATCTTGT -GACAATTTAACTTTTTTGACACATAAAAATTGTACATATTTATTTGTTTGAGATGGAGTC -GCACTCTGTCACTCAGGCTGGAGTGCAGTGGCGTGATCTTGGCTCACTGCAACCTCCGCC -TCCCGAGTTCAAGTGATTCTCCTGGCTCAGCCTCCCAAGCAGCTGTCATTACAGGCCTGC -ACCACCACACCCGGCTGATTTTTGTATTTTTAGGAGAAACAGGGTTTCACCATGTTGGCC -AGGCTGGTCTTGAAGTCCTGACCTCAAGTGATCCACCCACCTTGGCCTCCCAAAGTGCTG -GGATTATAGGCATGAGCCACCGTACCAGACCCCTAAAAATTGTATATATTTAAGGTGTAC -CATTTGATGTTTAGATATACATTGTGAAATGATTACATTCCACATATTACCTCTACAGAG -TTACCATTTTTGTACACTTGGTCAACATCATCCCATTCTCCCCTTCCTCCACAGATATTT -CTTGTATACTATATAGAAGCCAAGGGTATTTTGGGGGAAGAGCTCAAAGTTCCTTTCGTG -GAGTTAAAAATATATATATACTATGTACATATAAGCCATTTAGCAACCCTAGATGCTTAA -TAAAGAATACTGGAGGCCCGGTGTGGTGGCTCACACCTGTAATCCCAGCACTTTGGGAGG -CCGAGGCGGTCGGATTACGAGGTCAGGAGTTCAAGACCAGCCTGGCCAACATGGTGAAAC -CCCATCTTTACTAAAAATACAAAAATTAGCCGGGTGTGGTGGTGGGCGCCTGTAATCCCA -GCTACTCGGGGGGCTGAGGCAGAATTGCTTGAACCTGGGAGGCAGAGGTTGCAGTGAGCT -GAGATCACGCCACTGCATTCCAGCCTGGGTGACAGAGCAATACTCTGTCGCAAAAAAAAA -AAAGAATACTGGAGGCTGGGCGAGGTGGCTCACACCTGTAATCCCAGCATTTTGGGATGC -CAGAGGCGGGCGGAATATCTTGAGCTCAGGAGTTCGAGACCAGCCTACACAATATGCTCC -AAACGCCGCCTCTACAAAACATACAGAAACTAGCCGGGTGTGGTGGCGTGCCCCTGTGGT -CCTAGCTACTTGGGAGGTTGAGGCGGGAGGATCGCTTGAGCTCGGGAGGTCGAGGCTGCA -ATGAGCCGAGATGGTGCCACTGCACTCTGACGACAGAGCGAGACTCCGTCTCAAAACAAA -CAACAAATAAGGTTGGGGGATCAAATATCTTCTAGTGTTTAAGGATCTGCCTTCCTTCCT -GCCCCCATGTTTGTCTTTCCTTGTTTGTCTTTATATAGATCAAGCAGGTTTTAAATTCCT -AGTAGGAGCTTACATTTACTTTTCCAAGGGGGAGGGGGAATAAATATCTACACACACACA -CACACACACACACACACACACACACTGGAGTTCGAGACGAGGCCTAAGCAACATGCCGAA -ACCCCGTCTCTACTAAATACAAAAAATAGCTGAGCTTGGTGGCGCACGCCTATAGTCCTA -GCTACTGGGGAGGCTGAGGTGGGAGGATCGCTTGAGCCCAAGAAGTCGAGGCTGCAGTGA -GCCGAGATCGCGCCGCTGCACTCCAGCCTGAGCGACAGGGCGAGGCTCTGTCTCAAAACA -AACAAACAAAAAAAAAAGGAAAGGAAATATAACACAGTGAAATGAAAGGATTGAGAGAAA -TGAAAAATATACACGCCACAAATGTGGGAGGGCGATAACCACTCGTAGAAAGCGTGAGAA -GTTACTACAAGCGGTCCTCCCGGCCACCGTACTGTTCCGCTCCCAGAAGCCCCGGGCGGC -GGAAGTCGTCACTCTTAAGAAGGGACGGGGCCCCACGCTGCGCACCCGCGGGTTTGCTAT -GGCGATGAGCAGCGGCGGCAGTGGTGGCGGCGTCCCGGAGCAGGAGGATTCCGTGCTGTT -CCGGCGCGGCACAGGCCAGGTGAGGTCGCAGCCAGTGCAGTCTCCCTATTAGCGCTCTCA -GCACCCTTCTTCCGGCCCAACTCTCCTTCCGCAGCCTCGGGACAGCATCAAGTCGATCCG -CTCACTGGAGTTGTGGTCCGCGTTTTTCTACGTCTTTTCCCACTCCGTTCCCTGCGAACC -ACATCCGCAAGCTCCTTCCTCGAGCAGTTTGGGCTCCTTGATAGCGTTGAGTGGAGGCCC -TGCCGCGACTTGGCAGTAGCTTATTTTGTTCACTCCTCTCTGGCTGGTGTGGGGGAGGTG -GGGGCATTAGGCCAGGGTGAAGCAGGGGAACCACTTAGGAGTCTGTTAAGATGATCTGAA -CTTCAGAACAAGATGTTATTAACAGAGTGAAAGTATTTGGATTCTGGGTATATTTTGAAA -TCGGAGGCAACAGGTTTTTCAGATAGATTCGATAACGGAGGTTATCCTGAATAGTTGAAA -AGATAAAGTTGCCTTTTGCTGAGGTGGGAAAGAGAAGATTGCCAGTAGAGCAGGTTTCTC -AGGAGTTCAGTCTTGGGCATAGCATGGTAGGGGTGAATTTGGCTGGAGTGAGTTGGAGAG -TAGGAGAAGAGAAATCCAAGGCAACATTTGACCAGCCTGGGCAACATAGTGTGACTCCGA -GTCTGCAAAAATTAGACGGGTGTTGTGGTGCGCGTCTGTGGTCTCAGCTACCTGGAAGGT -TCAGGCCTTGGAAGGCTCAGGGAGGTGGAGGCTGCAGTGATCTGTGATTGCGCCTCTGCA -CTCCAGCCTGGGCGACAGAGCCAGACCCTGTCTTAAAACAAAATAAACGGCCGGGCGCGG -TGGCTCAAGCCTGTAATCCCAGCACTTTGGGAGGCCGAGGCGGCCGGATCACAAGGTCAG -GAGATCGAGACCATCCTGGCTAACACGGTGAAACCCCGTCTCTACTACAAATACAAAAAA -TTAGCCGGGCGTGGTGACGGGCGCCTGTAGTCCCAGCTACTCGGGAGGCTGAGGCAGGAG -AATGTCATGAAGCCGGGAGGCGGAGCTTGCAGTGAGCCGAGATCGCGCCACTGCACTCCA -GCCTGGGCGATAGAGCAAGACTCCGTCTCAAATAAATAAATAAATAAATAAATAAATAAT -AAAAACATCGGTAGGCATATTTCAAGGAATTCTATTTAAAAAAAATTTTTTTAGAGACAA -GTTCGCTCTCTGTGGCCCAGGCTGGAGTACAGTGGCATGATCCTAGCCCATGGCAGCGTT -GATCTCTTGGCCTCAAGCGACCCTCCTTTGGAGTCGCTGGGCCTAAAGGAGTGAGCCACC -ACGAAATTTTATTATAAATGGAGGGTAGAGAAATTGGGCAATAAATGGAGGGGGAAGTGA -GTTAAGAGGAATTTTAATTATGTGTGTGTGGTTTTAAAAGAGGGGGGTCTTGCTCTGTTG -CCCAGGCTGCTGGGGTGCCAGTGGCGCAATCATGAATCACTACAGCCTTGGACTCCTGGC -CTCAAGCTATCCTCCCACCTCTGCCTCCCAAAGTACTGGGATTACTAGTGTGAGCCACTG -CACTAAGATAGGAGCAACATGTTTCAGCATGTTTGTGGGTTGATAGGAAAGATGAGAATG -GGAAAGTTGATGTCGGAAAGAAGACAATGGCTAGAGCAATGTCCTAGAGTAGGTAAGAAG -GGATGGATTTGGCCTTTGTTGGAAACATTAGCGGTTCTTTTGGTGACAGCTATATAGTTA -ACACATCTATGATACGTGAATGGGCAGATAGGATGGCAGGAGATTTTGAAAGTTCTCTTG -ATTCTTACTGTTCTCTTAGTGAAAGAAGCAAGGTTATCAGCTAGAAGCTGGGATGGGAGA -GGAAAGAGAAGATGGGAAGTAGATAGTTCTTTAGAAGAGTGGGCAAGGGTTGGACTAGGG -AAGTTTAGTGGAAATATTGCTAGGCAACATAAAGAGCCTACTTGAGATTCGTGGTCATGA -GTTGAAGGAGACCAGACAGCAAGATTGTGTATGAGGGCACCCACAGAGTAAATGGAGAGT -TGAAATTAATGCAGTTGTGATTTTACCACGTGGATATGAAGAAGTGAGGGGGAGAAGTAC -AAAGGAGTTCTCTTAATGATTGACCATGGAATTTAAGCTGGCTAAGAAAGGAAGTGAGAG -GCCGGGCGCGGTGGCTCACGCCTGTAATCCCAGCACTTTGGGAGACTGAGGTGGGTGGAT -TACCTGAGGTCAGGAGTTTGAGACCAACCTGGCCGATATGGCGAAACCCCATCTCTAATA -AAAATACAGAAAAATTAGCCGGGAATGGTGGCAGGTGCCTGTAATCCCAGCTACTCAAGA -GGCTGTGGCAGGAGTATCCCTTGGACCCAGGAGGTGGAGGTTGCAGTGAGCCGAGATCAC -GCCACTGTACTCCAGCCTGGACGATATAGTGAGACTTCACCTCAAAAAAAAAAAAAAAGA -AAGGAAGTGAGGATTTTAAGACCCTGAGAGACAGTTTAAAAAGTGGGAGGATCGGCCGGG -CGCTGTGGCTGACACCTGTAATCCCAGCACTTTGGGAGGCCGAGTTGGGCAGATCACAAG -GTCAGGAGTTCGAGACCAGCCTGGCCAATATGGTGAAACCTTGTCTCTACTAAAAATACA -AAAATTAGCCGGGCATGGTGTCACGTGTCTATAATCCCAGCTACTCGGGAGGCTGAGGCA -GAAAAATTGCTTGAACCTGGGAGGCAGAGGTTGCAGACAGCTGAGATCACTCCATTGCAC -TCCAGCCTGGGCAACAAGAGCAAAACTTTGTCTTTAAAAAAAAAAAAAAAAAAAGAATAC -AAAAATTAGCCGGGCGTGGTGGCGCGTGCCTATAATCCCAGCTACTTGGGAGGCTGAGGC -AGGAGAATCAGTTGAACACGGGAGGCGAGGTTTGCAGTGAGCCGAGATTGCGCCACTGCA -CTCCAGCCTGGGCGACAGAGCAGGACTCCTCTTGGAAAAAAAAAATTAGCTGGGCATGGT -GGCAGGTGCCTGTAGTCTCAGCTACTAGGGAGGCTGAGGCAGGAAAATCACTTGAACCCG -GGATGTGGAGTTTGCAGTGACCCGAGATCGTGCCACTGTACTCCATCCTGGGCGACAAAA -TGAGACTCTGCCTCAAAAAAAAAAAAAAAAAAAAGTGGGAGGATCAATGTACTGCCAGTC -CTAATGAAGTGGAATGATTGTCCCCATCAAATCACTAGTAGGAGTAAGTTGCAGAGCCTA -GAAGGTGATGGTTAAGAGAGTGGGATTCTTGAAACTGCATTTATGGAGAGGTTGTGGTTA -TTGGTTATAATAAATAAATACAGTTGAAGTGAGTGAGTAGCTGAGATTTGGGGATGTATC -AGTTCATTCTTACACTGCTACAAAGACATACCTGAGACCAGGTATTTATAAAGATAAGAG -GTTTAATCAGCTCACAGTTCTGCTGCCTGTACAGGCTTCTCTTGTGGAGGCCTAAGGAAA -CTTACAGTCATGGTGGAAGGTGAAGGGGAAACAAGCACAGTCTTCACATGGCCAGCAGGA -GAGAGAGAGAAGGGGGAAGTGCTACATACTTTAAAACAACCAGATCTTGTGAGAACGCTT -ATCAGGAAACAGCACTTGGGGATGGTGCTAAATCATTAGAAATCACCCCCATGATCCAGT -CGCCTCCTACCATGCCCACCTCCAACACTGGGGATCACAATTCAGCATGAGATTTGGGTA -GGAACACAGAGCTGCACCACATCAGAGGATGTACAAGATTGTGGTGGAGAGGAGTTTAGA -GACCTGCAAATATAGGGTAATTGAAGGGATCATCTACATGGATATTTAAATCACCAAAAA -TTATGACAGGAGTAGTGTTGGAGAGAGAACTGCGATGTAAACATTAAGGAATGAGGAAGA -GTGACTCGGTAGGCTGTAGGTGACTGCAATAGGAAACGATAATAGACTGTGAGTCTGGTG -ACAAGATTTTCCTTCTTTCTTTTTTTCCCCCCCCCCGAGACAGGGCCTCTTTTTGTTGCC -CAGGTGGGAGTGCAGTGGCGCGATCACGGCTCACTACAACCTCCTCCCAAGCTCAAGGGA -TTCTCCCACTTCAGCCTCTCAAGTAGCTGGAACTACAGGTGCTGACCACCATGCCTGGCT -ACTTTTTGTCAGGATTTTCAAGGCTGGGAATTTTGAGAGGGGAATGGAGGAGAATAATCT -GAAAGTGCAAGTAAGGAGCAGGGAAGATTTCTTTTTTCTTTTTTTTTTTTTTTTTTGAGT -CGGAGTCTGGCTCAGTCGCCCAGGCTGGAGTGCAGTGGCGAGATCTCCGCTCACTGCAAG -CTCCGCCTCCCGTGTTCACGCCATTCTCCTCCTTCAGCCTCCCGAGTAGCTGGGACTACA -GGCGCCCGCCACCACGCCCAGCTAATTGTTTTTTTGTATTTTTAGTAGAGACGGGGTTTC -ACCGTGTTAGCCAGGATGGTCTCAATCTCCTGACTTTGTGATCCGCCCACCCCGGCCTCC -CAAAGCGCTTGGGATTACAGGCGTGAGCCACCGCGCCAGCCAGAGCAGGGAAGATTTCTT -CCCCACATCTCCAGTAGGTACAGTGATATGAAGTGTGTGGAGGAGAAAAGAGGAAACATC -TATCATTTGAGATGGCTGCGAAAGGAAAAGGCATCCTCAGGGAGCTAGATTTTACTTAGA -GCAAGAAATGAAGGGATGATTCAGAGGTTAAAAGAGTGGATTTTATGAATTACTCAAGGG -AGCACAGTGGAAGTTTCAGGAAGTGGTAGGAGAAGGTAGAAGATGGCAGGGTGTTGGGAA -TAATTTGAGAAATCTGAGCTACTGGAAATGACTGAGAATCAGATATAAAGGCAGTCCTGG -TGGTCCGTTCTGGCTGCCGTTGCTGTGTAACGAATCTGCCAAAACTTAGTGGCTTGAAAC -AACAAAGAACATTTTATTATCTCTCATTGTTTCTGTGGGTTAGGAATTTGTGAGAGCCGT -GCTGGGCAGTTTTCGTGCGGCTGTCTCGTGGTTGCACCTACATAGTTGCTAGAGCTACAG -TAGCTGGGGACTGAGCAGCTAGGGATTGGCAGGCTATCTCTTTTTTTCATGTAGTCTCAT -GAAGATTTCTTTATGTGGTTTCAATGTGTGGGCTGGTTTGGATTTCCTTATAGCATGGTG -GCCTCAGTTGGATTGCTGTTTTGTGATCCTTTTCATCCCTCCTTGTCCTGTCCCCAGACA -ACCACTGATCTACTTTCTGTCACCATAGATTAGCCTGCATTTTTAAGAATTTTTATAAAC -GTGGAATGATAGAGTACCTTTTTTGTCACGTTTCTTTTATTTATCATAGCTATTTTGATT -TTCATCCATTTTATTGCTGAGTAGTATCCCATTGCATGTATATACTATACTGTATTCATT -CGCTTGCTTGTGAACATTTGGGCTTTTTCCAGTTTGGGACTGTTAACAAGTAGAGCCACT -ATGAATATTAGTGTATAAGACTTCATATAGCCAAGGCTGGCAGATCGCTTGAGCCCAGGA -GTTTGAGACCAGCCTGGGAAACATGGTGAAACCTCTATTTTTATTTTAAAATCAAAAATT -AAAAATTTTCTATAAAAAATTTTAAAGAAGACTTTGTATAGACATACGCTTTCATTTTTC -TTGAGTGAATACTTAGGTCTCAGGGTAGATGTATTTTAAGTCTTTAAGGAGCTGTCAAAC -TCTTCCTCAAAGTGGTGGTTGTACCATGTTACTTTTTAATATAACAGAGATTAATTGAGC -AAAGAAAAATTCAAAAGTTGGACAGCCCCCACAACTAAATAGGTTCAGAACAGCTCCCCC -ATTTTGCATTTTGACCAGCAATGTATGAAAGTTCCATTTGCTCAGTGTCCCTGCAAACAC -CTGGTATGGTCAGTCTTTTTAATTTTAGGCATTATAATAGATATAGTGGCTTCTTGTGAT -TTTAATTAGCATTTCCTAATGACCAGTGCTGCTGTTGATCATTTCATGAGTGTATTTGCC -ATCCGTATATCTTTTTTGGTGAAGTGTCTATTCAAATCATTTGGGTTTTTTTTTTTTTTG -TTTTTTTTTTTTGGAGACAGTGTCTCACTCTGTCACCCAGGCTGTTGTGCAGTGGTGCAA -TCACACAGCCTACTGCAGCCTCCACCTCCTGCGCTCAGTCTTCTTGTCTCAGCCTTCTGA -GTAGCTGAAATTACGAGCACACGCCACAATGCCTGGCTAATTTTTTAAAATTTTGTAGAA -ACAAGGTCTCATTATGTTGCCTGGGCTTGTCGTGAACTCCTGGGCTCAAGCAATCTTCCT -GCCTCAGCCTCCCAAAGATTGGGATTGCAAGTATGAGCCACTGCACCCGGCCAACTTACC -CATCTTTTAATTGAATTTTTTTGTTGTTGAGGTTTGAGAGTTCTTCATGTTTGCTGGGTA -CAATATCTTTATCAGATAGGTAACTTGCATGTATTTTCTCCCGGTTTACACTTTGGTTTT -TCATTTTGTTAACAACGTCTTTTTAAGAACAGAAAATCTTAATTTTGCTGAAATCTAATT -TTTCAGTTTTTTCTTTGATGGTTTTGAGAGAGGAGGTAAAAAAAGACTAGGTAAGCCGAT -AGTTAGACAGAGTCCTCGGTAGAACTTCCCTTCTAACAAAAAGCAGCCCAAGAAATCACT -TCTCTTCTAACAAGGAGCAGCCTGGAAGATCGGGCTGTAAACATGTATAAGGAAGCAGCT -CTGGCACAGAGGGGGAGCTTCCTGGGTAATCAGCAAGCTTCACATACGTAAGGTGGGTAT -GTGAAGTAAACACAGTATGTGAAGTAAACACAGTGGACCTTAGTACATACTCAGATAAGG -AAGCTGGAAGCTTGCATGTTGTGAGTTGTTGGGGTTGCCTGCAGCTGCACGGAGAGAAAG -GGGTACCTGGGGCCAGGCATGTCCACCATGGTGGCTCCACCTCCCCTTATTTAGCACATG -CACAATAGGAAAGAGATAAGCAATGTGGAGTAGCTCAGGCCAAGGACCTGCCTGCATAAT -AAAAGGTTGGGGTGGGGGATGCCAGAGATTCACGCTCTGTGCAGATGGCAACACCTGGTC -CTAACTGGTTTTTTGCTCCCTATGTGTAGATAAGCTACCCCCTTCCCATTAGCTCATTTA -TAAAAATGCTTGCATTTCACTGTGGAATGGGAACTCTTTTCAGGACCTCTCTCTGCAGGA -GAGAGCTAGTCTCTTTCTTTTGCCTATTAAACTTCTGCTCTAGCCTCACACCCTTGGTGT -GTCAGCGTCCTTGATTTCCTCAGCGTGAGACCAAGAACCTCGGGTGCCACCCCAGGCAAC -AAGGCCATTTCAGTTTGTTCTTTTGTTATAGGCAATCCATGATCACAGATTTTTCTCTCT -TTTTTTTTTTTACACAGTTTAGAGTTTTAGTTTTACACTTAGGTCTGTAATCCATTTTGT -ATTAATTCTTATATGTGGCTCAGTGTAGGTGGAAATTTGGTTTGTTTTTGCATAAGGATT -TCCAATAGTTTTACCACCATTTCTTGAAACTACTATGCTTTCTCTATTAAACCACATTTG -TAACTTTAGTTAAAATCAGTCACATATATCACAGGGCTATTTCTGACTCTCAATTCTGTT -ACATTGTCTATTAGTGTATATTGATGTCAGTACTACACTTTTAATTACTATTGCTTCAGG -GTATGTCTTGTAAACCAAAAATAAAATTATAGGCCCCCCCCGCCCCTGCACAACCAACTG -AATGGACCCATCCTCTCAGCCAAGGGCATTCCAAAATTAACCTGAAAAACTAGTTCAAGC -CATGATGGGAAGGGGGAGTTGGACATGTCTCATCACACCCTACTACCTTTTGGAATTACT -GATAGAACAGACTCTTAAAGTCTGAAAAGAAACATTTACAACCTACCCTCTCTGAAGCCT -GCTACCTGGGAGCTTCATCTGCATGATAAAACCTTGGTCTCCACAACCCCTTATGGTAAC -CCAAACATTCCTTTCTGTTGATAATAACTCTTTCAACTAGTTGCCAATTAGAAAATCTTT -AAATCTTCCTATGACCTAGAAACCTCCCTACCCCCACTTTGAGTTGTCCTGCCTTTCCTG -ACAGAACTCATGTACATCTTACATATATTGATTGATGCCTCATGTCTCCCTAAAATGTAT -AAAACAAAGCTGTACCCCACCACCTTGGGGACATGTCATCAGGACCTCCTGTGGCTGTGT -CATAGGAGCGTCTTTAACTTTGGCAAAATAAACTTTCTAAATTGATTGAAACCTGTCTTA -GCTACTTCTGGTTTACAGTCTTAAAGTTAGATAATGTAAATTGTCCAGCTTTGGTTTATT -TTTGTCCTTAGTAGTTCCATATAAATTTTAGAATCAGCTTTTCAATTTAATACACTACTT -TCCTCTTAGATCCACAATTAAATATATTTGATGCTAACAATTCTGTTTTATGTTTTTCGT -TTTTTTTTTTTGAGACAAGAGTTTCGCTCTTGTTGCCCAGGCTGGAGTGCAGTGGCGCGA -TCTTGGCTCACCACAACCTCCACCTCCCAGGTTCAAGCAATTCTTCTGCCTCAGCCTCCC -GAGTAGCTGGGATTACAGGCATGCGCCACCACGCCCGGCTAATTTTGTATTTTTAGTAGA -GACGGGGTTTCACCATGTTGATCAGGCTGGTCTTGAACTCCTGACCTCAGGTGATCCACC -CACCTCGGCCTCCCAAAGTGTTGGGATTACAGGCGTGAACCACCATGCCTGGCCAGTTCT -GTTATTTTTAAAACCCAAGTTTCCCTGGTCATATCTTGGTTGGATGAAGCGTATTTTCAA -TAGATTACCCTGGAAAGGCTAGTGAGTACGGTATTCTTCTACATTTTAGACTTTTCTTAG -TCTTGCTACTTCAAGGACAGCTAGGCTGCATATAAAATTCTTGGCTCATACTTTTTCCCC -ATAAATTTCTATGAGAAAGTCTAATGATAACTGATTTTCTTTATTTTGTAACTTAGTCTT -TTTGCTTAGAGGCTCTCTGAGGATGGGAGGGGGTTCTTCCTCCCATCCCTAGGAATTTTT -CTTTTTTTTAAATTCCTAATCACTAGACCACCAGGAAGATTGTTTGTTTTGTTTTGTTTT -TATTCTTCAGGGACCCCATTTATACATACGTTAAATAAATACTGTTTGCCAATGTATCAA -CCATTTTGCTTCTTATTTATTTTTGTTCCTTTGGTTCTTTTTCATGGCTTTGCTTTGGTG -CTCCTTAGATTTTCAGTCAGATGTATTTGTCCTTGGGTACCTTGTAATCAGTATTACCTT -TTCTTCTGTCGCTTTGTTTTCTGTTCGTTTTGAAATTACTTGTTTCCTGGTCTGGCAATA -ACAGTTGAGATATGAGGAGTTTGAGCTGCCATCTGTCTATGTATCTTGCTTTAAGACTGC -ACTCTTCTATTGATATCACTGGCCTTGATTTTGTGATTTCTTTATTTCTTCAGGACCACC -CTTCATTTTCTACTGTTTGCTTCCTTTTTTTTTGAGATGGAGTCTCACTCTGTCACTCAG -GCTGGAGTGCAGTGATCTTGGCTCATTGCAACCTCTGCCTCCCGGGTTCCAGCAATTCTC -CTGCCTCAGCCTCCCAAGTATCTGGGACTACAGGTGTGCACCACCATGCCCGGCTAAGTT -TTGTATTTTTAATAGAGACGGGGTTTTGCCACATTGGCAGGCTGGTCTCAAACTCCTGAT -GTCAAGTGATCCACCCACCCCACCCACCTCTGCATCCCAAAGTGCTGGGATTACAGGAAT -GAGCTGCCGTGCCCAGCCTCCCCCCTACCCCCCTTTTTTTCTTTCGAGACAGAGATTATA -GGTGTGAGCCACTGGACCCAGCCTGTTTTTATTCCTTTTACCAAATCTCCAAGGAATATC -TTCCCTTCCAAGTGCGAATGTAACCTTAAGTCAGTTAACCTCTTTGTGATTACTTTTCTT -ATCTGCAAAGTGACTTAATGATCTTAAGTACTTTTTTTTTTTGAGACAGGGTCTCACTGT -CACCCTGGCTGGAGTGCAGTGGCACGATCTCTGATCTCCACTCACTGCAATCTCCTCTTC -CCTGGTTCAAGCGGCCCTCCCACCTTAGCCTTCTGGGTAGCTGGGACTACAGATGTGAAC -CACCACGCCCAGCTAATTTTTGTACTTTTTGTAGAGATGGGGTTTTGCCATGTTGCCCAG -GCTGGGATTATTAAGTACTTTTTATCATACAGCAAGATTGACATTTTATATTGGAATACA -TTTGTCTCTATATAACGGAGATTAACAGGAAAATGACAAGCCTGGGTGCGGTGGCTCATG -CCTGTAATCCCAGCACTTTGGGAGGCTGAGGTGGGAGGATCACTTGAGGTCAGGAGTTCG -AGACCAGTTTTGCCAAGATGATGAAAGCCCATGTCTACTAAAAATACAAAAATTAGCCCA -GCTTGATGGTGGGCGCCTATAATCCCAGCTATTTGAGAGACTGAGGCAGGAGAATCACTT -GAACCTGGGCAGCAGAGGTTGCAGTGAGCCGAGATCATGCCACTGCACTCCAGCCTGGGT -GGCATAGCGAGACTCTTGTCTCAAGAGAAAACAAAACAAAACAAAAAAAAAACAGGAAAA -TGACAAAAAGTAATATTACAACTCAGTGAATTTTATAACAAACTTTTTTGGAATTCATTG -ACTAATACTATACCAAATCCAAAATACTCTCTAGTATACCAAATCCAACTCTACCCTATA -GTATAAATTGGATTCTATTTGGACTTGTCTCACTAATCCCTCATACAGTGTGTTTTATTT -TTTATTGAAGTAAAAAAATTTGTCATTTTAACCATTTTTAAGTATATAGTTCAGTAATAT -TAAGTATGTTCATGTTGTTGCGCAATAGATCTTCGGAAGTTTTTCGTCTTGCAACCTGAA -ACTCTACCCATTAGCAAATTCCCATTTCTCCTTACACTTAGCCCTTGGTAATCATCATTC -TTTTTTTTTTTTTTTTGAGATGGAGTTTTACTCTTGTTGCCCAGGCTGGAGTGCAATGGT -GCAATCTCGACTCACCACAACCTCCGCCTCCCAGGTTCAAGCAATTCTACCTCAGCCTCC -CGAGTAGCTGGGATTACAGTCATGCACCACCACGCCCGGCTAATTTTGTATTTTTAGTAG -AGAAGGGGTTTCTCCATGTTGAGGCTGGTCTCGAACTCCTGACCTCAGGTGATCTGCCCA -CCTCGGCCTCCCAAAGTGCTGGGATTACAGGCGTGAGCCACTGCGCCTGGCCCATTCTTT -CTAATTCTATAAATTTGACTACTTAGTTACCTTACATAAATAAATTCTTATAGTTAGTGT -TATTTTTGCTTCCATGCCTTTTTTGTTGTTGTTCATGCTCTTACTTGGAATGCGTTCTAT -TTTGTCTACCTATGCACATCCTGTTGGGTTTTTTTTTTTTTTGGGGGTTTTTTTTGTTTT -TTTTTGTTTTTTTTTCCCAGACAAGGTCTCAATTTGTTACCCAGGCTGGAGTGCAGCGGC -GCCATCTCCACTCACTGCATCCTCAACTTCCTGGGCCCAGGTGATCCTCTCGCCTCAGCC -CCTGCAGGTAGCTGGGACTATAGGCATGTGCCACCATGCCCAGCTAAATTTGGTTTTTTT -GTTTGTTTGTTTTTGAGACAGAGTCTCACTCTGTCACCCAGGCTGGAGTGCAGTGGCACA -ATCTCAGCTCACTGCAATCTCTGCCGCCCGGGTTCAAGTGATTCTCCTGCCTCAGCCTCC -CAAGCAGCTGGGATTACAGGTGACTGCCACCACGCCAGCTAAGTTTTGTAGTTTTAGTAG -AGATGGGGTTTCACCTTGTTGGCCATGCTGGTCTCGAACTCCTGACCTCGTGATCTGCCT -GCTTCTGCCTCCCAAAGTGCTGGAATTACAGGCATGAGCCACCACGCCCGGCCAGAATTT -TTGTATTTTTAGTAGACACAAGGTTCTTACCCTGTTGCCTAGGCTGGTCTGGAAGTCCTG -GACTCAAGCAATTCACCTGCCTTGGCCTCCCAAAATGCTGGGATTACAAGCCACCATGCC -CGGCCTAAATCCTGTTGTTTTGTTTTGTTTTATTTTGTTTTGTTTTGTTTTGTTTGTTTT -TTGAGACAGAGTCTCGCTATGTCTCTCAGGCTGTAGTGCAGTGGCGCGATCTTGGCTCAC -TGCCACCTCTGCCTCCCAGGTTCAAGTGATTCTCCTGCCTCAGCCTCCCAAGTAGCTGGG -ATTACAGGCATGTGCTACTATGTCCGGCTAATTTTTGTATTTTTAGTAGAGACAGGGTTT -CACCATGTTGGCCAGGCTGGTCTCGAACTCCTGACCTCGTGATCCACCCACCTCGGCCAC -CCAAAGTGCTGGGATTACAGGCGTGAGTGGTTTTTATTTCTTAGGCCGGTTTCCTCCATA -TGATCTTGCAGTAGACATTAATTTCTTTCCTTTTTAATTAAAATACTGTTTGTATTTCAC -ATTTTGATGTTTGTTAAGATTTGTTTTATATTGTTTTTTGTTTTGTCTTGTGTGATAGTC -TTAAATCCCTAGTTAGATAATAACTGGAGAGTACCATGTTTCTATATATCTCTCAGTGAC -TTGCACAGTGCTAGCAGATAGTGCTAAAAAATTATTTATTATTATTATTATTTTGTTATT -GTTGTTGTTGTTGTTAGACAGGGTCTTCCTCTGTCACCCAGGCTAGAGGGCAATGGGATG -ATCATAGCTTACTGCAGCCTCCAACAACTGGGCTCATGTAATTCTCCTGCCTCAGCTTCC -CAAGTAGCTGGGATTACAGGCATGAGCCACCATGTCTGGACAAAAATATTTCCAGGTGCA -GTGGCTCATGCCTGTAATTCCCACACTTGGGAGGCCGAGCGAGGCTGGAGGATCACTTGA -GCCTAGGAGTTCAAGACCAGCTTGGCTAAGATGGCGAGACCCCGTCCCTACAAAAAATTT -TAAAAACTAGCCAGGCATGGTGGCATGCACCTATATTCCCAACTACTCAGTGGGCTGAGG -TGGGAGGGTCATTTGAACACAGGAATTTGAGGGGAGAAAAAAAGAAGAGAGAAAGAGAAG -TGAAGGAAGGAAGAAAGGAAGGAGGGAGGGAGAGAAGAAAGAAACGAAAGAAAGGAAAAG -AAAAGGAAGGAAAGAAAATTGGTACCAGGAAAGCAGGAAAGGGAAATGGAAGTAAAAAAA -TAATAATAATAATAAAATGAAAATTGGTTAGTCACTATTAACAATTTGTATCCTTATAAT -CTGGAAACATTATAATTTCAAAAGAAAAAATATTCTTTGGATCATAGGTTCTGAGGTCAG -AACAGCATTCCCGTAGTCTAGATGAAGTCAAGTTTTATCTGATCTTAATTGAAATAAATA -TAGCTGGCCTTGAACAAATCTACTCATGGTATGTGGATAGGAATTAAATTGTAGGGGCAT -TCACTTGATGGCATTCATTCTTAGAACATTTACCTATGTCTAGCTTTTGGAGTAAAGTCA -CATAACCTCTAACCAGGTAAGTTTCCTGTGGCTTTATTTAGGATTTTAAATACTCATTTT -CAGTGTAATTTTGTTATGTGTGGATTAAGATGACTCTTGGTACTAACATACATTTTCTGA -TTAAACCTATCTGAACATGAGTTGTTTTTATTTCTTACCCTTTCCAGAGCGATGATTCTG -ACATTTGGGATGATACAGCACTGATAAAAGCATATGATAAAGCTGTGGCTTCATTTAAGG -TATGAAATGCTTGCTTAGTCGTTTTCTTATTTTCTCGTTATTCATTTGGAAAGGAATTGA -TAACATACGATAAAGTGTTAAAGTACATGTTATTCAGTTTTCATTTTGAAGATTAGATGG -TAGTATGAGTTAGTTAAATCAGGTGATATCCTCCTTTAGAAGTTGATAGCCTATATATGT -CATCCTTTGTGGAGGCAATTTAAATAAAATTTAAAACATTTATTCCTGGCTGGGTATGGT -GGCTCACTCCTGTAATCCCAGCACTTTGAGAGGCTGAGGCGGGTGGATCACCTGAGGTCA -GGAGTTTGAGACCAGCCTGGCCAACATGGTGAAACCCCGTCTTTACTAAAAATACAAAAA -TTAGCCAAGCATGGTGGCACGTGCCTGTAATCCCAGCTGCTTGGGACACTGAGGCAGGAG -AATTGCTTGAACCTGGGGGGCAGAGGTTGCAATGATTGCACCACTGCACTCCAGCCTGGG -CGATAGAGTGAGACTCCATCTCAGAAAACGAACAAACAATGTATTCCTTTTAGTATTTTT -ACATTGTATCAAACTATGGAAGTCCTCTAATTGAGATTAATAAGAAAAAGACAATCTGAA -TTATAATTTTAAACATTTAACAAGCATGTAGTAAAATAATGATGAAGATAAATAGCATTA -GTACAGCAATTAATATTTGTAGCATGCTGACAGTGCTCTGTGTGCGTTTCATATATTAAA -TTACTCTAATCATCCCAAATCCTGTAAGTTGGGTATCAATTCAAGTGTTCCTATTGGGTA -GGAATATACAGTTCTTTTAGGAAATGTAGTATGGTTCTGTGTCTCAAACAGGACACTTAC -ACAGTTGGCCAACATCATCACCTTCTCCATTCTCTGAGATGTTTAGTCTTACTGAGCACT -AAATATGGGTCATCAATAGTCCAGACTACCTTGAGCAAACAATAGTCCAGACTACCTTGA -GCAAACAGAGCATATACTCATACAGTGTATAAAGAGCACCAAGCATACAGATTTCATGTC -TTTCTCATAGTTACTCTTGTAACATGAGCTAAAGATCAGACCTCTATGTCACCTTTGTAA -CTGATTTCTAGATTTTTTTTTTTTTTTGAGATGGGGTCTTGCCCTGTCACCCAGGCTGGA -GTGTAGTGGCGTGATCATGCCTCATTGGAGCCTTCAACTCATGAGCTCAAACAATCCTCC -TACCTCAGCTTCCTGAGTAGTTGGGACCACAGGTGTGTGCCACCACACCCAGCTCATTTT -TGTATTCTTTGTAGAGATGCAGTCTCACCCTGTTGCCCACGCTGGCCTGGAACTCCTGAG -CTCAAAAGATCCCTCCGCCTTGACCTTCCAAAGTGCTGGGATTACAAGCATGAACCACTG -CACCCGGCCTAGATTTTTAAATGTGCTTTCCAGTATACACTGAAACTAGAAGTCGACTAA -AGAATTACCAAGAGAATTCTATAAAATAGAGATTGAAATGGGGCTCGATGTGGGATGGGT -TGGTGATATTGCAGGGAGAAGTAATCTGAGTAAAGGAGGAAAAGAACTGATTTGGGAAAA -CGATAGTTTTAGTAGTGAGTTTGAGTATGAATTAAGTTGAGATTGAATTTGAATTAAGTT -GAGGTTGAATATGAATTAAGTTGAGGTTGAGTTTGAGGTATGAATTAAGATGTGAAATTG -ATCATTGGAAATGTTAGATTGAGAAAAGTCACAGCTGGATTAATAGCTTCAGAAGTGTGT -TTGCAGACAGTTGCAACTAAAGTAATAAGAATAGATGGCCTTGGCCGGGCGCGGTGGCTC -ACGCCTGTAATCCCAGTACTTTGGGAGGCTGAGGCGAGCAAATCACGAGGTCAGGAGTTC -AAGACCAGCCTGGCCCACATGGTGAAACCCCGTCTTTATTAAAAATACAAAAATTAGCTG -TGCACAGTGGTGCACGCCTGTAATCCCAGCTACTCGGGAGGCTGAGACAGGAGAATCGCT -TGAACCTGGGAGGTGGAGGTTGCAGTGAGCTGAGATCAGTGTGACTGCACTCCAGCCCGG -TGACAGAGTGAGACTCTGTGTAAAAAAATAAAATAAATAAAATAATGGCCGTAAGCAAGT -AAAGAAGGATGGCCAGCTCTTATTGGGAATGCCTAAATCTAAGGCTTGATCAGAAGTAAT -GAAACCGTTGGGGCCCTACATTGCTATGACATCCAAAGGGCCATGAATATCAGGAAGAAA -GATAATTAACAGGGTCTAATGTTACAGAGAGGTTGAGAGCAAGGAGATTTGATTAAAAGG -GTCTTTAGAGCTGATGTCAGGTGTATGATGCCTTTAAGAGCAGTTTTTATAGTGCAGGGG -GTGGTCAAAAGAGAAAATAGGTGCTTTCTGAGGTGACGGAGCCTTGAGACTAGCTTATAG -TAGTAACTGGGTTATGTCGTGACTTTTATTCTGTGCACCACCCTGTAACATGTACATTTT -TATTCCTATTTTCGTAGCATGCTCTAAAGAATGGTGACATTTGTGAAACTTCGGGTAAAC -CAAAAACCACACCTAAAAGAAAACCTGCTAAGAAGAATAAAAGCCAAAAGAAGAATACTG -CAGCTTCCTTACAACAGGTTATTTTAAAATGTTGAGATTTAACTTCAAAGGATGTCTCAT -TAGTCCTTATTTAATAGTGTAAAATGTCTTTAACTTAAGTGATTAGTACAGTGTTTCTAT -TGACATATACTTATACAACTTCAAAAACAACTATTAAATTTTCTGTTATTTAGGAACATG -CATATTAGTCATGAAAGTATAAAGAATTAGATGGGAATGATAAATGCTAAAATCAGGACA -TGTGTTCCATTTGTGAATGGAAGGCAGGGAGAAGGTGCCGTTTGGAAGGAGTACCCAAGA -GCCGTAAGCTGAATTGGCAGTGTTTTACATCTTAAGCTGAGAGATAGATTTTTTTTTCCC -CTTTTTCTTTAAAAACTCTAAAACTGTTAATTCCAAGGAACCCAGAAGTCTAGGTAGATT -ATTTCTGCTAGTTAAAAGCAGTAGTCCTGAAAGCTGAATATTTTGGTGTCTTTTGAGCCA -ACTTTAGTTTCATCATTACCAAGGGGGAAGAGAGCTAACAGTTGATGAGCACTTGCTCTA -GGCCAGTCCAGAGTGCTGGGCACCATACGCATTTTATCTCCCTCCCGCTATTCACAACAA -ATATGGGAGGTAGTTTATATTATAGCCATCTAATAAGATGGGGAAACTAAGACTCAAAGA -GATTCAGAAACTTGTCCATGATTATAAATGTAAGAGAGTTGGAATTCAGATTTATGTATT -TAGACCCCAAGCCTTTCTCATTACATCATTTTGCCTTCCAAATCTCTACCCTCTATCCTT -CACCTCCCCACTGATCAAAACGAGATGATAGTTTGCCCTCTTCAAAAGAAATGTGTGCAT -GTATATATCTTTGATTTCTTTTGTAGTGGAAAGTTGGGGACAAATGTTCTGCCATTTGGT -CAGAAGACGGTTGCATTTACCCAGCTACCATTGCTTCAATTGATTTTAAGAGAGAAACCT -GTGTTGTGGTTTACACTGGATATGGAAATAGAGAGGAGCAAAATCTGTCCGATCTACTTT -CCCCAATCTGTGAAGTAGCTAATAATATAGAACAAAATGCTCAAGAGGTAAGGATACAAA -AAAAAAAAAATTCAATTTCTGGAAGCAGAGACTAGATGAGAAACTGTTAAACAGTATACA -CAGTTGTCAGTTTGATCCACCGAGGCATTAATTTTTTCTTAATCACACCCTTATAACAAA -AACCTGCATATTTTTTCTTTTTAAAGAATGAAAATGAAAGCCAAGTTTCAACAGATGAAA -GTGAGAACTCCAGGTCTCCTGGAAATAAATCAGATAACATCAAGCCCAAATCTGCTCCAT -GGAACTCTTTTCTCCCTCCACCACCCCCCATGCCAGGGCCAAGACTGGGACCAGGAAAGG -TAAACCTTCTATGAAAGTTTTCCAGAAAATAGTTAATGTCGGGACATTTAACCTCTCTGT -TAACTAATTTGTAGCTCTCCCATGAAACTTTTGTAGCTTAAATACACAAGAATTTTTTGA -AAAGGAAATAAGATAATGATGCAAAATAGTTAATTTTTTAAAAAAATGTTAGACACTGCA -GTGGATGCAACAAAATACTTTATATGAAAGATTTATCCAGTTAACTTTTGTGGAGTATTA -GGTATTAGACTAATAATTAGCACACTTACTTAAGTTAGAAAGTATAATAATGCGCCGGAC -GCGGTAGCTCACGCCTGTAATCCCAGCACTTTGGGAGGCCAAGGTGGGCGGATCACAAGG -TCAGGAGATCGAGACCATCCTGGCTAACACGGTGAAACCCCATCTCTACTGAAAATACAA -AAAAATTTGCCGGGCGTGATGGCGGGCACCTGTAGTCCCAGCTACTCGGGAGGCTGAGGC -AGGAGGATGGTGTGAACCCCGGAGGCAGAGCTTGCAGTGAGTCAAGATCGTGCCACTGCA -CTCCAACCTGGGCGACAGAATGAGACTCCATCTCAAACAAAAAAACAAAACAAAACAAAA -AAAAGTGTAATAATAATTTATCATTAGCTGGATGATATGCTGTTGTTTCCCATGTCACCT -GTATAAGATATGTAAAATAAGAACACATTATTTACATCTAATATAGATAAAATCCTGAGG -CGCTCTCAGATTGTTTTGTAGAGTTCAAATGTAAATATTGTTTTCATTTATGGTCCTTTT -GGTTATAAGTAACAGAAATCAACTCTAAAAAGATTTTTATTATAGGTTAGATTATGTCAT -GGAACCTTAAGGCTTGTCCCTTTCTAGTTCTTTTGTGTAAAGCGGTGATTTCTTCCATGG -AGGGAATGGTATTTAGGCAATTTTTTTTTTTTTTTCGAGATGGAGTCTTGCTCTGTCGCT -CAGGCTGGAGTGCAGTGGCACCATTTCAGCTCACTGCAACTTCCACCTCCTGGGTTCAAG -TGATTCTCCTGCTTCAGCCTCCCAAGTAGCTGAGATTACAGGCACCCGCCACCACACCCG -GCTTATTTTGTATTTTTAGTAGAGATGGGGTTTCACCATGTTGGCCAGGCTGGTCTTGAA -CTCCTGACCTCAAGTGATCTCCCCACCTTGGCCTTCCAAAGTGCTAGGATTACAGGCGCC -TAGCCTAGGCAGTCATTTTCAAAAAACAAGCATGACTCACCAAAAGTTTTAAGATTTTCT -GTGATAATGTTCTTATTGAGGCTTACATTATATTACAGTTTCTTGAATCTAAAATGATGT -ACCCTCTTAGAATATATACATCATGCTTCATTGGTCTCAGGGGGCTGATTTTTATAAGGA -GAGATTTGCTAGTTTTCACAATATGTCCTCTAAGTTGGCATGTATAGCTAAACAGGCTTT -CATAAAAATATACAATTTAGTTAATGAAATTTGGGATATAGTCTTTTATGATTGAAATAA -TTTTGCTAAATAGACTGTCTCTGATTTATTAGGTAATCACCACTCTTATTTTGTTTTACT -TCCTTAATGTCTACATAGAAAGGAAATGAGAAAAATCCAGAGGTTGTCATTTGACTTATG -AGTCTGTTTGACTTCAGGATTTGGTACATGAAATTTCACTTAATCTTTTTGATATGTATA -AAACAAATATTCTGGGTAATTATTTTTATCCTTTTGGTTTTGAGTCCTTTTTATTCCTAT -CATATTGAAATTGGTAAGTTAATTTTCCTTTGAAATATTCCTTATAGCCAGGTCTAAAAT -TCAATGGCCCACCACCGCCACCGCCACCACCACCACCCCACTTACTATCATGCTGGCTGC -CTCCATTTCCTTCTGGACCACCAGTAAGTAAAAAAGAGTATAGGTTAGATTTTGCTTTCA -CATACAATTTGATAATTAGCAGAATAGAGGATTGTAAAATGTCATTGTAGAACATCCCTT -GGGCCAGATTCTAATGGGTAGAAATTTGAACTAAACCTCTGGGTTTTGTTTGTTTTTAAT -GCCTTTCTGTTACCCAGATGCAGTGCTCTTGTAGTCCCAAGTCTAAGCTCTAGGTTGCCT -TCTTTCCTGGCAGAAGTTGGTGTCTATGCCATAAGGAGGTAGTTCCTGTTAGAAGGGATT -TAATTATACCTTATATAAGGAATTAGTGTTTGCCCTTCTAGGTATAGTTGGATGTTAGCT -TCTGATGTAAACTGGATTTCTTTTTCTTTCTCTCTCTTTTTTTTTTTTTGTTTTGGAGGC -AGAGTTTTGCCCTTGTACCCCAGGCTGGAGTGCAGTGGTGTGATCTCAGCTCACAGCAAC -CTCCGCCTCCTGGGTTCAAGCAATTCTGCCTCGGCCTCCCAAGTAGCTGGGATTACAGGC -GACTGCCACCACACCCGGCTAATTTTTGTTTTATTAGTAGAGATGGGGTTTCACCATGTT -GGCCAGACTGATCTTGAACTCCTGACCTCAGGTGATCCACCCGCCTTGGCCTCCCAAAGC -GCTGGGATTACAGGCGTGAGCTGCCGCACCCAGCTGTAAACTGGATTTCTAATGGTAGAT -TTTTAGGTATTAACAATAGATAAAAAGATACTTTTTGGCATACTGTGTATTGGGATGGGG -TTAGAACAGGTGTTCTACCCAAGACATTTACTTAAAATCGCCCTCGAAATGCTATGTGAG -CTGTGTGTGTGTGTGTGTGTGTGTGTGTATTAAGGAAAAGCATGAAAGTATTTATGCTTG -ATTTTTTTTTTTTACTCATAGCTTCATAGTGGAACAGATACATAGTCTAAATCAAAATGT -TTAAACTTTTTATGTCACTTGCTGTCTTTTCGTCCTCGTTAAATTTAATTTTGTTGGTCT -TTTGTTGTTATTGGTTGGTTTTCTCCAAATGCTAGCTATGTTAAGAAATTTAAGGCCAGG -TACAGTGGCTCATGCCTGTAATCCCGGCATTTTAGAAGGCTGAGGCAGGAGGATCACTTG -AGCTCAGGAGTTTGAGACCAGTCTGGGCAACATAGCAAGACCTCGTCTTTGTTTAGGGGA -AAAAAAAGAAATTTAAGTAGGAGATTATATAAGCAAAAATACAATTAATTTCCAGCATTC -ACTATATAATATAAATCTCCAGACTTTACTTTTTTGTTTACTGGATATAAACAATATCTT -TTTCTGTCTCCAGATAATTCCCCCACCACCTCCCATATGTCCAGATTCTCTTGATGATGC -TGATGCTTTGGGAAGTATGTTAATTTCATGGTACATGAGTGGCTATCATACTGGCTATTA -TATGGTAAGTAATCACTCAGCATCTTTTCCTGACAATTTTTTTGTAGTTATGTGACTTTG -TTTTGTAAATTTATAAAATACTACTTGCTTCTCTCTTTATATTACTAAAAAATAAAAATA -AAAAAATACAACTGTCTGAGGCTTAAATTACTCTTGCATTGTCCCTAAGTATAATTTTAG -TTAATTTTAAAAAGCTTTCATGCTATTGTTAGATTATTTTGATTATACACTTTTGAATTG -AAATTATACTTTTTCTAAATAATGTTTTAATCTCTGATTTGAAATTGATTGTAGGGAATG -GAAAAGATGGGATAATTTTTCATAAATGAAAAATGAAATTCTTTTTTTTTTTTTTTTTTT -TTTGAGACGGAGTCTTGCTCTGTTGCCCAGGCTGGAGTGCAATGGCGTGATCTTGGCTCA -CAGCAAGCTCTGCCTCCTGGATTCACGCCATTCTCCTGCCTCAGCCTCAGAGGTAGCTGG -GACTACAGGTGCCTGCCACCACGCCTGTCTAATTTTTTGTATTTTTTTGTAAAGACAGGG -TTTCACTGTGTTAGCCAGGATGGTCTCAATCTCCTGACCCCGTGATCCACCCGCCTCGGC -CTTCCAAGAGAAATGAAATTTTTTTAATGCACAAAGATCTGGGGTAATGTGTACCACATT -GAACCTTGGGGAGTATGGCTTCAAACTTGTCACTTTATACGTTAGTCTCCTACGGACATG -TTCTATTGTATTTTAGTCAGAACATTTAAAATTATTTTATTTTATTTTATTTTTTTTTTT -TTTTTGAGACGGAGTCTCGCTCTGTCACCCAGGCTGGAGTACAGTGGCGCAGTCTCGGCT -CACTGCAAGCTCCGCCTCCCGGGTTCACGCCATTCTCCTGCCTCAGCCTCTCCGAGTAGC -TGGGACTACAGGCGCCCGCCACCACGCCCGGCTAATTTTTTTTTATTTTTAGTAGAGACG -GGGTTTCACCGTGGTCTCGATCTCCTGACCTCGTGATCCACCCGCCTCGGCCTCCCAAAG -TGCTGGGATTACAAGCGTGAGCCACCGCGCCCGGCCTAAAATTATTTTTAAAAGTAAGCT -CTTGTGCCCTGCTAAAATTATGATGTGATATTGTAGGCACTTGTATTTTTAGTAAATTAA -TATAGAAGAAACAACTGACTTAAAGGTGTATGTTTTTAAATGTATCATCTGTGTGTGCCC -CCATTAATATTCTTATTTAAAAGTTAAGGCCAGACATGGTGGCTTACAACTGTAATCCCA -ACAGTTTGTGAGGCCGAGGCAGGCAGATCACTTGAGGTCAGGAGTTTGAGACCAGCCTGG -CCAACATGATGAAACCTTGTCTCTACTAAAAATACCAAAAAAAATTTAGCCAGGCATGGT -GGCACATGCCTGTAATCCGAGCTACTTGGGAGGCTGTGGCAGGAAAATTGCTTTAATCTG -GGAGGCAGAGGTTGCAGTGAGTTGAGATTGTGCCACTGCACTCCACCCTTGGTGACAGAG -TGAGATTCCATCTCAAAAAAAGAAAAAGGCCTGGCACGGTGGCTCACACCTATAATCCCA -GTACTTTGGGAGGTAGAGGCAGGTGGATCACTTGAGGTTAGGAGTTCAGGACCAGCCTGG -CCAACATGGTGACTACTCCATTTCTACTAAATACACAAAACTTAGCCCAGTGGCGGGCAG -TTGTAATCCCAGCTACTTGAGAGGTTGAGGCAGGAGAATCACTTGAACCTGGGAGGCAGA -GGTTGCAGTGAGCCGAGATCACACCGCTGCACTCTAGCCTGGCCAACAGAGTGAGAATTT -GCGGAGGGAAAAAAAAGTCACGCTTCAGTTGTTGTAGTATAACCTTGGTATATTGTATGT -ATCATGAATTCCTCATTTTAATGACCAAAAAGTAATAAATCAACAGCTTGTAATTTGTTT -TGAGATCAGTTATCTGACTGTAACACTGTAGGCTTTTGTGTTTTTTAAATTATGAAATAT -TTGAAAAAAATACATAATGTATATATAAAGTATTGGTATAATTTATGTTCTAAATAACTT -TCTTGAGAAATAATTCACATGGTGTGCAGTTTACCTTTGAAAGTATACAAGTTGGCTGGG -CACAATGGCTCACGCCTGTAATCCCAGCACTTTGGGAGGCCAGGGCAGGTGGATCACGAG -GTCAGGAGATCGAGACCATCCTGGCTAACATGGTGAAACCCCGTCTCTACTAAAAGTACA -AAAACAAATTAGCCGGGCATGTTGGCGGGCACCTTTTGTCCCAGCTGCTCGGGAGGCTGA -GGCAGGAGAGTGGCGTGAACCCAGGAGGTGGAGCTTGCAGTGAGCCGAGATTGTGCCAGT -GCACTCCAGCCTGGGCGACAGAGCGAGACTCTGTCTCAAAAAATAAAATAAAAAAGAAAG -TATACAAGTCAGTGGTTTTGGTTTTCAGTTATGCAACCATCACTACAATTTAAGAACATT -TTCATCACCCCAAAAAGAAACCCTGTTACCTTCATTTTCCCCAGCCCTAGGCAGTCAGTA -CACTTTCTGTCTCTATGAATTTGTCTATTTTAGATATTATATATAAACGGAATTATACGA -TATGTGGTCTTTTGTGTCTGGCTTCTTTCACTTAGCATGCTATTTTCAAGATTCATCCAT -GCTGTAGAATGCACCAGTACTGCATTCCTTCTTATTGCTGAATATTCTGTTGTTTGGTTA -TATCACATTTTATCCATTCATCAGTTCATGGACATTTAGGTTGTTTTTATTTTTGGGCTA -TAATGAATAATGTTGCTATGAACATTCGTTTGTGTTCTTTTTGTTTTTTTGGTTTTTTGG -GTTTTTTTTGTTTTGTTTTTGTTTTTGAGACAGTCTTGCTCTGTCTCCTAAGCTGGAGTG -CAGTGGCATGATCTTGGCTTACTGCAAGCTCTGCCTCCCGGGTTCACACCATTCTCCTGC -CTCAGCCCGACAAGTAGCTGGGACTACAGGCGTGTGCCACCATGCACGGCTAATTTTTTG -TATTTTTAGTAGAGATGGGGTTTCACCGTGTTAGCCAGGATGGTCTCGATCTCCTGACCT -CGTGATCTGCCTGCCTAGGCCTCCCAAAGTGCTGGGATTACAGGCGTGAGCCACTGCACC -TGGCCTTAAGTGTTTTTAATACGTCATTGCCTTAAGCTAACAATTCTTAACCTTTGTTCT -ACTGAAGCCACGTGGTTGAGATAGGCTCTGAGTCTAGCTTTTAACCTCTATCTTTTTGTC -TTAGAAATCTAAGCAGAATGCAAATGACTAAGAATAATGTTGTTGAAATAACATAAAATA -GGTTATAACTTTGATACTCATTAGTAACAAATCTTTCAATACATCTTACGGTCTGTTAGG -TGTAGATTAGTAATGAAGTGGGAAGCCACTGCAAGCTAGTATACATGTAGGGAAAGATAG -AAAGCATTGAAGCCAGAAGAGAGACAGAGGACATTTGGGCTAGATCTGACAAGAAAAACA -AATGTTTTAGTATTAATTTTTGACTTTAAATTTTTTTTTTATTTAGTGAATACTGGTGTT -TAATGGTCTCATTTTAATAAGTATGACACAGGTAGTTTAAGGTCATATATTTTATTTGAT -GAAAATAAGGTATAGGCCGGGCACGGTGGCTCACACCTGTAATCCCAGCACTTTGGGAGG -CCGAGGCAGGCGGATCACCTGAGGTCGGGAGTTAGAGACTAGCCTCAACATGGAGAAACC -CCGTCTCTACTAAAAAAAATACAAAATTAGGCGGGCGTGGTGGTGCATGCCTGTAATCCC -AGCTACTCAGGAGGCTGAGGCAGGAGAATTGCTTGAACCTGGGAGGTGGAGGTTGCGGTG -AGCCGAGATCACCTCATTGCACTCCAGCCTGGGCAACAAGAGCAAAACTCCATCTCAAAA -AAAAAAAAATAAGGTATAAGCGGGCTCAGGAACATCATTGGACATACTGAAAGAAGAAAA -ATCAGCTGGGCGCAGTGGCTCACGCCGGTAATCCCAACACTTTGGGAGGCCAAGGCAGGC -GAATCACCTGAAGTCGGGAGTTCCAGATCAGCCTGACCAACATGGAGAAACCCTGTCTCT -ACTAAAAATACAAAACTAGCCGGGCATGGTGGCGCATGCCTGTAATCCCAGCTACTTGGG -AGGCTGAGGCAGGAGAATTGCTTGAACCGAGAAGGCGGAGGTTGCGGTGAGCCAAGATTG -CACCATTGCACTCCAGCCTGGGCAACAAGAGCGAAACTCCGTCTCAAAAAAAAAAGGAAG -AAAAATATTTTTTTAAATTAATTAGTTTATTTATTTTTTAAGATGGAGTTTTGCCCTGTC -ACCCAGGCTGGGGTGCAATGGTGCAATCTCGGCTCACTGCAACCTCCGCCTCCTGGGTTC -AAGTGATTCTCCTGCCTCAGCTTCCCGAGTAGCTGTGATTACAGCCATATGCCACCACGC -CCAGCCAGTTTTGTGTTTTGTTTTGTTTTTTGTTTTTTTTTTTTGAGAGGGTGTCTTGCT -CTGTCCCCCAAGCTGGAGTGCAGCGGCGCGATCTTGGCTCACTGCAAGCTCTGCCTCCCA -GGTTCACACCATTCTCTTGCCTCAGCCTCCCGAGTAGCTGGGACTACAGGTGCCCGCCAC -CACACCCGGCTAATTTTTTTGTGTTTTTAGTAGAGATGGGGTTTCACTGTGTTAGCCAGG -ATGGTCTCGATCTCCTGACCTTTTGATCCACCCGCCTCAGCCTCCCCAAGTGCTGGGATT -ATAGGCGTGAGCCACTGTGCCCGGCCTAGTCTTGTATTTTTAGTAGAGTCGGGATTTCTC -CATGTTGGTCAGGCTGTTCTCCAAATCCGACCTCAGGTGATCCGCCCGCCTTGGCCTCCA -AAAGTGCAAGGCAAGGCATTACAGGCATGAGCCACTGTGACCGGCAATGTTTTTAAATTT -TTTACATTTAAATTTTATTTTTTAGAGACCAGGTCTCACTCTATTGCTCAGGCTGGAGTG -CAAGGGCACATTCACAGCTCACTGCAGCCTTGACCTCCAGGGCTCAAGCAGTCCTCTCAC -CTCAGTTTCCCGAGTAGCTGGGACTACAGTGATAATGCCACTGCACCTGGCTAATTTTTA -TTTTTATTTATTTATTTTTTTTTGAGACAGAGTCTTGCTCTGTCACCCAGGCTGGAGTGC -AGTGGTGTAAATCTCAGCTCACTGCAGCCTCCGCCTCCTGGGTTCAAGTGATTCTCCTGC -CTCAACCTCCCAAGTAGCTGGGATTAGAGGTCCCCACCACCATGCCTGGCTAATTTTTTG -TACTTTCAGTAGAAACGGGGTTTTGCCATGTTGGCCAGGCTGTTCTCGAACTCCTGAGCT -CAGGTGATCCAACTGTCTCGGCCTCCCAAAGTGCTGGGATTACAGGCGTGAGCCACTGTG -CCTAGCCTGAGCCACCACGCCGGCCTAATTTTTAAATTTTTTGTAGAGACAGGGTCTCAT -TATGTTGCCCAGGGTGGTGTCAAGCTCCAGGTCTCAAGTGATCCCCCTACCTCCGCCTCC -CAAAGTTGTGGGATTGTAGGCATGAGCCACTGCAAGAAAACCTTAACTGCAGCCTAATAA -TTGTTTTCTTTGGGATAACTTTTAAAGTACATTAAAAGACTATCAACTTAATTTCTGATC -ATATTTTGTTGAATAAAATAAGTAAAATGTCTTGTGAAACAAAATGCTTTTTAACATCCA -TATAAAGCTATCTATATATAGCTATCTATATCTATATAGCTATTTTTTTTAACTTCCTTT -ATTTTCCTTACAGGGTTTTAGACAAAATCAAAAAGAAGGAAGGTGCTCACATTCCTTAAA -TTAAGGAGTAAGTCTGCCAGCATTATGAAAGTGAATCTTACTTTTGTAAAACTTTATGGT -TTGTGGAAAACAAATGTTTTTGAACATTTAAAAAGTTCAGATGTTAGAAAGTTGAAAGGT -TAATGTAAAACAATCAATATTAAAGAATTTTGATGCCAAAACTATTAGATAAAAGGTTAA -TCTACATCCCTACTAGAATTCTCATACTTAACTGGTTGGTTGTGTGGAAGAAACATACTT -TCACAATAAAGAGCTTTAGGATATGATGCCATTTTATATCACTAGTAGGCAGACCAGCAG -ACTTTTTTTTATTGTGATATGGGATAACCTAGGCATACTGCACTGTACACTCTGACATAT -GAAGTGCTCTAGTCAAGTTTAACTGGTGTCCACAGAGGACATGGTTTAACTGGAATTCGT -CAAGCCTCTGGTTCTAATTTCTCATTTGCAGGAAATGCTGGCATAGAGCAGCACTAAATG -ACACCACTAAAGAAACGATCAGACAGATCTGGAATGTGAAGCGTTATAGAAGATAACTGG -CCTCATTTCTTCAAAATATCAAGTGTTGGGAAAGAAAAAAGGAAGTGGAATGGGTAACTC -TTCTTGATTAAAAGTTATGTAATAACCAAATGCAATGTGAAATATTTTACTGGACTCTAT -TTTGAAAAACCATCTGTAAAAGACTGAGGTGGGGGTGGGAGGCCAGCACGGTGGTGAGGC -AGTTGAGAAAATTTGAATGTGGATTAGATTTTGAATGATATTGGATAATTATTGGTAATT -TTATGAGCTGTGAGAAGGGTGTTGTAGTTTATAAAAGACTGTCTTAATTTGCATACTTAA -GCATTTAGGAATGAAGTGTTAGAGTGTCTTAAAATGTTTCAAATGGTTTAACAAAATGTA -TGTGAGGCGTATGTGGCAAAATGTTACAGAATCTAACTGGTGGACATGGCTGTTCATTGT -ACTGTTTTTTTCTATCTTCTATATGTTTAAAAGTATATAATAAAAATATTTAATTTTTTT -TTAAATTAGCTGTATCTGTGATTGTATTTCTTTTTTGCATATTATTTTGCCCTTTGGCCC -ATATTTTGATATGGATGCCACCATAGCATTTTGTGTATGTGCATGTGTATTCCCACTTAA -TGTCACATTTTTCATGTCTTTACATATTCTTATTTTTGTTTGTTTTTGAGACAGAGTCTC -GCTCTGCTGCCCACGCTGGAGTGCAGTGGTGCAATCTCAGCTCACTGCAACCTCTGCTAT -CCGGGTTCAAGCAGTTCTCGTGCCTCACCCACGTGAGTAGTTGGGATTACAGGCATGTGG -CACCATGCCCCACTAAGTTTTGTATTTTTAGTAGAGATGGAGTTTCACCATGTTGGCCAG -GCTGGTCTCAAACTCCTGCCCTCAAGTGATTCGACCACCCTGGCCTCCCAAAGTGCTGGG -ATTACAGCCGTGAGCCACCGCACACGGCCTCTCTATTTATTTCTATACATAGCTTTTCAC -ATTATATTATGTTTATATATTGTTTATATCTGTATTTCCTCTTTCATTAGAGAAAAGGTA -GTACATCTTATTCTTCATGGTGTCTACAATATCTGGCAGTTTTTGGAAGTCAAGCGTGAG -CTTAGAGCATAGACTGGTGGGATTGTCAAAGAAGAGGGCAACTGGAAGAGAACTGTCAGT -TATTTTTGGATCAGTCTTTAATTCATCATGACGGGTTAGGCATTAGTTGTATTTCTTGCT -AATTTTGAAGAAGACTTATTAACAAATCCTACATTAGGTAAATGGTTTTGAAAGTTGAGT -TAATCATAATGGTGTTTGACCTAGGACTATTTTTAGGCCCTATTTATCTTAATATCGAAT -AATGAAGCAGCTTCCCCCTTAGATATAGACAGAAAACATCAAAGCCACCACACTACCTGG -CTGGATTTATCCTAGTAATAAAATCAAAACTGAGCTAGTTCTCTGGCTTTCATTGTAATA -ATTGTCCTTGTGGTTGTAAGGAATCTAGATGAAAATTACATGGTCTGTTCTACAGCCACA -GCTGTACCTACATTCAGAAGACAGACAAAAGTTGCTGTGTTTGAAGAGATCCTTCATTAA -GGGATCAGACAGAGATTACTTTGAGACATATTCTAAGTTTAACTTTTCTGCAGGGTTGCC -ATTAACAGAAATAAACTACAGAGTTAATTTCTTTTTGTTTTTGATACAGTCTAACTCTCA -CCCAAGCTGGAGTGCAGTGGCGCAATTTCAGCTCACTGCAACCTCTGCCTCCCAGGTTCA -AGCAATTCTCCTGCCTCAGCCTCCCGAGCAGCTGGGACTACAGGCATGTGCCACTATGCC -TGGCTAATTTTTGTATTTTTAGTAGTAGAGACGTGGTTTCGCCACGTTGGCCAGGCTGGT -CTGGAACTCCTGACCCCAGGTAATCCACCTGCCTCGGCCTCCCAAAGTGCTGGGATTACA -AGCTTGAGCCACTACGCCTGACCCAGAGTTAACTTTTTAAAAAAGTTTTTATGAACTTAA -GTCTTGTGATGTTTGAAATAATGGATTCAATTTAGACATCAAATTCCAGAAGTTACTAAG -AGCAGCTGGGCGCGGCAGCTCACACCTGTAATCCCAGCACTTTGGGAGGCCGAGGCGGGT -GGATCACCTGAGATCAGGAGTTCCAGACCAGCCTGGCCAACATAGTAAAACCCTGTCTCT -ACTAAAAATACAAAAATTAGCCCGGCATGGTGGCACGCCCTGTAGTCCCAGCTACTTGGG -AGGCTGAGGCAGGAGAATTGCTTGAACCCGGGAGGTGGAGGTTGTGGTGAGCCGAGATTG -TGCCACTGTACTCAAGCCTGGGCTAAAAAGCGAGACTCCGTCTCAAAAAAAAAAAAAAAA -AAAACACGTTACTAAGAGCAACTCTGGGCCAGGCACGGTGGCTTACACCTGTAATCCCAG -CATTTTGGGAGGACGAGACAGGCGGATCACTTGAGCCCAGGAGTTCAAGACCAGCATAAG -CAACAACGCAAAACCCCTGACTCTACAAAACATGAAAGAATTAGCAAGGCATGGTGGTGC -ATGCCTGTAGTCCCAGCTACTGGAGAGGCTGAGGCAAAAGGATCACTTGAGTACAGGAGG -TTGAGGCTGTGTAATGAGCCGTGTTCACACCATTGCACTTCAGCCTGGGCAACAGACTGA -GACCCTGTCTCAAAAAAAAAAACCAAACCAAAGCAACAAACAAAAAACAAGAGCAACTCT -GCTTCTGTACACTTTTTTTTTTTTTTTTGGTAGTGACATGATCTATGTTGCCCAAGCTGG -TCTCGAGTTCCTGGGTTCAAGCCATTCTCCCACCTCGGGCTCCCAAAGTGCTAGGATTAC -AGGCATGAATCACCATGCCCAGCCCTTCTGTACACTTTTCACAGTGTACCCTTTTGTGTT -TTTTAAAATGTTTGTGTATACATTTATTGTGAATTTTTAAAAAACATGTAATTAAGGCCA -GGCATGGTGGCTCATACCTGTAATCCTAGCACTTTGAGAGGCTGAGGTGGGTGGATCACC -TGAGGTCGGTAGTTCGAGACCAGCCTGGCCCAACATGGTGAAACCCCATCTCTACTAAAA -ATACAAAAAAAAAATTAGCTCGGCATGGTGGTGGGCGCCTGTGATCCCAGCTACTGGAGA -GGCTGAAGCATGAGAATCACTTGAACCCAGGAGGCGGAGGTTGCAGTGAGCCAAGATCGT -GCCACTACACTCCAGCCTGGGTGACTCAGTGACTGTCTCAAAAAGAAAAAAAGTAATTAA -GTTCTGTCATGATATATCATCATTACCCTTTTTGAACTTTTAAAATTTTTTATCTTTAGA -GGTAATTCATATAATGTTCTTCAATAGATAAGTGCTTTTCTGTCAATATATCTTGGAGAA -CACACCATATCAGTATTTAAAACTCTCATTCTTCCTATTTCTCCACATCCTCTCCAGCAC -CCGTTGTTTCCTGACTTTTTAATGATTGCCATTCTAACTGGTGTGAGATGGTATCTTATT -GTGGTTTTGATTTGCATTTCTCTGATGGCCAGTGATGGTGAGCATTTTTTCATGTGTTTT -TTAGCTGCATAAATGTCTTCTTTTGAGAAGTGTCTGTTCATGTCCTTCGCCCACTTTTTG -ATGGGGTTGTCTGTTTTTTTCTTGTAAATTTGTTTGAGTTCATTGTAGATTCTGGATATT -AGCCCTTTGTCAGATGAGTAGATTGCAAAAATTTTCTCCCATTTTGTAGGTTGCCTGTTC -ACTCTGATGGTAGTTTCTTTTGCTGTGCAGAAGCTCTTTAGTTTAATTAGATCCCAGTTT -TGGCTTTTGTTGCCGTTGCTTTTGGTGTTTTAGACATGAAGTCCTTGCCCATGCCTATGT -CCTGAATGGTAATGCCTAGGTTTTCTTCTAGGGTTTTTATGGTTTTAGGTCTAACATTTA -AGTCTTTAATCCATCTTGAATTAATTTTTGTATAAGGTGTAAGGAAGGGATCCAGTTTCA -GCTTTCTACATATGGCTAGCCAGTTTCCCAGCACCATTTATTAAATAGGGAATCCTTTCC -CCATTGCTTGTTTTTCTCAGGTTTGTCAAAGATCAGATAGTTGTAGATATGCGGCGTTAT -TTCTGAGGGCTCTGTTCTGTTCCATTGGTCTATATCTCTGTTTTGGTACCAGTACCATGC -TGTTTTGGTTACTGTAGCCTTGTAGTATAGTTTGAAGTCAGGTAGCGTGATGCCTCCAGC -TTTGTTCTTTTGGCTTAGGATTGACTTGGCGATGCGGGCTCTTTTTTGGTTCCATATGAA -CTTTAAAGTAGTTTTTTCCAATTCTGTGAAGAAAGTCATTGGTAGCTTGATGGGGATGGC -ATTGAATCTGTAAATTACCTTGGGCAGTATGGCCATTTTCACGATATTGATTCTTCCTAC -CCATGAGCATGGAATGTTCTTCCATTTGTTTGTATCCTCTTTTATTTCATTGAGCAGTGG -TTTGTAGTTCTCCTTGAAGAGGTCCTTCACATCCCTTGTAAGTTGGATTCCTAGGTATTT -TATTCTCTTTGAAGCAATTGTGAATGGGAGTTCACTCATGATTTGGCTCTCTGTTTGTCT -GTTATTGCTGTATAAGAATGCTTGTGATTTTTGTACATTGATTTTGTATCCTGAGACTTT -GCTGAAGTTGCTTATCAGCTTAAGGAGATTTTGGGCTGAGACAATGGGGTTTTCTAGATA -TACAATCATGTAATCTGCAAACAGGGACAATTTGGCTTCCTCTTTTCCTAATTGAATACC -CGTTATTTCTTTCTCCTGCCTAATTGCCCTGGCCAGAACTTCCAACACTATGTTGAATAG -GAGTGGTGAGAGAGGGCATCCCTGTCTTGTGCGTGTTTTCAAAGGGAATGCTTCCAGTTT -TTGCCCATTCAGTATGATATTGGCTGTGGGTTTGTCATAGATAGCTCTTATTATTTTGAG -ATACGTCCCATCAGTACCTCATTTATTGAGAGTTTTTAGCATGAGGGATTGTTGAATTTT -GTCAAAGGCCTTTTCTGCATCTATTGAGATAATCATGTGGTTTTTGTCTTTGGTTCTGTT -TATATGCTGGATTACATTTATTGATTTGCGTATGTTGAACCAGCCTTGCATCCCAGGGAT -GAAGCCCACTTGATCATGCTGGATAAGCTTTTTGATGTGCTGCTGGATTCGGTTTGCCAG -TATTTTATTGATGATTTTTGCATCAATGTTCATCAAGGATATTAGTCTAAAATTCTCTTT -TTTGGTTGTGTCTCTGCCTGGCTTTGGTATCAGGATGATGCTGGCCTCATAAAATGAGTT -AGGGAGGATTCCCTCTTTTTCTATTGATTGGAATAATTTCAGAAGGAATGGTACCAGTTC -CTCCTTGTACCTCTGGTAGAATTCGGCTGTGAATCCATCTGGTCGTGGACTCTTTTTGGT -TGGTAAGCTATTGATTATTGCCACAATTTCAGAGCCTGTTATTGGTCCATTTAGAGATTC -AACTTCTTCCTGGTTTAGTCTTGGGAGGGTGTATGTGTCGAGGAATTTATCCATTTCTTC -TAGATTTTCTAGTTTATTTGCGTAGAAGTGTTTATAGTATTCTCTGATGGTAGTTTGTAT -TTCTGTGGGATCGGTGGTGATATCCCCTTTATCATTTTTTATTGCGTCTATTTGATTCTT -CTCTCTTTTCTTCTTTATTAGTCTTGCTAGCGGTCTATCAATTTTGTTGATCCTTTCCAA -AAACCAGCTCCTGGATTAATTTTTTGAAGGGTTTTTTGTGTCTCTATTTCCTTCAGCTCT -GCTCTGATTTTAGTTATTTCTTGCCTTCTGCTAGCTTTTGAATGTGTTTGCTCTTGCTTT -TCTAGTTCTTTTAATTGTGATGTTAGGGTGTCAATTTTGGATCTTTCCTGCTTTCTCTTG -TGGGCATTTAGTGCTATAAATTTCCCTCTACACACTGCTTTGAATGTGTCCCAGAAATTC -TGGTATGTTGTGTCTTTGTTCTCGTTGGTTTCAAAGAACATCTTTATTTCTGCCTTCATT -TCGTTATGTACCCAGTAGTCATTCAGGAGCAGGTTGTTCAGTTTCCATGTAGTTGAGTGG -TTTTGAGTGAGTTTCTTATTCCTGAGTTCTAGTTTGATTGCACTGTGGTCTGAGAGACAG -TTTGTTATAATTTCTGTTCTTTTACGTTTGCTGAGGAGAGCTTTACTTCCAACTATGTGG -TCAATTTTGGAATAGGTGTGGTGTGGTGCTGAAAAAAATGTATATTCTGTTGATGTGGGG -TGGAGAGTTCTGTAAACTGCTTCAACCACTGTGGAAGTCAGTGTGGCGATTCCTCAGGGA -TCTAGAACTAGAAATACCATTTGACCCAGCCATCCCATTACTGGGTATATACCCAAAGGA -CTATAAATCATGCTGCTATGAAGACACATGCACACGTATGTTTATTGCGGCACTATTCAC -AATAGCAAAGACTTGGAACCAACCCAAACGTCCAACAATGATAGACTGGATTAAGAAAAT -ATGGCACATATACACCATGGAATACTATGCAACCATAAGAAATGATGAGTTCATGTCCTT -TGTAGGGACATGGATGAAATTGGAAATCATCATTCTCAGTAAACTATCGCAAGGACAAAA -AACCAAACACCGCATGTTCTCACTCATAGGTGGGAATTGAACAATGAGAACACATGGACA -CAGGAAGGGGAACATCACACTCTTGGGACTGTTGTGGGGTGGGGGGAGTGGGGAGGGATA -GCATTAGGAGATATACCTAATGCTAAATGAGGAGTTAATTGGTGCAGCACACCAGCATGG -CACATGTATACATATGTAACTAACCTGCACATTGTGCACATGTACCCTAAAACTTAAAGT -ATAATAATAAAATAAAATAAGAAAAATGCAAAAATTAAAAATTTAAAAAAAAGCTCTCAT -TCTTTTAAGCACTTACAGGATATTCTTACAGATGTGTACCACGCTTAATGAATTGAGCTC -TTGTGGATGAGAGTTTAATTTGTTTCTAATCATTTGTTATTTAATAGTACAGTCAGCATC -TTTAGGATTAAGTATCTAGAATTAGAACTACTGTGTTGAAGAGGCTATTGCATTTAAATT -GTTTTTTTTTTTTTTTTGATACGGAGTCTTGCTCTGTTGCCCAGGCTGGAGTGCAATGGC -GTGATCTCAGCTCACCGCAACCTCCGCCTCCCAGGTTCAAGCAGTGCTCCTGCCTCAGCC -TCCTGAGTAGCTAGGATTACAGGCACACGCCACCATGCCCGGCTAATTTTTGTATTTTTT -TAGTAGAGACGGGGTTTCACCATGTTGGCCAGGCTGATCTTGAACTCCTGACCTTGTGAT -CTGCTCGCCTTGGCCTCCCAAAGTGCTGGGATTACAGGCATGAGCCACCGTGCCCGACCT -ACATTTAAATTTTAAACAAAAGTTTGCTAAATTGTTTTCAGTAGAGGTTATATTAATCTA -TATTTATACCAACATGGAGAGTTTGTTTCCTGCAAAATAGCCAATAATTTATCAAACCTT -TGAATCTTTGTCAATTGAATAGTTAAAAATGATTATCTCATATTTGTACATTTTTATCTT -ATTGTGAAGTTCAGCACCTTTTCATGTGTTTAAGAACTTTTAATTTTCTGTTGTTTATAT -GGTCTTCCCATTACCATTTTAACTATTTGTTTTTATTTTCAGAGTTTTTGCTTATAAAAT -TTTATTTACAGTCAATTCTCTTTATTTGTAGAATCTGTATTTGTAAAGGCACCTACTTGC -TAAAATTTATTTGTAACCTAACATCAATACTCATGGCAGTTTCATGGTTTTTCATGGACA -TACACAGAGGTGAAAAATTTGAGAACCTTACCCAGATATTCCCAGCTGGGGTTGAACAGT -GCTCAGTTTTTTGTGTAGCTTTCTTACTATAAACAAGTGTCCTTTTTGAAAGCAGTTTAT -ATAGTTCTACATTTTTCACATTTTTGTGCCTTCTGTTTGTGATTTTACTGTTTAAAGTGA -TTCCCAAGCATTGTGCTGAAGTGCTATATAGTGGTATTCCAAGGTGCATGCGGGCTGTGA -GGTGCCTTAGAGAATACATGTGTTAGATAACCTTTGTTTAGTCATGAGTTATAGTGCTGT -TGAGTGGGAGTTAGATGATGATGAGTCATCACTATTTATTATTATATTTTTTGAGATGGA -GTCTCACTCTGTCACCCAGGCTGGAGTGCAATGGCATAATCTCGGCTCACTGCAACCTCC -TCCTCCCGGGTTCAAACGATTCTCCTGCCTCAGTCTTCCCAGTAGCTGGGATTATAGGCA -CCCGCCACTGCACCCAGCGTAATTTTTGTATTTTTAGTAGAGGTGGGGTTTTACCATGTT -GGGTAGGCCAGTCTTGAACTCCTGACCTCAAGTGATCCACCCACCTTGGCATCCCAGAGT -GCTGGGATTACAGATATGAGCCACCATGCCAGGTCTATATTTATTAAATAATGTGTCTTT -AAACAGAAACAGATATAAAACAAGCTTACGTATTTATAAGTTGGTGAAAATGTGACCAAA -GGCTTACAAGAACCTAACCCTGTATTTCTGTTAGGAGCAATGGCTCAGTATTCACTAATT -TGCGTGTTTGTGGCAACTTCATAGAACATAACTACCTCAAGTAATGAGAATTGACTGCAT -TCTTTTTCAAGTCATTTTATAAACAATTTACAGAAGAATAAAGGGATGGTGAAAATTAAC -TTTGTTAGCAATTTTAATGAGAATCCAAATATAGGAGACCCACATTTTTTCCCATATTTT -CCCAGTTTTGAATGTTTATGTATACCTAAAAGGCATTACATCCTTTGAAAGCAGCTGTCA -TTATGCATGAATCTGGAACATACCTACCTTTAAATACGGATTTTGGATTTCAAATGCATC -TCTACTATGTTCTACCTTATTATTTGTATTCTTCATGAACTCACTTTGTCAAAATGCAAT -ACTTTTTGTTTTTTAATTTATTTTTATTTTTTGTAGAAATAGGGTCTCACTGTGTTGCCC -AGGCTGGCCTTGAACACCTGGCCTCAAGTGATCTTCCTGCCTTCCAAAGTGCTGGGGACG -GTAGGCATGAGCCACCACACCTGTCCAAACTGCAATACTTCTGAAAACTTTAGGGCTCAT -AGTTTTGTTGAAGTGATAGATGATGGCTATATTCTTTGTTACATAACAGCAAAACATTTT -TGTTTTTACATTTATAAATACCAATTAGAATGACTTTCAGTGGATTGGTTTTCATTTTTC -ACATCATCTTTACCTTCCTGTTACTTTGTGTACATATCTGTCTTTCATACTTGTCCACTT -ACAAACTTTTTCAAGTAAATTCTGGTGTTACAAGCATAAAAGATGAAAGAACGTTGTCAC -ATGGTCACTTGTCCTTTTAGCAATTATGCGATGATTCAACTGTTCTAGGTACAACTAGAG -GGAGAGTATCCCAGGCAAGGGAGATAACAAATAGAAAGGCCCTAAGACACAAGTGTATTT -AACATGTTTGGGGAACAACAAGGAGTTAATCGTGGCTGGAGTGGAAGTAAGGAGGAGAGA -TTAAGGAGATGGAGCTAAGAGAGGTAGTCAAGGGCCAGGCCATATGTCAGCGATAGTAAG -GTCTTCAGCATTTACTTTTTTAAGCTGGGAGTCCATGGAAAGGTTTTGAACCCAAGGTAT -AGCATGATCTGACTTACAGAAAGAGACTTCTGATTGCTGTGTTGAAAATACACCATAGGT -TTGAAGGGAGGAAACAGGCTGACTAGTTAGAGCCAGTGTGGGTAGTGGTGGTTGGATCTG -AGTATATTTTCCAAGTGGAGCCACCAGGATTTTTCAGTAGATTGATTACATGTGGTGTAT -GAAAGAGGAGTGTCAAGTGTAACTCCGAGATTTTTGGCTTATGCAACTGGAAAAATAAAG -TTAGAATTTAATGAGATGGAGGTCTGCATAAGGAGTACTTTTGTGGCAGGAAAGAAATTG -GGTTTTGAACATGTGAAAATTGAGATGCCCATTAGTAGAAGTTGGATGTGAATAAAGAGT -CCAGGCCAGGTGCAATGCCTCATGCCTGTAATCTCAGCACTTTGGGAGGCCAAGGCAGGA -GAATCATGTGAGCCCAGGAGTTCAAGACCAGACTGGGCAACAAAGTGAGACCCCGTCTAT -ATTATAAAATAAAAAAATAGTTCAGAGGAGAGGTCTGGGCTAGAGATGGAAATGTAGAAG -TTAGTAAATTTAAAGCTGTTGAACTAGAGGAGATAGCTGAGGAAGTGCATTCAAATAGAG -AAGATGTCAGAGGAGAACTTTGGGGTTCTCTCAGTGGTTAGAGATAGGATATGAGGAAAA -ACAGTGCAGGAGACTAAGGAGGAGCTCTCATTGAGTTAGGAAAATCAAGAGGGATGCCCT -GGAAGCCAAATGAAGGCAGTGTTTTGAGGAAGAGGGGTGATGGGCCATGTGAAAGCCAAT -AGGTCACATGCTGCTAATGGGTCAACTAAAGTGAGGACTGAGAAGTATTCACCAATTTAG -CAATGTGGAGCTCATTGGTGACCCTCATAAGAGCTGTGTTGGTGGAATGGAGGAGGTAAA -ATCCTGGAGGGAGAGAACATAAGAATGAGAGAACAGTTGACAGTGCATGTAAACAACTCT -TTCACGGAACTTTGTATTTCTGAATTTTTGTTTATTTGGCTATTAATAAAATCATATCTG -ATATAGCTTTATTTTAGTAAGGTTTGTTTTTGTGGGACTTCAGTTGTGTATACACATATA -ATATGTGTGTGTATGTATGTGCGTATGGTGTTTTGATGTAAAATTTATTATTGTGGGTCA -TGGTTAAAAAAAAAGCTTGAGAATGAGGAGTTAGATCAAGAAATAGAAGGAAAGTTGACA -TAAGAAGTTGTGGATGTAGGAGATTCTACCATGTAGACACAGTGGAAGGATTTAGGGAGT -TGGAGCAGGTTGGGATATGTGATCAGAAAGCGGGAGTTTAGCTCTCTCACTTGCCCCTGC -TTTTACCATGTGATGTGTCTGCTACCCCTTCACCTTCCACCATGACTGTAAGCTTCCTGA -GGTCTCCCTAGAAGCCAAGCAGATACCAGCACCATGCTTCCTGTAAAGCCTGCAGAACCA -TGAGCCAATTAAACCTCTTTGTAAATTACCCAGTTTGAGGTATTTCTTTATAGCAGTGCA -AGAATGCCCCAATACAGAAAATTGGTACCGAGAAGTTGGGCATTGCTATAAAGATACCTG -AAAATGTGGAAACAGCTTTGGAACTGGGTAATGAGTAGTGGTTGGAAGAGTTTACAGGGC -TCAGAAGAAGACAGGAAAATGAGGGTAAGTTTCAAACTTTTTTTTTTTTTTTTTTTTTTT -TTTTTTTTGAGACGGAGTCTTGCTCTGTCGCCCAGGCTGGAGTGCAGTGGCGTGATCTTG -GCTCACTGCAACCTCTGCCTCCCGGGTTCAAGTGATTTTCTGCCTCAGCTTCCCAAGCAG -CTGGGGTTACAGGCATGCACCACCATGCCTGGCTAATATTTTTGTATTTTTAGTAGGGAT -GGGGTTTCACCATGTTGGCCAGGCTGGTCTCGAACTCCTGACCTCAAGTGATTCACCCAC -CTCGACCTCCCAAAGTGCTAGGTTTACAGGCGTGAGCCACCGCTCCCGGCAAGTTTGGAA -CTTCTTAGAGACTAGATAAGTGGTTGTGACCAAAATGCTGATGGTGATAGGGACAGTGAA -GTCCAGGTTGACAAGGTCTCAAAAGGAAACGAATTTATTGGGAACTGGAGCAAAAGTCAC -ACGTTATGCCTTAGCAAATAACTTGGCTGCATTCTGCTTGTGTCCTAGGGATCTGTGGAA -GTTTGAACTTAAAAACTATGACCTAGCGTATGTGGCAGAAGAAATTTCTAAGCAGCAAAG -CATTCAAGATGTGGCCTTCTGCTACTAACAGCCTGTGCTCAGATGTGGGGGCAAATGAAT -GACTTAAATTTGGAACTTACATTTAAACAGGAAGCAGAGCCTAAAAGTTGGGAAATTTTG -CAGCCTAGCCAGGTGGTAAAAAAAAAAACCATTTTCTCCAAGGAATTCAAGCAGGCTGTG -GAGCAACCACTTGCTGATATTTGCATAACTGAAAGGGATCCAAGTGGTAATATCCAAGAC -AATGGGGAAAAGGCCTCAAAGGCATTTCAGAGACCTATGGGGCAGCCCCTCCTGTCATAG -GCCCTGAAGCCAAGGAAGACTGAATATTTTCCTGGGCTGAGCCCAGGGCCCTGTTGCCCT -GTGCAGCCTCAGAACACTGCTCCCTGCATCCAGATGGCTCCAACTCCAGCAGGGGCTCAA -AGGGGCCTAGGTACAGCTTGGGCTGTTACTTTGGAGGGCATAAGCCATAGCCTTCACAGC -TTCCATTAGGTGGTAAGCCTGCAGGCACACAGAATGCAAAAATGGTGAATTCTTGGTAGC -CTCTGCCTGGATTTCAGAGGATGTATGGAAAAGCCTGGGTGTCCAAGCAGAACCCTGCTG -CAGGAGCAGAGCCCTCACAGAGAGCCTCTACTAGGGCAGCATGGAGGGGAAATGTGGGGT -TAAAGGCCCCACGCAGAGTCCCTACTGGGGCACTGCCTAGTGGAGCTGTGAGAAGAGGGC -TACTGTTCTCCAGAATGGTAGAGCCACTGGCAGCTTGTACCCTGCACTTGGAAAAGCCAC -AGACACTCAACCCAGCCTGTGACAGCAGGCTGAACTCTGCAAAGCTATAGGAGCAGAGCT -GCCCAAGGCCTTGGGAGCCCAACCCTCATATCAGCGTGCCACATGGAAACCAAGGAGATC -ATTGTGGAGTTTCATGATTTAATGACTGCCATGCTGGGTTTTGAACTTGCATGGGGCCTA -TAGCCCCCTTTTTTGGCAGGTTTTTCCCTAATGGGAATATTTCCCCAATCCCTGAACCCT -GATTGTATGTTGGAAGTAAATAATTTGTTTTTTATTTTATAGGCTCATAGGTGGCAGGGA -TTTGCCTTGTCTCAGATGAGACTTTGGATTCCTGAGTTAATGCTGGAATGAGTTAAGACT -TTGCGGCACTGTTGGGAAGGCATGGTTGTATTTTGCATTGTGAAAAGGACATAAGATTTG -GGAGGGGCCAGAGGTGGAATGATGTGGTTTGGATATTTATCTCTACTTATGTTGAATTTT -ATCCCGAGTGTTGGAGATGGGGCATGGTGGGAGGTGTTTGGATCATGGGGGCAGATCCCT -CATGGCTTGGTGTTACCTTTGTGTTGTTACTGAGTTCTCGTGAGATCTGGTCATTTCAAA -GTGTATGAAACCTGCCCCCTGCCCCCCCACTGTCTCTCACTTGTTTCTGCTTTCATCATG -TGACATGTGTGCTCACCTTCTGCCATGATTTTAGTTTCCTGAGGCCTCCCTAAAAGCCGA -GCAGATGCCAGCACCATGCTTCCTGTAAAGCCTGCAGAACCGTGAGTCAACTAAACCTCT -TTTCTTTATGAAAGAAAAGGAAGGAAGGGAGAGAGGGAAGGAGAAAAAGAGAGAGGGAGA -GATGGATGGAGGAAGGGAGGGAGGGCTTACAACCATGAGGACAGTTTTTAGGTCAATGAG -GGATGACTTGGGAGTCCTATGAAGACTGATGTAAACTAGAATAAAGGGCATGATGAGCTT -ATGATTCAAAAGTATTTTGTCATAGAAATAGTTTGTTTTCTGTAAAAGAACACAGTAAAT -ATTTTAGCTTTGTAGGCCACTGAGTCTCTGTTGCTTTAAAAAATGTGAAAACCATTCTTA -GCTTGAGGGCTGGACAGTCCAGGGCCATACTTTACTGACCGCTGCTTGAACTAAACGCTG -TTAGAAGCAGCTCTGGAAAAATAATTTGCATGGAATCTTATGATTTTTTTTTTTTTTTTT -TTTTTTGAGGCAGAATTTTGCTCTTGTTGCCCAGGCTAGAGTGCAATAGCGCGTTCTTGG -CTCACTGCAACCTCCGCCTCCTGGGTTCAAGCAATTCTCCTGCCTCAGCCTCCCGAGTAG -CTGGGATTACAGGAAGGCACCACCATGCGTGGCTAATTTTGTATTTTTAGTAGAGACAAG -GTTTCTCCATGTTGGTCAGGCTGGTCTCGAACTCCCAACCTCAGGTGATCCACCCGCCTC -GGCCTCCCAAAGTGCTGGGATTACCAGCGTGAGCCACTGCACCTGGTCAAGTATTATGGT -TTTTTAATAGTATGCACACATGGGACAAAACTCAACTGGTATAAAAGGGTATGCAGGAGA -AAAAAAGCAAACTTCCCTCTCTCCCTTTTCTGTCCACCAGCCATCCTGTTCTCCTCCCTA -AACTCAATTATGGTTGCCTGTTTTTTATATAAGTTTTCCATGAATTTATAAATACATCAC -GTGCATATATCCTGTCAGTCAATATTAAGAAATTACTAGGTTATTTTGTGTTTATGTGTG -CACTATTAGATTTAATGAGTTATGCTAGTTGTTGCCTCTTATATCCACATTCAGTCTTCA -TTGTCTGTTCTGTAATAATAGATCTGGGCCCTGTAAATACCTCTCCCATGACAGTAAGCA -CAGAGTGAAACTTTGTCAATCGAGGGTGCTGCTGACACACTGAAGGGGCAAGGGCTGCTT -TTCCTGGTTCCATTGTGCTCCTCTAGGCAGACACCTGCAACACCTGTGCCATCTGCAATA -CCAGCTCCTGTAGCACATACACTCTGCCTCTGCAGCACCTCGTTCTGGCTGCACACTTCT -TGGGCGGTGCCTAACTTCAGCAGCACCCAATGGTCAGCAGCGCACAGTACCCCCACATGA -ATGGCTTCCCTTGACATGCACAAGGTCCCTTCTCTGCAAAGTGCCCCAAGCCCAGCACCT -TCTCCAGCTGCAACTCCACAGCCTCAGCAAACCTCTGTCTTTCACAGCTGTGTCCTCTCA -CACGAAGTCTGGATCTCAGCCCGGATCTCAGCCCTGAGCTTTCTTCTTTGAGTTGTTCTG -TCTCAGCCTGGGGTGAAAAGCCCATATCGGCTGTTCCCTGCATCTGCCCAGGCTTCTCTT -TATTCCTTACTACCCAATCCCCATTCCAATCCTCTGTTAATAACTCTTACGGACAGTCCC -CAACTCATGATGACTTGACTTAGGATTTTTCTACTTTGCAATGGTGCAAAAGTGATCCGC -ATTCAGTAGAAACTGTTCCTCAAGTACTCATACGACCTCTATTTTTCACTTTCTGTACAG -TATTCAATAAATTGCGTGAGATTTTCAATACTTTATTATAAAATAGGCTTTGTGTTTATG -ATTTTGCCCAACTGTAAGCTAATATAAGTGTTCTCAGCGTGTTTAAGGTAGGTCAGGTTA -AGCGATGATGTTTGGTAGTTTAGGTATATTAAATGCATTTCTGACATACAATATTTTCTA -CTTACAATGGGTTTTTCAGGATATAACCCTGTTGTAAGTTGAGGAGCATCTTATTTTATT -TATTTATTTATTTATTTGAAATGGAGTCTTGCTCTGTCACCCAGGCTGGAATGCAGTGGC -ACGATCTTGGCTCACTGCAACCTCTGCCTCCTGGGTTCAAGCAATTCTCCTGCCTCAGCC -TCCCAAGTAGCTGAGACTACAGGTGCACACCACCATGCCTGGCTTTTTTTTTTTTTTTTT -AATTTTTTTTTGTATTTTTAGTAGAGACAGGATTTCACCATGTTGGCCAGGCTGGTCTCG -AACTCCTGACCTCAAGTGATCTGCCCACCTCGGCCTCCCAAAGTGCTGGAATTACAGGCG -TGAGCCACTGCGTCAGGCCGAGCATCTGTATATTAAACTTTCCCCATTCAAATTTCTGTG -TGGTTTCTGTCTCCTGACTGGATTCTGATATAATGCTTAACAACCTTTCTAATTACAAAG -GTATTACATATAAAATCAGACAAGCAAGAAGACAATCCATCCCACCTTCTAGTACTCTTG -CCCTCCAGAGGTAGCTCCAGTTAATATTTTAGTGCTAAACTAGATTTATTTTTGTTTTAA -ATAGAAAAATAATGCAGGCACGAAAGTAAAACAAAAAACAGTACAGAATGGGAGAGACTG -AAAAGTAAGAATGGCTTCCAGGCCCACTTCCTAGAGGTACGCACTATTAACATTTTTAGA -TATAAACTTCCAGAAATTTTTTTCCAGTTTTATTTAGGTATAATTGACAAAATTATTTAT -ATTTCAGTTGTACAACATGGATGTTCAACATGTTTTGGTGTACATATACTTTCTGATATT -ATAAATGGTTACCACAAGCAAGCTCAGTAACATATTCAGAAATTCTTAATGTAGCTAGCA -ATATAAGTGGTTTTGTTTTTTGTTTTGAGACAGACAGGGTCTTGCTCTGTTGCCCAGGCT -GGAATGCAGTGGCGCCATCTTGGCTCACTGCAACCTCTGCCTCCCGGGTTCAAGCAAGTC -TTGCGTCTCAGCCGCCCTAGTGGCTGGGACTACAGGCATGTGCCACCACACCTGGCTAAT -TTTTGTATTTTTAGTAGAGATGGGGTTTCACCATGCTGGCCAGGCTGGTCTCGAATTCCT -CACCTCAAATGATTCGCCCGCCTCAGCCTCCCAAAGTGCTGGGATTACAGGTGTGAGCCA -CCGCACCCAGTCATAAGTGGTTTTCTAAACAAATGAGACCACACCATACATACTGTCCCT -ATATTTCATACTTGGGCAAGGGGAGGGGAGTTGACTTTTTTCTTAGTGAGAATAAAAATG -AGGATAAAAGTATGGTTGTTTACCAACTTATAGTAGTATCATGAATTTCGAATGGTCTTC -TGGCCGTTCAGAAAACTACTTAACTGGTAGGAACGAAATTCTGGACACTGACATTGATAT -AGACACTCATATCAAATATAATACTATGAAATACTATGATATGGAAATAATATGCAATCA -CTAGAGATAAAATATTTTCTACCCAAGTAGAGTGGATTCATAAGAAAATTCTAAATTATA -GCATATGTTGAACTCTGAGAAGCCTCTGGAATGAAGTCATTTTTCCCTAACCCCTGTTTC -CTCTTTATATTGGCAGTGGATAAATGGAAAGTAAGTTAACTCTACTGTACCAAAGCTAGT -TCAATATAGAAAACAGGTTCTACAAGGATTAAGGAACATCTCTTGGCCCACAGAAGATTC -ATGTGGATCCTGTGTTAAACCCGTTTCATCCATGTATGAAAGTGATTCAACCGTTAAGTT -AGCCATTTATTATATAAATTGAATACTTCTTCCATATTGTGGCTTTTAGATAGATTGGCA -GACCTGTCCCCAACCCCTTCCCTGTTGACCATGGACAATGGAGGGTTTGCTGTATAAACC -TGATTGAAGGGTTTGCCTTTAGCTGGGGTGGATTACTCAGGGACCTCAAAGGTATTGGTG -ATGATTTATTTCTTGAGCTTGGTGGTGAGTATGCAGGACTGTGTTTTGTTTTGTTTTGTT -TTTTAGCAAGCCTTACACATTTTCTTTTGTATGCAATATTTAATAAAATAATTTTGGATA -ATTTGGTTTTTAGCATTAATCAACAACTTTTTTTACATCCTCAATATGCCCCAAGACAAA -TTATTGATTCAGCAGTTTTTAGCTGAATCTTTTATTTCTGAATGATTGGAGAGAACGGCA -GTATCCATTTCTGGAGAATAGTTAAGTACTTAGATTGAGGATGTCTTTCTTTCATGACAT -TAAGCAATGCAATATCATCTGCATCCAAGAGCCAACTTAACATGTTCAGTCTAATGAGCC -TTGGTAGTCGTAACACACATTGACTCAAAGACTTGACTGTTGTGGCCTGAGCTTTGATAC -ACTCTGTGAAATGCCTGGAGATGTCCAACTCCTGCAAGTTTGGCATGTTGTCCAGTGCTT -GAAAGAAATTTCTGTATCCTTCCTCTGTAATCTTGTGATTGATTGAAAGCTTTAGGTTCT -CAAGTTTCTGGAAACCTCCACTGATTGCTACTTTGGCTACAAGAACAAAACATTCATGAA -AATAGAATCATAAGGACTTCCATTTCAATAATGGTAGACAAGGTTATTTGAACCAGCCTC -CTCTCCCACCACTACTGCTAGTAGGAAGTACTCAATATAATTTTGTTTTTTGAAATGGGG -TCTTGCTATGTTAACACAGGCTGGTCTCTTTTTCTTTCTTTCTTTTTTTTTTTTTTTTTT -TTTTTTTTGAGATGGAGTCTCGCTCTGTTACCCAGGCTGCAGTGCAGTGGCATGATCTCA -GCTCACTGCAACCTCCGCCTCCCAGGTTCATGCCATTCTCCTGCCTCAGCCTCCCGAGTA -GCTGGGACTACAGGCGCCCGCCACCATGCCCAGCTAATTTTTTGTATTTTTAGTAGAGAC -GGGGTTTCACTGTGTTAGCCAGGATGGTCTCGATCTCCTGACCTTGTGATCCACCCGCCT -TGGCCTCCCAAAGTGCTGGGATTACAGGCGTGAGCCACCGCGCCTGGCCATCACAGGCTG -GTCTCAAACTCCTGGACTCAAGTGATCCTCCTGCCTCAGCTTCCCAAGTAGGTGGGATTA -CAAGCACGTGTCACTGTGCCCAGCTTAATATAATATTTTGAAAATATCTCCTTAAAAACC -CCAAAGAGCTGATGGGTTAATAAAGAACCACCTGGCAAAAATCTAAGGGAGAAGCAGAAA -CCAAAGAAGTACAGCCAAGCCTAAAGCACTGACGCCATTGTGCTGAGAGTTTCACCATCC -TGGACAAATATGAGCTTCTCTTTTGGTCTCACAGGAGGTCACATGCCAAGGTACATCATG -CCTACAAACCAGACTAAATTGGCAAGTCACAGTGGCTCACGCTTGTAATCCCAGCATTTT -GGGAGGCCGAGGTGGGTAGATCACTTGAAGTCAGGAGTTCGAGACCAGGCTGGCCAACAT -GGTGAAACCCCATCTCTACTAAAAATACAAGAATCAGCCGGATATGGTGGTACATGCCTG -TAATCCCAGCTACTCGGGAGGCTGAGGCAGGAGAATCAGCTTGAACCTGGGAGGTAGAGG -TTGCAGTGAGCCAAAATCCCACCACTGCACTCCAGCCTGGGTGACAGAGCAAGACTCCGT -CTAAAAAACAAAAAGAGAAAACAAAAAAAAAAAAATCCAGACTAAATTACAAGGGACTTC -AAAGAGTGTAGCAATTATGTCTTCCCCTTTTATAGAGAAGGGGGTGTGACATTCCTAAAG -CCATGTCATCCGACTGTCCACTGCTATGCCCTATTTCTGTTGCCCAGAAGGGACCCTCCT -GTTTTTGAGACTAAGGGCTCTGAAGGAAATGGAAGCCGGGCACACCCTGTGTTCTCAATG -AACACAGGCTGACTAGACTTTGGAATGGGTACCAAGCAGAGTTCTTGTTGTTTGGTTTAG -GGGTTTTTAAAAAAAAAAAAATTTTTTTTTTTGAGACAGGGTGTCACTTGGTTGCCTAGG -CTGGAGTGCAATGGTTCAGTTATAACTCACTGCAGCCTAGAATTTCTGGGCTCAAGCAAT -CCTCCCGCCTCAGCCTCCTGAGTCCTAGCTACTCAGGACTAGCCACCGTGCTTGGCTAAT -TTTTCAATTTTTTATGGAGACAAGGTCTTGCTATGTTGCCCAATCTTGTCTCAAACTCCT -GGCCTCAAGCAGTCCTTCTATCTTGGCCTCCCAACGTGTTGGGATTACAGGCATGAGCCA -CCATGCCCAGCCTTGTTTTTAATTGCTAAACCTCTTTTTTTTTCTAACTTGGGCAAAGGT -TAAGTTTGGTTTCAATCTAGAATCCATGGCTGTAGCTTGACTGAGGTTAAACAACAGAGG -TACTGAGAAAATGTCTTCAGCTATGTCCTGAATAGGTATCTTCAGTAACATTCAAGAGAT -ATTTCTCATTCCCCCACATGAAAGACACTTCTGGAGGGACTTGAAGAAAGACTCAGGTCT -TTCACATCCATTCCCTTCTTTCCCCTGTTTCAGCATAACTCCCACTTCATATTGTGTGAT -TAGCCGGTTCTGTGATGTGTCTGAATGCTGTCTCCTACAGGTAAAGTTTAAGCATTACTG -ACTATAGGCAAACAATGGCCTCTCAGCTGTCCATCAGAAGAAGCTACAGAAAAGTAAGTT -TTCTTTATTCAGTCAACAAATATTTACTGAGTTCCCACTATAGGCCAGGCATACTCTGCT -GGGCGCCGGGAAGAGAAAACACCTGCTCTCAGGAGGGAGAGCGACAGGGGTTACTGGCTC -AAATCTGTGTGTGAGATTGAAGTTCTAAGGAAGGCTTTGACCTAATGTAGTGAGAAGAAT -AAAACACAAATAATTTATAGTAAATGAGGATGAGAGAGACCACAAAATTTAATGTTATTA -AATTCTTCCTCATAAGGAGGAAGAAACAAGGCCTTTAAGAAAAACGTTGTAAGTTGTCAT -TGTTTTTTGTTTTTTGTTTTTTTTTTTGAGACGGAGTCTCGTTCTGTTGCCCAGGCTGGA -GTGCAGTGGCATGATCTTGGCTCACTTCAAGCTCCACCTCCCGGGTTCACACCATTCTCC -TGCCTCAGCCTCCCGAGTAGCTGGGACTACAGGTGCCCGCCATCACGCCTGGCTAATTTT -TTGTATTTTTAGTAGAGACAGGGTTTCACCATGTTAGCCAGGATGGTCTCGATCTCCTGA -CCTCGTGATCCATCCGCCTCGGCCTCCCAAAGTGCTGGGATTACAAGCTTGAGCCACCGC -TCCCAGCTGTTTTAAATAACGTAAAATAACAGTGCTGAGCAGTAAGAAAATGAGATCCAG -CCCTTGTAACACACCCGACAAAGCCTCTCCAATTGGGCTTTTACTTTCCTCTCCAGTCTT -CTTCTCACAACTCACTCCTGCCTCTTGCTCCAGACATTCCTTTTTTTCCCCCAAGTTCTT -CAAATATATCTGGTTCTCTTTAACTCCAGGCTATGACACAAGTGATTACCTCTGTTTGAA -ATGATGTCTCAATTCCTCCTCTACCAACTATGCCTGGGCCATTCTCCCTTCAGGTCTCGA -CAGAAATCTCAGTTCCTCAGGCCAGGCACTGTGGCTCACTTGAGGTCAGGAGTTCGAGAC -TAGCCTGGCCAACATGGTGAAACCCTGTGTTTACTAAAAATACAAAATTAGCCAGGTGTG -GTGGCGTTCACCTGTAATCCTAGCTACTTGGGAGGCTGAGGCAGGAGAATCGCTTGAACC -CAGGAGGCAGAGTTTGCAGTGACCCGAGATCGTGCCATTGTACTCCAGCCTGGGCAACAC -GAGCGAAACTCTGTCTCAAAAAAAAAAAAAAGAAGAAAATAAGAAAATAAAATTCAGTGA -GCAGTACACATGATTTTTGTACTTTTCTGTGTGTATATTCATCTTCAGTTTTAAAAAGGA -GTACTTAGGAGAGTATGTGGCCAGGTGTCTAAAACACCAGGTGGCAAGAACGCAGATTTG -AGGGCTTGTATATCCACAAATGGGAGACCTTTTTGTACTTCCAAATCTGACCAGAATGCG -CCTAAATCCACAGAAGGACACTAGAAGGAACAGTATGATAGTGAAAATGAGGAAGCGGGT -TGAAAATTTCTAGAGGGGCAAATGTTTACATAGACATGTTGCAGCAGAAAGCTTGGCTAT -ACCACTGGCTTCCATGCAAGCTGAAACACTAGCTCACCAATTTCCACCACGCTGTCATCA -TTCAAAGTCTTGAAAAATGAGAGGACTCGGAGACAATGAAGCTGCTGACACTGCTGGATG -ATCAGTTTGGCCACTCGATAAATTCCATCCCCAGTAGGAAGGATCAATTCTTCCAGGTTA -CTAAGAGAACCTAAAATGTAGGCTGTCAGAAAAGACCAAAAAGCTATTCTCTTTGTACTT -TTTGTTCCTATGCAACAGTAATCTGAAAATCATGTATGGTCTAAACATCATGCACAGTCC -AGGAAGCAAGAGAAGGGCCAGCACATGCTTCCGTCTTCCTCGCTCCTCCAGACAATTCCT -CCACCACACCCTACCAACCAATCTCTCCTGCACTAAAGTCCAGGCTGCCAATTAAATCTC -CATTCCTCGTTTAGAATGAAGCTTTTCCTGACCTGCAGGTTCCTTTCCCAGCTCTGCAAT -GCCTTCAGCCACCTCCTTTCCCATTCCACCTCCCGCTGTCATGCTCAGTGATTCTGCACC -GGCCTCCTGGCCCTAAAGCCTCACAGTCCACCACTCTTAGAACCTTCCTTTTCACTTCGA -CTCCTTCCTAGCATGGCACACGATAGATCATTTGATCATTAGAAAGGTAACTTCTGAGGC -CTCACACATGGAAATATATTGAATAATTTCTAGCATAAATCAATGTCCCAGGCTTATCAT -TTTCTTTCCCTGTCTCAGTTCTTGAATCAGCCAATTTTGCAAGGACCCTGGTTCCTTTAC -GGAAGGACAGGATTTAGAAATCAGTATCTAGGCACTTGGTGTGTTCATTATTACTGCAGT -GCCTTCTAGTCTCTCCATTGAGCTGAGAAATATGTATGTGTGTATCTATAGACATGTGGA -GATCAATGTATATATATGGAGATCTCTATGTATGTTCATAGGTTAGAAAAAACCATGAGG -CCAGACCAATATTTCCAATTCTAATCCAACACCTCAAAGCTCTTTCTAGCCTCACCTTTC -AATATTTGTAAGTTCCTTTTCCAACTGTGAGGAAACGTGGCTCTCATTATACTCACTGTT -TTGTTCGTTTGTTTGTTTTTGAGACAGAGTCTTGCTCTGCCACCCAGGCTGGAGTACAGT -GACCTGATCTTGGCTCACTGCAATCTCTGCCTCCGAGGTTCGAGAGATTCTCCTGCCTCA -GCCTCCCAAGTAGCTGGGACTACAGGCATGTGCCACCACGCCCAGCAAATTTTTGTATTT -TTTAGTAGAGATGGGGTTTCACCATGTTGGCCAGGCTGGTCTTGAACTCCTGACCTCAGG -TGATCCACCCACCTCGGCCTCCCAAAGTGCTGGGATTACAGGTGTGAGCCACTGCATCTG -GCTATACTCACTGTATTATTTGCTTAATCAACCTATGATGTATGCCAGCCATCCCCTTGG -CCCTGATCCTACCTCTGCCACCTCAGCCCCCACCCACCTAGCTGCCTCCAAGGAAGGGAA -GAGGAGGATCCTGGCAATCTTTATGTACGCTTATGTTTGAGAAGCACTTTGAATAGCACA -GCTCTACAGTCCCTATCTCCACTGCCTCCCTTTATTTTTTTTTTTTCCTGTTAGAATGTT -TTCCATGAGCATACATTACTTTCATAATCGTTTTTTAAAAATCAAAAAGTATTTAAATTC -TATAGCTATCATTTTATGAAATACACGTTTTCACGTATTGAGTTTTCCTAACCTTAAAAG -ATAGATTTTTACCTATATTCTAGTAACCTTTGAAACTATGAGGTCTTATATTTGGAACAG -ATACCTGACAATTATATTGCATAGTGCTATAATGACCAAATATGTACTCTAAGAAGTCAT -TCTGCTTTGAATGAGATCAAGTCTGCAGGTAAAACTGTAGTGAAGGCATTTGGTGGGTAG -AAGTTGAGCATGACTATCACTTCGTTCTGGTAAGCAAGGGTAGCTAATGCATATCTTGCT -TCCTTGTGGCTAATTCAGAATAGGAGAAAAAGGATTCTGGGCAGAAAGAGAATAAGAATA -CGTAAAAAGCACTATTTTGTAAACATACCAAATTTTTCTGATGTTTCCTCATCAGGAAAT -TGCTGGCCTTCAAGATTTAATATCTTCAGAGAAATAAAATTTGGCAAACTGGCAACTATG -AAAGGGAAAATAAAAATTTAGTTATGTCAGCTACATCTCTCACAGCAAAGTGTAACATCT -TTAAAATGAAGAAATTAGAAATTAATGTAATAATTCAGATTGAATTCATATGACTAAGTA -GATAAAACAAATTTTTGCTCAGAAAATAATTCTGATAACTAGAAATGTTGTTAAAATTCT -AGAAATTTCAAATCCAACTGAGCGTAAAGGTAAAAAAGAAAATTCTGGAAACACCTATAC -TTTATTATTTGAGAAACAATTTTTCTTTTCTTTCTTTCTTTTTTTAAAGAGACAAGGTCT -GTGTCACCCAGGCTGGAGTGCAGTGATACCATCACAGCTCGCTGCAGCTTCAACCTTCTG -GGCTCAAGTAATCTTCCTGCCTTAGCTTCCCTAGTAGCTGGGACTATAGGTACCACTGTG -CCCGACTGATTTTTTTAAAGCTTTTTAGAGGTGGAGGGCTTGTTATGTTGCCCAGACTGG -TCTCAAACCCCTGACCTCAAGCAAGCCTCCTGCCTCAGCTTCCCAAAGTGCTGGGATTAC -AGGCATGAGCCACCACACCCAGCCTCAAAATATGGCCCTTTTTAAAGAGTGCTTAATATG -ACTCTGTGTCCATAGAAACATTTTAAACCACCATTGTAATATTATAACTCTTACCAAATG -GGACGGCTTGAAAAAATGAATCCGAAAACTTAATTTCTGTGAGTTTCTTACAGGAAACAA -GCATAGTCATGAGAGACCCAAAATCCGAAAAGAAGTTACACTTCAGATGGAAAACATGAA -GGTTTGGAGAATTTTGAATTAATTTTACTGTAAAAGATCAAGGATTTTCAGAAATTAGAA -AATACTGCAAATTTCTATCAAAATTAGCCAAGTAGTTTATTATTTTGTTTCAATAATACT -TAATTAAAACCAGGTACCACGATCTCATGATCTAAGAATCAGGTCACTGGGCTTGGGTTC -AGGTTCTGCAACTAGCAAGTAATGTGAATTTTGACACATTATTGACCCTCTCTGGGTAAC -AGTTTTCTCTTCTATAAAATACAGGCATTGACTCAGTAGCATCACCTTCAGCTTTAACAC -CTAATAACTCTAAGTTTGTACATGACATTTACACAATAAGAATACAAAGAGGCCAGGTGC -TGTGGCTCATGCCTGTAATCCCAGCACTTTGGGAGGCCGAGGCAGGTGGATCATCTGAGC -TCAGGAGCTCGAGACCAGCCTGGCCAACATGGTGAAACCCTGTCTCTACTAAAAATACAA -ACATTTGCCAGGCGTGGGTGGCAGGCAGCTGTAATCCCAGCTACTCGGGAGGCCGAGGCA -GGAGAATCGCTTGAACCCAGGAGGCAGAGGTTGCAGTGAGTGGATATTGCGCCACCAAAC -TCCAGCCTGGGCAACAGAGCAAGACTGCATCTCCAAAAAAAAAAAAAAAAAAAAGAATAC -TATACAAAGAAACAGCTGGGCATAGTGGCTCACGCCTGTAATCCTAGTACTTTGGGAGGC -TGAGGTGGGCGGATCACCTGAGGTCAGGAGTTTGAGACTGGCCTGGCCAACATGGTGAAA -CCCCGTCTCTACTAAAAATATTAAAAAATTAGCCGGGCATGGTGATGGGCGCCTGTAATC -CCAGCTACTTGGGAGGCTGAGGCAGGAGAATTGCTTGAACCCAGGAGACGGAGGTTGCAG -TGAGCTGACAACGGTGCCACTGCACTCCAGCCTGGGTGACAGATTGAGACTCTGTCTCAA -AAAAAAAAAAGAAACATAAAAGGAATGACAACTCTGAAGCAAAACCTAAATTTGTTGCAA -AATTTGCTGACAAATATAACTACCACCTCCAGCTTCGGTGTATATATGTATGCACACACA -CACACACACACATATATATATATATATATTTTTTTTTTTTTTTTGAGACAGGGTCTTGCT -CTATCACCTAGGCTGGAGTGCAGTGGCATGATCATAGCTTATTGCAAGCTTGAACTTCTG -GACTCAAGTGATCCTACCAGCCTCCTGAGTAGCTAGGGACTATAGGTGCACACCATCATA -CTTGGCTAATTTTTTATTTTTTTAGTAGAGACGATGTCTTGCTATATTGCCCAGCCTGGT -GCTGAACTCCTGCTCTCAGGCGATCCTCCCGCCTCGGCCACCCAAAGTGCTAAGATTACA -GGTGTGAGTCACTGTGCCTGGCCAACAATGAAGCTTTTGCATGCAAGTCTTTATGTCAAT -TCCATAGAGTTGTATATTTCTTCAATCTTTAGTGTTCAGTGTTTACTAAGTTAAGGAATG -ATGGTGCCTAAGTCATTTAGCTAAATGATGTATTTAAGAAAGATGGCTGCACCATTTTCC -ATGAACTATTAGGATAGGCTGGTGAGAAACAGGGAAATACTTCCAATGACTACGGATTAG -CAGATTTCCTTCCTGCTGAGCTGCCAGATCTGTAAGTTGCAATGTAAGACCAGCCTAACC -AAAAACAAAATAAAATAACCCTACAAATTATTTTGGAGTGGCAACATTATATTAGGGATT -TCTTTCTTTTTTTTTTTTTTTTTTTCTGAGATGGAGTTTTGCTCTTGTTCCCCATGGAGT -TTTGCTCTTGTTCCCCATGGAGTTTTACTCTTGTTCCCCAGGCTGGAGTACAATGGCGCG -ATCTCGGCTCACATTGCAATCTCTGCCTCCCAGGTTCAGGTAATTCTCCTGCTTCAGCCT -CTCAAGTAGCTGGGATTACAGGCATATGCCACCATGCCAGCAAATTTTTGCATTTTTAGT -AGAGGCAGGGTTTCACCATGTTGGTCAGGCTGGTCTCGAACTCCTGACCTCAGGTGATCT -GCCCTTCTCGGCCTCCCAAAGTGCTGGGATTACAGGTGTGAGTCACCAGGCCCGGCCTAT -ATTAGGGATTAAGAACTCAGATTTTGGAGTCAAAATTCCTGTATTTGAGTCACAGATATA -CATTTCCTTAGCTGGATATTACGAATTACTTTATCTCTTTATGTCTCAGTTTTCCCAGCT -ACAAAATAGCATTAATAATAGTACTTTACTTTGGCCAGGCACGGTGGCTCATGCCTGTAA -TCCCAGCACTTTGGGAGGCCGAGGCGGGAAGATCATGAGGTCAGGAGATCGAGACCATCC -TGGCTAACACGGTGAAACGCCGTCTCTACTAAAAATACAAAAAATTAGCTGGGCGTGGTG -GCAGGCACCTGTAGTCCCAGCTACTTGGGAGGCTGAGGCAGGAGAATGGTGAACCTGAGA -GGAGGAGCTTGCAGTGAGCCGAGATCGTGCCACTGCACTCCAGCCTGGGCGACAGCGCGA -GACTGTCTCAAAAAAAAAAAAAAAATAATAATAATAATAATAATAATAGTACTTTACTTC -ATAGAGTGGGTATGAAGACTGAGTTCATATTTGTGAAGTGCTTAGAATACTTCCTAGTGT -GTAGTAAAGGCTCAATAATTACAAACAGCACTCTGCTTTCTTAATGAGAAAGAGTGCTAT -TCCTCACAATTTACCATGGATACAGGCTACACCCTTAGAACCACAGGCACTTTAACTCTT -AAATAAATTATTGGCCAAGTAGCTTTTCCAACTTACGTAAAAACAAGTATATTAAAGTGC -CATCCTTACCTAGTTTGGAAGGATCATACTCAGCTGAAATTTGGATCAATAATTTCTCCA -TATGGTGGAAGTTTGGAAATTCTTCAGGAATGACTGAAAAAACATTTATATTGCCCTCCA -GATCCACAGACAGTTCTTTCAGGCACAGGAACTTATCCAGATTAGGAAAGATTTGGTCTG -GAAAGCAGCACAGTTTCCCATTATTAATCTAAAGAGTTCTGAATGGACATTTTAAAACTG -TCATTTTGATTCATCCAGCTATTTTCACATGCAAACCTTCCACATACCATAAAACATTCT -TTTTTTTTTTTAAAGAATACATATATGAAGATATTGCTTTTTGCAGCTTATGCACTGTAT -GGGAAGCCCTGTGCTACTCTTCAGACTCACAAAAAGAAATACAGCATCTCGGCTAGGCGC -AGTGGCTCATGCCTGTAATCCCAGCACTTTGGGAGGCTGAGGCGGGCGGATCACGAGGTC -AGGAGTTTGAGACCAGTCTGGCCAACATAGTGAAACCCCGTCTCTACTAAAAATACAAAA -AAAAAATTAGCTGGGTATGGTGGTGTGCATCTGTAATCCCAGCTACTCAGGAGGCTGAGG -CAGGAGAATCACATAAACCTGGGAGACGGAGGTTGCAGTGAGCCAAGATCGCGCCATTGC -ACTCCAGCCCAGGCTACAGTGTGAGACTCCGTCTCAAAAAAAAAAGAAGAGAAAAGAAAT -ATAGCATCTCTTCAACAAACGGTTGGGGACAACTGGATTTGCACATGCGAAAGAATGAAG -TTGGATTCCTATCCCTCACCATGTAAAAAAAATCAACTCAAAATGGATCAACGACCTAAA -TATAAAAGCTGAAATCACACAACTCTTAGAAAAAACATAGGAGTTAATCTTCATGACCTT -GGATTTGGCAATGGATTCTTAGATAGGACACCAAAAGGACCAGCAATAAAAGAAAAAAAC -AGATAAATTGGACTTCGTCAAAATTTAAAACTTTCGTGCACAAAGGACATTATAAATAAA -GTAAAATGACAACCTATGGAATGGGAAAAATATTTTCAAACTGTGTATCTGATAACAGGT -TGAAATCCAGAATATACAAATAACTCTTACAATGCAACAAAAACAACAACAATTTTTAAA -TGAGCAAACATATTTTTTCAAAAAGTGAAAAGATACTTAACATCATTTTCATGATTTGCA -TTAGAGAAATGCAAATCAAAACCACAATGAGATACCACTTCACAACTACTAGAACGGCTT -TATGATAATCACAAAACAAAATGGGCTGGGTGAGGTGGCTCATACCTGTAATCCCAGCAC -TTTGGAAGGCCAAGGTGGGTGGATCATTTGAGCCCAGGAGTTCAAGACCAGACTAGGGGC -CAGGCACGGTGGCTCATGCCTGTAATCCCAGCACTTTGGGAGGCCGAGGTGGGTGGATCA -CCTGAGGTCAGGAGTTCAAGACCAGCCTGGCCAACATGGTGAAACCCCATCTCTACTAAA -AATACAAAAATTAGCTGGGTGTGGTGGCGGGAGCTTGTAATCCCAGCTACTTGGGAGGCT -GAGGCAAGAGAATGGCGTGAACCCAGGAGGCAGAGCTTGCAGTGAGCCGAGATTGCGCCA -CTGCACTCCAGCCTGGGGGACAGAGCGAGGCTCCATCTCAAAAAAAAAAAAGAAAGAAAA -AGAAAAAAGACCAGACTAGGCAACATAGCAAGAATCTGTCTCTACAAAAAATAAAAAATT -ATCCAGGCACGGTGGTGCATGCTGGTAGTCTCAGCTACTCAGGAGGCTGAGGCAGGAGGA -TCACCTGAGCTCAAGAGGTTGAGGCTGCAGTGAGCCATGATTGCACCACAGCACTCCAGC -TTGGGCAATAGAGCAAGACACTGTCTGAAAAACAACAATGAAAACAAAAACAGGTCGGGC -ACTGTGGCTCATGCCTGTAATCCTAGCACTTCGGGAGGCCAAGGTGGCTGGACTGCCTGA -GCTCAGGAGTTCGAGACCGGCTTGGGCAACATGGCGAAACCCCATCTCTACTAAAAATAC -AAAAGTTAGCCAGGTATGGTGGTGCACACCTGTAGTCCCAGCTACTCAGGAGGCTGAGAC -AGGAGAATTGCTTGAACCCGAGAGGTGGAGGTTGCAGTGAGCCAAGATCTCGCCACTGCA -CTCCAGCCTGGGTGACAGAATTAGACTCTGTCTCCACAAAAACAAAAATTAACAAGTGCT -GAAGAGGATGTGGAGTAATTGGAACCTTTGTACATGGATAGTGGGAATGTAAGATGGTGC -AGCTACTGTGCAAGTTCCTCAAAAAGTTAAACATAGAACTACCATATGAATCAGCAATTC -TGCTTCTAGGTATATACCCAAAATGATTAAAAGCAAGAACTTAAACCGATACTTATAATG -CCAGTGTTCATTGCAGCATTATTTATGATAGCCAGAAGGTAGAAACAACCCAAGTGTCTC -TCAGCAGCAGAATGGATAAACAAAATGTACTATATACATACCATGGAATATTAGCTATAA -AAAGGATGAAGTTCCTTTTCAAAGTTGATACATAATAATTGTACATATTTATGGAGTACA -TGTGAAGGAATGAAATTCCAATATAGGCTACAACATGATGTACCTTGAACAGTATGCAAA -GTGAAATAAGCCAGACAAGTGATAATGCTTATAAACAATATCTAGAAGAGGCAAATTCAT -AGAGACAGAAAATAGAAGAGAAGTTATCAGGGGCTGGTGGGAGGGAAGATTTTTTTTTTT -TTTTTTTTTTTTGAGACGGAGTCTCACTCGGTAGCCCAAGCTGGAGTGCAGTGGCATGAT -CTGGGCTCACTGCAACCTCTGCCTCCCAGGCTTAAGTGATTCTCATGCCTCAGCCTCCCG -AATAGCTGGGACTACAGGCGCATGCCACCACGCCCAGCTAATTTTTTGTATTTTAGTAGA -GATGTGGTTTCACCATGTTGCCCAGGGTGGTCTCAAACTCCTGAGCTCTGGCGATCCACC -CTCTTCGGCCTCCCAAAGTGCTGGGATTACAGGCGTGAGCCCCCGCGCCCGGCCCAATTT -ATTGTTTAATTGGGATGATGAAAAGGTTCTGGAGATGGATAGTGGTGATGGTTGTACAAC -ATAGTGAATGCTTAATGCCACTGAGTTGTACACTTAAAATGATTAAAATGTAAGCTTTGT -TACATGTATTTTACCATAATAAAACAGTACTTGAAAAAAGATGAAAAATTTTCTAAATTT -GGTAAATGTCAACCCACACATTCCAAAAAAGTTCAGTGCACCTCAAGCAAGATACATACA -AAGCAAAGCACACCTAGGCATATAACAGTCAAACTGCTTAAGACCAAAGCAATACTAGCA -ACAATTAGAAAATGAAAAAATATTTTTAATGACATTTACAATACTTTCAAAAGATATGAG -TATCTAGGAATAAATTTAATGAAAGATGGGTTAAGTCTACACTGAAAACTATCAAATAGT -GCTTAGAGGAGTTAAGACACAAATAGATGAAGATATTATTTCCCATTAATTTATTTATTT -CCCAGGGACTACAGGCCTTTCTTCCTTTAGGCAGCTAGGGTGAAGGTAATTTCTAAGCAT -CATCTTACATATAGCTAATTCTTTTACTAATAACAGATAATTCATGTCTTTATTAAGAAC -CTTCAATAATTTAATATAAATATTTTATTCATTTTGTCTGAGTTATTTGAAAACCATTCT -ATTATTCAAGGACTTTTCACTAATTCATGCTACTGTCAAAAAAAATTAGTGAAGGTTTAT -TTTATATCTGTTCTATCAATGAGCATGCATGCTTTCATGGCCTCAGAAGTTTTCAACCAC -TTAAAGTAAGAAAAAGAAATTATACATCAGAATAGTCATCCAAAATATATACAGGTATAC -CTTGTGACTGGATTGTCCCTGAGACTTCAAGAGATTCCAGGGAAGGCAGGGTGAGAAGCA -GTTCCTGTTCGGCTGCGCTGAGTTCCAACTTGCTTATGGAGCACTTGGTGACAGAGGCCT -TAGACAGCTCAAGAGCTGGGCGGATGCTTTCTATAAAGCCTCTGCTGTGGTTTAAATGGA -GTTCGATGCGCTGTGAAGCTGAGAAAACTGTCATTAGAATCTCAAGCATATCCTGGCCTA -CAACATCAATATCATTCACATCGACTTCTAGACAGGGAATCTTGTACTGCTTTGGAGAAA -GTTTCCAATAGCCAGTACTAAGGTCTGGTGATGCCCTGCGCTGCATATCCATATAGCTCT -TTACATTATCCTCTTTTTCAGCTAAATTTCGCTCCCATTCATTCATAGGTTCAAAGGCAG -AAGCATAGTCCTGATCTATAGTTGGCACCTGTGATTTGTCAAAACATGTTTCCAGAACTG -AAAAATGTGCTCTGGGTGATGTCTTATTTCCTCGTATTGGGAAGTGGATGCTCCTCAACA -ATGACAAGCTTTCTGGGTGGTCGAAAAAGTACTGTAAGTTAAGCGCACCCAAAGTCAGTG -TTCTCCCTTGAAGGAATTGCAAAACAAATGGAGAACACGCAGCAACAGTGTTGCTTTGAT -AAGCAGTTTTCAGGGCAAGAACCAGTAAATGTTCTGAAACCATTGAAAAGTAAGCTTGTG -GACAAATTTGCCACAATCCCCTAAGTAACTGCATCTGCAGTGAAATTTCTGGCTGGTGCT -TTAAGTAGTCATCATTTTCAGATATATTCTCCAATGACTCTTTGTTATCCACTAAATGGA -GCAAATGAGACACAATTTTGGGCCCTGCTTTTGTTGAAGGGAGGCTGGAGACATAGTTCA -AAAAATTGTTGTAGGCGCTTACAGTCATCATGGGTGAGTTGATTTGTTTCAAATGATACA -GTCCCAAATCTTGATGTTCCTGCCTATCTGAATCCAGGAGTTCAATCAGCCTCATCCCCG -CAAGAAATTCTTGGAAGGCAGGACTTAAAAACCGGTAGAATGGTCTTAGTCTCTGGGCTG -TAAATTTGCTCATCAAGCACATGGTTAGATCTTCATCTTCATCAACCCCTGCTTCTGCGA -GATCATCATCATTAAACTCAAAGCAACATGAAAAAAACCCTTTCAAGGCCAGCTCACCAC -AGGAGGACACAGTTGCTTTGAGAATTTCAGCTGTCGCTTTGTTCCTTAAGGAAAGGCGTT -CCATATAGGACTTGAAAACAGCCACATCATCAAAGGATGGGTCAAAAGGATACTGAAACC -AATGAGCACAGATCGCCGCCACAAAGAGAGGAGTTTTCTGTATCTTCTGCAAACTTTGGT -TCTTTCCAAAGTAAACCATAAACTTTCGCAGACGAGTCATATTATGTGAAAAGAGCTTCC -GTAATATACAGACAGTATTATAAAAGGGAAATGCTTTGATCTCTAGAATGGTCTCTAGGT -ATCGGCGGATGTCCCTGGCCCTGTTTGTACGGACAGCAATCAATAGGCAGGTCCGGGATA -AGTGGTTTTTTTGAATCAGTTTTCCTATGACTTGAGGGATTGAACATATTTCTTTGTAGT -CATCTAAAAGGAATAAGACCTGATTCTTTAACTGCTGGATAATGTTCCTCATGCACATTT -CAGTAACAGATCCTTCTTTCTCTAGGAGCTGGTCACAGATGATACTGGCCAGCCCCTCGT -CTGGTCTGGTGGAACTAAGGGAGAGGTAGAAAACCAGCTGGAACCTGTTTAACAGGGGAC -AGCATCCAGATGCCCACAGAAAAGCTATTTTCTTCAGGAGGACCGTCTTTCCACTTCCAG -CTTCACCCTCCACACACATGACAGAGTTCAAGTTGCCAAAGACCTCAGGCAGCACCAGAG -GTTCTTGCACAGGTTTGCTGATGTGTTTTGAAGCAATAGACAGATCACAGCCCAGCAAGT -GGTCCGTGGCCAGATCGGAAGAGATATCAAGCAAAGACATGTGGCGGAAACTGGCGCTGG -TATAAGCTGCTCTCAGCTGCTCATTCAGATTCTTTGCCTCTTGAAACCACTGGGCTTCAC -CCTGTGCCATTTCTGTGGAGAGAAAGAAAGGGGGGCACAACAGGGATTCATAGTCACATC -TCCCTCAGTCTGAACGCCATGCCTTTTCATTCCATGATTCTGCCTGTCTACTACGAATGT -GTTAGGATTTTCCACAGCCATCCATGATTCCCACATTGCAATCATCTCATAGGTTTTGGC -ACAAAATCGGAATGTGGAAAGCATGTGTCCAAAGTGCCACACTTGAAGCAGGGACCTAGA -CATAATGTGTGCTTATCATAAGCACCATGCATCTCAGGAAAGAGGCCAGGCAAAGTGACT -CATGCCTGTAATCCCAGCACTGTGGGAGACCGAGGCAGGCGAATTGCTTGAGCCTAGGAG -TTCAAGACCAGCCTGGCCAACATGGCAAAACCCTGTCTCTACAAAAAATACAAAAATTAG -CCAGGTGTGGTGGCACATGCTTGTGGTCCCAGCTACTGGGGAGGCTGAGATGGGAGGATC -ACTTGAGCCTGGGAGGTCGACGCTGCAGTGAGCCATGATCTTGCCATTGCATTCCAGCCT -GGGTGACAAAGTAAGACCCTGTCTCAAATAAAATAAAATAAAATAAAATGAATAAAAATA -AAAATCTCAGGAAAGAAGTTTACTGATTGGTGCTTCTAAGGACTGGTTTGCTTGTACCTG -AGACACGTTGCCTACTATCAGTTTGGTCCTGCCTGCACTCTGGAGAAGCCACAAGAATCT -TGACTTTTGCTCATACACGACACTGTTGCAATGCTGCTCCTCTTTGGAAGCTCTTTGGAC -AATTATAAATACTCTTTTTTTTGCAACTGCCCTGTATACAAATATATTTACAAATACATA -TAATCCCACGTGCTACTTCAAAGTTCTTACCTGGCACTATAGGACCAACTGCTATTGAAT -CTTCAAGATTGCTTTCACTTGTGGTTTCCTTTGAAAAATAAAATCTTTTCTTAAATCAAA -ATTTGTATAGGAGAGTGGTGCATCATGTTGTAATCATTGGAGACTAAACATCTTCTAAAC -ACAGCCCACCCTCAAATATTTATGCTGATTGAAGGGAGGGGTGCTTCAAATATAAAGCAA -TTAATAATCTGCAAACATCTGCAGGAATGTGTGTTCCTCAGTGGTTCATATATGATACAA -TCCATGGGTGATACTATCCAGGTAACGAGAATCATAGTGTCTTGAGCAAGAAGGTTCCTC -CAAAAGTCATTAGGTGCAGCCTCTGTCCTTGGTTAACTACATTTTTAATATTACACATCC -AGTACTTAGAAAGGTTATGTTTCTTCCTTAAAAAATAAGCTATTTCTATATCCCCAAACT -TTTTTCTTTAGTGCCATTTTTCAATTCATATAGAAAATGAATCACGTGAATAGCAAGTTG -GTTTTTGTGTGTGTTGGGGGTGGGGGGATGACAAACATAACATACCTAACACACAGCCTC -AAAATAAGGTGGTATATGTGTGCATTAAATTAGTAGTGGCTTAGATCCCAGAGAACTAGG -TAGGATTTCTCAGCTTAAAGTGGTCAGCCTATTACCTGGGTTTATGATTTTGTTGGGCAC -TTGCTTTTGTTAAAGTAACATCTTCTCAACCCCTAAAAAGGCCAAGTAGCAAATCCAGCC -TTATTGCCCTCCACATGAAGATCCAGAGAGGATCCATTTAGCCTGGATGCTAATCAACAA -AGTCTAAGAGAGTCTGGGCCTTGAAGTAGAAGAGATTGACTATACATTTCATGAAGCTAG -GTATTGTGCCCGACTGGTTTATTGTGATTTCCCCAATCTGTACCTAGAACATAACAGATG -TTTAGTAGATGTTTGTAGAATTTATTAACTTATTTATGATGAGACATTCCTGTTCAAAAC -AGTTTTCAATTATATGATCATGTGCTGGTAAAACAGACAAGATGACGGTGTTCATTACCA -GTAATTCACAAAGTTCACCACGGCTCTGAAGGTCTGGAGTCACTTCCGCAGAGGACTTCA -TATTTTGGAGAAATGGACAACTAAGTGTAAAAGTTAAAAGTTATAAAAATAGTTGTATGC -ATTTAAGTATTTGTGTTAGGTATTTTTGTTGTTGTTGTTGCTGTTGTTGTTTGTTTTGTT -TTTTCAGAGACAGGGTCTCACTATGTTGCCCAGGCTGTTCTGGTTTCAAACTTCTGGCCT -TGAGCAATTCTCCTCCCTAGGCATCCCAAAGTGCTGGGATGATAGGTGTGAGCCGCCAGG -CCTGGCCTTGTATTAGGTTTTTAAAAACACTATTAGAGTTTTAGTGACAAGAATTAAGTA -TAAAACTATCATTTCCAGTGATTCTCTCTGTCAGTGTCATCAGGTAAGGCACTTAGGTTG -TGGCTCTCTGCAAGAGAAATGTAAATGGTTTATAAAGTAATGGTAAAGGATATCAAGGGA -ATTTGGTGGAATGAGTGCGTATATGACACTCCTCCATGCAACCAACAAAAATGAACTTAA -AGAATCAAAAATAGGAAAAAAAAAACCCTCTATTTATGAATTCTGGAACAAAAAACAACA -TAGTGGAAAAACTGGTGAAATCCAAATAAACTCTGGATTTTAGTAAATAGTAATGTACAG -TATATGAATTGGTACACTGATGATTTTGACAAATATGCTAGTGTAAGATGTTAGCATTAG -GGTAAATTGGGTATGGCAATATATAGGAACTCTTTGTATTAGTTTGCACTTTTTCTGCAA -ATCTAAATAATTTCAAATAAAAATTTATTAAAAAAAGAAAAAAACTCTGTGAGTGCGGTG -GTTCACACCCAAAATCCCAGCACTTTGGGAGGCCGAGGCGGGCAGATCACTTGAGGCCAG -AAGTTTGAGACCAGCCTGGCCAACATGGCAAAACCCTGTCTCTATTAAAAAGAAAATTCC -AAAATTAACCAGGCAGTGGTGGCGCACACCTGTAGTCCTAGCTACTTAGGAGGCTGGGGC -ACAAGAATCGCTTGAACCTGGGAGGCGGAGGTTGCAGTGAGCTGAGATTGTGCCACTGCA -CTCTAGTCTGGGCAATGGAGTGAGACTCTGTCTCAAAACAAAACAAAAAACAAACAAGGC -CAGGCATGGTGGCTCACGCCTGTAATCCCAGCAGTTCGGGAGGCTGAGGCGGGTGGATCA -CTTGAGGTCAGGAGTTTGAGACCAACCTGACCAACATGGTGAAACCTCACCTCTACTAAA -AATACAAAAATTAGCCGGGCGTTGTGGCGGACGCCTGTAATCCCAGCGACTTGGGAGGCT -GAGGCAGGAGAATTGCTTGAACCCAAGAGGCAGAAGTTGAAGTAAGCCGAGATCGCACCA -TTGCACTCTAGCCTGGGCAACAGGAGTGAAACTCTGTCTCAAAAACAAACAAACAAACAA -ACAAAAAACTAACCAACCACAAACCACTCCATTGCCAGGTGCAATGGCTCATGCCTGTAA -TCCCAGCACTTTGGGAAGCCAAAGTAGGAGTTTCACTTGAGGTCAGAAGTTCAAGACCAG -ACCAACCTGTGCAACATAGAGAGACCTCCTCTAAAAATTAGCTAGCAGGGTGGCATGCAT -GTATAGTCCCAGCTACTTGGGGGTGCTGAGGCAAGAAGATCACTTGAGGCCAGGAGGTTG -GGGCTACAGTGAGCCGTGATTGCATCACTGCACTCCAGCCTGGGTGACAGAGTAAGACCC -TGTCTTAAACAAACAAAAAATTAAAAAAGAAACCCTCCGTCAGTATCAAAAGAAAAGAAT -GGCCACAAACATACTCTCTAAAAACTACTTGCCAATCTCATGAAACTAGGACGCAAATAC -CCTCTAAACTCAGGTTTGATGTATGCTTGAAGAACAAGAGAGAAAGTTCAAAAAGAGCTC -TAGTTGCAATTATTAAAATGGACAGATGAGAACTATACATGTGAGTAAGTCAGTGGCCTA -TTCCATGCTGTAGAATCACTGGAGAGCAGGAGTAAAGCAAAGGGACACTTTTTTTTTTTT -TTTTTTGAGACAAGGTCTCACTCTTTTGTCCAGGCTTGAGGGTAGTGACACGACCAAGGA -TCAATGTAGCCTCCAACTCCCAGGCTCAAGTGATCCTCCCACCTCAGCCTCCCAAGTAAC -TGGGACTACAGGCACGTGCCACCATGCCAGGCTAATTTTTTTTTTTTCTGGGCTCAAGGT -ATCCTCCCACCTCAGTTTCCCAAAGTGCTGGGATTACAGGCGTGAGCCACTGCATGTGGC -CCAGATACTTCATTTGTATTGCCTTCAGGTGACTTGGCGATGAGTCCAGAAATAGAAGCA -TAGCTTCAGGAAAACAACAAGTAGAACTTTTAACGTTTCTGTCCAAAGTCAGCCATGTAG -AGGTAAATAAAAACAAACCCATAGGGAAGGGGTGAAGTGGCTTACAAAAGAAAAAAAATA -TTTTAATAGGCCCATCAAGGAAAAGAACTATGAAGGAAGGTAAAATATAAACTTATTCAT -ACAAACAAATGCCAAATAAAGTCAGTTGCCAGCATAACTGCACTACAAAAAATGTGAAAG -GGACCAGGCACAGTGGTTCACCCCTGTAATCCCAGCACTTTGGGAGGCTGAGGCAGGCAG -ATCATGAGGTCAGGAGTTCAAGACCAGCCTGGCCAACATGGTGAAACTCTATCTCTACTA -AAAATACAAAAATTAGCCTGGCATGGTGGTGGGCACCTGTAATCCTAGCTACTCAGGAGG -TTGAGGCAGGAGAATCACTTGAACCCGGGAGGCAGAGGTTGCAGTCAGCCGAGATCGTGG -CGCTGAACTCCAGCCTAGGAGACAGAGCAAGACTCCATCTCAAAAAAAAAAAAAAAAAAA -AAAAAAGTGAAAGGAAGCACATCATTTAAAAGGAAAATGATAGCAGATGGAAATTTGGTT -CTACTCAAAGGAATGAAAAGTACCAGGAATGATAAGATAACTAAGAGGGCAAATATGAAA -GACTTTTGCCGTTGTAAAAATGTACTTAAATTGTTTAAAGCAAAGATATAACATTATATT -GTAAGATTTATTAAAGTACATGGAAATAAAATGTATGACAATAGCACAAAGGATGAGAGG -GGAGAAATGGAAATATACTATTGTATGGTTCATACATTTTATGTCAAGTGTTATATATTT -TTTTGACCCAGAGTCTCACTGTGTCACCCAGGCTGGAGTGCAGTGGCACGATCTCAGCTT -TCTGCAGCGTCTGCTTCCTGGGTTCAAGCAATTCTCGTGCCTCAGCCTCCCAAGCAGCTG -GGATTACAGGTGTGCGCCACCACACCCAGCTAATTTTTTTGTATTTTTAGTAGAGACGGG -GTTTCACCATGTTGCCCAGGTTGCTCTGGAACTCTTGACCTCAAGTGATCTGCCTGCCTC -AGCCTCCCAAATTACCGGGATTACAGGCATGAGCCACTGCACCCAGCTGTTATAATATTT -TTGAAGATTACTATGATATGTTAAATAGGCATATGGTAAACTCTAGAGCAAGTAGTAAAA -AGGTAAAATAAGGATTAATAGCTAATAAGCTGACAGAAATAAAATGGAGTACAAAAAAAA -ATACTCAAGGAGGGGTAGAAAAAAGAAAAAAAAAAAAACCCTAAACCCTAGGAAGTCAGG -AAAAGAAAAAGAAACAAAGAAGTGATGAAATAAATAGAAAGCAAATGGTAAAATAGGTTT -AAATCCAACCATATTCATAATTGCATTAAATTTAAACGTTCTAAACATTCCAATTAGAAA -GCAGTTATTGTCAGACTCTTAAAAAGCAAGACCTGGCCAGGCGTGGTGGCTTACGCCTGT -AATCCCAGCACTTTGGGAGGCCAAGGCAGGTGGATCATGAGGTCAGGAGATCGAGACCAT -CCTGGCTAACACGGTGAAACCCCGTCTCTACTAAAAATACAAAAAATTAGCCAGGTGTGG -TGGCGGGGTGCCTGTAGTCCCAGCTACTCGCGAGGCTGAGGCAGGAGAATGGTGTGAACC -CAGGAGGCGGAGCTTGCAGTGAGCCAAGATCGTGCCACTGCACTCCAGCCTGGGCGACAG -AGCAAGACTCCGTCTCAAAAAAAAAAAAAAAGAGAAAACCTGGCTGGATGTGGTGGCTCA -CACCTCCATCTCAAAAAAAAAGCAAGACCTGCTGGGTTCAGTGGTCCACACCTGTAATCC -CAGCACTCTGGGAAGACAAGGCAGGAGAATTGCTTGTGGCTAGGTGTTCGAGATCAGACT -GGGCAACATAGTGAGACCTTGTCTCTATAAAAAACTAACAAACTTAGCCAGGCTTGGTGG -CATGTGCCTGTAGTCCCAGCTACTCAGGAGTCTGAGGTGGGAGGATTGCTTGAGCCTGGG -AAGTCCAGGCTGCAGTGAGTCAAGACTGCACCACTGCACTCCAGCGTAGGCAACAGAGCG -AGTCTGTCTCATAAACAAATAAAAAATAAAATAAAAGACCCCACTGTGTTGTTGCCTATA -ACAATTCACTTTAAGGCTGGGTGCAGTGGCTCATGCCTGTAATCTCAACACTTAGGGTGG -CAGAGGTGGGAGGACAGCTTGAGCCCAGGAGTTTGAGATCTGCCTGGGCAACATAGTGAG -ACCCCGTTACCCACAAAAAGGAAAAGGAAAAAACAAGAATTGACTTTAAATATAGTCACA -GATAGATTAAAAAGAAAATAATCTAAAAGATGTAACATGAAAAAACTAATAAAGGCCTAA -AAAATACTATCAAGGATAAAGAGGGATATTTCTGTTTTTTAGAGACAAAGTTTTACTCTG -TCACCCAGGCCACAGTACAGTGGCACAATCATAGCTCATTGCAACCTATACTCCTGAGCT -CAAGCGATTCTCCTGCCTCTGCCTCCCAGGTAGCTGGGACTACAGATGCATGCTACCACA -CCCTGTTTGTTTTAAAAATTTTTTGTAGAAATGGAGTCTAGCTATGTTGCAAAGGCTAGT -CTCAAACTCCTCGCCTTGTGCACTCCTCCCACCTCAGCCTCCCAAAGTGCTGGGATTATA -GGTGTGAACCACCATGCCTGCTTGGGATATTTAATATATTCTCTGGAATATGAAAGACCA -AAGGGCAAAAAAATAGCTAAGACACACTCTTGAAGAGAAAGAACAAGACTATTCTGCAGG -AAAATATGAAAATAAGCTCAACTGCCAGGCGCGGTGGCTCACACCTGTAATCCCAGCACT -TTGGGAGGCTGAGGTGGGTGGATCACCTGAGGTTGGGAGTCCGAGACCAGCCTGACCAAC -ATGGAGAAACCCCATCTCTACTAAAAATACAAAATTAGCTGGGCGTGGTGGCACATGCCT -GTAATCCCAGCTACTCGGGAGGCTGAGGCAGGAGAATCACTTGAACCTGGGAGGCGGAGG -TTGTGGTGAGCCGAGATCGTGCCATTGCACTCCAGCCTGGGCAACAAGAGTGAAACTCCG -TCTCAAAAAAAAAAAAAGAAAGAAAAAAAGAAGAAGAAAATAAGCTTAACATTATTAGTA -ATTACACTGACAAAAATTAAAATTTGGGCAATACCAAGTTAGTGAGGAAGCAAATCAATA -GAAACGCATCTAGGCCAATGGGAATGTAAATCAGTGCAACCACTTGGGAAAAAGCTTTGC -ATTATCTAGTGGAGTTGAACACCCGCAAAGTTCTATGACTCTGCAATTCTTTACTTTGTT -ATGTATCCTAGAGAAACACACATGAGCACTGGAAAATATGTACAAGAATGTTCATAGGGC -ATTATTTGAATTTGCAACACTCTGAAAACGACCCACGAGGTTAATCAACAGTAAAATAAG -TTATTATATATTCATAAAATAATACACTATTTACCAATGAAAACAAGTGAACTACAACTG -TGTAGTACATATAAATATGGATGAATCTCAAAAACATCGTGGAGTAAAACCAGCCAATTA -CAAGAAGAATCATGCAGTATGCTTCTTATTTGAACTTCAAGAATAGACAAAGCTAAATAT -GTTTAAGGATGTATATGTAGTTGGTAAAACCACAAAGAGAAGCAAGGGAATAATTAACCC -AAACTGAGCATCACATTTACCTCTGGATTGGAGGGACAGGGATATAATCAGAATTAGGGG -GTGGTTGGCATGCAGAGTTGTTTTTTGTTTTTTGATTTTTTTTTTTTGAGACAGAGTCAC -GCTCTGTCGCCCAGGAGTGCAATGGCGCCATCTTGGCTCACTGCAACTTCCGCCTCCCAG -GTTCAAGCCATTCTCCTGCCTCAGCCTCCCTAATAGCTGGGACTACAGGCGTGTGTCACC -AGGCCCGGTTAAATTTTTCTGTTTTTTAACAGAGATGGGGTTTCACCATGTTGCCCAGGC -TGGTCTCGAACTCTTGAGCTCAGACAATCTGCCCACATCGGCCTCCCAAAGTGCTGAGAT -TACAGGCGTGAGTCACTGCACCCGGCCGCAGGGGTCTTTTAAGGCATTGATAATGTCCAA -TTTCTTGACTTTACTAGGAGGTTCATAGGTTGCTTTTTATTCATTCTTTAAAGCATACAT -AAAAATTTTAGGTAATCATTTGGAGACATACTGGTTTGCAGTTTTTTTAAGAGGCAAAGG -AAGAGTAAAAATCCAAAAAGGAGTTGGCTGGGAGCAGTGGCTCATGCCTGTAATCCAAGT -ACTTTGGGAGGCTGAAGCAGAAGGATCATTTGGAGCCAGGAGTTTGAGACCAGCCTGGGC -AACAAAGCAAGACCCCATCTCTACAAAAAAAAACTTTAAAAAATTAGTCGGGCATGGTGA -CACATGCTTGTAGTCCTAGCTACTTGGGAGGCTGAGGTGGGAGGATCACTTGAGCCCAGG -AATTTGAGGCTACAGTCAGCTAGGATTGTACCACTGCACTTGCTCCAGCCTGGGTGACAG -AGCCGAGACCCAGTCTCTTAACAAAAAAACACTAAAGGCCAGGTGTGGCGGCTCACACCT -GTATTCCCAGCACTTTGGGAGGCTGAGGCAGGAGGATCACTTGAGGTCAGGAGTTCAAGA -CCAGCCTGGCCAACATGGTGAAACCCCGTCTCTACTAAAAGTACAAAAAATTAGCCAGGC -ATGGTGGGGAGGTACCTGTAATCCCAGCTACTTGGGAGGCTGAGGCAGGAGAATCGCTTG -AACCCGGGAGGCGGAGGTTGCAGTGAGCCGAGATCACGCCACTGCACTCCAGCCTGGGTG -ACAGAGTGAGACTCCATCTCAAAAACAACAACAACAAAACACTAAAACTAATAATAATAA -TAATAGTATAAAAGGGAGTTGATCGATTCCAGAGTAAGTTCTAAATAAGACTAGACTGCA -TCCTAGCTTATCCTTCCAAGAATTAAGTAGAATGTCCCCATTGTTCTCAATAATTTATTA -TACACTAAGCCCAAATAAGAAAGAAAAATGAGGTAACTACTGCTATCAAAATACCTTCAA -GGCAATAAAATTAGATAGAAGTATTCATTTTGTTTTATTTTTGTTTTTACCACTATACAA -ATGAGCAGGAAGCATTCATTTTAAAATCTGTATGTGTTCATATTCATTTCTAAAAAAAAA -ACTCTTACTAATTACATAGTGAAAACACAAATTTCTTCTTGCAATTAAACATTTCTAAAG -AGTTTGATGGGTAAAAAAAAATTAAGTTTAAAGATTCATAGAAAAGAAATATTTCTTCAT -AAAATTTTAGAACAGATATTTTTCTGAAAGCTTCCAGCAGAGGAAAAAAAAAAATTTTGT -TTGCAGTAAAAGGATTGACAAGCAGAAAGGCATGGAACTTCTCGACAGCACATTAGGAAC -CAGTAGAAATGTAGCAGTGCCTCTACAATTTAGAATTAAAATGACTTCCAACCTATAATT -CTACACCTAGCTAAACTATCAAATAAGTGTGAGAATACAGGAAAAACATATATCTAGATA -GATCTATATGTCTGTATATGCATTATATGCAACTAAAAGTGTGTATTTCTTATGCAGTCT -TTCCCAGGGAACTCCGATGAAGTGTTCCAACAAAATGAGCGAGTGAACCAAGAAGAGGAT -GACATTAGATCCAGGAGATACAACAGAGGAGATAATCTCCAGGATGCCTGTGAAGAAAGA -TCCCTGGATCCCAGGATGATTATAGGACAAGTTGTTCATAATCCAGCAGGCCAGAAGACT -TCCAGGGAAACTCATTCAAGGAGGTGAAAATGATGGATGACTCCTCCAAGATGAAAATGG -ACCAGCCGCAGTGGCTCACGCCTGTAATACCAGCACTTTGGGAGGCTGAGGCAGGCGGAT -CACTTGAGGTCAGGAGTTTGAAACTAGCCTGGCCAACGTGGCAAAACTCCATCTCTATTA -AAAATACAAAAATTAGCCAGGCATAGTGGTGCATGCCTGTAGTCCCAGCTACTTGGGATG -CTGAGGCAGGAAGAATTGCTTGAACCTGGGAGGCAGAGTCTGCAGTGAGCCGAGATCATG -CCACTGCACTCCAGCCTGGGTGACAGAGCCAGACTCCGTCTCAAAAAAAAAAGAAAAAGA -AAAAAAAAATGATGACTCTTTCAAGAAATGAAAATGATGAGATATCTGGTAGGTCTGAAT -GACTTAAGAGGAGATTTAAACATTTGGGATAAGTTGAAGATGAGCTGGTGTTCATCTTCA -TTTATTTCATTTAAATAAATAAAATTATTAATACATGAATTTTATCTCAAGAAACAAAAA -TAAGCAATGTACATAAAAATTAAGCAGATGGCTGGCCGGGCGCGGTGGCTCACGCCTGTA -ATCAGAGCACTTTGGGAGGCTGAGGCGGGTGGATCACGAGGTCAGGAGATGGAGACCATC -CTGGCTAACACGGTGAAACCCCGTCTCTACTAAAAAAATAAATAAAAAATAAATTAGCCG -GGCGTGATGGCAGGTGCCTGTAGTCCCAGCTACTCGGGAGGCTGAGGCAGGAGAATGGCA -TGAACCCAGGAGGCGGAGGTTGCAGTGAGTGAGATCACGCCATTGCACTCCAGCCTGGGC -GACAAAGTGAGACTCCATCTCAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAATTA -AGCAGATGGCTATAATTTTTTTAAAAATAGAAAAGTGTTGATGAGAAATGGGAAACCTCA -TACATTGTTGGTCAAACTGTATGCTTCCATTTAGAGGAAATAGTCAGAACAAATAAATCC -ATAGACACCAATTAGGTTGGTGTATCCCAGGGGCTGGGCATGGAGTGGGGTGGAGAGAGA -AGGAGGGCCTGCTTAGTGGATACAGAGTTTTCTTTGGGGGCGATGAAAGTGTTTTGGAAC -TAGATAGAGGGGGTGGTTGCACAACATTGTTGTTGGTGGGAATTTAAAATGGTGCAAGCA -CTGTGGAAAAAACAGTTTAGCATTTCCTCAAAAAGTTAAAACAGGCCAGGCGCTGTGGCT -CACGCTTGTAATTCCAGCACTTTGGGAGGCCAAGCCAGGTGGATCACTTGAGGTCAGGAG -TTTGAGACCAGCCTAGCCAACATGGTGAAACCCTAAAAATACAAAAAATTAGGCGGGCAT -GGTGGCAGACACCTGTAATCCCAGCTACTCAGGAGACTGAGGCAGGAAAATTGCTTGAAC -CTGGGAGGCGGAGGTTGCAGTGAGCTGAGATTGCACCGCTGCACTCCAGCCTGAGTGACA -GAGTGAGACTCTGTGTGAGAAAAAAAAAAAAAAAGTAAAAACATAGAATTACTATACAGC -TAGCAATATCGTTGTTAGGTATATGCCCCAGAGACTTGAATACAGTTACATGCTCCATCA -GATACCTGTACCCAAATGTTCCTATCGGTATTACTCATGGTAGCCAAAAGGTAGAAACAA -CCCAAATATCTACAAATAGATGAATGGATAAATAAAATGCAGTGTATCCATATGGAATAT -TACTTGGTCTCAAAAGGAAGGAAGTACTTATGCAAGCTACAACATGGATAAACTTCAAAA -CAATATGCCAAGTGAAAGAATCCAAATGCAAAAGGTCAAACGGTATGCTTCCATTTAGAG -GAAATAGTCAGAACAAATAAATCCATAGACACCAATTAGGTTGGTGTATCCCAGGGGCTG -GGCATGGAGTGGGGTGGAGAGAGGAGGGGGGCCTGCTTGATGGATACAGAGTTTTCTTTG -GGGGCGATGAAAGTGTTTTGGAACTAGATAGAGGGGGTGGTTGCACAACATTGTGAATGT -ACTATAATAAATGCCACAGAATTGTGTACTCTAAAATGGTTTAATTGCTGTGCATGGTGG -CTCACGCCTATAATCCCAGCACTTTGGGAAGCCAGGATGGGAAGACTGCTTGAGCCTAGA -AGTCTGAGAGCAGCCTGGGCAACATAGAGAGACCCTGTCTCTTAAAAAAAGAAAAAAAAA -TTAGCTGGGTGTGAAGACATGTGCCTGTAGTCCCAGCTACTTGGGAGGCTGAGCGAGGAA -GATTGCTTGAGCCAGAGAGGTCAAGGCTGCAGTGAGCCATGATTGCACCACTGCACTCCA -ACCTGGGCAAGAGAGAGAACCTGTCACAAAAAATAATAAATAAATAAATAAAATGGTTAC -TACCTGAATTTTACCTCAGGAAAAAAAAATAAGCTAACATACCAACAGGACAGTTATTAC -TTCCTAAAAAAATAAAAGGATATACAGGAAGGGAAAAATAAATAAAAATTTACCACAAGC -TTCAGCTCCACATAGCATTTGTATAGTCATGATAATGTAAACATGTAATGTGAATATATG -AATCTAGCCAAAACTATGCCATAACTATAAAGAGGGGAAGGCTAGTACAGGAAGGGGGTC -ATGGAGCAAAGGGATGAAAGACATGAAGACTCATCCTTCATAGCCTGAATCCGAGGAGTG -GATAAAGACTCAATCTAAAGATAAAATAAGGCAGGAAATGAGGAAAAAGAAAAAAACTGT -TGAAGTGCATCCAAAGTTGCAGATGGTTAACATTCATTCCACTCACTTGGGAAAACATCT -GGTGTGATCGTCTAATGGGTCATCACCTTCCTGCCATTTCTCTAAACACCCTCCACAGGA -AAAGCACTGGACGATGTCCTTTATACCTAAAAGTAAGGAAACTTGATCAGTGCCACTGGC -ATGGGCATCTGTCCATTAACATGCAGATAATAACCACCAGACCTGTAATAGTGAAAGCCT -ATTCAGTCTCCAGTTGGGTTTTGTGACAGTCAGAAGTTGGTTACCAGTGAGGCAATTTTC -TATATAAGACTCTGTCCACCAATGGGGTAACTGGCAAGTAGTCATTGAATGCTCCTACAC -ACCATGCACTTTGATGCACACCATCCCTCTGCCCCATTCTCCTTTGATCAACAAACAGAT -TGGCAACCAGAATCTGGAATTGAAGCTCCATGAGGGGGCTGGGCGCAGTGGCTCATGCCT -GTAATCCCAGCACTTTGGGAGGCCAAGGCCAGCGGATCTCCTGAGGTCAGGAGTCTGAGA -CCAGCCTGGCCAACACGGTGAAACCCTGTCTCTACTAAAAATACAAAAATTAGCTGGGCA -TGGTGGCACATGCCTGTAATGCCAGCTACTCAGGAGGCTGAGGCACAAGAATCGCTTGAA -CCCAGGAGACGGAGGTTGCAGTGAACCAAGATAACGCCATTGCACTCCAGCCTGGGCAAC -AAGAGTGAAACTCTGTCTCAAAAAATAAAAATAAAAATAAGCTCTATGAGGGTAGAGGTT -TTTGCTCACTAATGAATGACATGAACCTAGAAAAGTGCTTGACACTCATGTGGCACTCAA -TTAGTATTCGTTTAATGAATGAATCAGAAAGAATATATTTAGAGCTCACGGAAAAAAAAA -TACCAGCAAATCTAGCAGCCCTTATGTAAGTGAATGCATGAAGAATTAATTGCCTCTTAC -CACATTATTGCCATGTTTATTACACCAGAAATAGGATTAAGTCTCTTTGTGAAATTATAT -TTCTTTGGAAAGAAATTGGTATTTAGCTCTGCAAAAGGATCAAACTAGAAACAGAGCATT -TCTCATCTTCCTTCCACTCTGGGAAAGCTGGGGCAGAGGAAAGCCTCCCAGAAATATGAG -ATCCTAGAGCTTGCAAGATCTGAAAACAGTCAGAGATGATTAGGATTTGTGTGGAGTGGT -GGAGGATTGGAAAGGAAGAGGGGGAGCACACTGGTCAGAGGGGTCTTGCGGAAGGCTGAC -AAGAGGAAGACACAGTAGAGTAGGGAGAAATGGCAAACACTCTTTCCAAAGGCTTAAGAT -TGTGAGGCAGTCAGATTTTTTTTTTCCAATGGCACATGTCTGTTAGGTAGAGTGACAACT -ATATTCTGCTTCTCTGTGTTGCTCTATGGTATTTGTGACAACTACTTGATCTCTCAGTTA -AAGATCTGCATTAACCTCCACTGTAACTTATGCATGTGTTCGGTTTGAGCAAGACCAGCA -AGGTACCTAGGAACCTTTCCCTGATCATCTTGTATTTCAGGCAGAGATTTAGCTGACAGG -AACCAGCCCATCATTTATAGATTGCAGAGGTGCTTCCTAATGACCAGCAGCTAAAGAGAA -AATGCCACAATCTGGTGGAAGGCTCTACGTGTTTAGGAATCATGAAAATTAATTTCCTGA -TTTTCTCCTGCAGGCAGAATGTGGCAAAGATTGCTATCCATGTTCCTATTATCTCAAATC -CTTCCATACTAATAGAAATCCCAATATTTAGCTGGGCACATTGTCACCCAGGAAAAAGAT -TAGGTTTCCCAGCTCCTCTTACAGCTAGGTATGGTCATCTGACTAATAATAATAATAATA -ATAATAATAATAATAATAATTATTATTATTATTATTATTATTATTTTTGAGACAGAGTTT -CACTCTTGTTGCCCAGGCTGGAGTGCAATAGCATGATCTTGACTCCCCGCAACCTCCACG -TCCCAGGTTCAAGCGATTCTCCTGCCTCAGCCTCCCAAGTAGCTGGGATTACAGGCACCC -GCCACCATGCCTGGCTAATTCTTTGTATTTTTAGTAGAGACAGAGTTTCACCATATTGGC -CAGGCTGGTCTCAAACTCCTGACCTCAGGTGATCCACCCACCTCGGCCTCCCAAAGTGCT -GGGATTACAGGCGTGAGCCACCATGCCCGGCCCATCCAACTAAGTTCTGATTAAAGAAAT -ATAAGCAGAAGTGTCCTGTGACAGTTTCTAGGAGCACTTTGTCAGGGGACAAGAGGTGAG -GAGAGTAATGTGTAGAAAGAAAAGACATGATAATTATCACAAATAGAATACTTGTATTCA -TTGTTAGTCCAGACCTTAAGGTTTCAAATTTGAAGGTTTACCACCTAAGGGAGGAATAGA -AAACTGGGAGAGGATTTATGATGCAGGAAAGAAAAGAGATGTATGCCAGGTGCAGTGGCT -CACACCTGTAATCCCAGCATTTTGGGAGGCCAAGGCAGGAGGATTACTTGAGCCCAGGAG -GTTGAGGCTGCAGTGAGCCATGATCTCGCCACTGCCCTCCAGCCTGGATGACCATGTCTC -AAAAAAAATAGAAAGAAAAGAAAACGAATCTATAAGAAATGCTGAAGAGAGGCCTGGCGC -GATGGCTCACACCTGTAATCCCAGCATTTGGGAGGCCAAGGCGGGCAGATCACGAGATCA -GGAGATCAAGAGCATTCTGACTAGCATGGTGAAACCCTGTCTCTACTAAAAATACAAAAA -AGTAGCTGGGCGTGGTGGCAGGCGCCTGTGGTTCCAGCTACTCCAGAGGCTGAGGAAGGA -GAATCTCTTGAACCCGGGAGGTGGAGGTTGCAGTGAGCCAAGATCTGCATTCCAGCCTGG -GCAACTCTGTCTCCAAGGGGGAAAAAAAAAGAAAAGAAAAAGAAACGCTGAAGCTAGTGG -ACATTGCTGAGTGTAGCTAAACGTAAGCCCAGGAGCATAAAGTCTATGTGGGAATTAAAG -GTCAAGCAAGCAAGTGGGCACAACCTACTGACTCACCTGTGTAGAAAAGACCTGCTTTGG -CCAGTGCTGCAACTCCCACAGCTGATTCCCGGGGCCAGTCCTTAAAAGAGTCCAGCCGTA -GTTCTTCGTAAGCAAAGATGCTGTCATTGCAATAAGCTTGAATAAAAAGCACAAGGTGAG -ACCAGCAGGCTTTAGTCTTTTTTTTTTCTATATCTTTATTGCTGCTGCACAAATTAAAGA -GACCAGTAGGCTTTGATATTGCAAGTATCAGCGTTCAAGTTGTCCCTTCACAGTTACAGA -TGGAATGATGTCTAGAGTTTGCTTCAAAATAAACGGGGCGGGGCGGGGGGGACGACAAAA -AGAGATAGGGACAAAAAATCAAAAGAAGAAATAAACAAGCAAAGCCTTTGGAAAATGTTT -GAGTTTTTACCTGATGCCATAGGTAATTCTCTCTGGACCCAGGAATTCACAAAATGTTCT -CCCTGAGGGAAATTAAAATTCAAGTTGTTGATTATCTGACTTTTTTTTTTTTTTTTTTTT -TGAGGCAGAGTCTCACTCTGTTGCCCAGGCTGAAGTGCAGTGGCAGGTTCTCGTCTCACT -GCAACCTCCGCCTCCTGGGTTCAAGTGATTCTCCTGCCTCAGCCTCCCGAGTAGTACAGG -CATGTGCCACCACACCCGGCTAATTTTTTTTTTTTTTGTATTTTTAGTAGAGACAGACAC -GATGTTGGAGGTCTTTTTTTTTTTTTTTTTTTTTTTTTTTGAGACAGAGTCTCGCTCTGT -CGTCCAGGCTGGAGCACAGTGGCACGACCTTGGCTCACTACAAGCTCTGCCTCCCAGGTT -CACGCCATTCTCCTGCCTCAGCCTCCCGAGTAGCTGGGACCACAGGCGCCTGCCACCATG -CCGGGCTAATTTTTTTTTTTTTTGTATTTTTAGTAGAGATGGGGTTTCACCATGTTAGCC -AGGATGGTCTCTATCTCCTGACCTCATCATCCGTCCGTCTCGGCCTCCCAAAGTGCTGGG -ATTACAGACGTGAGCCACTGCACCCGGCCCATGTTGGAGGTCTTGAGGCTGGTCTCGAAC -ACCTGATCTCAAGTGATCTGCCCAGCTCGGCCTCCCAAAGGGCTGGGATTACAGGCATGA -GCTACTGCGCCCAGCCTGATTGTTTGACTTATGAAGTATATACCTATCTATGAACAAGAA -CTGAAGGAACTTTACCCCAGAATGAAGAGTTTCACTGGATGGAACGGCAGAGTCGGAGGA -GAATTATTCCTTTAATTTTTATTTCTGTTGATGTTGCAATTGTTTTTATGCAGTGCAAGC -AAACATACACACACACACACACACACACACACACACACACACACGCATGCAAGCTGTGAA -TGTTTATGCATACTCAGGAGGAAGCCTTCTCAGGGTCACTGTTTCCGGAAACTGACCTTG -AAAACAGACCTGCATTTAAATATCACAGATGTACTTTGACGAATGAGGAAGTAAGAGACA -TAGAATGGTAACTAAATTCATCAGGGTATTATATATTGAGCAACTGATTCTTCTGGGAAA -GCTGCACCCAGTTTCTTTTTGAGGAAACACCTCTCTTCCCCCACTGTCAGGCCATGTTCT -CTATAGAGTTCTGGTCTCCTGAGTCATGTTAATCAATAAATTCTCATTTTTGTTTAAGCC -AGTTTGGATTCGATTTCCCATCACTCTCCACTAGGAAATTTTTACTGATTCAGGATAGTT -AGCCAGCTAGGAAGAGCCAGCTCTGCAGCCCACTGTGGGTGACAGCGCCTAGGTCAGGAG -ATCTTAGCAAGCCTGCAGATAGGGGCAGCAGAGGGAAGCTGGGGCAAGTGGCTTCATTCA -TAAAGGGGAAGACTATCAGGAAGGCAAGCAGAGCCCGTCAGAAGCCAGCCCTGGAAAAAG -AAAAAGGCTCTAGGTCAGCAAGTGAATGTGATATTTTTCACTTTGAAGATGGGAGCCAGG -GGAATGAAAGGAGAAAGGAAGAAAGAAATCAAACCCATGACATAAAAAGAATGCCTATGC -CCTTCTGAGTCAGACACTTACAGGTAATCCAAAAACTTGAGAAAAAAATTGCTGTTATAC -ATTACCGTTATGTCAACAAATCCCTTGTAGCTTTGAATATACTGGGTAATTTCCTCTGAG -GATTTCTTACTCCGAAGAAATTCACATCTGTAATTAATAAATATAATTAAAATTTACCCC -AGTACTGTGATAGAGCTGTCCTATATCACAATGAACATTTATAAAGACGTATTGAATTGT -TGAATTTTATTATACTTCAATAAAATTGCCAAAAAATTTACCACAAAACTTAGGAGAATT -ACCATTATTCTCATATAATTATTTGTTATTTCTATTAGTGACAACATGTGTAGTTATTTA -AAATTAAATCTTCAGGTTAACTTTTTTCTTGAAATAAAACATGCAATACAATCAAAGAGA -CTGATTTACAGTAAATATAGGATGGAGCTTTTGTTTTTTGGAATTAAGCAGTGGTGACTA -AATCTAGTCGCTAGGGTTATATGAAAGCTACTGGCAGTAAAGAGAACTATATTTAAAATA -ATAGGCCAGACGCAGTGGCTCACATCCAGGAGTTCAAGACTAGCCTGGGCAACATGGCAA -AACCCCATCTCCACAAAAAATACAAAAATTAGCCGGGCATGGTGCCACACCTCTGTAGTC -CCAGCTACTCAGGAGGCTGAAGGGGGAGGATCACCTGAGCCCGGGGAGGTAGAGGCTGCA -CTGAGCCATGATCAGGCTGCTACACTCCAGCCTGGGCAACAGACTGAGACCCAGTCTCAA -AAGTAAATACAAAAAATCTTTTTAAGATAACAATATATTTATCTACTGAACAAAAAATTA -CCATGCATTAAAAAGTAATGGCTATTAGGCCAGGCGTGATGGCTCACGCCTGGAATCCCA -GCACTTTGGGAGGCCGAGACAGGTGGATCACGAGGTCAGGAGTTCGAGACCAGCCTGGCC -AAGATGGTGAAACCCTGTCTCTACTAAAAGTACAAAAATTAGCTGGGTGTGGTGGCGGGC -GCCTGTAATCCCAGCTACTTGGGAGGCTGAGGCAGGAGAATCGCTTGAACCTGGGAGGTG -GAGGTTGCAGTGAGCTGAAATCATGCCACTGCACTCTAGCCTGGGCAACAGAGCAAGACT -CAATCTCAAAAAAAACCAAAAACAAAAAAAGTAACGGATGTTAATGGATAATTTTTGATT -TTTTTAAAAAAGAGCACACTGAATACCATTTAAAAACATATTCCTTTCCCATAAAAGAGA -AGCAGTTTTAAAATTAACTTTTAAAATTTCCTCCAATTCAGCTGGGCATGGGGGATCATG -CCTGTAATCCCAGCACTTTTGGAGGTTGAGGCGGGTGGATCACTTGAGGCCTGGAGTTTG -AGACCAGCCTGGTCAACATGGTGAAACCCTGTCTCCATCAAAACTACAAAAATTAGCCTG -GCATGGTGGCATGCGCCTTGTAGTTCCAGCTGCTCTGGAGGCTGAGGCAGGAGAATTGCT -TGAACCCGAGAGGTGGAGGTTGCAGTGAGCCGAGATCACGCCACTGCACTCCAGCCTAGG -CAACGAGAGCGATACTTCGACTCAAAAAAAGAGAAGTTATCTCTAGGTAAGATCATGATG -GAAATTTTCATCTTACTTTATACCTTTCACTGTTGAAATTATTTTACAGTTGAAGTAAAG -GAAATTTTACAATATCCAACAAGAGCCGATGTCATTTATTTAATATCAAAATTAATATTG -GAAAAATGTCTATACTTTAGGCTACCACCCATCTGCCTGAATTAATCAGCATTAATACTT -AATTTTAAATATTACCTGTCAACGCAGGTCACTGAATGTGATCTCCTTTAAGGTATTATC -ATGTAATAAACTGCTACAAAAAGTCTAATTCTCTCAAGAGTTTTATAGTCATCCACTTCA -TTTTCAGGTCAACATTTTAACATATTTTCCCATATTTTTTTCTGAAGCTTTAATCTCTGC -AAAGCCCATCTTTAAATTTGAAGGAAAAGGTAGAAGAGTGAGGAGCAGCAGTAATTAACT -TGAATTTGGAACTTGGATATAACTAAAGACACATTTTGCTTCTTCATTTTTATGTCAGTT -TGCAAAGGAAACAGTTATGATTTTAGCTAAATACAGAAATTTTTTTTTTTCTTTTTAAAA -TTCTACTTGTATCCATTTCAAAACCTACTCTGAATTTTCACTCAGTTCCCACAATATTAC -CATAATTCTTTGAGCTGTTGGCAAAATGGATCCATTTTATAAAGTCATGCCTTTTGCATT -GAGCTTTTGCCTGTCTCATATATTTAGATAAATTTGAAAGCAAAAGGAATATCCACTGTG -TTGAATATCTTTAATAGCATGGTTGAAATTTATAATTTGAAATTCGTAAGTTCAAAGAAC -ATTTATCTACTGCTTGATTTTATGCTTGAAACTTCCTATGCTTCACAGCAGTTTTTTTTT -TTAATAGGTGGACAAAAATCCTTCCTCCTATCATTCATAACAATTTTCTTTATACTTAAG -TAAAATATACAGAAACTTTTAAAGGAACGCCAAAATCTTGTTCCCTCTTATTGTTGCCAC -ATTATTTTTATTATATTACCCAACCAGGTTTTTATGGTTATGCTTATACTCTCATACAAA -ACAGTTTTAGAACAAATATTGAAGGAAATGCAGGGTCACAAAATGAATAAGTTTACCTTA -ATAACATTAATATGAACAATGATATTATTTTATTGAACTTAAAGCACGCTCTTGAGCTTA -AGAGCAAAGATGTAGCCACAGTTGAACTTGTATTTATTTGGGCTATATTTGTGCTTTTAA -TTACTCATACTAATACACAGGAAGCTTATTTGCAACAGGATATTTATATAATTTAAAATA -TTTTCAGAGTTTTTGTGTGTGTTGAAATGTTAGGAAAACAGAACTCATTCTTAAGCAATG -ATTTGCAAAGAGCAGTGCTCATATGCAGATTTTTAAGGCATAGCCCAAATGGTTAGAAAT -GCTGCAAAAGTTTAATTTTCTTTTGGGTGATCTGTTGTCTGGAAAAAGCTGTTACATGTA -AAAATTTGGATGCTGAAATCAAATGGCTATACCCAAATGAGCAAGAATAGTTTAAAACAT -TTAAATCAGCATCTGCATAAAAATTAATATAAATATTATTTATGACTGTTATGTATATAT -GTATATATAATTATATTATGTATAAGAATATATTTATACAAATATATACTACTAGAAAAT -TGTATATGATGCACTATTTTATTTTATGTAATATTTTATGTATATATTTATTTACACATA -ATTTATATACTTTTAAGACTGTGTCCATTTTTCATTTATTCTTGGTCTCCGGTTTGAACA -ACGCTGCTTTATGGCATTACACTGATAATTCTCTCTACTCTTTAGTTCTCTTCCTTATCG -CTTATTCATGTGTATCTTATTCCATGCTATAATGTAATGTACCATACATGTGTTGAATTT -TAAAAAAAAATTAGCAGAATTTCAATGCTTTCCTATATTACTCAACATAAATATTCTCTA -TATAGAATGAATTGGAACAAGCTATTTTGAATCTGAAAGGATAATCAGTGATTCTACCAA -TCATAGTGGTAAACTCATTCAAACTCAGCCTGTTAAAATGAGACGCTCTGCCCTATATCA -CTGAAAACCTCTTGATTTGCCAGATTTTTCCCTTCTTTACAAATGAAAATGCTTAGTGTT -TTCTGAGTTCCTTTGCACTATCTCCCACTGGATTCAGGTCATTGATTTCATCTTCAGAAC -ACTTGGAAAGTTTATTTTGTGGTGTCTATGAGCTAATTTATTTTTATTGCAATGTTTATT -TAAAATGAAACAATAAGTACACTGAAGTTTTGTGCATTTCATTTTATGAAAATGTTATCC -CAAAGGGATACAGAAGAACTAAATACAAATTTTCAAAATTTATTGTTTTTTTTTGCCTGC -TGCTATATCTGAGGTTGTACTTTTGTTCTGATCTTTGTAACACCTCAAAAAAAAAATGGG -TTAAGAGAAGGATGAACAGAAGAATGGATATGAGACCTATCTGATAAGGCAAGCAGATTA -ATAGACGAATGGAGGAATGTTTGGATGTATAGGTATATATGTGTTCATTGCACACATATG -GAGGAATGTTTGGATGTATATGTATATGTGTTCATTGCACAGTTTTCAACTTTTTGTGTT -GAAATTTTTATAAAAAGAAGTTGGAGAAATAAAAAACAAGAAAACAGAACCATAAGATTT -TTATTTAACATTTTTGATTAAAGGAATTGTATTGCAAATTATGACTTTTTAATTTGGCAA -CATCCTTTTAATGGTGTTCTTTCTTTGTCCTTCTCTTTCTCTTCCTCTCTCTCTCCCTCT -CTTCCCTAAAGCTCCATTCCGACTTAGACAAGGGAGAGGGCACTGCGAAATACACCCTCT -CAGGAGATGGCGCTGGCACCGTTTTTACCATTGATGAAACCACAGGGGACATTCATGCAA -TAAGGAGCCTAGATAGAGAAGAAAAACCTTTCTACACTCTTCGTGCTCAGGCTGTGGACA -TAGAAACCAGAAAGCCCCTGGAGCCTGAATCAGAATTCATCATCAAAGTGCAGGATATTA -ATGATAATGAGCCAAAGTTTTGGGATGGACCTTATGTTGCTACTGTCCCAGAAATGTCTC -CTGTGGGTGAGTAGGCAAATCAAAATTCTGTGAGATACAATGAGACCTCTTCAACATTGA -CTTTTTGCAGGTTGATGTAAACATCTTATCTATCATCTAAAAGAATTATTTTTCAATTCT -AGAAAATACAGTTCTTTTCATTTATTTTTGTAACTTTTTTGTTTTTCTTTCTGCTTCATT -ATGAAGATAACTACAGGAATATATAACATTAGTTCCTGTTTTCCACCCTGTGAATTTACC -TGAATTCATAGAATCCTTGCGTGCTTTAAGCAAAAAATGTATTTTGTATTGAAATTGATT -CTTATCTCAATTCCAGACACCTATACAGTGCTGGAGACACCTACCCTACACCACGAAATG -CCAGACAGTAATTCCTAGATCAAAGTAAATGATCTAAAGCATGCATCACATCTGATCTGG -AAGTGGTCCAGAAACAGGTGTGTTGCATCTTCTGTAGCTGTAAATAGAGATTCTGGAAGG -GTGATACTGTTTCCTTTTCAGGGTAAATAACCCATACTTGTTATGCCATCAAGCCAAGCA -GCAAATGAATAATGTCATGAAAATATTATTAGAACAAATTAACAAATTACAATTACAATT -ATCAAATTAACAATTAGAATATAGTAGCACCATCATTCTAAAAATTTAAATTTGATATAA -ATATACATTTCCATATCAGCCTAAATTTACAAAGTCCTATAATATGTAGGATATAAGGTC -AATAAGTTAAGAATTCCAGCCTTAAGGACAATTTTAAATTATAATTTTTATTCCTCAGTC -ACCACTGCTAATCCTTCAATTTATTTCAAAGTAACTTCTGGTTTTTATTACATTTGGAAG -ATAAAGCAACTTATCACATGTAGGTTACAACTTAAAATTCGTGTATGAGCCATTGCTTAT -ATTTTCTAAATCTGACATGACCCAGGGGGTTTCTACTGCTCCTACCACCACCCAGGACAT -GCGATGAAGATTGTGCACGCTACCGTGAGGGCAGAAGCAGGTTAGTAGCTGTAGGAGCTG -TCACATGGATTTACTATAATGCACTTGAAATTGTGTATGTGACCTTATCAGGCATTTAAG -GACCATAATCTCTCCTTGACCTAAGAAATCAGCTTGAAGTAATTCACTTAGATTTCAAAT -TTTAATGTGGATACCCAAGGCTGCAAATCTGTTATTCAGTACCTGCTACACTTTTGGGGT -TGCCTCTTTTATGCACTGTTAGAATTGCTAGAAATTTAGAAGTCCAATTGGAAAGAAGCA -TATCTTGTTAGAAAGTATTCCCAGAAAATGAGGAAGGCTACATTTTAACTGTGTCTTGAT -TTTACAGGGAGAAAAATAAAGTTAATATTTTGAGGAAAAAATAAGGCTTTTAAGATGACA -TGCTATATAGTAGACAAATAGTTTAACTCGGTGCCTACTTCATGTACACTGGATGTGTTA -ACATGAATTTATGACCCTCAGTGACTTTTTATTACCAAAACAGCTTCCTTAAAGCAAACA -CACACACATGCCTCTACAGTATTGGAAAATTCCGTCTCCTTAGATAAAACAATTAGGATT -TTTCTTGGGCCAACTAGAATAATTAGGGCTGCAGAGTTGGAGCCTTTATATAAGGAGTTT -GCAGCTCATATCCGAAGAGAGAAATGTATTTGGAAAGTCAAAAGTGTAGGTAAGTGAGAA -AGCAGAGTAGTTTCAGCTTTTGCAGTTGGAGTGGGTATAATTTACTGTGTTGTCATAAGA -TACTGGAAAGATCTTTGGAAGAATAGGTTCTTAAAGTGTTTTCTCATGTGCCCTTACTGA -CATTTCCCATTGGGCCTTCAAGACAACTCCAGTAAATACTTAAATTGATTTTCAGTGCAC -TGCTTTCTTTCATTTTTATTTATTTATTTTGAGACTGGGTCTTGGTCTGTTGCCCTGGCT -GGAGTGCAATGGCCCAATCTTAGCTCACTGAAGCCTTAAATTCCTGGGCTGAAGAGATCC -CTCCACCTGAGCCTCCTTAATAGCCAGTCATGTGCCACCCTGCCTAGCTATTTTTTTTTT -TTTTTTTTTTTTTTTTTTACTTTTTGTAGAGAAGGGATCTAGCTATGTTGCCCAGGCTGT -TCTCAAGTAGTCCTGGCCTCAAATGATCCCTCCACCTTGGCTTCCCAGAGCACTGGGATT -ACAGTCATGAGCCACCCTCCTGGCTCCTTTTTTTTTTATTTTTAATAACAGAAGGGTATT -TCTTTTGAATGTGAAATTTTACCACATGGTATGAATTAGTCCAAGTGTTTTTATACTAAA -TTTACATAATATACACTTTTCAAGTAAGTACAAAGAGGTATAAACACTGCTTATGAATTG -AATGTTAAAAAATAAATCTCTATGCATTACTTTTGTCTTTCCCCATAATCTCACGTATAC -ACATAAAACAAAAAACAAGGAGACCCAGTTATAGTTGTGGTATCTGCTGTTTCTGCCTTG -AAATTTCCAGCTTACAGCTAAGCAACAACTACTGTGCATCCAGAACTTACATCTATGTTC -CTAGAGTACTTGAACCCCATTCTCAAGTGCACCCTTCTTACCAGGTGGAAATAGTTCACT -GCTGTAATAATCTAAGAAAACATTATGTTTCTCTCTACTTTTTTTTCTCTCATATAATCT -AGGCAATTCTCCCTCTGTATCATTTTCCTGAGAAAACTAAAATAATTTTTAATCAAGACC -AGATGGAACTTTGTATGGTATATTGACAGTATACCAATTGTTGTGACGAATCTTACTGCC -TGTTGTAGATATCAGTGTTTGAAGTATTCCCTATGAAATAACTTTTCTGTCCCAATAATT -GAGAGGGCTGTTTCATTTCCAAAAAAGGGAAGAATTAATCAATTAAAAATACATATAGTG -AAATAACCTGTTTTGTAACATAAAACATAAGATGAAGAAATATCGGAACATTGATATGAA -GTTTAACAGTAATGGATTATATATCCAGAAATATGAACAAATAAACCTGCAATGAAAATT -TACTAATGTTACTAATTTTCACTTGTGTAACGTGAACATTACAAAGAACATAGTGTACAA -AGGGAGAATGTTGGTGGGTAGGATGAGTCAAGATTTCAGAGGAAAATCAATATTTAAGAC -TTACAGCACTGTGGAATATATTTAATTTTCCTAAAGTTGAAGAAAATTTCAGTGAATCTA -TGAATTGTTTAAGACAAAGGTCACTCCGTTACTGACTTCTGCTACATCTAATTTTCCAGG -GAAGTAATATTTAGAGATAAAAAGCTTTTACTCTGACCTCCGGAAATTACTTAATGATCC -AGATACTCCCAAAGTCAAAGCAAATCCTTGGAGACAAGTTTGGACTTTATGAATGTGGAC -TTAATTCTTTAAGATCACTAGAGCAACAATAAATTACAGGAATGTACCCTCTTTATATCT -GATGATTATGCATAAGTGGGGTGTGCAGTTTTAAGTTACTTTTCCTACAGTGCTGACAGG -TTTAGAGTGTTAAATCCATACTCAACTTGTATTATCTTCCTCTGCTTGAGCTATGCCACC -TTGAGTCAGCTGATTTGACTATTTATAATTAGATACCTAACCTATGATATGATATAGTAG -ATGTCAATAGTGACTCATGATTTATGTAGTAAGTCTTACCATTTTCTAAGCAGTAGTCAG -GTGCCATGTGATCTAACTAAAGATTTGTATTTCTTATTTTACTTAACAATTACAGTAACC -CCAATGCAGTATTATTCACTGTTGGATTTTTTTTAATGTGAAAACTTAATAACCCCTGTG -GATAAGAAAGTAAGAATGATTCTTAGGTGCTTTAGGACCAAATTAATCAGAATTTAATAT -ACCACTTTGTCTAGGTGTCATGGCGGCTAAAATATCTTTGAGAAAGTTAAACTTAGCTTT -CAATCTCAGATGATCTACTTAAGAATTTGGAAAGTTTATATTATATTATTTGAGAATGGG -GATTCTTGCTTAAACGAAACCTGAAGGATGGGCATCTTTCATTCAATATCTTAAAAAGAA -AGTTTAGCTGACATTTAAATAAGAAAAGATACACCTAAAATAAAGTAGAACACTGGTTTA -ATAAAAATAGTGAACAGGTACTCCCTTGCCTTTCTATTTTTCTCTCAACTCTATTTTATT -TTACATGAAGTTTGGGGAGAAATGCTAAGATGAAATTTTTGGTGGAGTCTTTCAGAGGTT -ATTTAACCAGAGACTATTTTCTTTTTTCTTTTTTTTTTTTTTTTTGAGATGGAGTCTTGC -TCTATTGCCCAGGCTGGAGTGTAGTGGTGCGGTCTTGGCTCACTGCAAACTCTGCCTTCC -AGGTTCAAGTGATTCTCCTGCCTCAGCTTCTCGAGTAACTGGGATTACCGGTGTGCACCA -CCGCACTCAGCTAATTTTTGTATTTTTAGAAGAGATGGGGTTTTGCCACATTGGCCAGGC -TGGTCTCGAACTCCTGACCTCAAGTGATTTGCCTGCCTCGGCCTCCCAAAGTGTTGGTGT -TAAGGACATGAGCCACAGTGCCCAGCCAACCAGAGACTACTTGTTTCGTGGCCATATTTA -AACGGTCTAAGAAGGAAAAGTGAAGACTGTGTCTGTACTTTACATTAATGAACTATTACA -ATTTAGAAACATATATAAGTCTCCACACTTCCTTATTTTCACAAAAATGCCATAGAGAGA -CAAATTGAAACATAAAAAACTAGATATATTCTCTCATCCCATGAGCCAGCCATGGAAACA -GAGAGCAGCTCAATTAGTAGCAGAGGAACAGGTGAATTATCATCCACTTCTATCTATGCC -CTAAAAGCAGAGTTTTCTCAGAAGCTTGAAGACAGAATGTTGACTATTTATTTTCCACAC -ATAAAGACATTCTCCTTGTGCAATCAAACTACAATGTTTAAAATCAGGAAATTTGCATTA -ATGTATTATTATAATCTAATCCTTCAGCCCTATTCAAGCATTAGCACTTGTCTCAATAGT -GTCTTATATAACAAAAAGTTCAAGTTCAAAATCAAACATTGTATTAAAATGTTAGGTCTG -TTTAGTTTCCTTTAATCTGGAACAGGTTCATATTTTTTCTTGGTTTCCGTGACTTTAATA -TTTTTGAAGATTTCTGCCTAGTTATTTTTTAGAATGGCTCTCCCATCTTGAGTATGTGTG -ATGTTTCCTCATGTATGAATGAAGCATATACATCTTTGTCAGAAATATCCCAGAAGCAAT -TCTGTACTCTCCTCATTATGTTCTATTGGGTGGGCCATGGTTTTTGATTTGTCTCATTAC -TGATGATGGTTACTTTTATTATTTGATAAAGGTTGTATATAACTTATCTATTATGGCATA -ATACATTAGCTAAAACCTTAGCGGTGTAAAACAGCAGATACTTACGTTTCTCATAGGAAT -GGCTCTATTGAGTACCTCTGTCTCAAGGCTTCTCAAGAGTTTGTAGCTACCTTGTTGGCT -GGGGTTGCGGTCTGATCTAAAGGCTTAGTTAGGGGGTGGTAGAAATCTTCCATATGTTCT -TTGCTACGTGGACCTCACAGGCCTACATCATAACGTGGCAGCTGGCTTTCCTCAGAATGA -ACTACCCAAAAGAGAGCTAGACAGAGAGAAAACCCTCTGATTGAAGCCATAGTCTATTTA -TAACCTAATCTTGAAAGTGACATCACATCCCATCTGCCATATTATACAAGTAAGTGCAAC -GCGAATACAAGAAAGCCGGGATCATTGAGGGCTCTCCTACAGTCTACCTACCACTCTCTA -TACTCTGGCTCTCAATGATTCATGTTGCTCTCTCATGCAATATATCCTCATCCCCTTCTG -AGGACCCCAAAATTTTCAACCCACTATAGCATCAGCTCAAAGTCCAGAAGCTTTTCATCT -AAATCAAGTCCAGATGGGGAAGTGATTTTGGGTTTAATTCTTTTTTTTTTTTCTTTTAGA -TTTTTTTACTTTTAGTTTTGGAGTACCTGTGCAGGATGTGCAGGTTTGTTACATTGATAA -ACATGTGCCAGGGTGGTTTGCTGCACCTATCAACCCATCACATAGGTATTAAGCCCAGCA -TGCATTAGTTATTTTTTCTAATGCTCCCCATCCCTCCCCTCCACCCCCCATCAAGCCCCA -GTGTGTATTGTTCTCCACCCTGTGTCCATGTGTTCTTACCGTTCAGCTCCCACTTCTAAG -AGACAACATGTGGTGTTTGGTTTTCTGTTCCTGCGTTACTTTGCTAAGGATAATGGCTTC -CAGCTTCATCCATGTCCCTGCAGAGGACATGATCTCATTTCCTTTTTGTGGCGGCATAGT -ATTTCATGGTGTATATGTACCACATTTTCTTCATCCAGCTTTGTGATACTAAAGAGGCCA -GTTACTTACTACACATTCACCAAAAATACAGTGGCAAAACAGGAATAATGTCTCTAGACA -TTCCTGTTGAAAAAAATGGGAAAATGCACAGACTAAAAGAATGATTGGTCCACCACATTT -TAAAATCCCAGTGGTAAATGTTGCAAGTCATTTGATTATATTCAACGCCTGTGAATAATT -ATTCATGCCTCTCATCTCTGACCTCTAGGCTCTTCGTTCTGCCTTTTGAGTTATTCTTTT -TTTTTTTTCCATGAAATACAGCTGGTACTTGCAATAGTACTTGGGTGTTGAACTTGTTAA -GAGTGCATATCCTTTTTTTCAGATCCATCTATCATTTCCTAGTTGTATGTGGTTGTGTGG -GTTATTTCTCCTCTTTTACATTGATTTTCTCACCTGCAAGTGAATAATAGTAACACTTTA -TGAGCAGGGTTATTGCAAGTAGCAAGGAGAAAATATATATTTACCATTTGCCACAATCCC -TGGGGAAGTGCAGTCAATACATTGGAAAGGGTCCTCATAAGAGTTTGATGATCATTCTCA -GAAAGCTAGCCAGAGAAAGTCTAAATGGTAAAGGTTCCAGCTCATTATCTTCTTCCCTTT -TCTCAAGTTTTCTCTCCATCTGACATGTGAGCTCAGTATTTACCATTGCCCTTTCTACAA -ATTTAACCAAGTTTATTTAAAAACATAATGACCTTCTATCCCAATTTACATTTTCTTTGG -TGTAGAGGACGCCTTTACCTTGATGTGTGGAGACAAGCCGTTGATTTGTAAGAAACACCA -ATTATCAGCTTCCACTTGTGCTTCACAATCTGCTGAGTCGCTTTAACACTTTTGATGAAA -TTGAGCAAGGCCTTGTGATCTCTCCTGTGCCAGCCGTGAAGTGTCCACTGCACGCAGCTT -GGCAGAACTATTTTCAGGGCCATAGGATGTTATGGCTGTGTGGGCAGGGAGCATTTTATT -CGTCTGTTTGATTCCTATGTTTTTATTAGTGGTGCAATTGCAAAGGTAATGCTATTGACA -CTTTTTGTGTAGCCTTGAGAGAAGAGTATGAATTGTTTTAGTAGCAGCACAGCGTGTCCC -TAAATATAAATCATGCTGTACTGATAGTTACTTTAGCAGCCACTGATCAGCAATAAATGT -TAAAAATTAACAAGAAGTTTCTTTTTTTCGAAACCGCCAAATGACTCTAAGCATTAAATA -TATTTTAGCCGGAGTTGCTTCTCGGCCACAGAGTGGTTCACAACATTAAACATATTTTCA -AAGTATTACTCCTTCCCCAGCCTCCAAGTGGTTGTAAACATTAAATATGTCTTATAAAAA -CTGCTTTGCCAGCTACTGGCAAGACAGCTATGAACATCATTTTTCTTTAAAGTTGCCTTC -CAGCTGCGGGACTATTTTTCCTTATTTGCTCTATTCTATTTATATTTTGTACACAAAAGC -AGGCAAGAGGCTACATTGGCCCAATTGTCTCTGGCTTTATGATAAGTGATCGTGGGAGAG -CAGTTGCACCTCCGTAAAACCCTGCTGGCCACAGGAGCTTGCTGAAGTTCAATCACTGAT -ACTGAATATTTCATATAGATGTCAGCTGTGTCTTCCAAAATAATTTTTGTTTTTCATTGT -GCAATGTGTTGAGGCATAAAGATGGGCATGCATTAACATCAGCATTAAGAAAAATAACTT -GAAGCAACCAGACACTGATGAATTATACCCACTGATTCAGGTGAAAATATTCCGTGAAGA -GAACAGACTCAAATGGCAGGACTAGTATGTTAATGAGGTCTTTAACCCAAACATGATGAA -AGACTTGGAACCTCTGTCTGGAAATCATCCAGTCTGACAACTGCATGCGATTCAAAAAGA -GTGAAGAGTATGCTATTACATAAAGGTCTATCCAGGACTTAGAGCAGGAAATCTTTTCAT -TTTAACCAAATTCACAGTGAAAATAACGTGTGTCCCCTGAGTGAATTGAAAAATAAATTA -GCTCTATCATCTCAGGCACAGTAATTCATCATCAGGCCAAATAATTAATTACTCAGGAAG -GCTTTGATTTCTATGGGAGCCAAGTGTTCTCCAAATTGTGTAGTACTGTATCTTGCCAAA -TGTTTTATTTTAGGTGTATGTTCAACAGGTTCCAATATTCATCAATACCTACGTGACAGG -CACTATTTTAGGTACTGGAGCTAGAACTTGAGCAAAAAAGACAAAAAGTCCTCCTCTCAG -GAAGCTTTTACTCCAAAGGCCTCCTGCAGGGGCAGCAAGCTAAACTCTGTGAGCTAAATT -CAACATATCATCTGTTTTTATTGGAACAGTTACACTTATTCATTTCTGTGTTGTCTATGG -CTGCTTTCACACCACAATGGAAGAGCTGGCAACAGAGACCATATGGCCTGCAAAGACTAA -AATATTTACTATCTGCTCCTTCACAGAAAAGTATGTTGACCCATTACATAGTGGATTGAG -TTTGAGAGAGGAGATTAAGGTAAGGCCAACATTTTAAATCGACCTATGAGGAAAAAGTGT -TTTTTTTCTCCCTAAAAATTACCTCTTCCAAAAGAAAACAAAACACAAAAAAACATAAAA -ATAAAAAGAAAGAAAATGGCAAGACCCAAGGTAAAATGAAGGGTAAAAGTGAGCACCACA -CTAATACATATGTATCAGCGTGAAATCAGATGCTGCCATTTAGTTCCTGCTGAAAGTGTT -GGTTTGGCTTTATTAAAATAACTTAAACACAGTCTTTCAATGTTATAGACTCATGGTAAA -GGTTTCTTTTTCCTTTTGTGAATTTTTAAAAATTTCTCTGCAAAAATTATTCCTCACACA -ATTATGTAACTTTATATTTTCGATTAAAACTAAAACTAAAAATGTTGAAAGAACAACTCG -ATATTGGATTAAAATATTCATTTTCCATCTTCATTCTCAGGATTCATATTTGGTTGCCTC -ATTGCGATATAAGTATGTTGAAAAAAATGGAAAATTGCTGAAAGCAAAAATTTTAAAACT -CACCAGTATTAATAATTATCACCAAATAACTATTACAGAAAACTTCCTCAGAAAGTAAAA -TTAGAGTGAAGGTATCACAGGTTGGCACTATTTTCATTCCCGACCAAGAAACTGACACCT -GAAAATTAAAAAAAAAAAAAATCAGAGTCTACAGTTTTACAAATAATTAACAAAATGAAC -ATCAAAATAGGGTGCAATTTGTTTAATTGGCAAAGGCACACAACGAAAAAGAAATATGTC -AATTAAACTGTCAACCATGTTAATTTTGCCTCTGAGAAAAACATTGTAATGGGATAATTT -CACGAAATGCTTTGATGACAAGAAAATGCCAAAATATGATTAGCTACTTCAAAATTCAGC -TGAAACAAGAAAGCAATTGGTGGCAGCCAGAATAACCAAAGGTCTTTTTTTTAATATACC -TTTTTCTTTCCGTCTCTTTTTGCCTGTGTTTAATCAACAGTGTACATTTTGTTTGACAGT -GAAATGATTCATAATGAAAACACTGGCATCACAAAAACTTAGCAGAAACTTTGCTATAAG -ATTAGAGTATACACTTCTATTTTCCCCAAACTCTTTAAAAATATAATTACTATTTCTGAA -AGAATTTGTTATCTTTATGAAATAATGTATTTTTCTTACTAGCATTAAGTGCTTATTTAG -CTAAAAGCCAGAATTAGAAGCAATTCACTCATATGAGTATGTTTATATTTATATTGTCAA -ATATATTTACTTAGAATTTAAACCAAGATATATTTTATTTATTGTTCTCATCACTGCCTG -TTAGTCAGAATGGATATTTTAAATTTTACCAGGTCATGTAAATTTTACTACCTATATTTC -TTATTCCTGTTTGTTTAGAAAAATAATCTAGTCTATACCTAAGCTAACAAATAATCCTAA -ACATTGAAAATACACACATGATTATGAAGAATTCACTTTCTAATAAGTTTCAGCTTTTTA -CAAATGGCTGCTTAAATATAATGCATAAATATGACGTATTTTTAAAAATACATCTTGTTT -ATGCATTACTTGAACCATAACTAATCCCCATTTCCAGTCAAAAAGAACACTGTCTACATA -TGTTAATCTCTAATACAACAAAAGCAGGCTTAGCTTAATGGGAAACTTATTAGAGTAAAA -TAGTTCTTTTTATTTTGTATTTTATTTTATTATTACAATAAGAAGATTATAATTTGGTAA -CCAGCAGGAGAAGAAAGACTTCCAAAGATTATTTTATGGGTCCATGTATTTGCTTTTCCT -GAGGGTAACTAGTGCTAATTCTAGAAAGGCAGAATGCTGTAGGAAAAACAAAATAAGCCC -TGGAGTACAAAAGATTTGGATTCAAACTTTGAAGAAGGATGGGGAAGTGTTTAGGAAAGT -GCTTCCCTGTGTAAAACTTAGCTGGTAAGTACCGTATTTCACTGAAATTGCTGTTATAAT -AATTGAAACAGATTATTATTGGAAAAACAACATCAGCAAAACTAAACCCTAGATGAGTTT -AAATAAGTGCCAGCCATTTTTCTCCTGTGGGGATTTAGAGTTGAAGTGGGTATCTTCGAT -TACTTTTCTTTATCCTCATTCTTATTCTTATAATTTTTCCTAATAAGTCACCTAAGAAGG -GCATTTAAATAAGTGCTAGCCTTTTTTCTCCTATGGGGATTTAGAGTTGAAGTGGATGTC -TTCAATTACTTTTCTTCTCTATTCTCGTTCTTATTCTTATAATTTTTCCTAATAAGTCAC -CTAAGAAGGGGATTCTAGAAACATCTCTTTCCAAGAATACATATTGTTTTAAGAAATATC -AGTTACCCCAAATACTTGAATAGAAACTAGGAAAACAGGAGAGGGAGATTATAGGTGTAA -ACAAGGAATTACTGTGCAGTAATATGTGTAGTGGACGTGAAAAGTAATGACTTAACAGTT -TGAATACACAAAGATGGACCAAACAAATACATTTATTTCTCCTTTATTCCAAATTGCCAT -TGAAATAGGAAAATACAGTACTTATTAAAATAACTATGGAAAGTTGAGAAGAATTCTACC -AACAGACTAAAGAGTAAGCAATTTCTGAAGGACATAAGTCATATCAGAGGCACTTTAGTC -TTTGGCCATAATAGAATTATTGGAGGGACTTGCCTTTTCACTATAAACAATGATAAAACT -GGGCAACATATATGAGGAACCAGATTTTATGCATCAAACAAGAAGTACAAGTTTTTCATA -CTCAAGAGAGAAGAAAGCTGTGAGGTAAGTACCACATTTAACCAGAGAATGTGACTAGGG -GCACTTTTTCTCCCATTAAACAAGGAGGCAGACTCTAAAAATAATGAGTTTAGAGGAAGA -AATTAAAGCTTAAAGCTCTCATTTTGTAGGTTGTCTGCTTATTTGTTGATAATTTCTTTT -GCTATGCAGAAGCTCTTAGTTTAATTAGGTTCCAAAAAGAGGTAGGGGTAGAGGGACAAG -AGCCAAGGAACTTCCTGTTAGGTATTCAGTTCACTACCTGGGTGACAGGATCAGTGGAAG -CCCAAACAGTAGCAGCAAGCAATATAACTTTGTAACAAACCTGCACATATACCCCCTGAA -TCTAAAATTACAATTAAATCTCTATCTATATCATCTCTCTCTCTCTCTATCTATCTATCG -ATATTTGTAAATAGCTTACACTTGCTTAAGTGGTTGTTATTTAAGAGCTGCTGAAGTGGC -TGAAATTTGTATAGGAAAGGAAAAAGAGGCATGGGGTAGGCCTACAGAAGCCAGAATAAG -TATTTGGCACCAATTTCGTCAAAAGCTGCTCTATACCTGAGCAGGGTTAACATCTGCAAG -ACCTGCTGAAAGCAGATACTGTGGGAATAAGATATCAGTAGTAACAGGACAAGGAGATGT -TTGGGTTACTGACTAGCCACAGGGAAGATATTTTACGGAATATTCCAGGCATTAAGAACA -AGGACTATTTCTTTCAGTAAAGACTATGTCCTATCTTTAAGAACAAAAATGAAAGAGATT -TTCATTAACAAATAATGAAACCAGGCATGGCAGATCAAAAGGATCTGGAAGCCACTTAAC -AGCTTTCAAAGGACTATCTTAGGGCCTTTTACAAGAAGACAACGTCTAGATTTTCTACAG -TGTATTAGTCAAAAATACAAGTTACGCAAAGAAGCAGGAAAATTTAATAAATAATCAAGG -GGAAAAAAGCAGTCAATAGAACATACATCATAATGACCCAGGCATTGAAATCAACAGATA -TATATTATGAATATGTCCTTAGACTATGGAGATATTGTCACAATGATAGAACGAAAAAGG -TAATTTTGAGAAATGAACATTGTTGAAGTAACAAGTGCAAAATATACAATGAAAGAGTCA -CTGGCAGGGTATACTCTCAGATTTCAGCTATCTGAAAAGCAGACAAATAACCTTGAAAAC -CATTTTTGAATTTATCTAATCTCAAACAAAAGAGGAAAATGATTAAAATAAGAGATTGAA -TGACATGGGATATCAAGCCATCAAACCTATACACAAATGAAGTCTCAAAAGGATGAGTCA -GAAAACGGGAAAGAAAAAATATATTTGAAGAAATAATTGCCAGTGTTTTCAAATTTTGAT -GAAAAATTTCAATGCAGAGATCCAAGAACATCGAATCAACAAGAACACATGTATGAAGGG -TATAAAGCCAATAGCATAATTAAAATGAAATATTGAAAAATATCCATTTACTCTAAGGAA -GGTAACACTGAACCGGTGAAGAAAGAAATATCAACAACCAAGGGAAGAGAACAACACCCA -AATAAGAAAATAGTAGGCAAATAGTAACATGGTAGATTTAACAACACTGACATCACTGAT -TAAATGTAAATGGATTGAACATTTATATTAGTTTCCTAGGGTTATATAACAAAGTACCAT -AAACTAGGTGCCTCAAGCAACAAAAATGTATTATCTCAAGTCCTGAAGGCTAGAAGTCTG -AATTCAAGGTACTGGTTCCTTCTAAAGGCTGTGAAGAAGAATCTGTTCTAGACATCTCTC -CGTGGCTTTTAGATGGCTTTCTTCATGTTCATATGTTGTGCTTTCTGGTGGCATTTTTGT -GTCCAAATTTCCCCCTTTCATAAGGCCACCAGTCACATTGGATTACAGGCCACTCTAATG -ATCCCATTTTGACTTGATTACATCTATAAAGATCCTATCTTTCAATAAGGTTGCATCCTT -AATTACTGGGGTAAGGACTTCAACATATGTTTTTTTTTGGCAGACAATTCAACCCATAAC -ACACTTAAATTAAGAGCTAGCAATTTTGAGACTGGATTTAAAAAGGACAAAACTATATGC -TCTGTACAATGAATGCATTTTTTTTTGAGATGGAGTCATAAAATTGGTTGAAAATGAAAG -GAAGTAAAACTGATGTAGCTGGCTAATACCAGAGAAAGTAGGATTCAAAATAAGCTGAAA -TATCAGAGATGGAATGATAAAAGGTTAAGCTCATTAGCAATGCATTATAATTCTCAATGT -TTATACAACTGATAACAGAGTGTCAAAATGCATGAAGTAAAAACAGACAGAACTAAACAG -AGAAATAGACTTTCACTCATAGTTGAAGATTTTAATCATCGGCTCAAAATAAACAATAGA -TGAACAAATAAAAATTAGTAAGGATTTGGAAGATTCTAATAATACTATCAACTAATGTAG -TCTAATTTCAAGTTTTAGAACACTACACAAAAAATTTCATGAAAATATTTTTATCAAATT -TACATGGAATGTTTACGAAGGACTGTGTAGTCTCGGCCATTAAATAAGTCTCAATAATTT -GCAAAAGATTAAAAATTATCTTCAGAGGTGAAACAATGGTTTTTGCGTTGATTGAGATAT -TGATTAAATATGTACATTTGTGAAAACTTACTAAATTGAATGCTTAATTTCTGTGCATTT -TGCTGTATCTCAATTCTAATAGAAAGACAGGCAAACATATAAATACCTATTACAGACATA -CCTACACACACACACGTGTGTGTGTATGTGCGTGTATGTGTGTGTATAAGTCATTTGCAT -ATATACTGGCAGTGAATCCCAGTAAAGGTGAGGAATACATATTCTCACACTGGTAGAAAT -GATGACTTTTTACAGTCTGATTTTTTTTGTGTTTTAATCCAGAAATAATCCCAAAATTAG -AGATACAAGGGAATGGTAATTGGTCCATAGAATAAGAACTGTCCAAGAAATTTATAGCAC -ATGACTCCCTGTCCCACGCTCAATTGAGGTTTGTTTTATGTCTTTGCTCTGGGAAGAAAG -CCCCCAGTTTAGCTTTCTAAATGATGAGTTCAGAATCTACTCACAAGCATTAGAGATTAG -TAAAGTTCCTTGTATTCAGTAGAGTTCCTTGCAACAAGCATGAGCAGTAGAGATGCTGCC -ACTTTCAAAAGACTTACATATGGACCACTGAAAAGAAAGGCATTATGTGGTCAATTTTAG -AGCATGCGCCATGTGGAGATGAGAAGAATGTATATTCTGTTGTTTTTGGATGGAGAGTTC -TGTAGATGTCTATCAGACTCATTCGGTGGAATGTTGGGTTCAGATCCTGAATATCATTGT -TAATTTTCTACCTTGATGATCTGTCTATGACTATCAGTGGTGTGTTAAAGTCTCCTACTA -TTATTGTGTGGGAGCCTCTTTGTACTCTAAGAACTTACTTTATGAATCTGGGTGTTCCTG -TGTTGGGTGCATATATATTTAAGTTAGTAAGGTCTTCTTGTTGAGTAAAACCCTTCATCA -TTATGTAATGCCCTTCGTCTTTTTTTTTTTTTTTTTTTTTTGAGACGGAGCCTCACTCTG -TCGCCAGGCTATAGTGCAGTGGCATAATCTCAGCTCACTGCAATCTCCGCCTCCCGGGTT -CAAGTGATTCTCCTGCCTCAGCCTCCCGAGTAGCTGGGACTACAGGCACACACCACCAAG -CCCAGCTTATTTTTTTTTTTTTTTTGGATTTTAGTAGAGATGGGGTTTCACAATGTTGGC -CAGGATGGTCTCGATCTCCTGACCTTGTAATCCGCCCACCTTGGCCTCCCAAAGTGCTGG -GATTACAGGTGTGAGCCACCACTCCCGGCCCCTTCTTTGTCTTTTTTGAACTTTGTTTGT -TTGACGTCTGTTTGGTCTGAAATTAGGATGCAACCCCTGCTTTTTTCTGTTTTCTACATG -CTTGGTAGATTTTCCTCCATTCCTTTATTTTGAGCCGATGGGTGTCATCACATGTGAGAT -AGGTCTCAAAGACAGCATACCATTGAGTCTTGCTTTTTTATTCAGCTTGCCCCCCTTTAC -CTTTTAAGTGGGGCATTTAGCCCACTTACATTCAAGGTTAGTATTTGATATGTGTAGATT -TGATGCCCTCCCTCATGACTCCTATTCAACATAGGAAATCCCAGCCAGAGTAATCAGGCA -AGAGAAAGAAATAAAGGGCATCCAAATAGGAAGAGAGGAAGTAAAATTATCCCTATTTGA -AGCTGACATGATTCTATATCTAGAAAACCCCATAGTCTCAGCCGCAAAGCTCCTTCTGCT -AATAAAAAAAACTTCAGCAAAGTTTTAGAAAGAAAATCAATGTACAAAAATCACTAGCAT -TTCTATAAACCAACAGTAACCAAGCTGGCAGTCAAATCAGGAAGGCAATCCCATTCACAA -TTTTCACAAAAAGAATAAAATATCTAGGGTAAAATACAACTAACCAGGGAGGTGAAAAAT -CTCTACAATGAGAATTACAAAACACTGCACAAAGAAATCAGAGAGGATACAAACAAAATG -GAAAACATATCATGCTCATGGATAGGAAGAAGCAATATCATTAAAAGGCCATACTGCACA -AAGCAATTTACAGATTAAATGCTATTTCTATCAAACTACCAATGACATTCTTCACAAAAC -TAGAAACAACTGTTTTAAAATTCATGTGGAACCAAAAAAGAGGCCAAATAGCCAAGGCAA -TCTTAAGAAAAAGAACAAAGCTGGAAGCATCAGGCTATCTGACCTCAAACTATACTACAG -GGCTATGTTAACCAAAACAGCATGGTGCTGGCACAAAAACAGGGACATAAACCAATAGAA -CAGAATAGAGAACCCAGAAATAAAGCCACACAGCTATGATTATCTGATCATCAATAAAAC -TGACAAAAACGAGCAATGGGGAAAAGACTGTCTTTTCAATAAATGGTACTGGGATAACTA -ACTAGCCATATGCAAAAGATTGAAACTGGACCCCTTCCATACACAATATACAAAAATCAA -CTCAAGATGGATTAAACACTTAAATGCAAAACACAAAATTAGAAAAACCCTGGAAGACAA -CCTAGGCAATACCATCCTGGACTTAGAACAGGCAAGGATTTCATGACTAAGATGTCAAAA -GCAATTGCAACAAAAGCAAAAATTCATAAATGGGAACTTCATTAGTCCATTTTCACACTG -CTATAAAGAACCACCAGAGACTGAGTAATTTATTAAGAAAAAGGTTTAATTGACTTGAAG -TTCAGCATGGCTGGGAAGGCCTCAGGAAACTTACAATTATAGGAGAAGGTGAAAGGGATG -CAATGCACCTTTTTCACAAGGCGGCAGGAAGGAGAAGTACCAGGCAAAGCTGGGAAGAGC -CCCTCATAAAACCATTAGATCTCGTGAGAACTCATTCACTATCACAAGAACAGCATGGGT -GAAACCACCCCCAGATTCAATTACCTCCACCTGGTCTCTCCATGTGGTGATTATGGAGAT -TATAATTCAAGATGAGATTTGGGTGGGGATACAAAGCCTAAACATATCAGGATCTCATTA -AACTTATTAAGAGCTTCTGCATAGCAAAAGAAACTATTAACAAAATAAACAGACAACCTA -CAGAGTGGGAGAAACTATTTGCAAACTATGCGTCTGACAAATGTCTAAAATCCAGCACCT -ATAAGGAATTTAAACAAATTTACAAGAGAAAAACAACCCCACTAAAATGTGGGCAAAGTA -CATGAACAGACACTTTTCAAAAGAAGACGTACATGTAGCCAACAAGCATGTGAAAAAAAA -AAAAAACTCAATATCACTGATCATTAGAGAAATGCAAATTAAAACCACAATGAAATATAA -TTTCATACCAGTTAAAATGGCTACTATTAAAATGTCAAAAAATAACAGATGCTGGCAGGT -TGTGGAGAAAAGGAAACACACACTGTTAGTGGGAGTGTAAGTTAGTTCAACTATTGTGGA -AAGCAGCATGGCAATTCCTCAAAGAGATAAAAGCAGAACTACCATTCCAACCAGCAATCG -CATTACTGCATATATACCCAGAAGAAAATATATCATTCTACCATAAAGACACATGCACAC -AAATGTTCATTGCAGCAATATGCACAATGGCAAAGACATAGAATCAACCTCAATGCTCAT -CAATAAGAGTTTAGATAAAGAAAATGTGGTATATAGACACCATGGAGCTATAAAAAAGAA -TGAGATCACGTTCTTTGCAGTAACATGGATGGAGCTGGAGGCTATTATACTGGGCAAATT -AACACAGGAACAAAAAACCAAATACTAGAAAATACTGCATGTTCTCACTTACAAGTGGGG -AACTAAATTATGAGAACACATGGACACAAAGAAGGGAACAGCAGACACTGGAGTCTACTT -GAGAGTGGAGGGTGACAGGAGACAGAGGAGCAGGAAAAATAATTGTTGAGTACTTGGTAC -CTAGGTGACAAAATGATCTGTACAACAAACCCTGATGACACGAGTTTACCTATATAACAA -ACTTTCACAGGTACTCCCAAACCTAAAATAAAAGTTAAAAAAAAAAGAAGAAAGCAAGCC -CAAACCCATGCTGTACCACATGGCAGCATGGCATTGCAATTTTCAATTACTTGGGGGAAA -TAAAGATTCTCATGTGATTAGAGGAAATAATCTTTATTTAATAACAATGCTTAATAAGAA -CACTTGACAATAGATGGCTGAGAAAACCAAGAATCACCAAATATTTAAATAAAAATAAAA -AATAAAGAATATCCTATCTATGAAATGATATAAACACTAAGAAAATAGAATTAACAGAAG -AAACTAAAGAAAATGTCAATAGCCTTGGTGTCCATACAGGAATAGGAATCACAAAAATAA -CCATAAAATATCTTTAAAAATACCTTTTACAAGCAATACTCCAGAATGTCAGAAATTAAA -AATATCATTAGCAAAATAAAATGTAGTAAACAGAATAAATAATATAAAAGATAAAACTGA -TGTCTAAAGATTACCCCAGAATCTGAACTGCATGAGGGGGAATAATAATTACATTACTGG -ATATGAGAACAACTATTTAATAGACACATCTACTTGGACGACAATGCTTTTTTTTAAATG -TCAAATTTCCAGTTTATTAATTTTAATACATTTCTGATCAAAATTACTCATTTCTTTTTG -TGCCCTTATTTTTTGCATTAATTTTCATTTTTTACATCTCCATGCTGAATTGAAAAAAAC -TTCTACTGACCTGTATTTCAGTATATTAATTATTCCTTCAATGGTATCTAATCTGGTGGT -AAATTATCTATTATATTTAATTTAAGTAATCACATTTGTATTTTAAATGTTTTGCTTATT -TCATCCACAATATAATACTAACACAGTTTTTTCTCTACAATTAGTTTAAGCTTTTAACTA -AATTTTTAAAGAGTAAAATATTTATTATCTTTCTCTGATATTGTCCAATGTTTGAAGTCT -TTGTGGGTCTAATTATGTTTCATGTTGTTTTTGTTGATTTTAACTTATTTTGCCTTGTTT -CACTGTGTTCTTGAGAATATTGTAAGATTTAGCTTCAGGGTATTTTTATAAAGCATTCAG -ATTTGCTTCTCCCTAACACCTGGGCATACGAGTAGGACCACCTTAAAAAAATGTTCAATT -TTTGAGATTACCTGAGTCACGCAGTCACACAAACCCAAGTAGTGGATCCAAGCTACCACT -GCTTTAGGTCTGTCTGGTTCACCTTATGCTGAGGGTATAAGATTTGGTCATCTCAATCTT -TTAAGGGAGGGCTCCTTAAGAAGACTTAACATATGCTAGTCTTCGGTTTTGGTTTCTTTC -TCATTCACCCTGAAACTGTCAAAATAAATATTAATATTTGTAGAGATCGGCAAATTGCAA -CTGGTAAGGTTATATCCTTACCTCTCTGGGTTCTTCTTTTCTCTTCAAATTTGGTTTGAT -CACTTATTAAAATTTTATAAGCTCTCAATGCTTTTTAAAAGGTGTTTTAAATGTGCAATC -ACTAGCATTTTTTAAAGTTTTTTCATTGGGAATTATAGTCTGAATAATATCCCACCATAA -CTAGAAAATGAAATACCTACCTATTACAAATGCAATACATTTTATATATTGAGTAATTTG -CCTTTTTCTAATTAATTTTATTTTAAAACTCATATTAAAATCTATATTTTATAAGAAAAA -AAAAAGCTTGACATTTGATTAAGACCAACTTAGCTTCACAAATCTCTACCCGAATGACCA -CAGGCAAGTGATCTGACATTTGCAAGTGCCTATTTTTTTTTTTTTCATTTATAAAATGGA -AATACATAACTGAGCTTAATGTGGATCAAAGAAATTAATATGTAAAAAAAATCAAACACA -GAACCTGAAAATGATAAGAAGTCAATTATTCAGCCTACCTATCCACTACTAAAATAATTC -TTGGTTATGAACTTCACAAGGGAACACACTGTATTACTACTCTACTCAATAACATGAATT -TCATTGTGTATCTTCTGGGGCAAGTGCTTCTTCCAAACAAATTAAACCATGCCACTCACA -CAATGTTCATCTATCTCATAAGTTTTTGCTATTCTCCTGTTGATTTTACATTTAATGAGG -GATAAATTCTATTTTCACAATCATAATCAAATAGTACCTAGAGAATAAAAGCTACTAAGA -CAAGACAACCCTCAATTCAATTAATAACTAATCCCTCTCTTTTTCAGAGGTACATCAGTT -GTTCCTCCAGGATTAAAACCCCTCCTGTTTTACAATAAAAAAGCAGACAGATGACAGGAT -CTCATTAAACATCTATTGTGCACCCACACAAAAAAAATTAAGTAGGCAGATGCTCTGATT -ATTTAAAGCAACACACAGAGTTTTAGGATGAAGTTTTATTTCTGAATTTTAACTTATGCA -CAACTTGTAAAATGTTACTTCATTTATAAATAAAGCAGCACTAACTTGGTAACTTGGAAT -CACTGTATTTTTCTTCATAAAGAATAAAGTCAGGATAACAGTTAGCTCTCTTTGTTTCTG -AATAATGATTCAACATTAATTTAGATATTAGAAGCTCTCACATGCAAGTGTGTGTGCATG -CACTCACACACACTACTCTGAAGAGGTAGAATATGAGGAAATAAGATACACATACACAGA -GAAGGAATATGTAACATTTAATTTTAACCTCAATGACAGGAGGAGGAGCACCAATAATCT -AAAAAAAAAAAGAAATCCCAAAAGTTTTGGATTAGCTGCCTCTCTAGCTAAGTATATATT -TTATTGCTAATCAATATGGCATGACCAAAATTATAGCAATGAAAATAGTCATAATTATCA -TCAAGTAATGAAAATAATATTTATGTGCTCAACAGTATTACTTATATATTCTGATTAAAT -CATGTGCTATTGAAAGCAGGATGAAGTAGAGCTTATTTTATATCTTAGAATTCTTTTCCA -TCAATGTCAGTTTAGGAAATGAAAGTTATTAGAATTTCAATGTAATTAATTCATTTGCAA -TTGGTGTCTATTTGGTTATGTATTTTTTCATAGATATAGTCTGAGAGACATTATCTGGTA -ACTTTTATCATGTCTCTGAGAAACCAATTTTCTGTAAGGCCTAATTTTTTTTCTTAAATA -AATATGTATCTGCTCATGCATATACCAACTTCATGGATTCCAAACAATAAAAAAGTTAAT -TGTAAGGATTGGACAGAATCTACATTTTAAATGCTATTTAAAACATTATCACACTGTAGT -GGAGGAATCATAAACTGCAGAAGTTTTTCAACCATGCCACCACAAACTACATCTCTACAA -AATGTTTTGTACTTTTACTTTAAAGAACTAAAATTGGAAACAAAGAGTGGAGAATTATTT -CTTCCCTTTCTCTTCCCTTCATCCTCATTCTAGCACCAGTCATACCTGACCATGATTTTT -AAAGAGTATAAATTACTCCTCTTCTCAGAGGTAGAAATACACAGAAAAATACACAAAAAC -CAAATTCTGTCAAAATATATTTAAAGAGGTTTATTCAGAGCCAGTATAAGTGACCAAGGC -CTGGGTTACACTATCTCAAGAGGTTCTGAAAGCGTGCCCAAGGCAACCGGGTTACACTTT -GGTTTTATACATTCCAAGGAGACAACCAACTGCAGGTAATTGCAGGTAGGTCAGGGTAGG -AGCTTGTACGTCATAAGGGGCTTTTAGGGATCCTTTAGTTGACAATTGGTTGAGAGAGTT -ATGCTATCGTCTAAAGTCTTGAAATCGATAGAAAGGAATGCCTGAGTTCAGATAAGAGTG -GGGGAAAGACCAAGGATCTTATTAAGTAGATGAAGCCTCATAGGTGGCCCTCAGAGAGAA -TAGATGGTAAATGTTTCTTTTCAGACCTTTAAAGGTATCAGACTCTCAATCACTCCTAGG -TCCTGGAAAGGCATAGAAAGGGGAAGCATGGCTGCATTAATGAAGATTCTCCATAGATGC -AAATTTCCTCTACCTCAGTTTGCTGGCCTTGCAACAGCCATTTCAAAAGACATCAAAGAA -ATATATTTTAGGGCAAAATATTTTTATATCCTTCAGGGTCTGCTGTCTGTTATGTGATGC -TGTACCAGAGTCAGGTTGGAAAGCAAGCCACATTATATAGGGTTAATAAAAAACCCATGT -AAAGAGATGTTATCATTCGTAGGGCTGACTCCCAGTTTCTTTAAATAGGAATTTGGGCAA -GATGAAAAAAAAAAATCAGAATTTAGTCCTCAACTCAAATATTTTATTCATTCAAACGCT -TATTCAAACTACCGAATCCTCCAATAACAGAAAGTATAGTGTCCATCCTGAAGACTTTCA -TCCCATCTCACAGCATGTTTTCTCCTAGTACACCCTGATTGTCCAAGGACTTCTGAGAAC -ACCATTCCAGAAGAGGTCATGATCTCAACAACTGTCACAGAAAGAAAGAATACAGGAAGA -CAAGATGCGAAAGTTATGTCAGTGGCTTTCATTCATCACACCACTACGTACTGGTTCTCT -AGTACTGTGCTGTTATGATCCTCCTGACTTTTACCCTGTGAATATCCTAGTGCTTTTATA -TCAGTCTCACATCCTCAACACGCTGGTTTCCATAAAAACGCAACCAAGTCAGATGGCTGT -GATCTGGTGGGATTCTAGTTCCATTTGCAGCCTCCAAAGCAGTCTTTTACCTAAGAACAC -TCAGGCCTCCAAGGTTAAGATAACAGTATACTCCAATGCAAAATTCTCTACCTCCCTACT -TCAGGTCCCAGGGACTCTCAACTGCCAGTCACTCTTTGAACAATAAGAACAGACACTTAA -ATGATGATAACTAATGGCAGCAGCACTAATGTAAGAACGCTGGAACTATTAGTACTCTCA -TCTTCCAGATGAGAGAACTCAATAACATGATTTACATAAACTGCCTGAAGATACAGTAGA -ATAATAATAATGAAATCCAGGCATTGTGGACCTGAATGCTGCTACCTAGCTACTCCTCTT -TGTGGCTTCTCCATTGTCTTTTCCTGCCAGTTCAGTTCTAACAGGAATAAACAGACGCTA -TTCAAAAGCTTTTCAAGTCTGAATGAAGATGTACCTAGGGTTGGTGTTGATGAACTTTCC -CTCACTCTTCCTTAAGGAAATTTGTTCTTTCCTGCCATTTGAGAGACATGTTCTTCCTCT -TCTTCTTCAAAGAGCTAAAATGCACCTTTATTTGCATAATGAAGATAGTGCAGTGAAGTA -CTTTCTTGCATTGTCAAGGAAATGAGCTCCAAAGGAACTTCCTTTTAGCATAGAGAAAGC -TGCTGTTTTTATTTACTTGCATTTTGCATATGAATATATTTTAATTTAGTTTCAACAGGT -AACTGAATTAGAAAGTGAAATTATAAAGTCATTTCTCCAAGAAAGAAGGTAGAGCTTATA -AATATTAGTAATCTTAGCTGGGCACGATGACTTACGCCTGTAATCCCAGCACTTTGGGAG -GCTGATGCGGGCCAATCACCTGAGGTCAGGAGTTTGTGACCAGCCTGGCCAATATGGTGA -AACCCTGTCTCTACTAAAAGTACAAAAATTAGTCTCTACTAAAAATACAAAAATTAGCCA -TATGTGGTGGCACGTGCCTGTAATCCCAATTACTCGGGAGGGTGAGGCAGAAATTGCAGT -GAGCTGAGACTGCACCACTGCACTCCAGCTTGGTGAACAGAGTGAGACTCCGCCTAAAAA -TAAATAAATAAATAAATAAATATTAATAATCTTTTAAAGAAATGATTGTGGCTATTTCTA -GGTCTAATGATACTTGCTTAATCGTATTGAAAACAATGTTATTTCTTTGAATGGCAATGG -AATGTAAAATATTTAAAAACGCAATTTGACTTTTTTTACTTTTTAAAATTTATGTAGCTG -GGCCGGGCACCCTGGCTCATGCCTGTAATCCCAGCACTTTGGGAGGCTGAGACTGGAGGA -TCACAAGGTCAGGAGATCGAGACCATCCTAGCTAACAGGTGAAACCCCGTCTCTACTAAA -AACACAAAAAAATTAGCTGGGCGTGGTGGTGGGCACCTGTAGTACCAGCTACTCGGGAGG -CTGAGGCAGGAGAATGGCATGAACCTGGGAGGTGGAGCTTGCAGTGAGTTTGAGATTGCT -CCACTGCACTCCAGCCTGGGTGACAGAGCGAGACTCTGTCTCAAAATAATAATAATAATA -ATAATAATAATAATAAATAAAATAAAATTTATGTAGCTGATATATTACTATAACCTCACT -TGCATTTTTAAATTATTTTACTGGTTCTCTCTTTTTACTTTTATCTTACCTATGCTGTAT -TTGAAGTTAGTTTTATATAGGCAGAATTTTAAAAATTATTTATTTATGGGGTACAAATGC -AATTTGGACAATATTGTTGGCCATGTTTTTTCGTTTTTGATTTTTGTTTTTTTAACTACT -CTGCCAATCTATGTTTTTTAGTTGGTTTCTATAGGCCTTTTATGTTTAACAATTTGTATG -TTGTGGTTGAAGTCTACCACTTTGTTATTTGTTTCTGTTTCTTATTCCTCTGTCTTTTTC -TTGCTTTCCAATGGGTTACATAAACATGTTAAGTTTCCATCTTAATTTATTTATAGTGTT -TTAAATACAATGTTGCATCACTTAATGACAAAAATTACATTCTGAGAAATGCATTAGGCA -ATTTCTTCACTGTGTAACATCATATAGTGTATTCTATGTGTAAATAGAGATAGCATAACC -TACTACTCACCCTAGGTTATGTGGTACAGCCTATTGATCCTAGGCTACAAACCTATACAG -CATGTTACTGTGCTGAATACTGTAGGCATCTGCAACACAACGGTAAATATTTGTGTATCT -AAACATAAAAAAAAGGTACAGTAAAAAATCCAGTCAAAACATCAAAAATGGTATACCTGA -TTAGGGCACTTACTATAAATGAAGTTTGCAGGGCTAGGAGTTGCTCTGGGAGAGTTAGTG -AGTAAGTGGTGAGTGAGCGTAAATGCCTAGGGCATCATTGTACAAAACTGTAGACTTTAT -AAACACTGAATTTATAAAATTTATAAAGAAACTTATTTCTTTCAAAATACATTAAACTTA -TCCTACAGTAACTTTTTTACTTTATAAACTTTTTAACTTATTTTTAACTTTTTGACTCTT -TTGTAACAACACTTAGCTTAAAACGCATATTGTACACAGAAATACTTTATTTCCTTATAT -CCTTATTCTCTAAGATTTTTTTGTAATTTTACATCTTTTAATTTTTAATTTTTTTGTTGT -TGTTGTTAAAAACAAAGACACAAATGCACATACTAGCGTAGGCCTACACAGGGTCAGTAT -GATCAACATCACTGTTTTCTACCTCCAGATCTTGTCCCACTGGAAGCTCCTCTAGGCCAA -TAATGCATATGGATCTGTCGACATCTATGATAACAATGCCCTCTGAAATGCCTCGTGAAG -GACCACTGTGAGGCTGTTTTACAGTTGCCTATTACTTTTTTTTTTTTTTTTTTTTGAGAC -AGAGTCTCGCTCTGTCGCCCAGGCTGGAGTGCAGTGGCGTGATCTCGGCTCACTGCAAGC -TCCGCCTCCTGGGTTCACGCCATTCTCCTGCCTCAGCCTCCCGAATAGCTGGGAATATAG -GCGCCCCCCACCAGGCCTGGCTAATTTTTTGTATTTTTAGTAGAGATGGGGTTTCACCAT -GTTAGCCAGGATGGTCTCCATCTCCTGACCTCGTGATCTGCCCGCCTCGGGTCTATTTCT -TTTAATAAGTAGAAGGTGTACACTACAATAACAATAAAAAATATGGCGTAGTAAATACAC -AAAAATGTAATATATTTGTTTATTATTATTACTAAGTAAAATGTACTTGTATTAGTCCAT -TCTCACACTACTATAAAGACACTACCCGAGATTGGGTAATTCATAAAGGAAAGAGGTTTA -ATTGAGTCACAGTTCTGCATGGCTGAGGAGGCCTCATGGAACTTACAATCATGGTGAAAT -GGGAAGCAGTCATCTTCTTCACAAGACAACAGGAGAGAGAAGGATTGTGTGTAGGAGGAG -CTGTGAAACACTTAACAAAACCATCAGATCTCCTGAGAACTCACTCACTATCGTAAGAAC -AGTATGGCGGAAACCGCCCACATGATCCAATCACCTTCCACCAGATCCTGCCCTCAACAC -ATGGGTATTATGAAGATTACAATTCAAGATGAGATTTGGGTGGGGATATAGAGCCAAACC -ATATCATTCCACCCCTGGCCCCTCCCAGATCTCACATATTTTTTACATTTCCAACCCAAC -ATCATGCCTTCCTAACAGTCCCCCAGAGTCTTAAATCATTTCAGCAGTAACTCAACAGCC -CACAGTTCAAAGTCTCATCTGAGACAAGGCAAGACGTTTTGGCCTATAAGCCTGTAAAAT -CAAAAGCAAGTTAGTTACTTCCTAGATACCATGAGGGTACAAGAATTGGATAAATGCTCC -CATTCCAAATGGGAGAAATTAGTCAAAACAAAGGGGATACAGGCCCCATGAAAGTCTGAA -ACCCAGCAGGGCAGTCATTAAAACTTAAAGCTTTAAAATAATCTCCTTGTCTCCATGTTT -CACATCCAGGGCATGTTAATGCAAGGGGTGGGCTCCCATGGCCTTGGGCAGTTCCTTCAC -AGGCTGGCATTGAGTGTCTGTGGCTTTTCCAGGTGCACAGTACAAGCTGTTGGTGGATCT -TCCATTCAGGGGTCTGGAGAACAGTGGCCCTCTTCTCATAGCTTCACTAGGCAGTGCCCC -AGTGGGGACTCTGTGTGGGAGCTTCAACCCCACATTTCCCTTCTGCACTACCCTAGCAGA -GGTTCTCCATGATGGCTCCACCCCTGCAACCAATCTCGACCTGGACATCCAGGCATTTCC -ATACAACCTATGAAATCTAGGCAGAGGTTTCCACACCTGAATTCTTGACTTCTGTGTACC -CTCAGGCCCAACACCATATGGAATCCTCCAAGGCTTGGGGCTTGCACCCTCTGAATCAAC -AGATGAGCTGTACATTGGCTCCTTTTAGCCACGGCTGGAGCTGGAGTAGCAGCAGCTGGG -ACACAGGGCACCAAGTCCTGAGGTTGCCCAGAGCAACGGGGCCCTAGGCCCAGCCCATGA -AACCATTTTTCCCTCAGAGGCTGCTGGGTCTGTGATGAGAAGGGCTGCCATGGAAGTCTC -TGATACGCCCAAGAAAAATTTTGCCATTGTTTTGGCTACTGTAATAACATTTGGCTTCTT -GTTATTTAGGCAAATTTCTGTAGCCAGCTTGAATTCCTCCCCTGAAAAATGGGTTTTTCT -TTTCTACTGCATGGTCAGGCTGCAAATTTTCCAAACTTTTATGCCCTGCTTCCATTTTAA -ACATAAGTTCCAATTTGAGATAATGTTTCTCAAATTAAAAGTTCCACAGATCTCTAGGAC -AGGGGCAAAATGCTTCCAGTCTCTTTGCTAAGGCAGAGTAACAGTGATCTTTGGGCTCTA -GTTCCTAATGAGTTCTTGTCCATCCAAGACCACCTCAGCTTGGACTTCACTGTCTATATC -ACTATCAGCATTTTGGTCAAAACCATTCCACAGGTCCCTAGGAAGTTTCAAACTTTCTCA -CAACTTCCTGTCTTCTTCTGAGCCCTCCAAACACTTCCAACCTCTGCCCGATACCCAGTT -CTAAAGTCACTTCCTCATTTTCAGTATCTTTATAGCAGTGCCCCACTCCCAGTACCAATT -TACTCTATTAGTCTGTTCTCACACTGCTATAAAGATACTACCCAAGACAGGGTAATTTAT -AAAGGAAAGAGGATTAATTGACTCATAGTTCTGCATGGCTGGAGAGGCCTCAGAAAACTT -ACAATCATGGTGGAATAAAAGCAGTTGGCTTCTTCAAAAGGCAACAGGAGAGAATGAGTG -TGTCTAGGAGAAATTTTCAAACACTTTTAAAACCATCGGATCTCATGAAAACTTACTCAC -TATCATGAGAACAGCATGAGGGAAACTGCCTCCAGGATCCAATCACTTCCCACCGGGTCT -TGCCCTTGACACGGGAGGATCATGAGGATTACAATTCAAGATGAGATTTGGGTGGGGACA -CAGCCAAACAATATCAGTACTAGACAGAATTTTATGTGCTACACTTTTATATAACTGGCA -ATGAAGTAGGTTTGTTTACACCATCATTGCCACAAACAGGTGAGAAATATGTTAGACTAT -GATGTTAAGACAGCTCAGCTGCAATGTCACTAGGTAATATTCATCTCCATTATAATCTTA -TGGGACCACCATGATATATGCAGTCTATTGCTGAGCAAAACATCGTTATGCAGTGCATGA -TTGTACATGATTTTGTTTGGTTTTATTAATTGCACTGGTTAAAATAATATATGTGTAATG -TCAAGATCTACTGTTACTGATGTTTTACCTCTTTGAGTGAAGTGTAGAAAACTTGTTTCC -ATATGAGTCTCTTTACTATCACTACTTTTTAGATATAATTATCTTAAATACTTCCTCTAT -GTTCCTTGAGCATCTAACCAGATAGGTCATTAATTTTTGCTTCAACTATTAGAAATGGCT -TAAAAACTTAAGAGAAGTTTGATTATACATTATATTTATGCTTAATTTTACCCACTTAGA -TGGATGTTTGTAAAAGCTGCAAACCTTCTTCTTTTATCATTTCTTTTCTGTTTAGAGAAC -TTATTCTTTAAAGGTAAGTTTGTTAGCAACACATTACCTTAGCATTCTTTCATTAGACAA -TGTTTTCTATGTCACTTTAATGCCTGAAGAATATTATTGCTGGGTATTGAATTTACAGTT -CACAGTTCTGTTCTTTCAGTACTTCAAAAATATTATCTGTTTCCTTCCAACCTTCATGGT -TTCAGATGGAAAATCTGCTGTGGTTTACATTATTGTTTTTCTGTAAGAAATGTGCCATTT -TTCTCTGTTTGCTTTCAAGATTTTTAATTTTTTTTTTCATTTCCAGAAATTGAATGTATT -TGGGTGTATTCTATTTAGGGTTTGCTCTGTTTCTTGAATATTTAGGTTTATGTCTTCCAT -CTAATTTGGGAAGGTTGAAGTCATTATTTTTTAATAGTTTTTTAGTCCCACATTCTATCT -CCTCGTCTTCTATATCTCCTGTGATTATGAATGTTAGCTCTTTTATTATTGTTCTAAAGT -TTCCTGAAATGCTGTTAAATTTTTTTTTATCGGTTTTCTTCTTCTTGTTCAGACAGTAAA -TTCTGTTGGCATGTTTTCAGATTCACTGATTCCATCTTCTGCTCTGTGCTCTCTTCTTTT -GTGACCATCCTCTGCATCTATTATTTCCATTATTGTATTTTAATTTTATTTAGTTCATGT -TTACTTTTTATAACTTATAAGTCATTATTGAATTTATTTTTAATTTTAACTTTTATTTGT -TCCAAGAGAATACAATATATAATTGCTTATTTAAACACTTTTATGATGGTTTCTTCAAAA -TCCTTGTTAGATAATTCCAACCTCTTTTTTTTTTTTTTTTTTGTCTGAAACAGTCTCACT -CTGTTACCCAGGCTGGAGTGCAGCGGTATGATCTTGGCTCACTGCAACCTCCATCTCCCA -GGTTCAAGTGATTCTCCTGCCTCAGCCTCCCAAGTAGCTGGGATTACAGGCATGTGCCAA -TGCACCCAGCTAACTTTTTATATTTTTAGTAGAGGCGGGGTTTCACCATGTTGGACAGGC -TGGTCTTGAACTCCTGACCTCAAGTGATCCACCCATCTTGGCCTCCCACAGTGCTGGGAT -TACAGGTGTGAGCCACTGCGCCCAGCCCCAACCCCTTTTTTATGTCTGTGTTGGTGTCTG -TTGTCTTTCTCTCATTCAGGTTATGATTTCCTAGTTCTTTTGTCTTATAAGTGATTTTTA -TTGTGTCCTGAATTTTTTTTTATATTATGAGAATTTTTCTCTCTTATTATTTCGTAGATG -GTTCCCTATTGATGTGTAACCTGAGAGCTGGGTGGGTGTGCGTGTTTATCTTCCTGATGG -GACCTACTAATACCATCCTACCAAAAGTAGAGTACTAACTTATACTTCCTTCTTGCAGAC -TGGTTAGGTGGAAGTTTGTCTTCTCCCTCCACCCACTGGCAACCTCATGGCAAAAGTAGG -GTACTGAGTTACATATCTTTGTTTCCTCCAAGTGAAAAAATAACCTCACTTCCCTGATGT -GGTCCACTGACACCAGGGAGGGGGTGAGTAGGGGCCAACTCATACCACTTGGTTGCTTCC -AAGGAGTAGGAGTGGGGAGAAGCTGTGTAAGAACAGAACTGATCATTAAAGACCCTATTA -TAAATTCTTGCTTTTATAGTACTTATTCTCGTGGCTTAAATATTCCACACCAATTAATCA -CCACGGAATATATGAGACATTATATTAATTATTAAATTTTACATGGCAACACTAACAATA -TTAACTCCCATTTTCTATCTCTTTAATTGAGGTCATTCTTTAGAAGTTTAAGTATAATCC -ATTCTTCTGTCCTACACGACTGAGTGAAATAGCATGTAGCAGCAACATATAGTACCAGTA -TGTAATCCTAATGCAATGGCACACCCCAGATGACCCTTTTGTGTATTGGGATTGACCATG -GTTTTTCAGTTTAATCAAAGTTTAATTGATTAAACTCGGGGTTCACATTTTCTTTCTACA -CACCAATATTGAAGAGAAGTACTACAAAATACATAGGAGCACTAAACTGGTAGTAATAAG -AAAAACATAAAATATTTAAGATAATGTGTTGCTTTTATTTCTAATATTAGCTACTGATAC -AATCAAAGCACTGCGGCTCAGTTTTCTCTTTTGTAAGAGGACAAGTTTGTATTAGTTAAT -CTTTAAATGTCTCTACAATCTGATTCTGAGATTGTGAATAAATAGATCTAAATACTGTTA -CAGGTATATAAATACAGATATAAATGCAGGTATGGGTAAAATTATGGCTATGGGTCCCAG -CCTTAAATTCTTATTTTTTTATAACTAATTGATGTTCAGGGATAGATTTATGTGATCTGC -TCATTTGCAGGGGCCCATCACTTGGCTAATTCTTTGGTGTCATTCGTGTCAAGTTTTAAA -TAATTTTTGAGGAAGATGTCCTATATTTTAATTTTGCACTGAACACTGCAAATTTTGTAG -CTGTTCCTGTTAATTTGTTAATGAAATTTCTCTTTAGAAACTATTAAATTAATAAGACAA -TAATTAAGCAAATAATTATTAAATTTTACATGGTAACACTAGCAATATTAACTCTCATTT -TCTATCACTTTAATTCAGGCCATCCTTTAGAGGATTAAGTATAATCCATTCTTATGTCCT -ACACAACTGAGTGTGTAGTACCAGTACGTAATCCTAATGTAATGGCACACCCCAGATGAC -CTTTTTGTGTGTTGGCATGGTTTTTCAGTTTAAGCAAAAGTTTAATTGATTGAAACTTGA -AGTTCACATTTTTTTTTCTACACACCAATATTGAAGAGAAGTCCTACAAAATACTTATTC -TATGGAAAAAAACAAAGTCCACTTTAGATCTAAAATGGTTGCATAAAATTTTGAGAATAA -CTTAAATGTTATATGAATGTAAGTAAAAATTTTACTGTAAACTTTGTAAATGCTAACTAT -AAACAGAAAACAAGTATATCTTACATAAAGAAATGTGTAGACTCTCTGTATTAATCCATT -ATCACATTGCTATAAAGAATACCCGAGACAGGGTAATTTACAAAGGAAAGAGGCTTAATT -GACTCACAGTTCTGCACGGCTGGGGAGGCCTCAGGGAACTTACAATCATGGTGGAAGGGG -AAGCAGGCACGTCTTACATGGCAGCAGGGGAGAGAGAGCATGTGAAGGAGGCACTGTCAA -ACACTTGTAAAACCATCAGATCTCATGAGAACTCATTCACTATCATGAGAACATCTAGGG -GAAACTGTGCCCATGATCCAATCACCTCCCACCAGGCCCCTCAACACATGGGCATTATGG -GAATTACGATTCAAGATGAGATTTGGGTGACAGCACAGAGCCAAACCATATCACTCTCTG -TGCCTTGTATTTCTCATACTAGTCAATGCCAACTCATTAAAAGTTATCTGGAGATATAAC -ATTTTTGGGGGATTATATATTGCCACTACTGTTATATTTTTGTCAGGATAATGTAGCAAT -TCCCAAATAAATCATACTTAAACCTCCACAGAAAAGATAAATAGAAATGAGAATTCCATT -GCCAGCATTGCTTTGAAAAACGTTTCAGAACTTTGAATATATATACTCATAAACACACAC -ACATACACAGATATATATTTCTGCATGTGTATATACATATATGGATACATATGTTCTTAA -ACTTTCTTTTAACAGAAAAATTAAAATAGAAGGAAAATAATAGAGCACTAGTTATATAAC -TAGTATTATATAAATAATGTTATGTAACTAGTTATATAATTAAAAATATAAGAACCTAGG -TGAGAATGAGAAAGTCTAGATTTAGATTTACTACTTTTCCTAATACACACCCACACACAT -ATATGTATGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTATACTGTATTTGTCTC -ATAGTGGCTGGAGCACATTACTACCAACTTAGTGGATTAAAACACCACAAATCTACTATC -TTACAGTTCTGCAAGTAAAGACTAAAATAGAGTCACATAGGTGAAAATCAAGAGCTTCTA -CTCTCTTGAGGCTGCTTGGATTCTTGGCTATAACCCCTTTCTCCATCTTGAAATTCTGTC -ACTCCAACCTCTGCTTTCATAACACATCTCCCATCTCCTTCCTAACTCGAAGGCATCTTC -TCAGTCTTACAAGAACTCCACTTGGGTTCACATTTGGAACACCCAAATAACCCATCTTAA -TATTCTCATCTCTAGATCCTTAAATAATTGCTTCTACAATGTCCTTTTTGTCATGGAAGG -TGACATTTTCACAGGTTCCAGAAATTAGGACGTCGACATTTTAGTAAGCCATTATTCTGT -CTACCACAGTGGCCTTGAATAGAAATCCTTCAGTTTTCATATACAGAAACAAAACATTCT -TCCAAATGAGGAAGTTAGATTGGGAAAAAAAAAGTCAGCCACACAGGTCTCCCCATTAAA -ACTTAAAGATGGTTACATATTTTATTCTGAGTTAAAATGGAGAGTCTTCCTATATAACTC -TCTTTCCACTGTATGATTTATCACTTAATATCTCTCCCTTTCCGTTGTATGATTTGTCAC -TTAATATCTCTCTCTTACTTGAAGTCAGAGAACAGCAAACAGTTTGCAAAATATATAGTC -TATAAGTTGTGGTCCAAATAGTTCTTAGAATTCAGCTACATCTAAGTACACAATTACTTA -TTTAAGCTTGATTCAGTGAAACAGATTTTCAGTTGAGTTTCTTAATGGGAATGACCAGTC -ACATTTTTCAAATTTGGCTTTATGCATAAATTGACAAATTATTTATAATTTGAAACATCT -GGGCTCATTTTTTAAAAACAACTCATAAAATAAGAAATCATTTTATTGGTTCAAGCACGC -ATGCTATTAAATGACTCATCTTAATTTTTATCTTTTCATAAATGTTGGAAAATCTCAAGA -AACCGTTATGTTGTCCTAAAAATGGGCAAGCAAGGTTAATTTGTTTTTGATGTTTAGATA -GTTAGGTTATTTCTCATTTTTCTTTTAGGGATTACGTTGCTATCCTAAAGCTTGACATTG -GTTAGAGTCAGCCAATGTCAAATTTTTACCTATAGATAGTCTTAGAAAGATATTGCAAAT -TAGTGCTGCTTCTAGTAGCTAAAATAACCTGCAGCTAGCTGGCAGGCAGCAGTGAATATT -TTAAAAGGTGTACCAAATTACATTTAGATTTTCAGTTCCTTTTGAAGTGAAAAGAATATT -CAAAGCTTGGAAGTCTTTGACTCTCATAAGCCTAGTAGATTTCAAGTATAAACATAGATT -TAAAGATAGTTTACAGACAAATTGTGGATTTTATGTTATAATCTCAGATTAATGATTTTT -TCCTTTTTGTGATTTTTTTCCCAAAGTAATTGTAAAGGACTGTGTGTGTGTGTGAGAGAG -AGAGAGAAACAGAGAGAAAGAAACAGAGATAGAGACATTGTCCAAATGTACACGTAAGAT -TTTCTGTGACAATTACACATGGATTATCTAGTAACAAGTTGAACTACATGAAGCAATTGT -ATATCATCAGATATATACTATATGAAACAACTGTGTATAATCAGATTTATACTACATGAA -GCAACTATAATCAGATTTACACCACTTTCTGAAGAAAACTACAATTAATTGGATACTAAA -GTGGTTTCTGAAAGCCTAATTATTCTCAAGTCCATATGTGTGTTTGGTGGGATGGCAGTG -AGTTTTTGGAACAAGCTTAACTGTTTTTTGAAATAGTTGTGAACTTCACAAGTTAGATGA -AATACATGTCACAAGAAATTTTCCATGTGGAGACAATTTAGTAGTATTTCCACATTTATG -TTCAATAAGTGTAAATTATAAAATGATATTTTTGCTCAAAAATAAATGAAAATAACACAT -TTGAAACAAAAGTAGACTCACCCACCCTCCAGAATGATATCCAAAGACCAAAGGAAGTTT -GGCCTTTTTTAGGCATTTGAGACTTGCTTTCAGTAAATCATATCATCATATTTATTTCTA -TAACTTTTCTTTTTTTTTGAGATGGAGTCTCGCTTTGTCACCCAGGCTGGAGGGCAATGG -CGTGATCTTGGCTCAGTGCAGCTTCCGCCTCCTGGGCTCAAGTGATTTTCCCACCTCAGC -CTCCCCAGTAGCTGGGATTACAGACGTGAGCCACTATGCCTGTTTATTTATTTATTTATT -TATTTTTTAAGTAGAGATGGAGTTTCACCATGTTGGCCAGGCTGGTCTAAAACTCCTGAC -TTCAAGTGATCCACCCACCTTGGCCTCCCAAAGTGCTGGGATTACAGGCGTGAGCCACTG -GGCCCGACCTCTGTAACTTTTTATATTACTGTTTATCATGAGTACGCACAAAAAACCACT -TTTTGAATATCTTTTTCTCAGATATTTAATTATTAAGAACAAGGCAAAAATGTGTATACA -TTAGGTAAAAACTGCGAAAGCATGTCCAAAGAAGAAATAAAGTTCTGTAAATACTCAGAA -TCAACAGTTGAATAGGTATTCAGGTTTTCAGAGTTAAAAGCACAAATGCATATTTTAGTC -TCTTTTCATTTTATTTTAAATATGACTATCCATAGAATGACTATTGCACAGTTTCATATA -ACTCCCTTATTTGTGTATTTAGTAACTGAGATTCATTATTTTTTAATTAGTACCTAAATA -ATTACTAGTAATCTGAAACTGATTTTCTTTTATGAGGGCAAAATGCTTTTTCTTTTAATA -GGAAAAAGCAAAAAGGAAGTTAGTTTAAGTATTTTTTAAATAAAAGAAATTGACAAAATA -AAAAGCATGTTAAATTTAATTGTAACCCTGATGTATGTAAAACCTTTATTAAAAAGATTT -TTCCAAAGTTCTTATTAAGCTTTAATTAGCTATTTGTATATATTACCTTACTGGCATTTT -AAAAGTATAATTATTACATATTGTGTGGAAAGATTTAAACATTCAGAAAATATAAAACTC -AGTGCAACTATTTTTTCCTACTCTTCATGTATTATAATTCAGTCTTTTTGATAATTTTAT -ATATATAGAAGGTATGATATTTATATTACATTTAAAATTTAGTACAACTTTTATTAAAGT -GTACATATTATTTATACAATGCTTTTTTCTTTTCTATATCTATACAGCTTACCCATTTTT -GAGAAGGCTGCATTTTTTTCTATAACATGTATATGTTGTACTTTACTTGTCTATATCCCC -ATTAAGAGATATGTTGCTTCCAGGTTTCATTATTATAAATACTGTTTCAGTGAACAGCTT -TCAATATACATCTTTGTGCCCATGTGACAATAAAAGTATTTTTGTATGCATCTGCAAGTA -TAATAGTTATAGTGTCCATGCATTTTCAATTGGGTAGTTGTTTCAAAATAGCCCTACATT -TTTCAAATAATTCTAATTCAAAGGACTGCTCCCCTTTCAAACCTGGTAAATATTGAAGTC -CTTAATGCTTTTGCCAGAATTAGGCTGAATTGACAGAGCACTGATATGTAGAAGATATAT -ATATATATATATATATACACATGTATATTATATATATATTATAAATATAATATATATATA -TAAAATTCTTAAAATTTAGCAAGTAACTGAACAACAAAAATGACAGATAAAAGAAGCAGG -TCATGCATAGCCTATCTGCCTGAGATTTATTTAACCCTTCTTCAGAAGAGGCTTTGATTC -TTGTCTCTCTGTGTCTACCTATAGACAGCATTTCTCCCAGGAAGCTTGCCCTGACCTCCT -AGAATTGGTTATGTACCATTTATATATTTTTCATATTCCAATGTACATAACTCATAACAG -AAATTATCACAAGGGAACAGCTGTGGTTTTATTGTTATGTCTAGCTATCCAGCTATCATC -TTCACCAGACTGTAAGCTCCTTGAAGGCAGAAGCCAGATTTATCTTCATGACCTCTGTGT -TTTAGATGAGGTCTCAGAGATAGAGGTCAGAATTTTCCCTCTAGGTAAATTAATTAGATT -AACCTCAACTTATACAAAACAGTAGTCATTAAATTTATCCCCAGCCATCAGGAGCTTAAC -TACTCTGGAGAGAGAGCCAGGTGTTGGAGTAGGCAATTAAGACATCAAGACAAAAGGACA -TTAACAAACCTTTGAGGTTAAACTGGAAAAAGCCCTGACGGTCCAGTCCCCATCCTTTTT -TTCCTTCATGAAACAGCTCTATCAAGGATCACATGGGTCAGCACAGATGTGGAGTTGTCT -CACTATCAAAGGAACCCTGAACAACAGGCTCCTGCCTGCAGTTTTATGGAAGGTCAGGAA -AAGGCTGGGAGCGGAAAAGCACTGAGTATTGAATCAGAAGGAAGACAATTGTCTTCAAGA -CTCCTCCTCCTCTCCCCATGAAAAGGAGGTCTTGGGCAAACATGCCTGGGGAAGGTCTGC -CAAGGCCCCACAGTGGAGAGGCCTCCAGGGGAGGCACCAGTCAAGTGATGCTGATCTGTG -TGTGAGCATGGCCCTGCAGCCCTTACTGAAACTGCCATTAGAGGACTATGCACTAGTGTG -GGGAGGGCAGCTCTCCCTGTGGGACCCACTTGGTCAAGTCTTTGTCATTGTTTATGGATG -GGCCCAAAAATCACATATAGGATTGAGTCTGGGGCTGAACTCTTTACTGCTCTGTCTGTA -TTCCCTGTCTTGGTTGACACCTAATACATGACTAAGAAACTAAGAAATCATTTTAGACGT -CGTTTTTGTGTCTGTGTTTTATGCTTTGTTGGCTTGAAAACTTCATCCAATAAAACTTTA -GTTATTTTTATTACCATTTCTTCCTTTATGACCCCACAGCATCCTCCATGTGCCAGGGTC -CAAATCATCTTTAACCCGGACTATTCTATCAGTACCCAAATATATTCTTTCATCAAAATA -TATTCTTTCTCTTCTCTGGCTGTAATCTCATTCATTTCCAGGCTACTTCTTTTCAAACTA -AAAAGCAAATATAATCACACTATTCTCTTTCTTCAAACACTTTCCCAATTCCTAGAGTAA -AATCCCCCTTTTATAACATATAAGGCCTCAGTGACCTACCCTCAGGACTTCTTTAATTAT -TCTCCCATATTGTCCTATTATGTACCTGTTCCATCATCCTTACAACCCTCAGAACTCCCT -AATCTGTTTTAATGCCACATTGTTATGCTCTTAATTTCCTTTGATAAAATGATTCTTTCC -ATTCATAAACTAGTTGATTACCACAATTTCTTAAGTATTCTATTGAAGTTTATCTTATTA -GTCCATTTCACACTGCTATAAAGATACTACCTGAGAATGGATAATTTATAAACAAAAGAG -GCTTACTTGACTCACAATTCCACATGGCTGAACTTATAAGTGAACTTATAATTAAACTTA -TAATCATGGTGGAAGGTGAAGGGAAAGCAAGGCAAGTCTTACACTGTGGCAGGTAAGAGA -GAGAGCATGCAGGGGAAACTGCCACTTATAAAACCATCAGATCTCCTGAGAACCCCCTCA -CTATCACAAGAACAGCATAGAAAACCACCCTTTTGAGCTAATCACCTCCCACCAGGTCCC -TCCCCATGACACATGGGGATTACAATTCGAGTTGAGATTTGGGTGGGGACACAGAGCCAA -ATCATAACATGTATCTTCTTTGCCAAGATTTTCCTTACAACGCAAAGTAGATTGACATAT -TTTAATTTCTTCCATCCCACCCCACCATAATATTCTTACCTCTATGACAGTGCTCATCAA -AATTTGTAGCTATTATTTGTTTAAATGACTTCATTATGCTTCTTAAGAGGCATAAACTTT -CTGCTATATTCATCTTTGTATGCCTGTCACACATTACATTGGCTGAGACAAGGTAAATAT -TTAATACATATCTATTAAATAAGAAACTTAAAAAAATAAAAGAGTGAATGAATAACTGTA -TCTAGGAAGTATGAAAGTGACTTATCTTTTAGCATTTTTCATCAAGGAATTAAGAAATGT -GTGGGGAAAAAGTTAAGAGGCCCCAGTTAAAATGGCTTTTATCCCAAATTCAGGCAATAA -CAAAAGTTGACGAGGATATACAGAAAAAGGAACACTTGCACACTGTCAGTGGGAATGTAC -ATTAGTATGACCCTTAAGGAGAACAGTCTGGAGGTTTCTCAAAAAGCAAAAATTGAGCTA -TCATATGATCCAGCAATCCCACTCCTGGGTGTATAACCAAAAGAAAGGACATAGTATATC -AGAGAGATATCTGCACTCCATGTTTATTTCAGCACTACTCACAATAGCCAAAATTTGGAA -TCAACCTAAGTATCCATCAAGAGATGAATGAATAAAGAAAATATAGTACATATACAAAAT -GGAATACCATTCAGCCATAAAAAACAATGAGTTCCTGTCATTTGCTACAATGTGGATAGA -ACTGGAGGTCATTATGTTAAGTGAAACAAGCCAGGCGCAGAAAGACAAACTTCGGATGCT -CTCACTTATTTGTGGGAGCTAAGGCTTAAAACAATTGAACTCATGGAGATAAACAGTATA -AAGGTTAACAGGCTGAGAAGGGTAGTGAGGGTTTGGGAGGAAAGCGGGGCAGACTAATAG -GTACAAAAACATAGTTAGAAAGAATGAATAAGACCTAGCATTTGCTAGCACAACAGGGTG -ACTATAGTCAAAAATAATTTAATCGTACATTTAAAAATAACTAGAAGAGTATAATCAGAT -TGTTTGTAACACAAAGGATAACTGTGTGAAGTGATGGATATTCAATTTACCCTGATGTGA -TTATTATGCATGTATCAAAACATCTCATGTACTCCATAAATTTTCAAAAGAAGTATGTAT -AGCATAACGCTTAAAATAATATACTGTAATAGTCTACAACTTGGCAAGAAATTAAGCTTT -CGTTTATTTTTGTCACAACAGGTATACTACATGCAGATTAAAATATATATTTTATATATA -TATATATACCATCTATTTCTTAAGGGCATTTTTCATAACCTTGAAATATAAACAATAAAA -ATTATGAAGCAATAATTTATTTTTTTAAAAAAATCCACTTGCCAAACAAACAAGATACTC -CTTCCAGGATGTCAGTAATATCAAGATAAAATGCCAGAGAATTTTAGCTCAGTATAGGAA -ATCAAACAGCTAATTTTTAACTTTGATGGAGGAGAAAATTAGATAAATTCTGAAACTTCA -TCCATTTATTTTTCTCTCTCCATATATGTTAACAACAAGCAAACACATTAAATCCTTAAA -TTTGGTTACTAAGTTCACAAATGTCATGGACACAGTGAGGTAATATTTACAGACTGAATA -CTGATTTACTATCCTTTGAGTTTATTTCTACATTTCATAGAGTAGATGCTCTTTATAGAT -AAACTCTTTTGGAAAAAGTCCAGATAACTGGATAATTAACTATTAATATCACTTTTTGAA -AATTAAATATTAATATGTTTCTTCCAGGAAAAATTATTTAACAGGTTTTATTCAGACTAC -TAAGACATATGAAAACCTGACATTAACTCTTTGGTTTTCTAAAGATTCTCTTCAGGAAAA -TATGCCATCAGTCTTTCTAAATGTATTCAATTTTATCTCAGCAGTCTTGAACCTCCATCT -ACATGATGTTAATATACCTCCAAACAGCCATTCACATTTCTTAAGGTGATGAAATATGTA -TTACCTATAATATTCTTAACTACGTTATTGTAAGGCATATTTATCATGCCACATAATCCT -GGGCGAAAATTGGATATAATAACAATTCCCAAAAATGCCTCTTGACAAAGAGTTGGAATT -CAACTTTTTTCTTCTACTGCTAAGGATGTTTGCATCATGTCTTTGGAAAAATGTATCTCA -TAATTGCAAAGCCCTAATTCCTGATGCTCCATTAACAATATTACTCAAGATATTGTTGCT -GATGTGGCTGCTTATATTATTCCCAAGTGAATTTGTAGATATTTAGCAATAGCTGAGGCC -ATCTACAACGTAAATTCCTATCTATGTTACTCCTCTTGAGAATCTACTCTTGGTACATTT -AGCAAAGGAAGACACTTTTAGCAACACAGCAGTCCCCCCTTACCAGCGGTTTCACTTTCC -CCAGTTTTAGCTATCAGGGGTCAATCATGTTTTGAAAAATATTAAATAAAAGTTATAGAA -ATAATAACTTTGAAATTGTGTACCATAACGTGATAAATTCTCAGGTTGTCCATCCTACTC -CCTGTTTTCTTTATCACAAAAAGAAAAGTAGATAATAATAAGGTATTTCAAGAGAAAGAG -AGACCACATTCACATAACTTTTACTACAGTATATTGTTATAATAATTCTATTATTAGTTA -TGTTCATATAATTATTAGTAAAAGGAATATAATTATATTCCTATATAATTATAGAAGTAT -ACTACTCCTATACATAATATATTAGTATAACTAATATAATTCTATTATCATTAGTAGTGT -TAGTTTAACTAATAAACTAATAGAATTATTAAACTAATATAATTAGCCTATTAGTTTAAT -ACTACTCCCATAATGAGTATATTAGTTTAAACAAATACTACTACTATAATAATTAGTAGT -AGTTTAACTAATAAACTACTATAATTATTAGTTAAATTTCACTATGCCTAATTTTTAAAT -TACAGTCTATTATAAGTATGTATGTGTAGGAAAAATAGTACATGTTTACTATCCACTGTT -TCAGGCATCAACTGGGGGTCTTAATACATATCTTTTTACAGATAAGGGTGGAGCTACAGT -TACCTTCCAAAAGGTAGTATTGCCACTGCCATTGTTGCCTGCTAATGTAAATTCCAAAGA -AGTAGATGATTTTGGGAAGGGCAGAAAATAAAAATTTTCATCAATATTAATGATTTTGCC -ATTAATGCAGATTACCATTGCCATAGTCTGAGAAGTGACATGAAACTTGAGATTGCTAGA -GCTCATAGGACAGTAGAACAGTGAAGACATTGAGGTTTAAAGTGTTAATGTTATGAAGAT -AGCTAAAAACAGGACTTGATCAGTATAATCCTCTACCTAACACAACTTGGAATGGGGCAG -AGAGTAAAGAAATGGACCTCGGTAAGGATGTGCATGAGCACTAAGAGATAGTGCTTGATG -CAAGAGCAATAATAACAACAATAACCACAACAAAGGTATATTATTGGGAGCATTAGAGAA -GAACAACAGAAAACTAAAAATGTTGTGGATGTTACCTCCAAGCAAGAAAATGGGACTAGG -GACACAGGCCCTTTTTGTTTTTTGCTTGCACATTCTGTAATTAGAAAATGTTATTTTATA -GACACACATATCAAATAATAGGAGAAAAAAGATATTTCATGTAAGATTAGTTTATTCTCA -ATATTCTGAATTAGAATATTGGATTAAAACAATATTTAGAATGATAGAAGCTGGATTAAA -ACAACTTTAAAACAAGCATTTGTACTGCATTGACATCTCTATACAAGTTGCTGTCATACT -AGTTGAGAAGAGCTTGGGGACCTGGACTTGGGAGATTTAATGGACCGATAAAATTACTCA -GAGGGGTAACCTTATAAAGAATTCTGAAACCAAGAATTTGGAAGTCTAAGAAAAGAAGGG -ACAACAATGCAAGTTGTCCAACCAAAACCTCTCTTCCCTACTTTCTTGTTTCTGAATGCC -ATTTTTGTTTGAAAGAGCAATGTGTCCTGCCAAATAACTACAATGCCCAGTCTCCTTGCC -CTTGTGGCTGGTGACATGTTATAGTCCTGTAGAAACATACGTGGGCTGAAGTTCTTGAAA -AGGACACTTTGTTCTTCAACTTCTCCCTTCCTGCCCTGATATAAAAACATGATAACACAC -TACTTATTACTCTAAATATGCATAACTTTTATATGCACTAGGAAGTGTATATATCAATAC -TTAGAGATACAATTTCAGAAATGGTCAAATTAAAGAAAACAAGTTAAAAGTTTACAAGTT -CTTATAATAATTATAGAGGCAAGATAAATTACAGATTCAATTTTTTAAACTAGTAATTTA -ATTAATCACATCATGGATAATGTTTAGAGCTAAAATTTATTCTGTATTTACATAATCAAT -ATTGTAATTAAAAACCACTGAGTATTTTTTGTTGCAACTGAATGTTGTGTCCCCTCAAAA -TTCATGTGATGAAAACTTAACCTTCAGTATGATGGTAGCAGGACCTATAAGCTGCATATT -TATAATGAACTTTGGGAAGTAATTAGCTTATTATGGTGGAGGCTTCATAAATTAGATTGG -TGCCCTTAAAAATGACTCTTGAGAGCTCTTTCTGTCCACCATGTGAAGCTGCATTGAGAA -GGCAGCAGTCTGAAACCCAAGAGAGCTCTCTCACCAGAACCCAATTATGCTGGCACTCTG -CTGTTGGACTTCCAGCCTCCAGAACTGTGAGATGTGCATTCCGTTATTTAAAAGCCACTC -AGGTTATGGAACTTTATTAAAGCAGCCTGAACGGCTGAAGATGGAAATTGATCATGAGAA -GTGGGAGTGCTGTTATTATAAATACCTAAAACAAAGTGAAAATGGTTTTGGGCTCAGTGA -TTGGCAGAGATTGATGAGGTTTTATGCAAAATGCTAGATTACTGTGGAAGAAATTTAAAA -GCCAATTCTCGTGAGGGCTCGGAAAGAAATATAGAAGAAAACGCTGTCTTCTCAGAAAAT -AATTAAATAATCATGAACAGAATATTGATAAAATATGGACAGTAAAGGTCATTCTGTTGG -AGTCTCAAATGGAAATGAAAATGTTATTGGAAAATGGAGCAAAAGCAATCCATGTTGAAA -AGTGGAAACAAACTTCTTTGAATTGTATTCATGCTCTTGTGTTTTGCGGAAGGTGGAACT -TGTGTGCAGTGAAATTGGACATTTAACCCAGCAGATTTCTCAGCAACATGTAGAAGCAGC -AGCTTGGTTCCTTCTGAATCCGTAGAGTCAAATGTAGAAAAAGAAAAAGGTTTGAAGATG -GAATTGTTAAGGAAAAAGTAACCATAATTTAAGATCTGGGAAATTCTCAGCCTGTCCATA -TTGCAAAAAAAGTGAGAAAGTGTGTTCTGAAGAGAACATGAAGAGTGTTTCGGACCCTTA -CTGATTTGATTAATATGGGTGTGAACCACAGGCTTAATCAAACATCTCAACACAAACCAT -GAATAGAAATGGGATTATACCAGGAGAAACACTGCCAGTTGGGACTAAAGGAAACAGAGA -TAATGGGACGAAATAAAGGAAGACATTCGGAATGCTTAAGCCCTACAGGCCCGGACCCGA -GAGCTATTCAGTTGTGGATGTGTGCTATTCTCTCCTTCAAAATTACGGAAGAAGGGGCGC -AAAGGGGATTTGGAGACAATTACAGCTGCTGCTTTTACCAAAAATCCAGAGGGTATGGCA -AGGTGGGCCAAGGTTGCCTCCATTTTGATTTCAAAGGACAGAAATGATGCTCAGAGGAGC -TGTGTGGGAGGGCCATCCAGTGAAGCCCTGGGTGAGTGACCTCAGCCCTGACAAAAGACT -GTGCCATAAGTGGGTCCAGTGCATAGAGTCAGCAGCGAGCAGTGCCTCACTGAGCTGTCG -GGGACTGTCTGGAAGGTGAGTCATCAAGCCAAAGAGGATGCTTCTTGAACCTTAGGGTTT -GATGGAGTTTGCCCTGTTAGGTTTTAGATTTACTTGGGATCCAGCATTCATATATTTTAT -TTTTTTCGAATAGTGGTTCTTTTTGGAATGGGAATGTTTATCCTATGCCTGTCTCACCAT -TGTATTTTGAGAGTTCATGTTGTTTGATTCCACAGGTTCACAGATGAAGAGAAATTTTGT -GAGAATGAACTGTACCGTGAAGCTCACCTGCATCTGATTTAGGTAATATTTAAATAAGAC -CTTGGACTTTAGACTGGACTTGAGGCTGGAATGAGTTAAGATTTGTGGATTTGTTGGAAT -GGAATGACTGCATTTTGCATGTGAAGACATGAATTTTGGGGAACCTGGGGCAGAATGTTA -CGGACTGAATTTTTAAAGTGTACCCTCAAAATCTGTATATTGAAATCGTAACTGTCAATG -TAATGGTATTAGTAGTGGGACCTTCAGGAGGTAATTAGGTTGTCATAGTAGAGGCCTCAT -GAATAGGCTTAGTGTTCTTATAAAAGGGACCTAAGAGAGCTCTCACTTCTTTCCCCATGT -GCTTATACAAAAACCCAACAGTCTGCAACCGCAAAAGGGCCCTCCCCAAAACCAGAACAT -CCTGGCACTCTGACTTTGGACTTCCAACCCCTAGAACAGTAAGAAATACTTTTTTTGTTA -TTTGTAAGCCACTCAATCTATGGTATTTTGTATAGCAGCCCAAACTAAGACACTCCTCTA -CACTACAGTGTACACCACATTCTCCTGCCTCTTGGTATGCTTCAGTCACATTGAACTATA -TTTTGTTTACTAAACATGGCAAATTTATTACTGCACTATGGTTTTGCCATATAATTTTTC -CTGTCTTTTCAAACAGAAATTATTTCACAGCATACGCAGCTATAGGCAATTATCTAGCTT -ATGTATAAAATTACTTTCCTGATATTTGTCTCATTTTTTGTTTTTAAATGTTTTAAATAA -ATAATAATTGTTTTGAGGGGTACAGTATTATGTCTTCGTATATATTTATATTGTGGAATG -ATTAAATCAAGCTGCTTAACACATCTCTTATGTCACATATTTATCTTTTTGTTGTGAGAA -CACTGAAAATAGACTTTTTAGCAATCGAAAAAAGGCAAACCCTTAAAAGCAGAGAGTAGA -ATGCTAGTTAATGCTGGAGGCAAGGGATAGGGAATGGGGAGATGTTGTTCACAGGGTACA -GTTTCAGTTAGACACAAAGATTGAGTTTTAGAGATTGACTTTATAGCAGAGTGACTATAT -TTAAAAATGACTTCTTGTACATTTCAAATTTTCTATTTAAAATACACATAGTATCTGTAT -TCCCAGTTCTTAAAACATGACCTATTAAATAACAGATTTGCAAGACTGAATGATTTTTTT -TTACCTTGTCCTGATCTCTAGATTTTTTTTAAACTTTATCTCATTAATTGCTAATTCTCT -CTTTATGAATGTTAAATACTTAAAATAGCACCAAACATAACTAAACAAAACAAAAACAAA -ATTTGTTTTTTATTTCAAATGGCATGTTCCATTGCGTTCTAAGACTGGACTGTCTTGGAG -ATAAAGGAGACTTCATTATTTTATGTAATATTCCTTTGAGAAATCCTGAGCTTTGTGCAA -AAATGACATACTTCATTTTTTTTTTGCATAATTTTGTCACTCCATAGAGCCCAGATCGAG -TTCCTAAGTTCTCACTAAGGCACATGGGTCTAATTCCATTACATTTTTGGCCTGTGTCAA -TTAATTTTCTTTTCTCAGATAATAAATTTACACTCCTATTCCACTTCTGAGCAGTTAGAC -ATTTTCAGCATGGAACATTCTATTAAAATGAAATGGCAAATTGCCTTGTAGCACACAAAT -TTTCAAAGAATGTTTTGTCTGATAATGTAGTTCTTATGAGTTTCTTTCCAAAGTAATCCA -TTTACACAAAATACATTACTAATTTTGTGCATTTACAAACTACACATAATTTCAAAGTGA -TGTCATTGTTCCACAAATACATCTAGGTTTCAAAGTGATTGATTAAATTCTGTTTAAAAT -TTTGCACAGAAGTGCAGTATATAATCACATTTTATATCCCTCTGTGCAGGATATAGTTTT -CTAAATCATACTCCGTTGTCAGGAGCATGTTCATCTTCCATACAGAACACACTGACTGAA -AATAAGATCAAGTCTGCACATTTGGGATATAAATTTTTGAGATTACTGCTCTTGGTATAA -TATATCTCTCTTGTACTTGAGCCTCTAGATCTTGCTGTTTTTAACACTTTTGTGCTTGCA -AGATTTACGTTTCTGTATACTTTGTTTATATTCAGCAGCTAAGTATATTTTGTCCTGTTA -TCATGACTCAACCTGTGATTTGTTTCAAAAGTACAAGTAAGGTTTTAAACATAATTATAG -TGTAATCTCTTTAAATTTTGCTGCATTATCACTCAGAATGAGAAAAATATAAATTAAAAG -TGTGTGCACATGTATACACATTGTTCAATATACAAACTTAGGTATCATGTTAGATTCATC -ACGATTTGAAAATGAAATCTGATTATAATACTTGCAAGATGACTTACAAGTTAATATTTA -TATGTACACTTACAATTTTTATAATTTTTAAAACCATCCAATATGTTAGTTTTCTTTGAT -TTCTAATGATAACAGTAGAAATATCTGGCCCTGGTTATCACATCTGGTTGTTATGATCAA -TAGAAACGATATATGCAAAAGCAAATACATTTTAAAACTTTTTAATATTAATTTTTATTG -TTACATATTCTGAATAATGCAGTTTTATAGTTATTATTATAACATGACACAAATGGTAGA -GATTTTGATGCCTACATTTTTAATAAAAATGTTCAAAACCATATTTCACAAGATGTATCA -TGCTGTAAGGTTGCAACAGCCCTCTCAAATAGAGTTCTGCCTTTCTCTTGCCATTTAAAC -TAATGCTATCTGAGAGTGCAACAGAAGGCCCTCATTACATGCTGGTATCTTGATCATTGA -CTTCCTTGCCTACAGAACTATCAGAAAATAATTTTCTATTTTTTATAAATTACCCAGTCT -CAGGTATTTTGTTACAGCAGCACAAAACAGACTAAGACATAAAGTGTAAAATTATCCTTC -CATATTGCTGCAAGTGAAATGATTTTATTTTTTATAGCTGTGTAGTATTCATTGTGCGTG -TATGTGTGTGTATATATGTATATATGTATATCACATCATTTTCTTTATCCAGTTATTTGT -TGATGGACACATGTTGATTCAATATCTTTGGTATTATGAATATTGCTGCTATAAATAAAT -GAGTGCAAATATCTTTTTTGGTATAATGATATTTTTTCTTTTGGGTATACACCCAGTATA -GGATTGATAGAGTGAATTGTGGCTCTTTAATACTTGAAAAAATTTTCATACTGTTTTAAT -AGAGGTGGGACAAATTTACATTCCTTCTAATTATACTATAAGGCTATAGTAACAAAAACA -ATATGATACTGATATAAAAATGGACACAATACTCAATAGAGCAAAACAGAAATTCCAGGA -ATAAAGTGACAAAGGCACTTTGTCACTTTATTAGTGGATATTTATAAAGGGACCTACCTA -CAGTCAATGGATGTTTGACAACATTGAAAAAAACATACACTGGGAAAAGGATATCCCCTT -CAATAAATAGTGCAGGGAAAATTGGAAAGCCACATGCAGAGGAATAAAACTGGGCCCCTA -TCTGTTGCCATCCACAAAATTAACTCAGGATGAATTAAAGAATTAAATATAATACCTGAA -GATATAAAAGTACTCATAGAATACCTGGAAAAACTCTTCTAGACATTGGCCTTGGCAAAA -AATTTGTGACTAAGACCTCAAAAGCAAATTTAGCAAAAACAAAAGTAGACAAATGGGACT -TAATTAAACAAGAAAGTTTCTGCACAGCAAAAGAAATAACTGAGAAAACAGATAACCTGA -AGAATGTGAGAAAATATTTGCAAACTATGCATCCAACTAAGTACTAATATCCAGAATCGA -CAAGAAATTCAAACAACTCAACAACAACATAGATAACCCCATTAAAAAATGGACAAAGTA -CATAAACAGGCATTTCTCAAAAGAAGACATACTAGTGGACAGCAAACATATGAAATAATG -CTCAGTCTCATCATCAGAGAAATAAAAATTAAAAACACAGTGAAATGTCACCTTATACTA -GTCAGAATGGCTATTTTTTAAAAGTCACAACACATCAGGTATTGATGAGAATGCAGAGAG -AAGTGAGTGAATCCTTATATAGTATAGGTGACATTTTTATTTATAGAATATCAAAATAGT -TACTTAAAATTCATTTGAATTATAAAATATTAAAATGTAGATTTATGAATACTTTGTACT -TTCTAAAAGTTTAACCACAATAAAAATCCAAACTACCACTGTTGTGTCCATAATAATTCA -TAATTGTATGTGATGATGTTGAGAAATCTTCCTAAATATTAGGATGAGTCCCTCATTTAT -TTTAATGAAAATATCATTCTTAAAAGCATTTCAAGGAATATAGCTCAATAATTCAACGAA -TAACATTTGCAAATTGATAATCCATGGTTCAAAGATGTCAAGATGAACTCAAAGTCTACA -GGGATACCCTTTTGATTCAAGGAAATAATGTTACCCTAAATGAGAGAAGATAGGGAAGAC -CATGTCAAATGAATCACTTTTTGATGTGGTTTGGCTGTGTCCCCACCCAGATCTCATTTT -GAATTTTAGTTCTCATAATCCCCATGTGTCATGGGAGGCACCTGGTGGGAGGTAATTGAA -TCATGAGGGCGGTTACCCTCCATGCTGTTCTTGTGATAGTGAGTGAGTCTCACAAGATCT -GATGGTTTTATAAGGGGATTCCACATTTGCTCGGCTCTCATTCTTCTCCTTCCTGCTGCC -ATGTGAAGAAGGACTTGATTGCTTCCCCTTCCACCATGATTGTAAGTTTCCTGAATGTTC -CCCAGCCGTATGGAACTGTTAGTCCATTAAAGTTCTTTTCCTTATAAATTACCCAGTCTC -GGGTATTTCTTCATAGCAGCATGAGAACAGACTAATACACACTTCAATATTGATTTACAT -TTCTATGATCATCAGTGATCTTGAGTATTTTTTAATGTTTGTTGGCAACCTGCATGTCTT -CTTTTGATAAATGTCTGTTTATGTCATTTGCCTACTTTGTAATGACATAATGTGTTTATT -TATTTATTTATTGGGTTCCATGTAGATTCTGGATATTAGTACTTCGTTAGATACATAATT -TGTGAATATTTTCTCCTGTTCTGTAGGTTGCCTGTTTACTCTGTTGATTATTTCCTTTGC -TGTGCAGAAGATTTTTAGTTTACTTAGGTCCCATTTGCCTATTATTATTTTTGTTTCATT -TGCTTCTGATGACTTAGTCATAAATTCTTTGTCAAGGCTGATATTCAGTAAAGTTTTCCT -AGGTTTTCTTCTAGGAATTTTATAGGTTTTTACATTTGAGTATTTAATCAATCTTGAGTT -AATTTTTATATATGGTGAGATATAGGAATCCAGTTTTATTCTTGTGTATACGGATATCCA -ATTTTTCTAGTATAATTTATTGAAAAAGGTATCCTTTCCACATTGTTTATTTGTGCACGC -TTTGTTGAAGATTAGTTGGTTGTAGGTATGTGGCTTTATTTCTTGGTTCTCTATTTAATT -TTATTAAACTATGTATCTGTTTTTGTATTGGTACCATGCTGTTCTTGTTACTATAGGTTT -GTAGTATAGTTTGAAATGGCGTGAAGTGCTGACTCCAGCTTTGTTCTTTTTGCTTAGAAT -TGCTTTGGCTATCTGGGTCATTTTTTCAATTCAGGATTTCATATACACTTTGGGATTGTT -GTTTTCTAATTCTGTGAAAAATGACATTGGTAGTTTGATAGAAATTGCACTGAATCTTTA -GATTGCTTTGGACACCATGGTCATTTTTAATTTTTTTTAATCCATGAACATGGGATATTT -TTCCATTAGTTTGTTTTATTTCAGGTTTCTTCCACCCATCTTTTGTAGTTCTTATTGTAG -AAATATTTTACCTCCTTGGTTAAACGTATTTCTCAGTTATATGTGTGTGTGTGTGTGTGT -GTGTGTGTGTCTATTGTAAATGAGATTGAGTTCTTGATTTTGTTCTCAGCTTGAATATTA -TTGCTATATAGAAATACTACTGACTATTGGACATTGACTTTGTATTCTGAAACTTTATTG -CAGTCATTTGCCAAGTCTAGGAGTCTTTCAGAGTCTTTAGGGTTTTCTTTGTATAAGACC -ATGCCATCTCTTAATGTAACTTCTATCCCAACTTTTGTGGTAATTTTTTTCTTGCATTTT -TCATAATTTTATTTTCTAATTTGTTTATCCAGAAATACCATTTAGTTCTGTTTGGTATTA -GACTTGAAAACAGTGCAAATTTTCTTTCTTAATAAATAATGAAAATAAATTTCGTTTTTG -AAATTCATCCATGTTTTTGCTTAGTTCATTCATTTTTACTGATGTATTGAGTAGTCTTAA -GTTACCAATTTCATTTTTAATAGTTATGCTTTTTCCAAAAATTTTATAATAATTAGTAGT -TTTATCATGACTATTCCTATATACATATTCATATAAATGCATAACCAAGAGTGCAATTGC -TCATTTGGAGGGCATGCCATCTTTAGCACTATGACTATTAAATAAATTGTTTGCATTGAT -TAAGAGAGTGATATTCCACATCATCACCAGGACTGTGGATTGTCAGACATTTTAAGTAAC -TGTGGCTCCCCATACAGGATAACTTGGTTAAACTTGCTATTCACTTGGCATCCCTAGCAT -TATATTATTGTCATCATAAGAATGGAACCCAAAGTTCTGGAGTTTTTAATCAATTATTAA -ACAACAATTAATCGAATAGCGTGTAAGTTCCTAACCTTGTGCTAGCCGTTGTCCTACGTT -TGTGATAAGAATAGATAAAGTGCCCTTCTACCTAGCAGTTTAGACAATAAACAAGGAGAA -AAGAAATTAAATGTGTATTGTATGATAGCAGGTTAGTGATAATCACTAATAATCCAATGA -GTTTATATGATACCAAAATATTGATTAGATCAGGAAATCACTGCTAAAGAGGAAACATTA -TTTATCAGTATGTTTATAACAGATGGAGCAATGAGTATAAAAGTCATGTGTTAGGGTATG -CAAGATGATGATATAGAGCAGAGTAATCAAGGCAAAGGATAATGGTAGGTGGAGAAGCCA -GCTTATAGTTCTGAGCAAGATTATATCTTTAGCTTTTATTAGGCATGTAAATGAAATACT -TTTGAGATTTTCAGCTGGAAAAATGAAACTAATGTGAAATGCATTATATTGCCAGAGATG -ACTTCCATTGCCTGGTGGGGAAGAGACTGGAGGAAGGATGGATAATGGCAGCACACAGAT -GCTGCTGAGAAGCTATTTTAGTAGTTCATGTAAGAGACAATTGTCAAATTTGCAATAAAC -TTCCAGGTGAAATGGAATTGACTTTCTAATGGCTTGGAATGGTGTATATAAAACAGCTGG -TAATGACTCCATGTTTTGAAGCTTTAGAAAATAGATTAATAATGGCTCAAATTACTGAGT -GAAAAATTACTGGGAAAGAAGCAGGATTGGCAGGATGCAGATAAAGAATTCTCTTTTACT -TACTAAGGTTGAGATGCCTATTTGATATTCAAGGCAGTCGCAAATGAGATGAGTGTTCAG -GAATGAGATATCAGCTAAAGATATAAATTTTAGTCTTGTGCCCATATCTTAGCTGCAAGG -AAGGATTTTATGAACTGCTGGTGAGACTGTAAATCAGTGCAATCAACGTAGACAAAGAAA -TAAATGCATACACCATGTCATCTTGTAATAACTCCCAACCCCCAACACCTAAAGAAGAGA -CTGTGTGTTTAAGTTGGGAGTGTATTGTGGAGTAAGAATAAGAAACAGGATGGGTGCTAT -AGGGAGAGAGAGTAAAATCTAGAACACATTATTAAGTTAGGTGATGGAATACTATATGAC -CATTCACTGAAGTGAAACAACGGGGGAAATATGTATCCATCAGCTCTAGGAGTTCCATTT -TTCAAGCATAGCCTCATGGATGCTAACTTCTCGATGTGTCTAGACAGTGCATTTATGAGT -AACAAGCAAAATGCCTACAACAATCCACACAACATTGTCTGAGAAATCCCACAGCAGAAA -GTGAATTCTTGCTCTGGTCTGAAGCCACATACTATCACTTCCATCTTTATGAAACTGACC -AAAGTCTACATAAAAATAGTAACCAAGGCTCTGACTGGAACAAGAGGTAATGCTGAGAGG -GTCTGCAGTGATGTAAGATCCAATACACCCTATGATTTTTAAATTTTGTTCCTGGGTTTA -TATCCTAGAGAAACTCTGACATATATATGTGTATATATATATACATTCATATATATACAC -ATTTAATATATATATACACACATACACATTCATAGCAGCTTTTTGTTGAAATAGCAAAAA -ATGAGAAAAAAACTAAATGGCAATATAATGAACAAAAGGGAGCTGTGTTTTGATATTTTT -ATATAGTACAATGCTAAACAGCATTTAAAATAATTGATAAGAATTATATGGGCCAACATA -GATGGACATCATCAATGTCATATAAAACAAAATAAAGCAGAAGGTAGATAGACACTTTTT -TTTTGAGTCGGAGTTTTTGCTCTGTTGTCCAGGCTGGAGTGCAGTGGCGTGGTCTTGGCT -CACCGCAACCTCCGCCTCCCGGGTTCAAGCAATTCTCCTGCCTCAGCCTCCTGAGTAGCT -CGGATTACAGGCACCCTCCACCACGCCTGGCTAACTTTTGTATTTTTAGTAGAGACAGGG -TTTCACCATGTCGGCCAGGCTGGTCTTGAACGCCTGACCTCAGGTGATCCACCCACTTAG -GCCTCCCAAAGTGCTGGGATTACAGGCATGAACCACCACGCCCTGATGATAGACACGTTT -TTAACTTCTAAAAATATATGATCATGATTGTGTCTGTGGAGACTTGCACATATACTAAAT -TTTAAACAATTAGAGATATTTGTTCATTACCACATTTTGGGAGTCATTATTTCCTCTATG -AAGAGAGAAAGGAATTTGATACAAGTTCACAGGGGCTTCCAGTAGATTGAGACTTTTATT -TCTAGCTGAGCTGCTGATGTATGAATTTTTTTTGTTATTATGACTTTCATATGTATTAAA -AATAAAATGAAAAAACAAGGATTAGGTGAGGAACCTATACGTCTCTAATATGCAAAATAC -CACAGAAATAATGACTGTTGGGAAATTAGGCCTTAGCTCTGATGTTTGAACCATCCCCTC -AATGTTTCCCAGTGCTTCTTAGAGTATTTTGATCACCTCTGTGTTGGTGCTTTAGAACTA -GAGAAGAACGTTTTGTTAACTTTTTTTTTTTTTTTTTTTTTTTTTTTGAGACAGAGTTTC -ACTCTTATTGCCCAGGCTGGAGTGCAGTGGCACAATCTCGGCTCACTGCAACCTCTGCCT -TCTGGATTCAAGCGATTCTCCTGCCTTGGCCTCCAGAGGAGCTGGGATTACCTGCCACCA -CATCTAGCTAACTTTTTGTATTTAGTTGGTCGGGCTGGTCTTGAACTCCTGACCTCAGGT -GATCCACCCATGTCAGCCTCCCAAAGTGCTGGGATTACGTGCGTGAAACACTGCACCTGG -CCTTTTGTTAACTTTTAGTTTAAGTTCAGGAGTACACGTGCAGGTTTGTTATACAGGTAA -ACTCGTGTCATGGGGATTTGTTGTACAGGTTATGTTGTCACCCGGGTATTAAGCTTAGTA -CCCATTAGTTACTTTTCCTGAACCTCTCCCTTTTCCCACCCGCTACTCTCAGGTAGGTCC -GAGTGTGTGGTGTTCTCCTCTATGAGTCCATGTGTTCTTATCACTTGGCTCACATTTATA -AATAAGAACATGCTGCATTTGTTTTTCTGTTCCTGCGTTAGTGGGAGCTGAGGATGGGTG -GAGCTGAGGATAATGGTCTCCAGCTCCACCCATGTTCCTGCAAAGGACATGATCTTGTTC -TTTTGTATGGATGAATACTATAAAGTCTTCCAAACTGTTTTGGTTTTGGTTTGTTTTCTT -TCTTGAGAAAGGAAAGACAAAACAGAAATAAAAGAGTAGGCCGAGCGGGGTGGCTCACGC -CTGTAATCCCAGCACTTTAGGAGGCTGAGGCAGATGGATCACTAGGGGTCAGGAGTTTGA -GACCAGCCTGAACAACATGGTGAAATCCCGTCTCCACTGAAAATACAAAAAATCAGTCAG -GCATGGTGGCACATGCCTGTAATTCCAGCTACTAGGGAGGCTGAGGCAGGAGAATCGCTT -GAATCTGGGAGGCAATGGGTTGCAGGGTGTGCTGGGATGGCACCACAGCCTGGGTGAAAG -AGTGAGACTCTGTCTCAAAAAAAAAATAATAAAATAAAAAAGGGAGAGAGAAAGAGTACC -AATGTATGGCAGAAATCAAGAGAAGAGTTTGCTTTTTTGAATAACTACACCCTGGACATT -AGTTTCAAGAAACCGTCTGCTGGAAATATATGTTTAAGTTGATGGATCATTATTACACGT -AGCAGAAAGAAAGTCACTCCTTGCTAGAAAGCCCTGTGTAGGTCATTAGGCATCACAGTG -TGGAGTTATCTAAGCAAGCACCAAGGTAGGATATCTGAATAACTGATTTATTTCCATGTT -TACTGACAATATTCATTGCAACAAGTCAGTAGAGAAACAGTAAAGAGGGCAGGCATGGCT -ATGCTTCTATAGAATCTAGTGAAGAGGAGATAATTTCAAATAACCCAAGAAGGTAAATGA -GTAGTCAAATTTTCAAAAGGACTATTAACTCACAAACAGGAAACTATAATAGAAAATAGT -TGGTTGAAGGCAGAATGCCCAGTTCAGAAAAGATTCCTCTGAAAAGCAATATATAAGCAT -AGACTTCGAGGATGAAGAGTCACTCATTTTGAAAGAGCAGATGAAGAAAGTTTCAGGACA -AAGAAACAGCCATCTGCAAAGACCTCAACAAAGATATCACACAGAAAATGCTGTATTTAA -TCTGTTGCTAGACAAAAGTGAGCTACGGATCACATGGTCTTGGATGAGGGAGACAGATGA -TATAGTTTGGATGTCCCGCCCAAATCTCATGTTGAAACCAGATCCCCAGTGCTGAAGGTG -GAGCTTGGTGGGAAGTGTTTGGATCATGAGGTCGAATCCTTCGTGACTTGGTGCTGTCTC -CATGGTAGTGCACCTACCCCAACACACTCTCTCTCTCTTGTTCCTGCTTTCACCATGTGA -AGTGCCTGCTCCTGCTTTGCCTTCTGTCATGAGTAAAAGCTCCCTGAGGCCTCTCCAGAA -GAAGATGCCACTGTGCTTCCTGTACAGCCTGCAGGACTGTGAGTCAATTAAACATTTTTA -TAATATCCAGTCTCAGATATTTCTTCATAGCAATACAAGAACAGCCTAATATAACAGATA -AGCAGGGACTAAAGTCATCAAAATTAGAATTGTGCATTTAATTTTGATTGCATTTAATTT -TCATTGCATTTAATTTTGATTGCATTGAAAAGGCAGATGCTTTGAGGCTAGAATGAGGTA -ATAACTGTTTTGTTTTGTTTTGTTTTGTTTGCTCTTAACAAATTAGTGTGACTTCAGTCC -ATAGTAAATTGGAGAGGAACTGGTAGAACATAAAAGAACTGGTAAAAAGCCATTGTAAAC -ACTCAAGTTTCAAAAACAATTTTGTGGTAAGGGCAAATCCTCAGGTCAAGAAAAGTGTGT -TACTAAATTCCGTTAGTTTCCAGAAGAAAGATAAAGTCATATGACACAGATTTTACTCCT -TACGTTAGAGAGTGTGCTAAGGATACCACCCACATTTTCCAACATTTATTTCCATCATGT -TTTATGATCTTCATCTATATTCCATCCTCGTTATTTCTAGCAAGTCTATGAAATTTCTTA -CATTAATAGAAATAATGTATTAATATTCAGCAATGTGCTAAACATTGTTGAAACATTGTC -TCAATATTACTCTTGGAAGAGCTCTAAGGTAGACAATATTTCCAATATATGAGTCATGAG -GAAACTGAGGAATGGAAAGATTAAACAACTTGTTGAGGGTAGCATAATTGTAAATGGTGA -AGACATAATACAAATCCTAATATCTATGACTCTAATGCTTTAAAAAATCATTATATACAC -TACGCTGTCTCTGATGTGTGAATCTACCCACTTCTAATTCATTATAACAAGTATTTGTTG -CCAGGTAGCATTCCGGGCTTTGGGGATACATCTTTGGAAAAGCTTACAGAAATCTCTGTC -CTCAGGGAACTAATATTCTAGGGACTATACAATAAACAATAAGCAAAAATGTAACATGTA -TAGTGTGTTAGACTGTAGTAAGTACAATGGCAAAAAATTAAGAATGGAGAATGTCTAAGG -GGACAGATTGTTTGGAATTTGAATAAAGTGGCTATGGAAAACTTCACTGGGATAATGGCA -TCTGACCAAAAGCATGAGGAAGATATAGAACAAACCGTATCTGTTGCATGTTTAGTAATA -ACCAAGAAAATACTGTACGTGAAGCTGAGTGAGAAAATTACATAGTGGAAGGAGGGAAGT -CCATAGAAGAATTGGGGGCTTCATGTGGTGGAGCATCTATAAAGCATTGTGCAAGACTCT -GACTTTTACAATGAATGAAACGAGAGATCAGAGTTTTACACAGAAAAAGGTAATAACCTG -ATACATGTTTTAAAGTGATTATACAAATTGCTCTTTTGAGGATGTACTGAAGGGTGCCTT -ACGCAGAATCAGAAACACCTGTTTGCTGGCCATTTCAATAACCTGGGCAATAAATGATGG -TGGTTAGCACCAGGATGCTAGTGGTGAAAGTAGCAAAAATGATCAGAATTGAGCTGCATT -TTGAACATACAGTTAATAAGTTCTGTGGCATGACAGAAAAATGATTCCATAATATTTCTA -TTGAGCAACCTTGTTGATGTGATATTCAAACTAAGTCCTAATATTAAACAATGTAGGAAT -TTCAATGAAAATATGACCAAAGGGAGAAAATGGCTCCCAACCTACTGTAATTAAAGTTCC -CTTCTGTTGTTTTAAAATGCTACTATGGATAGAGAAAACAAGATATGGATTTGGAAGAAA -ATTACCCACAGTCTAATTGTCAGATTTATTGACTTAAAGATGTTACATGGCAATAAAGTT -GAAAGAAAAATAAGAAGAAATTCTAAAAGCCAGCATGTTAGATTTATTCCCTCCACAAAA -AGAATTGCTAATATTTATAAAGTGAGATAATATGCCAAATACTTGTACTTGGCATAATTG -CATTTTCTCAACAAATCCTATATAATCAACATTGTTTTATTTGTATATGTGTGAAAAATC -ATGACACCTGCAGTTTAAGTTACATTTGTATGAAACAGTCAATATGTGGCAGAGCCAGAA -TAAAGCCCACGTTTAGATTAAAGGAATTCTCTTTCCATTGCACCCACCCGTATTGCCTGT -GGAAACCCTAAACAAGCCATTTAAATTTGTTGGAATTCCGTTTTTCAACTTTTTAAAACC -TCTTAAGAGTCCAGTCCTAAATCAAGTTAATGTTTAAATTCTTCACTTATGCTTAGTATG -TACATTATTCCAAAATGTGAATTGGTCCCAAATATCATATTCATTCTAACAGAGGACATT -AAAAGTTTAAAAGATATCTGTTATATGGCTGAACTGATTATATTTTATCGGAAAGTGAAA -GGAACTGAAAGGAACTACTCAGCATATTTCTAATGAGTATACAGAGGAAAAGTACTTTGC -CTCATTTAATTTTGTAAAATCTCTCTCTGGCACATAATTCACAGTCTTTTTTTGCCCACA -GGAATAGTGAGGCAGATATGTATTAATTCATGTCATAATACAAGAATAGGTAGCGATAAA -ACACTGGCATTTTCCAAATTGCCAGTATAAAGAATTGCCAGAAGAACATGGGTATTAGCT -TCAGATTCTCCAAGGGGTTAACATTTATGTTATCTGTTAAATATGAATTATGTATTAACT -TCTCATATTTCATATATAAAACTTTATGCTTTGGTCCTGTTCCTCGGGATGACCTTGATG -TAATCAGAAATAATAGTGTTCTTTACCAAAGACATTAATCAATAACTTTATAATACGAAG -CACTATGAATATTTAATGGCTTCAGCTTGAATAATTCAAATCCCGTATTACTAGAAAATA -AAGTCAGTTGAGTGACTGAGAGTTCCATATTCCACAATTCCTACTCTGCTTACTACCTAT -TCTATTTACTATTCTCTTTACTATTTGAGAAGGGTATGGAGTTGTGTATGTTGCAAACAT -CACGTAACTTTTGTTCGACTTTCTTGAACACGTCATATTATTTTTTTTAGTTCATTTTCT -GAATATAAGTAATTTTTGGTGAATTAATACTTTAAACAAGAGTTCACCTGGAAAGCAGTA -GGCAAAATTTCATTAAAAATATTATTTTATTAACATACCTTAAAAATGTAATAGGACAAT -GCCTCAAAGAACAATTTCAAAATAAAAACACAGAAAACAAATGACCAGCAAAATTGCTCT -GAAGTCTTAAAAACAGAAATAAATACTTCAATAATCATAGGTAATATGGAAATCCAATGT -ATGACTTACCTATAGAAAACCCTTGTGGAATTTCATTTAAATCTAACGTCAATATGAGCT -ATGTAGGAAGTCCATTAATAAATAAGAATATTATATAGGTACACATGTATATATTAATTT -TAAGCCATATGCAGCCCTATTTGAAAATGTTAAAAAAAATCATCAGGATTAGTCCATACT -GATTATTAAAAAATAAAATTGTCACTGCTCATTGTAGAAGATAAATGTCAGCTGTGCAGC -AGATGTGTTTATAGCCACCCAGTAATCCTATCGCCTCAATAATGCATTTCCCTTTTTTTA -GTTAAAAACTTAATTACATTAAGAAACTGTATGTGTTTAGGAATATGAATATAGAAGGAG -TAATCATCGTTTAGTAAAATAAGTTTTACTTGCCTATTAAAATTTACTGTGTTTGTGAAG -ATGCTGATAGTACATTACATATGGAGATCCAAGTGCACATAGTCACTAATTCTTTAAACT -ATGTTTGATATTAGTAATAATTTACTTTACATATATATCGGAATTTAATTGAAAAATAGT -AAATGACTGCTAATATACATTATTCTTCTGAGTTGCATTTTTGCTTAATGAAATAGAATT -TTTAAAAGAATTGTTTATCTTTATTCCTACTAGATTATACATTTCATGAGAAAAGCATTA -TCTCTTTATTAGTATATTTGTTTACCTGTATTAGAACTTGACTTTGAAATAAACCAGATA -TAATACCATTGTTGTAGATGTATTTATTGTACTAAAAATAATATTCTGCATCTGAGTTTT -GAGCAAGGAGATTTTACAGTCTCCTTTCAGTTAGAAAACTACAAGACCCTCTTAACTGAT -GTTGGAAATGTAAGTAAAGAAGATAAAAATTAAAATGATAAAGAAAAGCATTTGGGGTAT -AGTAGCACTGTGCTTCCCAGGAGAATGAGTTGTTAAGTGCTCACTCCGCATTTTTCAGTA -ACATATACTTAAAGTAAGCACACAGGGGCTACAGACGCTATTTTTTGGTTCAACATGACC -TGAGCAGTTAATTAATTGTAAAGGGAAGAAGCAAGAATAGGCTCAGGGAGGGAGACAGAG -AAAGACTGGGTAGGGCGGGGAGGGAGGGAGAGTTTCACCTGTATCTAAAACAGATCAGAA -GCAATTTCTTCCTCCAACTCCTCACTTGTCTATTTCTACTAATAAAGAGCAAAACCAGAC -AAAATAGATTATTGTGTCATTTTTGTTTTCTTATTTTGTAATACACAGAAAAACTCAAGC -TGGAGACGGAAATGAACAGATGCACATGGCTGGAAAGACTCAGTGCTAATCTCTACAATG -TTGTTTTAATAGAATGGAGACAGGACCACATACTTTCTTACAATAATGAGGATCAATAAA -GACAAAACTGACACTTTGTAATGAATAATGATCTGAACACTCACCTGAGAAAGTATCTCT -TTGTTGCAGGTTTTTGGAAATGGGCTATATTTTTTGAATCATAACCGATATGTACTGCCA -TAAACAAGAGGATTTCAAGCCAGCTCCATCTGGTCGAAAATTATTTTATTTATTACTAAG -AGAAAAGTGTAAGACAAGTCCTGTGGTAAAAACAGATTTATTGCCTCTGCTATTCACCTG -TGTTATTTCTTCATATATTACCATTGACATATATTATCCATTCTTCACAGCAATGGCTTT -GCCGTGGCAAATTAAATATCTCATTGTCCTTCTCTGTCCATTTTACATTATAATGTTTCT -GAGCAGACTTTTATAGCTCTCTCACAGAATTATAGCAAGTCTTTAAATAAAAACAAAATT -GAAACAAAAATTTTAGACTCAACTTAAAATCCCTCTTTATTTTATAATTTGGATTTTTAA -GTAAAATATGCTATATCCTATTTAACGAGAACTTTCATATGTAATGTATCAATGGAATTA -TCTAAAGCTCATTTGGTTTTGCATAAAAACACAATTAGAGTAAAAACATTCTAAAATAGA -CACTGGAATAAAAACAATGAAAGCAAAACTATTAATTTTACATTTTTCATTCAAGTATTT -TGATTTTTACTATATTATATTATTATATTAGGTATCAGAGTAATCATTGATCGCTTTCAA -AACCCTGCTCCTTTCTAGGTGCAATGAAGAATTTTTATTTTATTGAAAAGTTATCTTAAG -ATGTAAGACTTGTGAATGATAGTAAAGATTTAGTAGACCCAATGTATTCTCAGATAAATG -TAAAATAAGCAAGATATGAATTAAAGGATAAATATAGAGTTTAACAGCATAGATCTTAAA -ATCCATTATCATAAGGTAGAAGGATGTATAATTTATCATGATTAAAATATACTAAATATT -CATATCACAGCATTCTGATTTCTGATATCTACAATTTAGGTGACATATATACATATGTGT -GTATATATATATAACTGTATTATTTGATATTTTAAAAGATAAAAGAGTTATATATTCAAA -TACCAGTATAGATGTTGCTCTGAAAGTATTGTTAGATGAGATTAACATGAAAATCAGTAA -TTTCTGAGAAAAGCAGATTATTTTCCAAAATATTGTAGGGCTCATCCAGTCAATTGATTA -TATTAAAAGACTGAGATCCCTCAAGGAAATAATTCTGCCTGCAGATTGCCTTTCGACTTG -AGACTGTAACATCAACTTTTCTCTAGGTCTCTTGCTTGCTAGCCTACCCTGCCAATTTTT -TATACATTAAAAAATATTTTTCTCAAATATTGTAGTTCTGCTATTTCCAGAAACGCTTGA -GGATTGTACTTCCAAGTTCTTTTAAAGTTGATCATCGTCATGAAACTCATCATCAAATGG -CATTTGAGCAAAATCTTTATTAATTAAACATGAGTGGAAGTTTAAAGGCCAAAGCACAAC -TCACTCAGGCATTGTGAACTATGTGTGCAGACAGATCACAACTCATCCTTGGTCTCTGGG -TGTGTCTGCGTCTATTACTGACTTACCCTGGATATGTAAAATGAGTAAGAAAAACTTTTT -GTGTTAGCCACTGGGATTTTTGGTTTGTGTATTACTTTAGCATATTATCTCATTTTGAAT -GCTATAGTTTAGGACACTAGTTTAAACTACTGAAGTTAAAATGTTCTCCTTATTTCAGAG -GAGAGAAGGATCTTACAGTGACAGACATCCATTAGTAAGAATTAATTTCTAGAGATAAAG -TGAATTCAGTAACCACAGTGTCAGTAGAGTCAGCATGGTCAAAATAGTCTACATGGGAAA -TGTTTGGTGGCTCTTAGTTGATCATGGAGTCTCTAGAACCAAAAGTTATGAATGCCAATT -AAGTTTCGATTTGGCTTATATGATCTCAAATCTTCAGGTTTACAAAACATATCTTGAGCC -ACCACCCAGCTCTGTCACCCAGGCTGGAGTGCAGTGGCACCATCTCAGCTCATTGCAGCC -TCCGCCTCCGAGGTTTAAGCGATTCTCATGCCTCAGCCTCCTGAGTAACTGGGACTACAG -GTGCTCACCACCATACAGGGATGTTTTTTCTATTTTTTTGGAGAGACACGGTTTCACCAT -GTTGGCCAGGCTGCTCTCGAACTCCTTACCTCATGATCCGCCCACCTCGGCCTCCCAAAG -TGCTGGGATTACAGGCGTGAGCCACGGCGCCCAGCCCATTTTTTCTTTTCACCCACCTCG -GCCTCCCAAAGTGCTGGGATTACAGGCGTGAGCCACTGCACTGAGCCTACAGCTCATTTC -TTAACACATAAAGCTTTGCACCTCTCCACAAAACTGCCATCAGGGATGTCCCCAGAAACC -ATTCATCCCAGGTGCCACGCAGAGAAGAGTTGCTTGTTCTCCTTTTCCCTTTACCTCTTC -CCTCTCACCTCATCATGTTCATTCATTCATCCCTTTTCCATTCTCACTTTTAAGCTTTAA -CCTTTCAAAAGCCTATCTTCCCCTATAAGTAATGTATTGTAACTCCCGCCATCACCATAT -CCTTCTCCAACCAACCAAACTGCCATCCTGAGTTTATGGAAAGTCCATAAACTAAGAAGA -AATGGGAAACATTCATTGCTAACTTGGCAGCCCCTCATCCACCCTACGTGAGAGCACAGA -TCTTATTGTCTTTGAAGACCCTTTCTTTTTTTTTTTTTTTTTTTTTTTTTTTGAGAAGCA -GTCTCACTGTCGCCCAGGCTGGAGTGCAGTGGCACAATCTCGGCTCACTGCAAGCTCCAA -CTCCTGGGTTCATGCCATTCTCCTGCCTCAGCCTCCCGAGCAGCTGGGACTACAGGCACC -CGCCACCACGCCCGGCTGATTTTTTTTGTATTTTCAGTAGAGACAGGGTTTCACTGTTAG -CCAGGATGGTCTCGATCTCCTGACCTCGTGATCTGCCTGCCTCGGCCTCCCAAAGTGCTG -GGATTACAGGCATGAGCCACCGTGCCCAGCTCCTTTTTTTTTTTTAAAGACAGGTCTCAC -TCTGCTGCCCAGGCTCAAGTGCAGTGGTGTAATCATGGCTTACTGCAGCCTCCAACTCCT -GTGCTCAGGCTATCCGCCTGCCTCAGCCTCCCAAGCAGCTAGGACTACAGGCACACACCA -CCACACCTAGCTAATCTGTTTAGTTTTTGTAGAGATGGGGGTCCTGCTATGCTGAACAGG -CTGGTCTCGAACTCCTGGCCTCAAGCAATCCTCCCACCTTGGCCTCCCAAAGTGCTGGGA -TGACAGGCATGAGCCACCATGCCTGGTCTGAAGACTTTTAAATGCTGCCATATTCAAGAC -GCGTTGAAACTCACCTGTATTCGATGAGCCTGCTTTTCGCAAATGAGTAACATAAAACAG -ACTGAAATACCTTAAGCTTCTCAGCCTTTTACCCTCCTCTGGAATAATGAGTGTATCCCA -AAAGTAAATCCATAATGAGGTCCAGTTTTTCCTTCATCCTTGGCTATGAAATAGACAAGA -AAAAGGCAAGCTAGCCATTTCCATCTCACTATAGCAGACTCTCATGTTTGCTTTTTGACC -GTACGTGGGAAGCGGGGGCCTGACTGCTTTCCTACTTCCTAAGCACAACTTACTTTTCCT -AGGAAATTCTCAACACAACCTACATGGATTAAACCAGGTTCCCCCCTTTGTTTCCAATAT -TCTTACAGCCAAAATGTCCAGAATGGGCAAGGCAACCTGAAAAAATGAGGACGGGTACAT -TATCCCATGCGCTAAACTGCCACTTACACTGGTTAGTCATGAAATCGGCAAAATTCCAGA -TGAGCTCTCCAACCACGTATTTTCTGCGTTTTTGATCCAGACCCAGATGGTACTGCTCTA -GCAGACTTTTCCGGTCCTCTTCACTGAACATCAGAGGTGGATCCTGGGATTCAAGGCAAA -GAGAATTAAGAGTAAGAACTGGCAGAATTGTAAATGTTAGATAAAAATAAAGATCCACTT -GATGGTGACCAAAATATCTGTCCTCACTGGGGGCTGTAGGGACTGCAGGACTCACTGATG -CTAGGGTAAAGACAGCCAGGGAGAAATTGGAAATCATCATTCTCAGTAAACTATCGCAAG -AACAAAAAAACAAACACCGCATATTCTCACTCATAGGTGGGAATTGAACGATGAGATCAC -ATAGACACAGGAAGGGGAACATCACACTCTGGGGACTGTTGTGGGGTGGGGGGAGGGGGG -AGGGATAGCATTGGGAGATATACCTAATGCTAGATGACGAGTTAGTGGGTGCAGCACACC -AGCATGGCACATGTATACGTATGTAACTAACCTGCACAATGTGCACATGTACGCTAAAAC -TTAAAGTATAATAATAATAAAAAAAAATACAAAAAAAGAAACGACAGCCAGGGAATGATG -TAACCCAGAATTAAAAAGGAGGTTTAAAAAAAAACCATCAATTAGCAACTGCTTTATTTA -TAAATATAACCTGATACTCAATTTTTCTTACTTTTCCGTCTCTGTCTGCTGATACAGTCT -TAAGGCTGAACTACACTAGAAGGAAAAATATGTCTTTAGGTCAGGCGCGCTGGCTCATGT -CTGTCATCCAAGCACTTTGGGAGACCGAGGTGGGAGGACTGCTTGAGCCTAGGAGTTCAA -GACTAGCCTACAAAAAGTACAAAAGTTAGCCAAGCATGGAGGCACACACCTGTGGTCCCA -GCTACTTGGGAGGCTGAGGTGGGAGGACTGCTTCAGTCCCGGAGGTCAAAGCTGTGGTTT -GCACCACTACACTCCAGCCTGGGTGACAGAACAAGACCCTATCTCATGAATGAATGAATG -AATGTAAAATGAAATTAAACTAAACCAGGCTGGGCATGGTAGCTCAGGTCTGTAATCCCA -GCACTTTGGGAGGTCGAGGCAGGAGGATCACTTGAGCTCAGGAGTTCAAGATCAGCCTAG -GCAACACAGTAAAACCCAGTCTCTATAAAAAGGCTAAATATTCGCTAGGTATAGTGGCGC -ATGACTGTGGCTCCAGCTACTTGGGGGGCCGAGGAGGAAGGATCACTTGAGCCCAGGAGG -TTGAGCAGTGAGCTGTGATTACGCCACTGCACTCCAGCCTGGGCAACAGAGTAAGGCTGT -CTCAAAAAAAAATTTTTTTTAATTAAACCAGATAAATTCAGTTATCCTAGTCATATATCA -AGACCTCAATAGCCACATGTAGCTAGTGGCTACCATTTCAGACAGTGCAGACATGGGGCA -TTTCCATCATTGCAAAGGTTCTTTTTTGAAACAAGGTCTCACTCTGTCACCCAGGTGGGA -GTACAGTGGTGCAATTATGGCGGACTGCAGCCTTGACCTACTGGGCTCAAACAGTCCTCC -TACCTCAGCCTCCCAAGTAGCTGGGACTAGAGGCAAGCACGACCATACCCAACTATTTTT -TTTTTTTTTTTTTTTTGAGACGGACTCTTGCTCTGTCGCCCAGGCTGGAGTGCAGTGGCA -CAATCTCGGCTCACTGCAACCTCCACCTCCCCAGTTCAAGCGATTCTCCTGCTTTAGCCT -CCTGAGTAGCTGGGATTACAGGTGCATGCCACCACACCCAGCTAATTTCTGTGTTTTCTT -AGTAGAGACGGGGTTTCACCATCTTGGTCAGGCTGGACTTGAACTCTTGGCCTCGTGATC -CACCCACCTCAGCCTCCCAAAGTGCTGGGATTACAGGCGTCAGCCACTGCACCCAGCCAC -AACTCATCTTAAATATTTTGTAGAGATGGGGTCCATGTTGTGCAGACTGGTCTCAAACTC -CTGGGCTCAAGAGATCCTCTGACCTCGGTCTCCCAAAGGGCTAGCATTCCAGGTGTGAGC -CAGCACACCCAGCACTGCAGAGGTTCTATCAATGCTCACCTAGACCCTCTCGAGTTTCTT -AAGAATTCAGAACTGGGGCTGGGTATGGTGGCTCATGCCTGTAATTCCAGCACTTTGGGA -GGCCAAGGCAGGTGGATCGCTTGAGGTCAAAAGTTCAAGACCAGCCTGACCAACGTGGTG -AAACCTCATCTCTACTAAAAAAAAAAAAAAAAAAAAAAAAATTAGGTGAGCATGGTGGTG -CATGCCTGTAATCCAAGCTACTTGGGAGGCTGGTGCAGGAGAATTGCTTGAACCTGGGAG -GCGGAGGTAGCAGTGAGTCAAGATTGCACCACTACACTCCAGCCTGGGCGACAAGTGAAA -CTCCTCCTAAAAGGAGAAAGAATTCAGAGCTGGTTACCTTTTCAAAGAGAATGAACAAGG -GTGCATATCCACAAATCACTTCCCCCTACTTGACTAGTTTGCAGAAGTGTCATTCTGTAA -GCACGATAAATTTAAGGGTGCAAACAGAACAGTGCAGTCCATTGTGGGTGGCTGTTCCCT -GTGTGTCAACGGGAGTCCCAGGAGCTGTGCAAAAGAGTGTGAGCTGGCTGGGGAGGGGAC -AAGGGGCTGGATGGGGTTCAGGAATCCACATGAAAAAAACCCCACAAGACAAAGCAACAT -ATCTTTGGTGAGAAGGACAAAAAATGAGATGGATAAACAAATGAGGACAGGCCAGGCATG -GTGGCTCAGGCCTGTAATCCCAGGATTTTGGGACGCGGAAGCAGGCAAATCACTTGACGT -CAGGAGCTCAAGACCAGCCTGGCCAACATGGCAAAACCCCACCTCTACAAAAATACAAAA -ATTAGCTGGGCATGGTGGCAGGTGCCTGTAATCCCAGCTGCTTGGGAGGTTGAGGCAGGA -CAATCGCTTGAGCCTAGGAAGTGGAGGTTGCAGTGAGCTGAGATCACACCATTGCACTTC -AGCCTGGGTGACAGAGTGAGACTCCATCTCAAAAAAAAAAAAAAGACAAAGTGAGTGATT -AAACATGGCTCTAAGATCTCACCCATGCCCTCAATAGGTATTATTTAGCATGTACTGTGT -CAGCTATTGCAGAGTACCTGGGAAACAACAATAAATAGGACTCCTGTCTCCTGAGCCCAC -AGTCCGATCAAAGAGAGAGCCAAAGAAATAACAACGGTGCCTGGCGAGAATGTTGGGGG +>chr5_70040526_70088546 +TTTCCATAAATTCTAAATAGCATCTCTTTGAAAACTAACTCCTTGGGCATTTATCCCAGT +GAAATGATAATAAACAGCCAGCATTTGAAATGTATATTTCAATGACTCGTATATGAATAC +TATGTTCAGATCACAATAAAGGACATTTCCAGCACCCTCTTAGATAATGCCTCCCATCTG +ATTATCACCATAGATTAGTTTTGCCTGTTTTTGAATTTTTTGTAAATGGGATCACACGGT +ATGTGCTGTAGTGAGCCTAGCTTGATTTGCTCAACGTTACGTCTAAAAGAATTCTCTATT +TTGCTGCTATGCAGCAGTTCAGCCTTTTTCACTGTTGTGTAGCATTCTGTTATATGAATA +TACCATAATTTAGTTACTCTACATTTCATGGACCTTTGAATTGCTCCCACTTTGGGGCTG +TTATGAATAATGCTACCGTAACATTTTTATATATCTTATGATGGTCTTAAACACGCTATC +TACTGAGTATATGCCAGGAATGGAATTGCTGGGTCATAGGGTGTGTGTCTATACACACAC +ATACACACATTTTTTAGTAGGTACCATCAAACAATTGCATTTTTGTTGTTTTTGTTTTAG +TTTGTTTTTAATGACCATACCATTTTACCACCAGCAATAAAGGAAGGTTCCATTGCTCCA +TACCAGCATTAGGTATTATCAGTCCTTTTATCTTTAACCATATTGGTGGGAGGTAGTGGT +ATCTCATTGTAGTCTTAATTTACTTCCATATACTTAATTTAAATACACTCAATTTATATA +CACACACGTATACTCTCTCAGCCTTCAGTAAGCCAGTCCCATTCCCCTGAACTAACCAAT +GTTAATAATGGTATACTTCTCTCAGTGCTCATGCAAACATGCATACACACACATACACAT +AAATAAGGGGTGTGGGAATGTATTTGATTCTGAGAGCTGCTGTAACAAATTACCACAAAC +TTAGTGGCTTAAAACAACAGAAATATTTCTGTTCACAGTTCTGGAGGCCAGAAATCTGAG +AGCAAGATGTCAGCTGGGCCACACCTCCCTCTGAAGGCTCCAAGGGAGAATCCTTGCTTT +CCTCTTCCAGCTGCTGGTGGCTCCAGGTATTACTTGGCTTATGGCAGCATAACTCCTATC +TCTGCCTTTGTCTTTGTGGTCTTCTTTTCTGTCTTCTTCCCTTCTTTTTATGAGGACTTT +TGCTGTTGGATTTAGGTTCCATTCTAACCTAGGATGATCTCATTTGGAAATCCTTAATTT +CATCTACAAAAACTGTTTTCCCAAATAGGTCACATTCACGCATATCAGATGGACAGATGT +ATCATTTTGGGGTCCACCATTCAACCCACTACAAGGAGTTTTTTAAACAAAAATAGGAAA +CTTAGATGTAACTTAGCACTTTTTTTTTTTTTTTTTGAGATGGAGTCTCACTCTGTCACC +AGACTGGAGTGCAGTGGCGCCATCTCAGCTCCATGCAACCTCTGCCTCCTGGGTTCAAGC +AGTTCTCTTGCCTCAGCCTCCTGGGTAGCTGGGATTACAGGCACGCGCTGCCACACCCAG +GTAATTTATTTATTTTTTTTTTGAGACAGAGTCTCGCACTGTTGCCCAGGCTGGACTGCA +GTGGCGTGATCTCTGCTCACTGCAACCTCCGCCTCCCGGGTTCAAGCGATTCTCCAGCCT +CAGCTTCCTGAGTAGATGGGATTACAGGCGCCTGCCACCACGCCCAGCTAATTTTTTGTA +TTCTTAGTAGAGATGGGGTTTCACCATGTTGGCCAGGCTGGTCTCCATCTCCTGACCTCG +TGATTCACCCGCCTCGGCCTCCCAAAGTGCTGGGATTACAGGCGTGAGTCACAGCCCCCG +GCCATAATTTAGCACTTTAAAAAATAATAGCCATGTTGGGCCAGGCGTGGTGGCTCATGC +CTGTAATCTGAGCACTTTGGGAGACCAAGGCGGGTAGATCCCTTGTGCCCAGGAGTTCAA +GACCAGCCTGGGCAACATGGCGAAACCCCATTTCTACTAAAAATACAAAAATTAGCTGGG +GCGAGGGGATAGGCCGAGTTCCGGGTGTAAGGGGGCCATTAGGGAGAGCAGAGCGAGGCA +GCTGATCTTCCGGATTGGGGGCCTTGCCCGGAAGCTGGACCTCACGGAGATGAAACGGAA +GATGCACGAGGATATGATCTCCATACAGAACTTTCTCATCTACGTGGCCCTGCTGCGAGT +CACTCCATTTATCTTAAAGAAATTGGACAGCATATGAAGATTGGACATCACATGTGAATG +CATGATATGAAGAGCCTGGTTACAGTTTCTACTGTTCTCTGCAAGTAAATAGGCCCAGAA +AGGTATAAGAGACTCTTTGAATGGACATAAAAATTCTGCTTGTTAAGAACAAGTTGAGCT +CTGGTAACTGATCTTAATAGCTAAAATATAAAAATATTTGGGAAGTCTGAAATGAGGTCT +CCTGGCCCTGGTGTGCCCTTAATGCCTGTGACAGTTGGCCTCTGTGAATATTGGTATAAT +TGTAAATAATGTCAAACTCCATTTTCTAGCAAGTATTAATAATTAAGGGAAGTATGTCTG +AAATGGCACTGTCTTGTCAGTCATTTCTGTTTACCCTTCTGTCTGGAGTGTATTTGTGAA +GAGTCCCTTATAACTTATGTTTTATGGACATCAGCACATAACCACAATGACATTGAAGCA +CAGGATCATTAGTCTATATTTTATTTTATTATTTTATTTATTTATTTATTTATTTTTGAG +ATGGAGTCTTGCTCTGTCGCCCAGGCTGGAGTGCAGTGGCACAATCTCGGCTCACTGCAA +GCTCTGCTTCCCAGGTTCACGCCATTCTCCTGCCTCAGCCTCCCGAGTAGCTGGGACTAC +AGGTGCCCACCACCACACCCGGCTAATTTTTTGTACTTTTAGTAGAGATGGGGTTTCACT +GTTTTAGCCAGGATGGTCTCGATCTCCTGACCTCATGATCCACCCGCCTTGGCCTCCCAA +AGTGCTGGGATTATAGGTGTAAGTCACCATGCCCAGCCCATTAGTCTATATTTTTAAGTA +AACATACCAATTAAGAAAGAAGCCAAAAACCAAAATTAGCCAGGTGTGGTGGCACGTGCC +TGTAGTCCCAGCTACTTGGGAGGCTGAGGTGTGAGGATCACTTGAACTCAGGAGGCAGAG +GTTGCAGTGACCCAAGATGGTGCCGCTGCACTCCAGCCTGGGTGACAGAGTGAGACCCTG +TTTCCACAAAAAGAAAAAAAAAATAGCCGTGCCTGTACTTCAGTACTTACAAATTTAACT +TTAGTATAGATGTACAGTAATTTATTCAATCATTTCCTTACTCATAGACAATTAGGATGT +TGCAACTTTTGCCACTACAAACAATTCTGCGATGTGGATTATCGTACTTATTCCCATTTA +TTGGTGCTTTCATTTCTATAAGAATGGATTTTTAAAGATAGAATTCCTTGGGAATAGTTA +TGTCAAAGCCAAATATAATATAGAGACAAATCTCTAAAAACATTTTATTTGGTAAGCAAG +AGCTGCAATTCATGGCATACACACAGACCGGGCTGATCATTGGTATGATCAGGAGAATAA +AGGGAAGGTTGCGGCCAGGTGTGATAGCTCATGCCTGTAATCCCAGCACTTTGGGAGGCC +GAGGCGGGCAGATCACCTGAGGTCAGGAGTTTGAGACCAGCCTGACCAACATGGAGAAAC +TCCGTCTCTACTAAAAATACAAAATTAGTTGGGCATAGTGGCGCATGCCTGAATCCCAGC +TACTCAGGAGGCTGACACAGGAGAAATGCTTGAACCCGGGAGGTGAAGGTTGCAGTGAGC +CGAGATCGCGCCATTGCACTCCAGCCTGGGCAACAAGAGCAAAATTCCATCTCAAAAAAA +AAAAAGAGAAGGTTCCGGGTTTTATGAGAAAGAACAGTATTACATACTGTTTTGGAAGAA +AGCTCATTCACACTAGAGCTTGTGGGAGCTAGCAAGCTCTGATTGGTGAGCGATGGTGGT +AGGTAAAACCAGTCTTAGAGTCATGGCAGTTCATTTTAGCAGCTATTAGGTAAAACTGGT +CTTAGGGATACAGAAGGCTGGTTCAGCAGTTGGACTTGTGGAAAATTTAATTCTTGAAGC +AGATGCTGTGTGCCCCGAATGCTTCTTCCCCCTGGCCCTTCAACTCTGATTTAGTTGAGT +ATTTCAAGAATGACCCAATTTATGTAATCAACTTTCACAGGTATACATGTCTTAAACTTT +AAACAGATGTTTTGGGTTTTGTTGTTGTTGTTTTTGAGACGGAGTCTCACTCTGTTGTCC +AAGCTGGAGTGTAGTGGTGTGATCTCGGCTCACTGCAACCTCCGCCTCCAGGGTCAAGTG +ATTCTCCAGCCTCAGCCTCCTGAATAGCTGGGATTACAGGCGCCCGCCACCACGCCCAGC +TAATTTTTGTATTTTTAGTAGAGATGGGGTGGGGTTTCACCATGTTGGCCAGGCTGGTCT +TGAACTCCTGACATCAAGTGTTCTGCTCACCTCAGCCTCTGAAAGTGCTGGGATTACAGG +CGTGAGCCACTGCGCCCGGCAGTCTTTCCTTCTTTTTTTTTTTTTTTTTTTTTTTTTTAA +TGACATGGGGTCTTACTTTATTACTCAGGCTGGTCTCAAACTTCTGGCCTCAAGGAATCT +TCCCACCTTGGCCTCCCAAATTGCTGGGATTACAGGCATAAGTCATCATGCCTGGCTACA +AACAGATATTTTCAATAAGAGGATAAAAGTTCATTTCCCCATACTTTGCTAACATCAAAT +GTTATTAATTCCTAATAGTTTTGCCAAACTGAGAGGAAAATGGTATGTTAGTTTTTCTGG +GTTTTCTTTCTTTTTAATTTTTTTTCTTTTTTATTCATCGCAACACTATTCACGATTTTT +TTATTTTTTATTTTATTTATTTATTTATTTTTTTTTGAGACAAGGTCTCCCTATGTTGCC +CAGGCTGGTCTTGTACCCCTGGGCTCAAAGGATCCTCCTGCCTCAGCCTCCCAAAGTGCT +AGGATTACAGGCATGAGTCACCACGCCTGGTTCACAATTTCTTTTTGTTTTTACCAAAGG +CAGGTATATTCCTGAAATTTTTTGTTTTTTTGTTTTTTTTTTGAGATGAAGTCTCACCCT +GTCACTCAGACTGGAGTGCACTGGCACGATCTCAGCTCACTGCAACCTCCGCTTCCTAGG +TTCAAGCGATTCTCCTGCCTCAGTCTTCAAAGTAGCTAGGATTATAGGCGCCGCAACCAT +GCTCAGCTAATTTTTGTATTTTTAGTAGAGACAGGATTTCACCATGTTGGCCAGGGTGAT +CTCAAATCCTGACCTCAAGTGATCCGCCTGCCTCAGCCTCCCAAAGTGCTGGGATTACTG +GCATGAGCCACCGTGCCAGGCCCTGAAATGTTATCTTAGTTATTAATTTGCAATTCCTTG +GCTCTAGAGGTTGGGCATCTTCTCAGATCTCTAGTGGACATTTGGATTTTCTTTTTGGTG +AACTGTCCAGTTTTTCTCTCTGCTTTACAATCTTTATTATATGCAATCTTCACATGTAGG +TACTACCATTTTTTTAGTTTGTTTTTGAAACAGCATATTGCTCTGTTGCCCAGGCTGGAG +CACGGTGGCAAAAACATGGCTTACTGCAGCCTTTGACCTCCTTGGCTCAAGTACTCCTCC +TGTCTCAGCCTCCTGAGTAGCTGGTACCACAAGCCCATACCACCATGCCCAGCTAATTTA +TTTTTGTAGAGATGGGGCCTGACCATGTTACTTGGGCTCAAATGATCCTCTCCCACTCAG +CCTCCCAAAGTGCTAGGATTACAGGCATGAGCCACCATACTTGGCCCCTTTTTTTTTTTT +TTTTTTTTTTTTTTTTTTTTTTTTTTGAGACACAGTCTTGCTCTGTCTCACAGGCTGGTG +TGCAGTGGCACGATCTCAGCTCATTGCAACCTCCACCTCCCAGTTTCAAGTGATTTTTGT +GCCTCAGCCTCCCTAGTAGCTGAGATTACAGGCATGCACCACCATGCCTGGCTGACTTTC +ATATCTTTAGTGTTGCCATGTTGGCTAGGCTGGTCTCAAACTCCTGACCTCATGTGATCC +ACCTGCCTCGGACTCCCACAGTGCTGGGATTACAGGTGTTAGCCACCACCCCGACATTAT +TTGAAACTTTTATTTTATCATGAGAGAGTTCCAGGAGTCAACTGAAGAGAGATTTTTGGT +ATGAAAATTACATATGCAAAAAGACTGATTCCAGTACATGAAATTAAATTCAACATTTAC +ATTAAATGCCTTCAAATATGGTAAAATGGTTTCTTTTGGCAGTTTACCTCATTATGTTTT +GAATGATTTGTCTATCATATGAAATAACTTTTATAAATATAGTAACTCAGGCCTGGGCAC +AGCGGCTCAAGTGGGAGGACTGCTTAAGCAACCGAGTTTGAGACCAGCGTGGACAACATG +GGGAGACCCCTTCTCTCCCAAAAAATAGCTGAGCATGGCAGCGCACTGTTAAAGGAAACA +GAGTTTCTTTGGTGGGTGATTAAAATGTTCTGGAGTTAGATAGTAGTGATGGTTGCACAA +CCTTGTGAATATATTAAGGTTTCCGCTCTATCTACCATTCAATTGTACTCTCTAAAACGA +TTAATTCTATAGTATATCAATTATATCTCTAAATAATAAAAACAAAAAGAAATGGCTGGG +TGCGGTGGCTCATGCCTGTAATCGCAGCACTTTGGGAGGCTGAGGCGGGCGGATCACAAG +GTCAGGAGTTTGAGACCAACCTGACCAACATGGCAAAACCCAGTCTCTACTAAAAATACA +AAAATTAGCTGGGTGTGGTGGCACACGCCTGTAATCCCAGGTACTCGGGAAGCTGAGACA +GGAGAATCACTTGAACCCGGGAGGCAGAGGTTGCAGTGAGCCACTGCACTCCAGTCTGGG +TGGCAGAGCGAGACTCCGTCTCAAAAAGAAAAAAATTAAAAAACAAAAAGAAACCTGGTT +CTATATTTTGTTTAAATTTATTTTTTTAACCATCATGTAATATGTCCAGGTAATTTGTTT +AAATTTTGACATCAAATGCAATTGTGAGAATTTTTATGATTCAGAAAAATCTAAGCAAGC +TTTATAAAAACATACTTTTTTTTTTACTTTTTTTTTTTTTTCTGAGACACAGCCTCACTC +TGTCGCCCAGGCTGGAGGGCAGGTTTTCATGTTTATCTGTGAGATGTACCTTTGGCACAT +TACTTTCCTGACATGAGATTTAAATTTTTTTTTTTATCTTGTGACAATTTAACTTTTTTG +ACACATAAAAATTGTACATATTTATTTGTTTGAGATGGAGTCGCACTCTGTCACTCAGGC +TGGAGTGCAGTGGCGTGATCTTGGCTCACTGCAACCTCCGCCTCCCGAGTTCAAGTGATT +CTCCTGGCTCAGCCTCCCAAGCAGCTGTCATTACAGGCCTGCACCACCACACCCGGCTGA +TTTTTGTATTTTTAGGAGAAACAGGGTTTCACCATGTTGGCCAGGCTGGTCTTGAAGTCC +TGACCTCAAGTGATCCACCCACCTTGGCCTCCCAAAGTGCTGGGATTATAGGCATGAGCC +ACCGTACCAGACCCCTAAAAATTGTATATATTTAAGGTGTACCATTTGATGTTTAGATAT +ACATTGTGAAATGATTACATTCCACATATTACCTCTACAGAGTTACCATTTTTGTACACT +TGGTCAACATCATCCCATTCTCCCCTTCCTCCACAGATATTTCTTGTATACTATATAGAA +GCCAAGGGTATTTTGGGGGAAGAGCTCAAAGTTCCTTTCGTGGAGTTAAAAATATATATA +TACTATGTACATATAAGCCATTTAGCAACCCTAGATGCTTAATAAAGAATACTGGAGGCC +CGGTGTGGTGGCTCACACCTGTAATCCCAGCACTTTGGGAGGCCGAGGCGGTCGGATTAC +GAGGTCAGGAGTTCAAGACCAGCCTGGCCAACATGGTGAAACCCCATCTTTACTAAAAAT +ACAAAAATTAGCCGGGTGTGGTGGTGGGCGCCTGTAATCCCAGCTACTCGGGGGGCTGAG +GCAGAATTGCTTGAACCTGGGAGGCAGAGGTTGCAGTGAGCTGAGATCACGCCACTGCAT +TCCAGCCTGGGTGACAGAGCAATACTCTGTCGCAAAAAAAAAAAAGAATACTGGAGGCTG +GGCGAGGTGGCTCACACCTGTAATCCCAGCATTTTGGGATGCCAGAGGCGGGCGGAATAT +CTTGAGCTCAGGAGTTCGAGACCAGCCTACACAATATGCTCCAAACGCCGCCTCTACAAA +ACATACAGAAACTAGCCGGGTGTGGTGGCGTGCCCCTGTGGTCCTAGCTACTTGGGAGGT +TGAGGCGGGAGGATCGCTTGAGCTCGGGAGGTCGAGGCTGCAATGAGCCGAGATGGTGCC +ACTGCACTCTGACGACAGAGCGAGACTCCGTCTCAAAACAAACAACAAATAAGGTTGGGG +GATCAAATATCTTCTAGTGTTTAAGGATCTGCCTTCCTTCCTGCCCCCATGTTTGTCTTT +CCTTGTTTGTCTTTATATAGATCAAGCAGGTTTTAAATTCCTAGTAGGAGCTTACATTTA +CTTTTCCAAGGGGGAGGGGGAATAAATATCTACACACACACACACACACACACACACACA +CACACACTGGAGTTCGAGACGAGGCCTAAGCAACATGCCGAAACCCCGTCTCTACTAAAT +ACAAAAAATAGCTGAGCTTGGTGGCGCACGCCTATAGTCCTAGCTACTGGGGAGGCTGAG +GTGGGAGGATCGCTTGAGCCCAAGAAGTCGAGGCTGCAGTGAGCCGAGATCGCGCCGCTG +CACTCCAGCCTGAGCGACAGGGCGAGGCTCTGTCTCAAAACAAACAAACAAAAAAAAAAG +GAAAGGAAATATAACACAGTGAAATGAAAGGATTGAGAGAAATGAAAAATATACACGCCA +CAAATGTGGGAGGGCGATAACCACTCGTAGAAAGCGTGAGAAGTTACTACAAGCGGTCCT +CCCGGCCACCGTACTGTTCCGCTCCCAGAAGCCCCGGGCGGCGGAAGTCGTCACTCTTAA +GAAGGGACGGGGCCCCACGCTGCGCACCCGCGGGTTTGCTATGGCGATGAGCAGCGGCGG +CAGTGGTGGCGGCGTCCCGGAGCAGGAGGATTCCGTGCTGTTCCGGCGCGGCACAGGCCA +GGTGAGGTCGCAGCCAGTGCAGTCTCCCTATTAGCGCTCTCAGCACCCTTCTTCCGGCCC +AACTCTCCTTCCGCAGCCTCGGGACAGCATCAAGTCGATCCGCTCACTGGAGTTGTGGTC +CGCGTTTTTCTACGTCTTTTCCCACTCCGTTCCCTGCGAACCACATCCGCAAGCTCCTTC +CTCGAGCAGTTTGGGCTCCTTGATAGCGTTGAGTGGAGGCCCTGCCGCGACTTGGCAGTA +GCTTATTTTGTTCACTCCTCTCTGGCTGGTGTGGGGGAGGTGGGGGCATTAGGCCAGGGT +GAAGCAGGGGAACCACTTAGGAGTCTGTTAAGATGATCTGAACTTCAGAACAAGATGTTA +TTAACAGAGTGAAAGTATTTGGATTCTGGGTATATTTTGAAATCGGAGGCAACAGGTTTT +TCAGATAGATTCGATAACGGAGGTTATCCTGAATAGTTGAAAAGATAAAGTTGCCTTTTG +CTGAGGTGGGAAAGAGAAGATTGCCAGTAGAGCAGGTTTCTCAGGAGTTCAGTCTTGGGC +ATAGCATGGTAGGGGTGAATTTGGCTGGAGTGAGTTGGAGAGTAGGAGAAGAGAAATCCA +AGGCAACATTTGACCAGCCTGGGCAACATAGTGTGACTCCGAGTCTGCAAAAATTAGACG +GGTGTTGTGGTGCGCGTCTGTGGTCTCAGCTACCTGGAAGGTTCAGGCCTTGGAAGGCTC +AGGGAGGTGGAGGCTGCAGTGATCTGTGATTGCGCCTCTGCACTCCAGCCTGGGCGACAG +AGCCAGACCCTGTCTTAAAACAAAATAAACGGCCGGGCGCGGTGGCTCAAGCCTGTAATC +CCAGCACTTTGGGAGGCCGAGGCGGCCGGATCACAAGGTCAGGAGATCGAGACCATCCTG +GCTAACACGGTGAAACCCCGTCTCTACTACAAATACAAAAAATTAGCCGGGCGTGGTGAC +GGGCGCCTGTAGTCCCAGCTACTCGGGAGGCTGAGGCAGGAGAATGTCATGAAGCCGGGA +GGCGGAGCTTGCAGTGAGCCGAGATCGCGCCACTGCACTCCAGCCTGGGCGATAGAGCAA +GACTCCGTCTCAAATAAATAAATAAATAAATAAATAAATAATAAAAACATCGGTAGGCAT +ATTTCAAGGAATTCTATTTAAAAAAAATTTTTTTAGAGACAAGTTCGCTCTCTGTGGCCC +AGGCTGGAGTACAGTGGCATGATCCTAGCCCATGGCAGCGTTGATCTCTTGGCCTCAAGC +GACCCTCCTTTGGAGTCGCTGGGCCTAAAGGAGTGAGCCACCACGAAATTTTATTATAAA +TGGAGGGTAGAGAAATTGGGCAATAAATGGAGGGGGAAGTGAGTTAAGAGGAATTTTAAT +TATGTGTGTGTGGTTTTAAAAGAGGGGGGTCTTGCTCTGTTGCCCAGGCTGCTGGGGTGC +CAGTGGCGCAATCATGAATCACTACAGCCTTGGACTCCTGGCCTCAAGCTATCCTCCCAC +CTCTGCCTCCCAAAGTACTGGGATTACTAGTGTGAGCCACTGCACTAAGATAGGAGCAAC +ATGTTTCAGCATGTTTGTGGGTTGATAGGAAAGATGAGAATGGGAAAGTTGATGTCGGAA +AGAAGACAATGGCTAGAGCAATGTCCTAGAGTAGGTAAGAAGGGATGGATTTGGCCTTTG +TTGGAAACATTAGCGGTTCTTTTGGTGACAGCTATATAGTTAACACATCTATGATACGTG +AATGGGCAGATAGGATGGCAGGAGATTTTGAAAGTTCTCTTGATTCTTACTGTTCTCTTA +GTGAAAGAAGCAAGGTTATCAGCTAGAAGCTGGGATGGGAGAGGAAAGAGAAGATGGGAA +GTAGATAGTTCTTTAGAAGAGTGGGCAAGGGTTGGACTAGGGAAGTTTAGTGGAAATATT +GCTAGGCAACATAAAGAGCCTACTTGAGATTCGTGGTCATGAGTTGAAGGAGACCAGACA +GCAAGATTGTGTATGAGGGCACCCACAGAGTAAATGGAGAGTTGAAATTAATGCAGTTGT +GATTTTACCACGTGGATATGAAGAAGTGAGGGGGAGAAGTACAAAGGAGTTCTCTTAATG +ATTGACCATGGAATTTAAGCTGGCTAAGAAAGGAAGTGAGAGGCCGGGCGCGGTGGCTCA +CGCCTGTAATCCCAGCACTTTGGGAGACTGAGGTGGGTGGATTACCTGAGGTCAGGAGTT +TGAGACCAACCTGGCCGATATGGCGAAACCCCATCTCTAATAAAAATACAGAAAAATTAG +CCGGGAATGGTGGCAGGTGCCTGTAATCCCAGCTACTCAAGAGGCTGTGGCAGGAGTATC +CCTTGGACCCAGGAGGTGGAGGTTGCAGTGAGCCGAGATCACGCCACTGTACTCCAGCCT +GGACGATATAGTGAGACTTCACCTCAAAAAAAAAAAAAAAGAAAGGAAGTGAGGATTTTA +AGACCCTGAGAGACAGTTTAAAAAGTGGGAGGATCGGCCGGGCGCTGTGGCTGACACCTG +TAATCCCAGCACTTTGGGAGGCCGAGTTGGGCAGATCACAAGGTCAGGAGTTCGAGACCA +GCCTGGCCAATATGGTGAAACCTTGTCTCTACTAAAAATACAAAAATTAGCCGGGCATGG +TGTCACGTGTCTATAATCCCAGCTACTCGGGAGGCTGAGGCAGAAAAATTGCTTGAACCT +GGGAGGCAGAGGTTGCAGACAGCTGAGATCACTCCATTGCACTCCAGCCTGGGCAACAAG +AGCAAAACTTTGTCTTTAAAAAAAAAAAAAAAAAAAGAATACAAAAATTAGCCGGGCGTG +GTGGCGCGTGCCTATAATCCCAGCTACTTGGGAGGCTGAGGCAGGAGAATCAGTTGAACA +CGGGAGGCGAGGTTTGCAGTGAGCCGAGATTGCGCCACTGCACTCCAGCCTGGGCGACAG +AGCAGGACTCCTCTTGGAAAAAAAAAATTAGCTGGGCATGGTGGCAGGTGCCTGTAGTCT +CAGCTACTAGGGAGGCTGAGGCAGGAAAATCACTTGAACCCGGGATGTGGAGTTTGCAGT +GACCCGAGATCGTGCCACTGTACTCCATCCTGGGCGACAAAATGAGACTCTGCCTCAAAA +AAAAAAAAAAAAAAAAGTGGGAGGATCAATGTACTGCCAGTCCTAATGAAGTGGAATGAT +TGTCCCCATCAAATCACTAGTAGGAGTAAGTTGCAGAGCCTAGAAGGTGATGGTTAAGAG +AGTGGGATTCTTGAAACTGCATTTATGGAGAGGTTGTGGTTATTGGTTATAATAAATAAA +TACAGTTGAAGTGAGTGAGTAGCTGAGATTTGGGGATGTATCAGTTCATTCTTACACTGC +TACAAAGACATACCTGAGACCAGGTATTTATAAAGATAAGAGGTTTAATCAGCTCACAGT +TCTGCTGCCTGTACAGGCTTCTCTTGTGGAGGCCTAAGGAAACTTACAGTCATGGTGGAA +GGTGAAGGGGAAACAAGCACAGTCTTCACATGGCCAGCAGGAGAGAGAGAGAAGGGGGAA +GTGCTACATACTTTAAAACAACCAGATCTTGTGAGAACGCTTATCAGGAAACAGCACTTG +GGGATGGTGCTAAATCATTAGAAATCACCCCCATGATCCAGTCGCCTCCTACCATGCCCA +CCTCCAACACTGGGGATCACAATTCAGCATGAGATTTGGGTAGGAACACAGAGCTGCACC +ACATCAGAGGATGTACAAGATTGTGGTGGAGAGGAGTTTAGAGACCTGCAAATATAGGGT +AATTGAAGGGATCATCTACATGGATATTTAAATCACCAAAAATTATGACAGGAGTAGTGT +TGGAGAGAGAACTGCGATGTAAACATTAAGGAATGAGGAAGAGTGACTCGGTAGGCTGTA +GGTGACTGCAATAGGAAACGATAATAGACTGTGAGTCTGGTGACAAGATTTTCCTTCTTT +CTTTTTTTCCCCCCCCCCGAGACAGGGCCTCTTTTTGTTGCCCAGGTGGGAGTGCAGTGG +CGCGATCACGGCTCACTACAACCTCCTCCCAAGCTCAAGGGATTCTCCCACTTCAGCCTC +TCAAGTAGCTGGAACTACAGGTGCTGACCACCATGCCTGGCTACTTTTTGTCAGGATTTT +CAAGGCTGGGAATTTTGAGAGGGGAATGGAGGAGAATAATCTGAAAGTGCAAGTAAGGAG +CAGGGAAGATTTCTTTTTTCTTTTTTTTTTTTTTTTTTGAGTCGGAGTCTGGCTCAGTCG +CCCAGGCTGGAGTGCAGTGGCGAGATCTCCGCTCACTGCAAGCTCCGCCTCCCGTGTTCA +CGCCATTCTCCTCCTTCAGCCTCCCGAGTAGCTGGGACTACAGGCGCCCGCCACCACGCC +CAGCTAATTGTTTTTTTGTATTTTTAGTAGAGACGGGGTTTCACCGTGTTAGCCAGGATG +GTCTCAATCTCCTGACTTTGTGATCCGCCCACCCCGGCCTCCCAAAGCGCTTGGGATTAC +AGGCGTGAGCCACCGCGCCAGCCAGAGCAGGGAAGATTTCTTCCCCACATCTCCAGTAGG +TACAGTGATATGAAGTGTGTGGAGGAGAAAAGAGGAAACATCTATCATTTGAGATGGCTG +CGAAAGGAAAAGGCATCCTCAGGGAGCTAGATTTTACTTAGAGCAAGAAATGAAGGGATG +ATTCAGAGGTTAAAAGAGTGGATTTTATGAATTACTCAAGGGAGCACAGTGGAAGTTTCA +GGAAGTGGTAGGAGAAGGTAGAAGATGGCAGGGTGTTGGGAATAATTTGAGAAATCTGAG +CTACTGGAAATGACTGAGAATCAGATATAAAGGCAGTCCTGGTGGTCCGTTCTGGCTGCC +GTTGCTGTGTAACGAATCTGCCAAAACTTAGTGGCTTGAAACAACAAAGAACATTTTATT +ATCTCTCATTGTTTCTGTGGGTTAGGAATTTGTGAGAGCCGTGCTGGGCAGTTTTCGTGC +GGCTGTCTCGTGGTTGCACCTACATAGTTGCTAGAGCTACAGTAGCTGGGGACTGAGCAG +CTAGGGATTGGCAGGCTATCTCTTTTTTTCATGTAGTCTCATGAAGATTTCTTTATGTGG +TTTCAATGTGTGGGCTGGTTTGGATTTCCTTATAGCATGGTGGCCTCAGTTGGATTGCTG +TTTTGTGATCCTTTTCATCCCTCCTTGTCCTGTCCCCAGACAACCACTGATCTACTTTCT +GTCACCATAGATTAGCCTGCATTTTTAAGAATTTTTATAAACGTGGAATGATAGAGTACC +TTTTTTGTCACGTTTCTTTTATTTATCATAGCTATTTTGATTTTCATCCATTTTATTGCT +GAGTAGTATCCCATTGCATGTATATACTATACTGTATTCATTCGCTTGCTTGTGAACATT +TGGGCTTTTTCCAGTTTGGGACTGTTAACAAGTAGAGCCACTATGAATATTAGTGTATAA +GACTTCATATAGCCAAGGCTGGCAGATCGCTTGAGCCCAGGAGTTTGAGACCAGCCTGGG +AAACATGGTGAAACCTCTATTTTTATTTTAAAATCAAAAATTAAAAATTTTCTATAAAAA +ATTTTAAAGAAGACTTTGTATAGACATACGCTTTCATTTTTCTTGAGTGAATACTTAGGT +CTCAGGGTAGATGTATTTTAAGTCTTTAAGGAGCTGTCAAACTCTTCCTCAAAGTGGTGG +TTGTACCATGTTACTTTTTAATATAACAGAGATTAATTGAGCAAAGAAAAATTCAAAAGT +TGGACAGCCCCCACAACTAAATAGGTTCAGAACAGCTCCCCCATTTTGCATTTTGACCAG +CAATGTATGAAAGTTCCATTTGCTCAGTGTCCCTGCAAACACCTGGTATGGTCAGTCTTT +TTAATTTTAGGCATTATAATAGATATAGTGGCTTCTTGTGATTTTAATTAGCATTTCCTA +ATGACCAGTGCTGCTGTTGATCATTTCATGAGTGTATTTGCCATCCGTATATCTTTTTTG +GTGAAGTGTCTATTCAAATCATTTGGGTTTTTTTTTTTTTTGTTTTTTTTTTTTGGAGAC +AGTGTCTCACTCTGTCACCCAGGCTGTTGTGCAGTGGTGCAATCACACAGCCTACTGCAG +CCTCCACCTCCTGCGCTCAGTCTTCTTGTCTCAGCCTTCTGAGTAGCTGAAATTACGAGC +ACACGCCACAATGCCTGGCTAATTTTTTAAAATTTTGTAGAAACAAGGTCTCATTATGTT +GCCTGGGCTTGTCGTGAACTCCTGGGCTCAAGCAATCTTCCTGCCTCAGCCTCCCAAAGA +TTGGGATTGCAAGTATGAGCCACTGCACCCGGCCAACTTACCCATCTTTTAATTGAATTT +TTTTGTTGTTGAGGTTTGAGAGTTCTTCATGTTTGCTGGGTACAATATCTTTATCAGATA +GGTAACTTGCATGTATTTTCTCCCGGTTTACACTTTGGTTTTTCATTTTGTTAACAACGT +CTTTTTAAGAACAGAAAATCTTAATTTTGCTGAAATCTAATTTTTCAGTTTTTTCTTTGA +TGGTTTTGAGAGAGGAGGTAAAAAAAGACTAGGTAAGCCGATAGTTAGACAGAGTCCTCG +GTAGAACTTCCCTTCTAACAAAAAGCAGCCCAAGAAATCACTTCTCTTCTAACAAGGAGC +AGCCTGGAAGATCGGGCTGTAAACATGTATAAGGAAGCAGCTCTGGCACAGAGGGGGAGC +TTCCTGGGTAATCAGCAAGCTTCACATACGTAAGGTGGGTATGTGAAGTAAACACAGTAT +GTGAAGTAAACACAGTGGACCTTAGTACATACTCAGATAAGGAAGCTGGAAGCTTGCATG +TTGTGAGTTGTTGGGGTTGCCTGCAGCTGCACGGAGAGAAAGGGGTACCTGGGGCCAGGC +ATGTCCACCATGGTGGCTCCACCTCCCCTTATTTAGCACATGCACAATAGGAAAGAGATA +AGCAATGTGGAGTAGCTCAGGCCAAGGACCTGCCTGCATAATAAAAGGTTGGGGTGGGGG +ATGCCAGAGATTCACGCTCTGTGCAGATGGCAACACCTGGTCCTAACTGGTTTTTTGCTC +CCTATGTGTAGATAAGCTACCCCCTTCCCATTAGCTCATTTATAAAAATGCTTGCATTTC +ACTGTGGAATGGGAACTCTTTTCAGGACCTCTCTCTGCAGGAGAGAGCTAGTCTCTTTCT +TTTGCCTATTAAACTTCTGCTCTAGCCTCACACCCTTGGTGTGTCAGCGTCCTTGATTTC +CTCAGCGTGAGACCAAGAACCTCGGGTGCCACCCCAGGCAACAAGGCCATTTCAGTTTGT +TCTTTTGTTATAGGCAATCCATGATCACAGATTTTTCTCTCTTTTTTTTTTTTACACAGT +TTAGAGTTTTAGTTTTACACTTAGGTCTGTAATCCATTTTGTATTAATTCTTATATGTGG +CTCAGTGTAGGTGGAAATTTGGTTTGTTTTTGCATAAGGATTTCCAATAGTTTTACCACC +ATTTCTTGAAACTACTATGCTTTCTCTATTAAACCACATTTGTAACTTTAGTTAAAATCA +GTCACATATATCACAGGGCTATTTCTGACTCTCAATTCTGTTACATTGTCTATTAGTGTA +TATTGATGTCAGTACTACACTTTTAATTACTATTGCTTCAGGGTATGTCTTGTAAACCAA +AAATAAAATTATAGGCCCCCCCCGCCCCTGCACAACCAACTGAATGGACCCATCCTCTCA +GCCAAGGGCATTCCAAAATTAACCTGAAAAACTAGTTCAAGCCATGATGGGAAGGGGGAG +TTGGACATGTCTCATCACACCCTACTACCTTTTGGAATTACTGATAGAACAGACTCTTAA +AGTCTGAAAAGAAACATTTACAACCTACCCTCTCTGAAGCCTGCTACCTGGGAGCTTCAT +CTGCATGATAAAACCTTGGTCTCCACAACCCCTTATGGTAACCCAAACATTCCTTTCTGT +TGATAATAACTCTTTCAACTAGTTGCCAATTAGAAAATCTTTAAATCTTCCTATGACCTA +GAAACCTCCCTACCCCCACTTTGAGTTGTCCTGCCTTTCCTGACAGAACTCATGTACATC +TTACATATATTGATTGATGCCTCATGTCTCCCTAAAATGTATAAAACAAAGCTGTACCCC +ACCACCTTGGGGACATGTCATCAGGACCTCCTGTGGCTGTGTCATAGGAGCGTCTTTAAC +TTTGGCAAAATAAACTTTCTAAATTGATTGAAACCTGTCTTAGCTACTTCTGGTTTACAG +TCTTAAAGTTAGATAATGTAAATTGTCCAGCTTTGGTTTATTTTTGTCCTTAGTAGTTCC +ATATAAATTTTAGAATCAGCTTTTCAATTTAATACACTACTTTCCTCTTAGATCCACAAT +TAAATATATTTGATGCTAACAATTCTGTTTTATGTTTTTCGTTTTTTTTTTTTGAGACAA +GAGTTTCGCTCTTGTTGCCCAGGCTGGAGTGCAGTGGCGCGATCTTGGCTCACCACAACC +TCCACCTCCCAGGTTCAAGCAATTCTTCTGCCTCAGCCTCCCGAGTAGCTGGGATTACAG +GCATGCGCCACCACGCCCGGCTAATTTTGTATTTTTAGTAGAGACGGGGTTTCACCATGT +TGATCAGGCTGGTCTTGAACTCCTGACCTCAGGTGATCCACCCACCTCGGCCTCCCAAAG +TGTTGGGATTACAGGCGTGAACCACCATGCCTGGCCAGTTCTGTTATTTTTAAAACCCAA +GTTTCCCTGGTCATATCTTGGTTGGATGAAGCGTATTTTCAATAGATTACCCTGGAAAGG +CTAGTGAGTACGGTATTCTTCTACATTTTAGACTTTTCTTAGTCTTGCTACTTCAAGGAC +AGCTAGGCTGCATATAAAATTCTTGGCTCATACTTTTTCCCCATAAATTTCTATGAGAAA +GTCTAATGATAACTGATTTTCTTTATTTTGTAACTTAGTCTTTTTGCTTAGAGGCTCTCT +GAGGATGGGAGGGGGTTCTTCCTCCCATCCCTAGGAATTTTTCTTTTTTTTAAATTCCTA +ATCACTAGACCACCAGGAAGATTGTTTGTTTTGTTTTGTTTTTATTCTTCAGGGACCCCA +TTTATACATACGTTAAATAAATACTGTTTGCCAATGTATCAACCATTTTGCTTCTTATTT +ATTTTTGTTCCTTTGGTTCTTTTTCATGGCTTTGCTTTGGTGCTCCTTAGATTTTCAGTC +AGATGTATTTGTCCTTGGGTACCTTGTAATCAGTATTACCTTTTCTTCTGTCGCTTTGTT +TTCTGTTCGTTTTGAAATTACTTGTTTCCTGGTCTGGCAATAACAGTTGAGATATGAGGA +GTTTGAGCTGCCATCTGTCTATGTATCTTGCTTTAAGACTGCACTCTTCTATTGATATCA +CTGGCCTTGATTTTGTGATTTCTTTATTTCTTCAGGACCACCCTTCATTTTCTACTGTTT +GCTTCCTTTTTTTTTGAGATGGAGTCTCACTCTGTCACTCAGGCTGGAGTGCAGTGATCT +TGGCTCATTGCAACCTCTGCCTCCCGGGTTCCAGCAATTCTCCTGCCTCAGCCTCCCAAG +TATCTGGGACTACAGGTGTGCACCACCATGCCCGGCTAAGTTTTGTATTTTTAATAGAGA +CGGGGTTTTGCCACATTGGCAGGCTGGTCTCAAACTCCTGATGTCAAGTGATCCACCCAC +CCCACCCACCTCTGCATCCCAAAGTGCTGGGATTACAGGAATGAGCTGCCGTGCCCAGCC +TCCCCCCTACCCCCCTTTTTTTCTTTCGAGACAGAGATTATAGGTGTGAGCCACTGGACC +CAGCCTGTTTTTATTCCTTTTACCAAATCTCCAAGGAATATCTTCCCTTCCAAGTGCGAA +TGTAACCTTAAGTCAGTTAACCTCTTTGTGATTACTTTTCTTATCTGCAAAGTGACTTAA +TGATCTTAAGTACTTTTTTTTTTTGAGACAGGGTCTCACTGTCACCCTGGCTGGAGTGCA +GTGGCACGATCTCTGATCTCCACTCACTGCAATCTCCTCTTCCCTGGTTCAAGCGGCCCT +CCCACCTTAGCCTTCTGGGTAGCTGGGACTACAGATGTGAACCACCACGCCCAGCTAATT +TTTGTACTTTTTGTAGAGATGGGGTTTTGCCATGTTGCCCAGGCTGGGATTATTAAGTAC +TTTTTATCATACAGCAAGATTGACATTTTATATTGGAATACATTTGTCTCTATATAACGG +AGATTAACAGGAAAATGACAAGCCTGGGTGCGGTGGCTCATGCCTGTAATCCCAGCACTT +TGGGAGGCTGAGGTGGGAGGATCACTTGAGGTCAGGAGTTCGAGACCAGTTTTGCCAAGA +TGATGAAAGCCCATGTCTACTAAAAATACAAAAATTAGCCCAGCTTGATGGTGGGCGCCT +ATAATCCCAGCTATTTGAGAGACTGAGGCAGGAGAATCACTTGAACCTGGGCAGCAGAGG +TTGCAGTGAGCCGAGATCATGCCACTGCACTCCAGCCTGGGTGGCATAGCGAGACTCTTG +TCTCAAGAGAAAACAAAACAAAACAAAAAAAAAACAGGAAAATGACAAAAAGTAATATTA +CAACTCAGTGAATTTTATAACAAACTTTTTTGGAATTCATTGACTAATACTATACCAAAT +CCAAAATACTCTCTAGTATACCAAATCCAACTCTACCCTATAGTATAAATTGGATTCTAT +TTGGACTTGTCTCACTAATCCCTCATACAGTGTGTTTTATTTTTTATTGAAGTAAAAAAA +TTTGTCATTTTAACCATTTTTAAGTATATAGTTCAGTAATATTAAGTATGTTCATGTTGT +TGCGCAATAGATCTTCGGAAGTTTTTCGTCTTGCAACCTGAAACTCTACCCATTAGCAAA +TTCCCATTTCTCCTTACACTTAGCCCTTGGTAATCATCATTCTTTTTTTTTTTTTTTTGA +GATGGAGTTTTACTCTTGTTGCCCAGGCTGGAGTGCAATGGTGCAATCTCGACTCACCAC +AACCTCCGCCTCCCAGGTTCAAGCAATTCTACCTCAGCCTCCCGAGTAGCTGGGATTACA +GTCATGCACCACCACGCCCGGCTAATTTTGTATTTTTAGTAGAGAAGGGGTTTCTCCATG +TTGAGGCTGGTCTCGAACTCCTGACCTCAGGTGATCTGCCCACCTCGGCCTCCCAAAGTG +CTGGGATTACAGGCGTGAGCCACTGCGCCTGGCCCATTCTTTCTAATTCTATAAATTTGA +CTACTTAGTTACCTTACATAAATAAATTCTTATAGTTAGTGTTATTTTTGCTTCCATGCC +TTTTTTGTTGTTGTTCATGCTCTTACTTGGAATGCGTTCTATTTTGTCTACCTATGCACA +TCCTGTTGGGTTTTTTTTTTTTTTGGGGGTTTTTTTTGTTTTTTTTTGTTTTTTTTTCCC +AGACAAGGTCTCAATTTGTTACCCAGGCTGGAGTGCAGCGGCGCCATCTCCACTCACTGC +ATCCTCAACTTCCTGGGCCCAGGTGATCCTCTCGCCTCAGCCCCTGCAGGTAGCTGGGAC +TATAGGCATGTGCCACCATGCCCAGCTAAATTTGGTTTTTTTGTTTGTTTGTTTTTGAGA +CAGAGTCTCACTCTGTCACCCAGGCTGGAGTGCAGTGGCACAATCTCAGCTCACTGCAAT +CTCTGCCGCCCGGGTTCAAGTGATTCTCCTGCCTCAGCCTCCCAAGCAGCTGGGATTACA +GGTGACTGCCACCACGCCAGCTAAGTTTTGTAGTTTTAGTAGAGATGGGGTTTCACCTTG +TTGGCCATGCTGGTCTCGAACTCCTGACCTCGTGATCTGCCTGCTTCTGCCTCCCAAAGT +GCTGGAATTACAGGCATGAGCCACCACGCCCGGCCAGAATTTTTGTATTTTTAGTAGACA +CAAGGTTCTTACCCTGTTGCCTAGGCTGGTCTGGAAGTCCTGGACTCAAGCAATTCACCT +GCCTTGGCCTCCCAAAATGCTGGGATTACAAGCCACCATGCCCGGCCTAAATCCTGTTGT +TTTGTTTTGTTTTATTTTGTTTTGTTTTGTTTTGTTTGTTTTTTGAGACAGAGTCTCGCT +ATGTCTCTCAGGCTGTAGTGCAGTGGCGCGATCTTGGCTCACTGCCACCTCTGCCTCCCA +GGTTCAAGTGATTCTCCTGCCTCAGCCTCCCAAGTAGCTGGGATTACAGGCATGTGCTAC +TATGTCCGGCTAATTTTTGTATTTTTAGTAGAGACAGGGTTTCACCATGTTGGCCAGGCT +GGTCTCGAACTCCTGACCTCGTGATCCACCCACCTCGGCCACCCAAAGTGCTGGGATTAC +AGGCGTGAGTGGTTTTTATTTCTTAGGCCGGTTTCCTCCATATGATCTTGCAGTAGACAT +TAATTTCTTTCCTTTTTAATTAAAATACTGTTTGTATTTCACATTTTGATGTTTGTTAAG +ATTTGTTTTATATTGTTTTTTGTTTTGTCTTGTGTGATAGTCTTAAATCCCTAGTTAGAT +AATAACTGGAGAGTACCATGTTTCTATATATCTCTCAGTGACTTGCACAGTGCTAGCAGA +TAGTGCTAAAAAATTATTTATTATTATTATTATTTTGTTATTGTTGTTGTTGTTGTTAGA +CAGGGTCTTCCTCTGTCACCCAGGCTAGAGGGCAATGGGATGATCATAGCTTACTGCAGC +CTCCAACAACTGGGCTCATGTAATTCTCCTGCCTCAGCTTCCCAAGTAGCTGGGATTACA +GGCATGAGCCACCATGTCTGGACAAAAATATTTCCAGGTGCAGTGGCTCATGCCTGTAAT +TCCCACACTTGGGAGGCCGAGCGAGGCTGGAGGATCACTTGAGCCTAGGAGTTCAAGACC +AGCTTGGCTAAGATGGCGAGACCCCGTCCCTACAAAAAATTTTAAAAACTAGCCAGGCAT +GGTGGCATGCACCTATATTCCCAACTACTCAGTGGGCTGAGGTGGGAGGGTCATTTGAAC +ACAGGAATTTGAGGGGAGAAAAAAAGAAGAGAGAAAGAGAAGTGAAGGAAGGAAGAAAGG +AAGGAGGGAGGGAGAGAAGAAAGAAACGAAAGAAAGGAAAAGAAAAGGAAGGAAAGAAAA +TTGGTACCAGGAAAGCAGGAAAGGGAAATGGAAGTAAAAAAATAATAATAATAATAAAAT +GAAAATTGGTTAGTCACTATTAACAATTTGTATCCTTATAATCTGGAAACATTATAATTT +CAAAAGAAAAAATATTCTTTGGATCATAGGTTCTGAGGTCAGAACAGCATTCCCGTAGTC +TAGATGAAGTCAAGTTTTATCTGATCTTAATTGAAATAAATATAGCTGGCCTTGAACAAA +TCTACTCATGGTATGTGGATAGGAATTAAATTGTAGGGGCATTCACTTGATGGCATTCAT +TCTTAGAACATTTACCTATGTCTAGCTTTTGGAGTAAAGTCACATAACCTCTAACCAGGT +AAGTTTCCTGTGGCTTTATTTAGGATTTTAAATACTCATTTTCAGTGTAATTTTGTTATG +TGTGGATTAAGATGACTCTTGGTACTAACATACATTTTCTGATTAAACCTATCTGAACAT +GAGTTGTTTTTATTTCTTACCCTTTCCAGAGCGATGATTCTGACATTTGGGATGATACAG +CACTGATAAAAGCATATGATAAAGCTGTGGCTTCATTTAAGGTATGAAATGCTTGCTTAG +TCGTTTTCTTATTTTCTCGTTATTCATTTGGAAAGGAATTGATAACATACGATAAAGTGT +TAAAGTACATGTTATTCAGTTTTCATTTTGAAGATTAGATGGTAGTATGAGTTAGTTAAA +TCAGGTGATATCCTCCTTTAGAAGTTGATAGCCTATATATGTCATCCTTTGTGGAGGCAA +TTTAAATAAAATTTAAAACATTTATTCCTGGCTGGGTATGGTGGCTCACTCCTGTAATCC +CAGCACTTTGAGAGGCTGAGGCGGGTGGATCACCTGAGGTCAGGAGTTTGAGACCAGCCT +GGCCAACATGGTGAAACCCCGTCTTTACTAAAAATACAAAAATTAGCCAAGCATGGTGGC +ACGTGCCTGTAATCCCAGCTGCTTGGGACACTGAGGCAGGAGAATTGCTTGAACCTGGGG +GGCAGAGGTTGCAATGATTGCACCACTGCACTCCAGCCTGGGCGATAGAGTGAGACTCCA +TCTCAGAAAACGAACAAACAATGTATTCCTTTTAGTATTTTTACATTGTATCAAACTATG +GAAGTCCTCTAATTGAGATTAATAAGAAAAAGACAATCTGAATTATAATTTTAAACATTT +AACAAGCATGTAGTAAAATAATGATGAAGATAAATAGCATTAGTACAGCAATTAATATTT +GTAGCATGCTGACAGTGCTCTGTGTGCGTTTCATATATTAAATTACTCTAATCATCCCAA +ATCCTGTAAGTTGGGTATCAATTCAAGTGTTCCTATTGGGTAGGAATATACAGTTCTTTT +AGGAAATGTAGTATGGTTCTGTGTCTCAAACAGGACACTTACACAGTTGGCCAACATCAT +CACCTTCTCCATTCTCTGAGATGTTTAGTCTTACTGAGCACTAAATATGGGTCATCAATA +GTCCAGACTACCTTGAGCAAACAATAGTCCAGACTACCTTGAGCAAACAGAGCATATACT +CATACAGTGTATAAAGAGCACCAAGCATACAGATTTCATGTCTTTCTCATAGTTACTCTT +GTAACATGAGCTAAAGATCAGACCTCTATGTCACCTTTGTAACTGATTTCTAGATTTTTT +TTTTTTTTTGAGATGGGGTCTTGCCCTGTCACCCAGGCTGGAGTGTAGTGGCGTGATCAT +GCCTCATTGGAGCCTTCAACTCATGAGCTCAAACAATCCTCCTACCTCAGCTTCCTGAGT +AGTTGGGACCACAGGTGTGTGCCACCACACCCAGCTCATTTTTGTATTCTTTGTAGAGAT +GCAGTCTCACCCTGTTGCCCACGCTGGCCTGGAACTCCTGAGCTCAAAAGATCCCTCCGC +CTTGACCTTCCAAAGTGCTGGGATTACAAGCATGAACCACTGCACCCGGCCTAGATTTTT +AAATGTGCTTTCCAGTATACACTGAAACTAGAAGTCGACTAAAGAATTACCAAGAGAATT +CTATAAAATAGAGATTGAAATGGGGCTCGATGTGGGATGGGTTGGTGATATTGCAGGGAG +AAGTAATCTGAGTAAAGGAGGAAAAGAACTGATTTGGGAAAACGATAGTTTTAGTAGTGA +GTTTGAGTATGAATTAAGTTGAGATTGAATTTGAATTAAGTTGAGGTTGAATATGAATTA +AGTTGAGGTTGAGTTTGAGGTATGAATTAAGATGTGAAATTGATCATTGGAAATGTTAGA +TTGAGAAAAGTCACAGCTGGATTAATAGCTTCAGAAGTGTGTTTGCAGACAGTTGCAACT +AAAGTAATAAGAATAGATGGCCTTGGCCGGGCGCGGTGGCTCACGCCTGTAATCCCAGTA +CTTTGGGAGGCTGAGGCGAGCAAATCACGAGGTCAGGAGTTCAAGACCAGCCTGGCCCAC +ATGGTGAAACCCCGTCTTTATTAAAAATACAAAAATTAGCTGTGCACAGTGGTGCACGCC +TGTAATCCCAGCTACTCGGGAGGCTGAGACAGGAGAATCGCTTGAACCTGGGAGGTGGAG +GTTGCAGTGAGCTGAGATCAGTGTGACTGCACTCCAGCCCGGTGACAGAGTGAGACTCTG +TGTAAAAAAATAAAATAAATAAAATAATGGCCGTAAGCAAGTAAAGAAGGATGGCCAGCT +CTTATTGGGAATGCCTAAATCTAAGGCTTGATCAGAAGTAATGAAACCGTTGGGGCCCTA +CATTGCTATGACATCCAAAGGGCCATGAATATCAGGAAGAAAGATAATTAACAGGGTCTA +ATGTTACAGAGAGGTTGAGAGCAAGGAGATTTGATTAAAAGGGTCTTTAGAGCTGATGTC +AGGTGTATGATGCCTTTAAGAGCAGTTTTTATAGTGCAGGGGGTGGTCAAAAGAGAAAAT +AGGTGCTTTCTGAGGTGACGGAGCCTTGAGACTAGCTTATAGTAGTAACTGGGTTATGTC +GTGACTTTTATTCTGTGCACCACCCTGTAACATGTACATTTTTATTCCTATTTTCGTAGC +ATGCTCTAAAGAATGGTGACATTTGTGAAACTTCGGGTAAACCAAAAACCACACCTAAAA +GAAAACCTGCTAAGAAGAATAAAAGCCAAAAGAAGAATACTGCAGCTTCCTTACAACAGG +TTATTTTAAAATGTTGAGATTTAACTTCAAAGGATGTCTCATTAGTCCTTATTTAATAGT +GTAAAATGTCTTTAACTTAAGTGATTAGTACAGTGTTTCTATTGACATATACTTATACAA +CTTCAAAAACAACTATTAAATTTTCTGTTATTTAGGAACATGCATATTAGTCATGAAAGT +ATAAAGAATTAGATGGGAATGATAAATGCTAAAATCAGGACATGTGTTCCATTTGTGAAT +GGAAGGCAGGGAGAAGGTGCCGTTTGGAAGGAGTACCCAAGAGCCGTAAGCTGAATTGGC +AGTGTTTTACATCTTAAGCTGAGAGATAGATTTTTTTTTCCCCTTTTTCTTTAAAAACTC +TAAAACTGTTAATTCCAAGGAACCCAGAAGTCTAGGTAGATTATTTCTGCTAGTTAAAAG +CAGTAGTCCTGAAAGCTGAATATTTTGGTGTCTTTTGAGCCAACTTTAGTTTCATCATTA +CCAAGGGGGAAGAGAGCTAACAGTTGATGAGCACTTGCTCTAGGCCAGTCCAGAGTGCTG +GGCACCATACGCATTTTATCTCCCTCCCGCTATTCACAACAAATATGGGAGGTAGTTTAT +ATTATAGCCATCTAATAAGATGGGGAAACTAAGACTCAAAGAGATTCAGAAACTTGTCCA +TGATTATAAATGTAAGAGAGTTGGAATTCAGATTTATGTATTTAGACCCCAAGCCTTTCT +CATTACATCATTTTGCCTTCCAAATCTCTACCCTCTATCCTTCACCTCCCCACTGATCAA +AACGAGATGATAGTTTGCCCTCTTCAAAAGAAATGTGTGCATGTATATATCTTTGATTTC +TTTTGTAGTGGAAAGTTGGGGACAAATGTTCTGCCATTTGGTCAGAAGACGGTTGCATTT +ACCCAGCTACCATTGCTTCAATTGATTTTAAGAGAGAAACCTGTGTTGTGGTTTACACTG +GATATGGAAATAGAGAGGAGCAAAATCTGTCCGATCTACTTTCCCCAATCTGTGAAGTAG +CTAATAATATAGAACAAAATGCTCAAGAGGTAAGGATACAAAAAAAAAAAAATTCAATTT +CTGGAAGCAGAGACTAGATGAGAAACTGTTAAACAGTATACACAGTTGTCAGTTTGATCC +ACCGAGGCATTAATTTTTTCTTAATCACACCCTTATAACAAAAACCTGCATATTTTTTCT +TTTTAAAGAATGAAAATGAAAGCCAAGTTTCAACAGATGAAAGTGAGAACTCCAGGTCTC +CTGGAAATAAATCAGATAACATCAAGCCCAAATCTGCTCCATGGAACTCTTTTCTCCCTC +CACCACCCCCCATGCCAGGGCCAAGACTGGGACCAGGAAAGGTAAACCTTCTATGAAAGT +TTTCCAGAAAATAGTTAATGTCGGGACATTTAACCTCTCTGTTAACTAATTTGTAGCTCT +CCCATGAAACTTTTGTAGCTTAAATACACAAGAATTTTTTGAAAAGGAAATAAGATAATG +ATGCAAAATAGTTAATTTTTTAAAAAAATGTTAGACACTGCAGTGGATGCAACAAAATAC +TTTATATGAAAGATTTATCCAGTTAACTTTTGTGGAGTATTAGGTATTAGACTAATAATT +AGCACACTTACTTAAGTTAGAAAGTATAATAATGCGCCGGACGCGGTAGCTCACGCCTGT +AATCCCAGCACTTTGGGAGGCCAAGGTGGGCGGATCACAAGGTCAGGAGATCGAGACCAT +CCTGGCTAACACGGTGAAACCCCATCTCTACTGAAAATACAAAAAAATTTGCCGGGCGTG +ATGGCGGGCACCTGTAGTCCCAGCTACTCGGGAGGCTGAGGCAGGAGGATGGTGTGAACC +CCGGAGGCAGAGCTTGCAGTGAGTCAAGATCGTGCCACTGCACTCCAACCTGGGCGACAG +AATGAGACTCCATCTCAAACAAAAAAACAAAACAAAACAAAAAAAAGTGTAATAATAATT +TATCATTAGCTGGATGATATGCTGTTGTTTCCCATGTCACCTGTATAAGATATGTAAAAT +AAGAACACATTATTTACATCTAATATAGATAAAATCCTGAGGCGCTCTCAGATTGTTTTG +TAGAGTTCAAATGTAAATATTGTTTTCATTTATGGTCCTTTTGGTTATAAGTAACAGAAA +TCAACTCTAAAAAGATTTTTATTATAGGTTAGATTATGTCATGGAACCTTAAGGCTTGTC +CCTTTCTAGTTCTTTTGTGTAAAGCGGTGATTTCTTCCATGGAGGGAATGGTATTTAGGC +AATTTTTTTTTTTTTTTCGAGATGGAGTCTTGCTCTGTCGCTCAGGCTGGAGTGCAGTGG +CACCATTTCAGCTCACTGCAACTTCCACCTCCTGGGTTCAAGTGATTCTCCTGCTTCAGC +CTCCCAAGTAGCTGAGATTACAGGCACCCGCCACCACACCCGGCTTATTTTGTATTTTTA +GTAGAGATGGGGTTTCACCATGTTGGCCAGGCTGGTCTTGAACTCCTGACCTCAAGTGAT +CTCCCCACCTTGGCCTTCCAAAGTGCTAGGATTACAGGCGCCTAGCCTAGGCAGTCATTT +TCAAAAAACAAGCATGACTCACCAAAAGTTTTAAGATTTTCTGTGATAATGTTCTTATTG +AGGCTTACATTATATTACAGTTTCTTGAATCTAAAATGATGTACCCTCTTAGAATATATA +CATCATGCTTCATTGGTCTCAGGGGGCTGATTTTTATAAGGAGAGATTTGCTAGTTTTCA +CAATATGTCCTCTAAGTTGGCATGTATAGCTAAACAGGCTTTCATAAAAATATACAATTT +AGTTAATGAAATTTGGGATATAGTCTTTTATGATTGAAATAATTTTGCTAAATAGACTGT +CTCTGATTTATTAGGTAATCACCACTCTTATTTTGTTTTACTTCCTTAATGTCTACATAG +AAAGGAAATGAGAAAAATCCAGAGGTTGTCATTTGACTTATGAGTCTGTTTGACTTCAGG +ATTTGGTACATGAAATTTCACTTAATCTTTTTGATATGTATAAAACAAATATTCTGGGTA +ATTATTTTTATCCTTTTGGTTTTGAGTCCTTTTTATTCCTATCATATTGAAATTGGTAAG +TTAATTTTCCTTTGAAATATTCCTTATAGCCAGGTCTAAAATTCAATGGCCCACCACCGC +CACCGCCACCACCACCACCCCACTTACTATCATGCTGGCTGCCTCCATTTCCTTCTGGAC +CACCAGTAAGTAAAAAAGAGTATAGGTTAGATTTTGCTTTCACATACAATTTGATAATTA +GCAGAATAGAGGATTGTAAAATGTCATTGTAGAACATCCCTTGGGCCAGATTCTAATGGG +TAGAAATTTGAACTAAACCTCTGGGTTTTGTTTGTTTTTAATGCCTTTCTGTTACCCAGA +TGCAGTGCTCTTGTAGTCCCAAGTCTAAGCTCTAGGTTGCCTTCTTTCCTGGCAGAAGTT +GGTGTCTATGCCATAAGGAGGTAGTTCCTGTTAGAAGGGATTTAATTATACCTTATATAA +GGAATTAGTGTTTGCCCTTCTAGGTATAGTTGGATGTTAGCTTCTGATGTAAACTGGATT +TCTTTTTCTTTCTCTCTCTTTTTTTTTTTTTGTTTTGGAGGCAGAGTTTTGCCCTTGTAC +CCCAGGCTGGAGTGCAGTGGTGTGATCTCAGCTCACAGCAACCTCCGCCTCCTGGGTTCA +AGCAATTCTGCCTCGGCCTCCCAAGTAGCTGGGATTACAGGCGACTGCCACCACACCCGG +CTAATTTTTGTTTTATTAGTAGAGATGGGGTTTCACCATGTTGGCCAGACTGATCTTGAA +CTCCTGACCTCAGGTGATCCACCCGCCTTGGCCTCCCAAAGCGCTGGGATTACAGGCGTG +AGCTGCCGCACCCAGCTGTAAACTGGATTTCTAATGGTAGATTTTTAGGTATTAACAATA +GATAAAAAGATACTTTTTGGCATACTGTGTATTGGGATGGGGTTAGAACAGGTGTTCTAC +CCAAGACATTTACTTAAAATCGCCCTCGAAATGCTATGTGAGCTGTGTGTGTGTGTGTGT +GTGTGTGTGTATTAAGGAAAAGCATGAAAGTATTTATGCTTGATTTTTTTTTTTTACTCA +TAGCTTCATAGTGGAACAGATACATAGTCTAAATCAAAATGTTTAAACTTTTTATGTCAC +TTGCTGTCTTTTCGTCCTCGTTAAATTTAATTTTGTTGGTCTTTTGTTGTTATTGGTTGG +TTTTCTCCAAATGCTAGCTATGTTAAGAAATTTAAGGCCAGGTACAGTGGCTCATGCCTG +TAATCCCGGCATTTTAGAAGGCTGAGGCAGGAGGATCACTTGAGCTCAGGAGTTTGAGAC +CAGTCTGGGCAACATAGCAAGACCTCGTCTTTGTTTAGGGGAAAAAAAAGAAATTTAAGT +AGGAGATTATATAAGCAAAAATACAATTAATTTCCAGCATTCACTATATAATATAAATCT +CCAGACTTTACTTTTTTGTTTACTGGATATAAACAATATCTTTTTCTGTCTCCAGATAAT +TCCCCCACCACCTCCCATATGTCCAGATTCTCTTGATGATGCTGATGCTTTGGGAAGTAT +GTTAATTTCATGGTACATGAGTGGCTATCATACTGGCTATTATATGGTAAGTAATCACTC +AGCATCTTTTCCTGACAATTTTTTTGTAGTTATGTGACTTTGTTTTGTAAATTTATAAAA +TACTACTTGCTTCTCTCTTTATATTACTAAAAAATAAAAATAAAAAAATACAACTGTCTG +AGGCTTAAATTACTCTTGCATTGTCCCTAAGTATAATTTTAGTTAATTTTAAAAAGCTTT +CATGCTATTGTTAGATTATTTTGATTATACACTTTTGAATTGAAATTATACTTTTTCTAA +ATAATGTTTTAATCTCTGATTTGAAATTGATTGTAGGGAATGGAAAAGATGGGATAATTT +TTCATAAATGAAAAATGAAATTCTTTTTTTTTTTTTTTTTTTTTTGAGACGGAGTCTTGC +TCTGTTGCCCAGGCTGGAGTGCAATGGCGTGATCTTGGCTCACAGCAAGCTCTGCCTCCT +GGATTCACGCCATTCTCCTGCCTCAGCCTCAGAGGTAGCTGGGACTACAGGTGCCTGCCA +CCACGCCTGTCTAATTTTTTGTATTTTTTTGTAAAGACAGGGTTTCACTGTGTTAGCCAG +GATGGTCTCAATCTCCTGACCCCGTGATCCACCCGCCTCGGCCTTCCAAGAGAAATGAAA +TTTTTTTAATGCACAAAGATCTGGGGTAATGTGTACCACATTGAACCTTGGGGAGTATGG +CTTCAAACTTGTCACTTTATACGTTAGTCTCCTACGGACATGTTCTATTGTATTTTAGTC +AGAACATTTAAAATTATTTTATTTTATTTTATTTTTTTTTTTTTTTTGAGACGGAGTCTC +GCTCTGTCACCCAGGCTGGAGTACAGTGGCGCAGTCTCGGCTCACTGCAAGCTCCGCCTC +CCGGGTTCACGCCATTCTCCTGCCTCAGCCTCTCCGAGTAGCTGGGACTACAGGCGCCCG +CCACCACGCCCGGCTAATTTTTTTTTATTTTTAGTAGAGACGGGGTTTCACCGTGGTCTC +GATCTCCTGACCTCGTGATCCACCCGCCTCGGCCTCCCAAAGTGCTGGGATTACAAGCGT +GAGCCACCGCGCCCGGCCTAAAATTATTTTTAAAAGTAAGCTCTTGTGCCCTGCTAAAAT +TATGATGTGATATTGTAGGCACTTGTATTTTTAGTAAATTAATATAGAAGAAACAACTGA +CTTAAAGGTGTATGTTTTTAAATGTATCATCTGTGTGTGCCCCCATTAATATTCTTATTT +AAAAGTTAAGGCCAGACATGGTGGCTTACAACTGTAATCCCAACAGTTTGTGAGGCCGAG +GCAGGCAGATCACTTGAGGTCAGGAGTTTGAGACCAGCCTGGCCAACATGATGAAACCTT +GTCTCTACTAAAAATACCAAAAAAAATTTAGCCAGGCATGGTGGCACATGCCTGTAATCC +GAGCTACTTGGGAGGCTGTGGCAGGAAAATTGCTTTAATCTGGGAGGCAGAGGTTGCAGT +GAGTTGAGATTGTGCCACTGCACTCCACCCTTGGTGACAGAGTGAGATTCCATCTCAAAA +AAAGAAAAAGGCCTGGCACGGTGGCTCACACCTATAATCCCAGTACTTTGGGAGGTAGAG +GCAGGTGGATCACTTGAGGTTAGGAGTTCAGGACCAGCCTGGCCAACATGGTGACTACTC +CATTTCTACTAAATACACAAAACTTAGCCCAGTGGCGGGCAGTTGTAATCCCAGCTACTT +GAGAGGTTGAGGCAGGAGAATCACTTGAACCTGGGAGGCAGAGGTTGCAGTGAGCCGAGA +TCACACCGCTGCACTCTAGCCTGGCCAACAGAGTGAGAATTTGCGGAGGGAAAAAAAAGT +CACGCTTCAGTTGTTGTAGTATAACCTTGGTATATTGTATGTATCATGAATTCCTCATTT +TAATGACCAAAAAGTAATAAATCAACAGCTTGTAATTTGTTTTGAGATCAGTTATCTGAC +TGTAACACTGTAGGCTTTTGTGTTTTTTAAATTATGAAATATTTGAAAAAAATACATAAT +GTATATATAAAGTATTGGTATAATTTATGTTCTAAATAACTTTCTTGAGAAATAATTCAC +ATGGTGTGCAGTTTACCTTTGAAAGTATACAAGTTGGCTGGGCACAATGGCTCACGCCTG +TAATCCCAGCACTTTGGGAGGCCAGGGCAGGTGGATCACGAGGTCAGGAGATCGAGACCA +TCCTGGCTAACATGGTGAAACCCCGTCTCTACTAAAAGTACAAAAACAAATTAGCCGGGC +ATGTTGGCGGGCACCTTTTGTCCCAGCTGCTCGGGAGGCTGAGGCAGGAGAGTGGCGTGA +ACCCAGGAGGTGGAGCTTGCAGTGAGCCGAGATTGTGCCAGTGCACTCCAGCCTGGGCGA +CAGAGCGAGACTCTGTCTCAAAAAATAAAATAAAAAAGAAAGTATACAAGTCAGTGGTTT +TGGTTTTCAGTTATGCAACCATCACTACAATTTAAGAACATTTTCATCACCCCAAAAAGA +AACCCTGTTACCTTCATTTTCCCCAGCCCTAGGCAGTCAGTACACTTTCTGTCTCTATGA +ATTTGTCTATTTTAGATATTATATATAAACGGAATTATACGATATGTGGTCTTTTGTGTC +TGGCTTCTTTCACTTAGCATGCTATTTTCAAGATTCATCCATGCTGTAGAATGCACCAGT +ACTGCATTCCTTCTTATTGCTGAATATTCTGTTGTTTGGTTATATCACATTTTATCCATT +CATCAGTTCATGGACATTTAGGTTGTTTTTATTTTTGGGCTATAATGAATAATGTTGCTA +TGAACATTCGTTTGTGTTCTTTTTGTTTTTTTGGTTTTTTGGGTTTTTTTTGTTTTGTTT +TTGTTTTTGAGACAGTCTTGCTCTGTCTCCTAAGCTGGAGTGCAGTGGCATGATCTTGGC +TTACTGCAAGCTCTGCCTCCCGGGTTCACACCATTCTCCTGCCTCAGCCCGACAAGTAGC +TGGGACTACAGGCGTGTGCCACCATGCACGGCTAATTTTTTGTATTTTTAGTAGAGATGG +GGTTTCACCGTGTTAGCCAGGATGGTCTCGATCTCCTGACCTCGTGATCTGCCTGCCTAG +GCCTCCCAAAGTGCTGGGATTACAGGCGTGAGCCACTGCACCTGGCCTTAAGTGTTTTTA +ATACGTCATTGCCTTAAGCTAACAATTCTTAACCTTTGTTCTACTGAAGCCACGTGGTTG +AGATAGGCTCTGAGTCTAGCTTTTAACCTCTATCTTTTTGTCTTAGAAATCTAAGCAGAA +TGCAAATGACTAAGAATAATGTTGTTGAAATAACATAAAATAGGTTATAACTTTGATACT +CATTAGTAACAAATCTTTCAATACATCTTACGGTCTGTTAGGTGTAGATTAGTAATGAAG +TGGGAAGCCACTGCAAGCTAGTATACATGTAGGGAAAGATAGAAAGCATTGAAGCCAGAA +GAGAGACAGAGGACATTTGGGCTAGATCTGACAAGAAAAACAAATGTTTTAGTATTAATT +TTTGACTTTAAATTTTTTTTTTATTTAGTGAATACTGGTGTTTAATGGTCTCATTTTAAT +AAGTATGACACAGGTAGTTTAAGGTCATATATTTTATTTGATGAAAATAAGGTATAGGCC +GGGCACGGTGGCTCACACCTGTAATCCCAGCACTTTGGGAGGCCGAGGCAGGCGGATCAC +CTGAGGTCGGGAGTTAGAGACTAGCCTCAACATGGAGAAACCCCGTCTCTACTAAAAAAA +ATACAAAATTAGGCGGGCGTGGTGGTGCATGCCTGTAATCCCAGCTACTCAGGAGGCTGA +GGCAGGAGAATTGCTTGAACCTGGGAGGTGGAGGTTGCGGTGAGCCGAGATCACCTCATT +GCACTCCAGCCTGGGCAACAAGAGCAAAACTCCATCTCAAAAAAAAAAAAATAAGGTATA +AGCGGGCTCAGGAACATCATTGGACATACTGAAAGAAGAAAAATCAGCTGGGCGCAGTGG +CTCACGCCGGTAATCCCAACACTTTGGGAGGCCAAGGCAGGCGAATCACCTGAAGTCGGG +AGTTCCAGATCAGCCTGACCAACATGGAGAAACCCTGTCTCTACTAAAAATACAAAACTA +GCCGGGCATGGTGGCGCATGCCTGTAATCCCAGCTACTTGGGAGGCTGAGGCAGGAGAAT +TGCTTGAACCGAGAAGGCGGAGGTTGCGGTGAGCCAAGATTGCACCATTGCACTCCAGCC +TGGGCAACAAGAGCGAAACTCCGTCTCAAAAAAAAAAGGAAGAAAAATATTTTTTTAAAT +TAATTAGTTTATTTATTTTTTAAGATGGAGTTTTGCCCTGTCACCCAGGCTGGGGTGCAA +TGGTGCAATCTCGGCTCACTGCAACCTCCGCCTCCTGGGTTCAAGTGATTCTCCTGCCTC +AGCTTCCCGAGTAGCTGTGATTACAGCCATATGCCACCACGCCCAGCCAGTTTTGTGTTT +TGTTTTGTTTTTTGTTTTTTTTTTTTGAGAGGGTGTCTTGCTCTGTCCCCCAAGCTGGAG +TGCAGCGGCGCGATCTTGGCTCACTGCAAGCTCTGCCTCCCAGGTTCACACCATTCTCTT +GCCTCAGCCTCCCGAGTAGCTGGGACTACAGGTGCCCGCCACCACACCCGGCTAATTTTT +TTGTGTTTTTAGTAGAGATGGGGTTTCACTGTGTTAGCCAGGATGGTCTCGATCTCCTGA +CCTTTTGATCCACCCGCCTCAGCCTCCCCAAGTGCTGGGATTATAGGCGTGAGCCACTGT +GCCCGGCCTAGTCTTGTATTTTTAGTAGAGTCGGGATTTCTCCATGTTGGTCAGGCTGTT +CTCCAAATCCGACCTCAGGTGATCCGCCCGCCTTGGCCTCCAAAAGTGCAAGGCAAGGCA +TTACAGGCATGAGCCACTGTGACCGGCAATGTTTTTAAATTTTTTACATTTAAATTTTAT +TTTTTAGAGACCAGGTCTCACTCTATTGCTCAGGCTGGAGTGCAAGGGCACATTCACAGC +TCACTGCAGCCTTGACCTCCAGGGCTCAAGCAGTCCTCTCACCTCAGTTTCCCGAGTAGC +TGGGACTACAGTGATAATGCCACTGCACCTGGCTAATTTTTATTTTTATTTATTTATTTT +TTTTTGAGACAGAGTCTTGCTCTGTCACCCAGGCTGGAGTGCAGTGGTGTAAATCTCAGC +TCACTGCAGCCTCCGCCTCCTGGGTTCAAGTGATTCTCCTGCCTCAACCTCCCAAGTAGC +TGGGATTAGAGGTCCCCACCACCATGCCTGGCTAATTTTTTGTACTTTCAGTAGAAACGG +GGTTTTGCCATGTTGGCCAGGCTGTTCTCGAACTCCTGAGCTCAGGTGATCCAACTGTCT +CGGCCTCCCAAAGTGCTGGGATTACAGGCGTGAGCCACTGTGCCTAGCCTGAGCCACCAC +GCCGGCCTAATTTTTAAATTTTTTGTAGAGACAGGGTCTCATTATGTTGCCCAGGGTGGT +GTCAAGCTCCAGGTCTCAAGTGATCCCCCTACCTCCGCCTCCCAAAGTTGTGGGATTGTA +GGCATGAGCCACTGCAAGAAAACCTTAACTGCAGCCTAATAATTGTTTTCTTTGGGATAA +CTTTTAAAGTACATTAAAAGACTATCAACTTAATTTCTGATCATATTTTGTTGAATAAAA +TAAGTAAAATGTCTTGTGAAACAAAATGCTTTTTAACATCCATATAAAGCTATCTATATA +TAGCTATCTATATCTATATAGCTATTTTTTTTAACTTCCTTTATTTTCCTTACAGGGTTT +TAGACAAAATCAAAAAGAAGGAAGGTGCTCACATTCCTTAAATTAAGGAGTAAGTCTGCC +AGCATTATGAAAGTGAATCTTACTTTTGTAAAACTTTATGGTTTGTGGAAAACAAATGTT +TTTGAACATTTAAAAAGTTCAGATGTTAGAAAGTTGAAAGGTTAATGTAAAACAATCAAT +ATTAAAGAATTTTGATGCCAAAACTATTAGATAAAAGGTTAATCTACATCCCTACTAGAA +TTCTCATACTTAACTGGTTGGTTGTGTGGAAGAAACATACTTTCACAATAAAGAGCTTTA +GGATATGATGCCATTTTATATCACTAGTAGGCAGACCAGCAGACTTTTTTTTATTGTGAT +ATGGGATAACCTAGGCATACTGCACTGTACACTCTGACATATGAAGTGCTCTAGTCAAGT +TTAACTGGTGTCCACAGAGGACATGGTTTAACTGGAATTCGTCAAGCCTCTGGTTCTAAT +TTCTCATTTGCAGGAAATGCTGGCATAGAGCAGCACTAAATGACACCACTAAAGAAACGA +TCAGACAGATCTGGAATGTGAAGCGTTATAGAAGATAACTGGCCTCATTTCTTCAAAATA +TCAAGTGTTGGGAAAGAAAAAAGGAAGTGGAATGGGTAACTCTTCTTGATTAAAAGTTAT +GTAATAACCAAATGCAATGTGAAATATTTTACTGGACTCTATTTTGAAAAACCATCTGTA +AAAGACTGAGGTGGGGGTGGGAGGCCAGCACGGTGGTGAGGCAGTTGAGAAAATTTGAAT +GTGGATTAGATTTTGAATGATATTGGATAATTATTGGTAATTTTATGAGCTGTGAGAAGG +GTGTTGTAGTTTATAAAAGACTGTCTTAATTTGCATACTTAAGCATTTAGGAATGAAGTG +TTAGAGTGTCTTAAAATGTTTCAAATGGTTTAACAAAATGTATGTGAGGCGTATGTGGCA +AAATGTTACAGAATCTAACTGGTGGACATGGCTGTTCATTGTACTGTTTTTTTCTATCTT +CTATATGTTTAAAAGTATATAATAAAAATATTTAATTTTTTTTTAAATTAGCTGTATCTG +TGATTGTATTTCTTTTTTGCATATTATTTTGCCCTTTGGCCCATATTTTGATATGGATGC +CACCATAGCATTTTGTGTATGTGCATGTGTATTCCCACTTAATGTCACATTTTTCATGTC +TTTACATATTCTTATTTTTGTTTGTTTTTGAGACAGAGTCTCGCTCTGCTGCCCACGCTG +GAGTGCAGTGGTGCAATCTCAGCTCACTGCAACCTCTGCTATCCGGGTTCAAGCAGTTCT +CGTGCCTCACCCACGTGAGTAGTTGGGATTACAGGCATGTGGCACCATGCCCCACTAAGT +TTTGTATTTTTAGTAGAGATGGAGTTTCACCATGTTGGCCAGGCTGGTCTCAAACTCCTG +CCCTCAAGTGATTCGACCACCCTGGCCTCCCAAAGTGCTGGGATTACAGCCGTGAGCCAC +CGCACACGGCCTCTCTATTTATTTCTATACATAGCTTTTCACATTATATTATGTTTATAT +ATTGTTTATATCTGTATTTCCTCTTTCATTAGAGAAAAGGTAGTACATCTTATTCTTCAT +GGTGTCTACAATATCTGGCAGTTTTTGGAAGTCAAGCGTGAGCTTAGAGCATAGACTGGT +GGGATTGTCAAAGAAGAGGGCAACTGGAAGAGAACTGTCAGTTATTTTTGGATCAGTCTT +TAATTCATCATGACGGGTTAGGCATTAGTTGTATTTCTTGCTAATTTTGAAGAAGACTTA +TTAACAAATCCTACATTAGGTAAATGGTTTTGAAAGTTGAGTTAATCATAATGGTGTTTG +ACCTAGGACTATTTTTAGGCCCTATTTATCTTAATATCGAATAATGAAGCAGCTTCCCCC +TTAGATATAGACAGAAAACATCAAAGCCACCACACTACCTGGCTGGATTTATCCTAGTAA +TAAAATCAAAACTGAGCTAGTTCTCTGGCTTTCATTGTAATAATTGTCCTTGTGGTTGTA +AGGAATCTAGATGAAAATTACATGGTCTGTTCTACAGCCACAGCTGTACCTACATTCAGA +AGACAGACAAAAGTTGCTGTGTTTGAAGAGATCCTTCATTAAGGGATCAGACAGAGATTA +CTTTGAGACATATTCTAAGTTTAACTTTTCTGCAGGGTTGCCATTAACAGAAATAAACTA +CAGAGTTAATTTCTTTTTGTTTTTGATACAGTCTAACTCTCACCCAAGCTGGAGTGCAGT +GGCGCAATTTCAGCTCACTGCAACCTCTGCCTCCCAGGTTCAAGCAATTCTCCTGCCTCA +GCCTCCCGAGCAGCTGGGACTACAGGCATGTGCCACTATGCCTGGCTAATTTTTGTATTT +TTAGTAGTAGAGACGTGGTTTCGCCACGTTGGCCAGGCTGGTCTGGAACTCCTGACCCCA +GGTAATCCACCTGCCTCGGCCTCCCAAAGTGCTGGGATTACAAGCTTGAGCCACTACGCC +TGACCCAGAGTTAACTTTTTAAAAAAGTTTTTATGAACTTAAGTCTTGTGATGTTTGAAA +TAATGGATTCAATTTAGACATCAAATTCCAGAAGTTACTAAGAGCAGCTGGGCGCGGCAG +CTCACACCTGTAATCCCAGCACTTTGGGAGGCCGAGGCGGGTGGATCACCTGAGATCAGG +AGTTCCAGACCAGCCTGGCCAACATAGTAAAACCCTGTCTCTACTAAAAATACAAAAATT +AGCCCGGCATGGTGGCACGCCCTGTAGTCCCAGCTACTTGGGAGGCTGAGGCAGGAGAAT +TGCTTGAACCCGGGAGGTGGAGGTTGTGGTGAGCCGAGATTGTGCCACTGTACTCAAGCC +TGGGCTAAAAAGCGAGACTCCGTCTCAAAAAAAAAAAAAAAAAAAACACGTTACTAAGAG +CAACTCTGGGCCAGGCACGGTGGCTTACACCTGTAATCCCAGCATTTTGGGAGGACGAGA +CAGGCGGATCACTTGAGCCCAGGAGTTCAAGACCAGCATAAGCAACAACGCAAAACCCCT +GACTCTACAAAACATGAAAGAATTAGCAAGGCATGGTGGTGCATGCCTGTAGTCCCAGCT +ACTGGAGAGGCTGAGGCAAAAGGATCACTTGAGTACAGGAGGTTGAGGCTGTGTAATGAG +CCGTGTTCACACCATTGCACTTCAGCCTGGGCAACAGACTGAGACCCTGTCTCAAAAAAA +AAAACCAAACCAAAGCAACAAACAAAAAACAAGAGCAACTCTGCTTCTGTACACTTTTTT +TTTTTTTTTTGGTAGTGACATGATCTATGTTGCCCAAGCTGGTCTCGAGTTCCTGGGTTC +AAGCCATTCTCCCACCTCGGGCTCCCAAAGTGCTAGGATTACAGGCATGAATCACCATGC +CCAGCCCTTCTGTACACTTTTCACAGTGTACCCTTTTGTGTTTTTTAAAATGTTTGTGTA +TACATTTATTGTGAATTTTTAAAAAACATGTAATTAAGGCCAGGCATGGTGGCTCATACC +TGTAATCCTAGCACTTTGAGAGGCTGAGGTGGGTGGATCACCTGAGGTCGGTAGTTCGAG +ACCAGCCTGGCCCAACATGGTGAAACCCCATCTCTACTAAAAATACAAAAAAAAAATTAG +CTCGGCATGGTGGTGGGCGCCTGTGATCCCAGCTACTGGAGAGGCTGAAGCATGAGAATC +ACTTGAACCCAGGAGGCGGAGGTTGCAGTGAGCCAAGATCGTGCCACTACACTCCAGCCT +GGGTGACTCAGTGACTGTCTCAAAAAGAAAAAAAGTAATTAAGTTCTGTCATGATATATC +ATCATTACCCTTTTTGAACTTTTAAAATTTTTTATCTTTAGAGGTAATTCATATAATGTT +CTTCAATAGATAAGTGCTTTTCTGTCAATATATCTTGGAGAACACACCATATCAGTATTT +AAAACTCTCATTCTTCCTATTTCTCCACATCCTCTCCAGCACCCGTTGTTTCCTGACTTT +TTAATGATTGCCATTCTAACTGGTGTGAGATGGTATCTTATTGTGGTTTTGATTTGCATT +TCTCTGATGGCCAGTGATGGTGAGCATTTTTTCATGTGTTTTTTAGCTGCATAAATGTCT +TCTTTTGAGAAGTGTCTGTTCATGTCCTTCGCCCACTTTTTGATGGGGTTGTCTGTTTTT +TTCTTGTAAATTTGTTTGAGTTCATTGTAGATTCTGGATATTAGCCCTTTGTCAGATGAG +TAGATTGCAAAAATTTTCTCCCATTTTGTAGGTTGCCTGTTCACTCTGATGGTAGTTTCT +TTTGCTGTGCAGAAGCTCTTTAGTTTAATTAGATCCCAGTTTTGGCTTTTGTTGCCGTTG +CTTTTGGTGTTTTAGACATGAAGTCCTTGCCCATGCCTATGTCCTGAATGGTAATGCCTA +GGTTTTCTTCTAGGGTTTTTATGGTTTTAGGTCTAACATTTAAGTCTTTAATCCATCTTG +AATTAATTTTTGTATAAGGTGTAAGGAAGGGATCCAGTTTCAGCTTTCTACATATGGCTA +GCCAGTTTCCCAGCACCATTTATTAAATAGGGAATCCTTTCCCCATTGCTTGTTTTTCTC +AGGTTTGTCAAAGATCAGATAGTTGTAGATATGCGGCGTTATTTCTGAGGGCTCTGTTCT +GTTCCATTGGTCTATATCTCTGTTTTGGTACCAGTACCATGCTGTTTTGGTTACTGTAGC +CTTGTAGTATAGTTTGAAGTCAGGTAGCGTGATGCCTCCAGCTTTGTTCTTTTGGCTTAG +GATTGACTTGGCGATGCGGGCTCTTTTTTGGTTCCATATGAACTTTAAAGTAGTTTTTTC +CAATTCTGTGAAGAAAGTCATTGGTAGCTTGATGGGGATGGCATTGAATCTGTAAATTAC +CTTGGGCAGTATGGCCATTTTCACGATATTGATTCTTCCTACCCATGAGCATGGAATGTT +CTTCCATTTGTTTGTATCCTCTTTTATTTCATTGAGCAGTGGTTTGTAGTTCTCCTTGAA +GAGGTCCTTCACATCCCTTGTAAGTTGGATTCCTAGGTATTTTATTCTCTTTGAAGCAAT +TGTGAATGGGAGTTCACTCATGATTTGGCTCTCTGTTTGTCTGTTATTGCTGTATAAGAA +TGCTTGTGATTTTTGTACATTGATTTTGTATCCTGAGACTTTGCTGAAGTTGCTTATCAG +CTTAAGGAGATTTTGGGCTGAGACAATGGGGTTTTCTAGATATACAATCATGTAATCTGC +AAACAGGGACAATTTGGCTTCCTCTTTTCCTAATTGAATACCCGTTATTTCTTTCTCCTG +CCTAATTGCCCTGGCCAGAACTTCCAACACTATGTTGAATAGGAGTGGTGAGAGAGGGCA +TCCCTGTCTTGTGCGTGTTTTCAAAGGGAATGCTTCCAGTTTTTGCCCATTCAGTATGAT +ATTGGCTGTGGGTTTGTCATAGATAGCTCTTATTATTTTGAGATACGTCCCATCAGTACC +TCATTTATTGAGAGTTTTTAGCATGAGGGATTGTTGAATTTTGTCAAAGGCCTTTTCTGC +ATCTATTGAGATAATCATGTGGTTTTTGTCTTTGGTTCTGTTTATATGCTGGATTACATT +TATTGATTTGCGTATGTTGAACCAGCCTTGCATCCCAGGGATGAAGCCCACTTGATCATG +CTGGATAAGCTTTTTGATGTGCTGCTGGATTCGGTTTGCCAGTATTTTATTGATGATTTT +TGCATCAATGTTCATCAAGGATATTAGTCTAAAATTCTCTTTTTTGGTTGTGTCTCTGCC +TGGCTTTGGTATCAGGATGATGCTGGCCTCATAAAATGAGTTAGGGAGGATTCCCTCTTT +TTCTATTGATTGGAATAATTTCAGAAGGAATGGTACCAGTTCCTCCTTGTACCTCTGGTA +GAATTCGGCTGTGAATCCATCTGGTCGTGGACTCTTTTTGGTTGGTAAGCTATTGATTAT +TGCCACAATTTCAGAGCCTGTTATTGGTCCATTTAGAGATTCAACTTCTTCCTGGTTTAG +TCTTGGGAGGGTGTATGTGTCGAGGAATTTATCCATTTCTTCTAGATTTTCTAGTTTATT +TGCGTAGAAGTGTTTATAGTATTCTCTGATGGTAGTTTGTATTTCTGTGGGATCGGTGGT +GATATCCCCTTTATCATTTTTTATTGCGTCTATTTGATTCTTCTCTCTTTTCTTCTTTAT +TAGTCTTGCTAGCGGTCTATCAATTTTGTTGATCCTTTCCAAAAACCAGCTCCTGGATTA +ATTTTTTGAAGGGTTTTTTGTGTCTCTATTTCCTTCAGCTCTGCTCTGATTTTAGTTATT +TCTTGCCTTCTGCTAGCTTTTGAATGTGTTTGCTCTTGCTTTTCTAGTTCTTTTAATTGT +GATGTTAGGGTGTCAATTTTGGATCTTTCCTGCTTTCTCTTGTGGGCATTTAGTGCTATA +AATTTCCCTCTACACACTGCTTTGAATGTGTCCCAGAAATTCTGGTATGTTGTGTCTTTG +TTCTCGTTGGTTTCAAAGAACATCTTTATTTCTGCCTTCATTTCGTTATGTACCCAGTAG +TCATTCAGGAGCAGGTTGTTCAGTTTCCATGTAGTTGAGTGGTTTTGAGTGAGTTTCTTA +TTCCTGAGTTCTAGTTTGATTGCACTGTGGTCTGAGAGACAGTTTGTTATAATTTCTGTT +CTTTTACGTTTGCTGAGGAGAGCTTTACTTCCAACTATGTGGTCAATTTTGGAATAGGTG +TGGTGTGGTGCTGAAAAAAATGTATATTCTGTTGATGTGGGGTGGAGAGTTCTGTAAACT +GCTTCAACCACTGTGGAAGTCAGTGTGGCGATTCCTCAGGGATCTAGAACTAGAAATACC +ATTTGACCCAGCCATCCCATTACTGGGTATATACCCAAAGGACTATAAATCATGCTGCTA +TGAAGACACATGCACACGTATGTTTATTGCGGCACTATTCACAATAGCAAAGACTTGGAA +CCAACCCAAACGTCCAACAATGATAGACTGGATTAAGAAAATATGGCACATATACACCAT +GGAATACTATGCAACCATAAGAAATGATGAGTTCATGTCCTTTGTAGGGACATGGATGAA +ATTGGAAATCATCATTCTCAGTAAACTATCGCAAGGACAAAAAACCAAACACCGCATGTT +CTCACTCATAGGTGGGAATTGAACAATGAGAACACATGGACACAGGAAGGGGAACATCAC +ACTCTTGGGACTGTTGTGGGGTGGGGGGAGTGGGGAGGGATAGCATTAGGAGATATACCT +AATGCTAAATGAGGAGTTAATTGGTGCAGCACACCAGCATGGCACATGTATACATATGTA +ACTAACCTGCACATTGTGCACATGTACCCTAAAACTTAAAGTATAATAATAAAATAAAAT +AAGAAAAATGCAAAAATTAAAAATTTAAAAAAAAGCTCTCATTCTTTTAAGCACTTACAG +GATATTCTTACAGATGTGTACCACGCTTAATGAATTGAGCTCTTGTGGATGAGAGTTTAA +TTTGTTTCTAATCATTTGTTATTTAATAGTACAGTCAGCATCTTTAGGATTAAGTATCTA +GAATTAGAACTACTGTGTTGAAGAGGCTATTGCATTTAAATTGTTTTTTTTTTTTTTTTG +ATACGGAGTCTTGCTCTGTTGCCCAGGCTGGAGTGCAATGGCGTGATCTCAGCTCACCGC +AACCTCCGCCTCCCAGGTTCAAGCAGTGCTCCTGCCTCAGCCTCCTGAGTAGCTAGGATT +ACAGGCACACGCCACCATGCCCGGCTAATTTTTGTATTTTTTTAGTAGAGACGGGGTTTC +ACCATGTTGGCCAGGCTGATCTTGAACTCCTGACCTTGTGATCTGCTCGCCTTGGCCTCC +CAAAGTGCTGGGATTACAGGCATGAGCCACCGTGCCCGACCTACATTTAAATTTTAAACA +AAAGTTTGCTAAATTGTTTTCAGTAGAGGTTATATTAATCTATATTTATACCAACATGGA +GAGTTTGTTTCCTGCAAAATAGCCAATAATTTATCAAACCTTTGAATCTTTGTCAATTGA +ATAGTTAAAAATGATTATCTCATATTTGTACATTTTTATCTTATTGTGAAGTTCAGCACC +TTTTCATGTGTTTAAGAACTTTTAATTTTCTGTTGTTTATATGGTCTTCCCATTACCATT +TTAACTATTTGTTTTTATTTTCAGAGTTTTTGCTTATAAAATTTTATTTACAGTCAATTC +TCTTTATTTGTAGAATCTGTATTTGTAAAGGCACCTACTTGCTAAAATTTATTTGTAACC +TAACATCAATACTCATGGCAGTTTCATGGTTTTTCATGGACATACACAGAGGTGAAAAAT +TTGAGAACCTTACCCAGATATTCCCAGCTGGGGTTGAACAGTGCTCAGTTTTTTGTGTAG +CTTTCTTACTATAAACAAGTGTCCTTTTTGAAAGCAGTTTATATAGTTCTACATTTTTCA +CATTTTTGTGCCTTCTGTTTGTGATTTTACTGTTTAAAGTGATTCCCAAGCATTGTGCTG +AAGTGCTATATAGTGGTATTCCAAGGTGCATGCGGGCTGTGAGGTGCCTTAGAGAATACA +TGTGTTAGATAACCTTTGTTTAGTCATGAGTTATAGTGCTGTTGAGTGGGAGTTAGATGA +TGATGAGTCATCACTATTTATTATTATATTTTTTGAGATGGAGTCTCACTCTGTCACCCA +GGCTGGAGTGCAATGGCATAATCTCGGCTCACTGCAACCTCCTCCTCCCGGGTTCAAACG +ATTCTCCTGCCTCAGTCTTCCCAGTAGCTGGGATTATAGGCACCCGCCACTGCACCCAGC +GTAATTTTTGTATTTTTAGTAGAGGTGGGGTTTTACCATGTTGGGTAGGCCAGTCTTGAA +CTCCTGACCTCAAGTGATCCACCCACCTTGGCATCCCAGAGTGCTGGGATTACAGATATG +AGCCACCATGCCAGGTCTATATTTATTAAATAATGTGTCTTTAAACAGAAACAGATATAA +AACAAGCTTACGTATTTATAAGTTGGTGAAAATGTGACCAAAGGCTTACAAGAACCTAAC +CCTGTATTTCTGTTAGGAGCAATGGCTCAGTATTCACTAATTTGCGTGTTTGTGGCAACT +TCATAGAACATAACTACCTCAAGTAATGAGAATTGACTGCATTCTTTTTCAAGTCATTTT +ATAAACAATTTACAGAAGAATAAAGGGATGGTGAAAATTAACTTTGTTAGCAATTTTAAT +GAGAATCCAAATATAGGAGACCCACATTTTTTCCCATATTTTCCCAGTTTTGAATGTTTA +TGTATACCTAAAAGGCATTACATCCTTTGAAAGCAGCTGTCATTATGCATGAATCTGGAA +CATACCTACCTTTAAATACGGATTTTGGATTTCAAATGCATCTCTACTATGTTCTACCTT +ATTATTTGTATTCTTCATGAACTCACTTTGTCAAAATGCAATACTTTTTGTTTTTTAATT +TATTTTTATTTTTTGTAGAAATAGGGTCTCACTGTGTTGCCCAGGCTGGCCTTGAACACC +TGGCCTCAAGTGATCTTCCTGCCTTCCAAAGTGCTGGGGACGGTAGGCATGAGCCACCAC +ACCTGTCCAAACTGCAATACTTCTGAAAACTTTAGGGCTCATAGTTTTGTTGAAGTGATA +GATGATGGCTATATTCTTTGTTACATAACAGCAAAACATTTTTGTTTTTACATTTATAAA +TACCAATTAGAATGACTTTCAGTGGATTGGTTTTCATTTTTCACATCATCTTTACCTTCC +TGTTACTTTGTGTACATATCTGTCTTTCATACTTGTCCACTTACAAACTTTTTCAAGTAA +ATTCTGGTGTTACAAGCATAAAAGATGAAAGAACGTTGTCACATGGTCACTTGTCCTTTT +AGCAATTATGCGATGATTCAACTGTTCTAGGTACAACTAGAGGGAGAGTATCCCAGGCAA +GGGAGATAACAAATAGAAAGGCCCTAAGACACAAGTGTATTTAACATGTTTGGGGAACAA +CAAGGAGTTAATCGTGGCTGGAGTGGAAGTAAGGAGGAGAGATTAAGGAGATGGAGCTAA +GAGAGGTAGTCAAGGGCCAGGCCATATGTCAGCGATAGTAAGGTCTTCAGCATTTACTTT +TTTAAGCTGGGAGTCCATGGAAAGGTTTTGAACCCAAGGTATAGCATGATCTGACTTACA +GAAAGAGACTTCTGATTGCTGTGTTGAAAATACACCATAGGTTTGAAGGGAGGAAACAGG +CTGACTAGTTAGAGCCAGTGTGGGTAGTGGTGGTTGGATCTGAGTATATTTTCCAAGTGG +AGCCACCAGGATTTTTCAGTAGATTGATTACATGTGGTGTATGAAAGAGGAGTGTCAAGT +GTAACTCCGAGATTTTTGGCTTATGCAACTGGAAAAATAAAGTTAGAATTTAATGAGATG +GAGGTCTGCATAAGGAGTACTTTTGTGGCAGGAAAGAAATTGGGTTTTGAACATGTGAAA +ATTGAGATGCCCATTAGTAGAAGTTGGATGTGAATAAAGAGTCCAGGCCAGGTGCAATGC +CTCATGCCTGTAATCTCAGCACTTTGGGAGGCCAAGGCAGGAGAATCATGTGAGCCCAGG +AGTTCAAGACCAGACTGGGCAACAAAGTGAGACCCCGTCTATATTATAAAATAAAAAAAT +AGTTCAGAGGAGAGGTCTGGGCTAGAGATGGAAATGTAGAAGTTAGTAAATTTAAAGCTG +TTGAACTAGAGGAGATAGCTGAGGAAGTGCATTCAAATAGAGAAGATGTCAGAGGAGAAC +TTTGGGGTTCTCTCAGTGGTTAGAGATAGGATATGAGGAAAAACAGTGCAGGAGACTAAG +GAGGAGCTCTCATTGAGTTAGGAAAATCAAGAGGGATGCCCTGGAAGCCAAATGAAGGCA +GTGTTTTGAGGAAGAGGGGTGATGGGCCATGTGAAAGCCAATAGGTCACATGCTGCTAAT +GGGTCAACTAAAGTGAGGACTGAGAAGTATTCACCAATTTAGCAATGTGGAGCTCATTGG +TGACCCTCATAAGAGCTGTGTTGGTGGAATGGAGGAGGTAAAATCCTGGAGGGAGAGAAC +ATAAGAATGAGAGAACAGTTGACAGTGCATGTAAACAACTCTTTCACGGAACTTTGTATT +TCTGAATTTTTGTTTATTTGGCTATTAATAAAATCATATCTGATATAGCTTTATTTTAGT +AAGGTTTGTTTTTGTGGGACTTCAGTTGTGTATACACATATAATATGTGTGTGTATGTAT +GTGCGTATGGTGTTTTGATGTAAAATTTATTATTGTGGGTCATGGTTAAAAAAAAAGCTT +GAGAATGAGGAGTTAGATCAAGAAATAGAAGGAAAGTTGACATAAGAAGTTGTGGATGTA +GGAGATTCTACCATGTAGACACAGTGGAAGGATTTAGGGAGTTGGAGCAGGTTGGGATAT +GTGATCAGAAAGCGGGAGTTTAGCTCTCTCACTTGCCCCTGCTTTTACCATGTGATGTGT +CTGCTACCCCTTCACCTTCCACCATGACTGTAAGCTTCCTGAGGTCTCCCTAGAAGCCAA +GCAGATACCAGCACCATGCTTCCTGTAAAGCCTGCAGAACCATGAGCCAATTAAACCTCT +TTGTAAATTACCCAGTTTGAGGTATTTCTTTATAGCAGTGCAAGAATGCCCCAATACAGA +AAATTGGTACCGAGAAGTTGGGCATTGCTATAAAGATACCTGAAAATGTGGAAACAGCTT +TGGAACTGGGTAATGAGTAGTGGTTGGAAGAGTTTACAGGGCTCAGAAGAAGACAGGAAA +ATGAGGGTAAGTTTCAAACTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTGAGACGGAGT +CTTGCTCTGTCGCCCAGGCTGGAGTGCAGTGGCGTGATCTTGGCTCACTGCAACCTCTGC +CTCCCGGGTTCAAGTGATTTT diff --git a/paraphase/data/smn1/ref_smn2.fa.fai b/paraphase/data/smn1/ref_smn2.fa.fai index 9d9d476..3b1feda 100644 --- a/paraphase/data/smn1/ref_smn2.fa.fai +++ b/paraphase/data/smn1/ref_smn2.fa.fai @@ -1 +1 @@ -chr5_70026528_70200526 173999 24 60 61 +chr5_70040526_70088546 48021 24 60 61 diff --git a/paraphase/data/smn1/config.yaml b/paraphase/data/smn1/smn1_config.yaml similarity index 85% rename from paraphase/data/smn1/config.yaml rename to paraphase/data/smn1/smn1_config.yaml index be77ae9..e9aba4f 100755 --- a/paraphase/data/smn1/config.yaml +++ b/paraphase/data/smn1/smn1_config.yaml @@ -1,8 +1,3 @@ -# thirdparty tools -tools: - samtools: path_to_samtools - minimap2: path_to_minimap2 - gene: smn1 # data files @@ -19,11 +14,10 @@ coordinates: hg38: nchr: "chr5" nchr_old: "chr5_70901946_81501946" - nchr_old_smn2: "chr5_70026528_70200526" + nchr_old_smn2: "chr5_70040526_70088546" nchr_length: 181538259 extract_region1: "chr5:70916946-70963000" extract_region2: "chr5:70041526-70087546" - vcf_region: "1-80000" pivot_site: 70951946 diff --git a/paraphase/data/strc/homopolymer_sites.txt b/paraphase/data/strc/homopolymer_sites.txt new file mode 100644 index 0000000..649a4a9 --- /dev/null +++ b/paraphase/data/strc/homopolymer_sites.txt @@ -0,0 +1,1130 @@ +chr15 43599110 A +chr15 43599111 G,T,C,A +chr15 43599112 0 +chr15 43599113 0 +chr15 43599114 0 +chr15 43599115 0 +chr15 43599116 1,A +chr15 43599356 C +chr15 43599357 G,T,C,A +chr15 43599358 0 +chr15 43599359 0 +chr15 43599360 0 +chr15 43599361 0 +chr15 43599362 1,C +chr15 43600438 C +chr15 43600439 G,T,C,A +chr15 43600440 0 +chr15 43600441 0 +chr15 43600442 0 +chr15 43600443 0 +chr15 43600444 1,C +chr15 43600567 C +chr15 43600568 G,T,C,A +chr15 43600569 0 +chr15 43600570 0 +chr15 43600571 0 +chr15 43600572 0 +chr15 43600573 1,C +chr15 43601021 G +chr15 43601022 G,T,C,A +chr15 43601023 0 +chr15 43601024 0 +chr15 43601025 0 +chr15 43601026 0 +chr15 43601027 0 +chr15 43601028 1,G +chr15 43601165 A +chr15 43601166 G,T,C,A +chr15 43601167 0 +chr15 43601168 0 +chr15 43601169 0 +chr15 43601170 0 +chr15 43601171 1,A +chr15 43601306 A +chr15 43601307 G,T,C,A +chr15 43601308 0 +chr15 43601309 0 +chr15 43601310 0 +chr15 43601311 0 +chr15 43601312 0 +chr15 43601313 0 +chr15 43601314 1,A +chr15 43601535 G +chr15 43601536 G,T,C,A +chr15 43601537 0 +chr15 43601538 0 +chr15 43601539 0 +chr15 43601540 0 +chr15 43601541 0 +chr15 43601542 0 +chr15 43601543 1,G +chr15 43601965 A +chr15 43601966 G,T,C,A +chr15 43601967 0 +chr15 43601968 0 +chr15 43601969 0 +chr15 43601970 0 +chr15 43601971 1,A +chr15 43602121 A +chr15 43602122 G,T,C,A +chr15 43602123 0 +chr15 43602124 0 +chr15 43602125 0 +chr15 43602126 0 +chr15 43602127 0 +chr15 43602128 0 +chr15 43602129 0 +chr15 43602130 0 +chr15 43602131 0 +chr15 43602132 0 +chr15 43602133 0 +chr15 43602134 0 +chr15 43602135 0 +chr15 43602136 0 +chr15 43602137 0 +chr15 43602138 0 +chr15 43602139 0 +chr15 43602140 0 +chr15 43602141 0 +chr15 43602142 0 +chr15 43602143 0 +chr15 43602144 0 +chr15 43602145 0 +chr15 43602146 0 +chr15 43602147 1,A +chr15 43602171 C +chr15 43602172 G,T,C,A +chr15 43602173 0 +chr15 43602174 0 +chr15 43602175 0 +chr15 43602176 0 +chr15 43602177 1,C +chr15 43602185 T +chr15 43602186 G,T,C,A +chr15 43602187 0 +chr15 43602188 0 +chr15 43602189 0 +chr15 43602190 0 +chr15 43602191 0 +chr15 43602192 T,1 +chr15 43602349 T +chr15 43602350 G,T,C,A +chr15 43602351 0 +chr15 43602352 0 +chr15 43602353 0 +chr15 43602354 0 +chr15 43602355 0 +chr15 43602356 T,1 +chr15 43602358 T +chr15 43602359 G,T,C,A +chr15 43602360 0 +chr15 43602361 0 +chr15 43602362 0 +chr15 43602363 0 +chr15 43602364 T,1 +chr15 43602493 T +chr15 43602494 G,T,C,A +chr15 43602495 0 +chr15 43602496 0 +chr15 43602497 0 +chr15 43602498 0 +chr15 43602499 T,1 +chr15 43602568 A +chr15 43602569 G,T,C,A +chr15 43602570 0 +chr15 43602571 0 +chr15 43602572 0 +chr15 43602573 0 +chr15 43602574 0 +chr15 43602575 1,A +chr15 43602640 T +chr15 43602641 G,T,C,A +chr15 43602642 0 +chr15 43602643 0 +chr15 43602644 0 +chr15 43602645 0 +chr15 43602646 0 +chr15 43602647 0 +chr15 43602648 0 +chr15 43602649 0 +chr15 43602650 0 +chr15 43602651 0 +chr15 43602652 0 +chr15 43602653 0 +chr15 43602654 0 +chr15 43602655 0 +chr15 43602656 0 +chr15 43602657 0 +chr15 43602658 0 +chr15 43602659 0 +chr15 43602660 0 +chr15 43602661 0 +chr15 43602662 0 +chr15 43602663 T,1 +chr15 43602728 C +chr15 43602729 G,T,C,A +chr15 43602730 0 +chr15 43602731 0 +chr15 43602732 0 +chr15 43602733 0 +chr15 43602734 1,C +chr15 43602812 T +chr15 43602813 G,T,C,A +chr15 43602814 0 +chr15 43602815 0 +chr15 43602816 0 +chr15 43602817 0 +chr15 43602818 0 +chr15 43602819 T,1 +chr15 43602821 T +chr15 43602822 G,T,C,A +chr15 43602823 0 +chr15 43602824 0 +chr15 43602825 0 +chr15 43602826 0 +chr15 43602827 T,1 +chr15 43602958 T +chr15 43602959 G,T,C,A +chr15 43602960 0 +chr15 43602961 0 +chr15 43602962 0 +chr15 43602963 0 +chr15 43602964 0 +chr15 43602965 0 +chr15 43602966 T,1 +chr15 43602972 A +chr15 43602973 G,T,C,A +chr15 43602974 0 +chr15 43602975 0 +chr15 43602976 0 +chr15 43602977 0 +chr15 43602978 0 +chr15 43602979 0 +chr15 43602980 0 +chr15 43602981 0 +chr15 43602982 1,A +chr15 43602986 A +chr15 43602987 G,T,C,A +chr15 43602988 0 +chr15 43602989 0 +chr15 43602990 0 +chr15 43602991 0 +chr15 43602992 0 +chr15 43602993 0 +chr15 43602994 1,A +chr15 43603144 C +chr15 43603145 G,T,C,A +chr15 43603146 0 +chr15 43603147 0 +chr15 43603148 0 +chr15 43603149 0 +chr15 43603150 1,C +chr15 43603416 G +chr15 43603417 G,T,C,A +chr15 43603418 0 +chr15 43603419 0 +chr15 43603420 0 +chr15 43603421 0 +chr15 43603422 1,G +chr15 43604249 C +chr15 43604250 G,T,C,A +chr15 43604251 0 +chr15 43604252 0 +chr15 43604253 0 +chr15 43604254 0 +chr15 43604255 0 +chr15 43604256 1,C +chr15 43604471 G +chr15 43604472 G,T,C,A +chr15 43604473 0 +chr15 43604474 0 +chr15 43604475 0 +chr15 43604476 0 +chr15 43604477 1,G +chr15 43604752 G +chr15 43604753 G,T,C,A +chr15 43604754 0 +chr15 43604755 0 +chr15 43604756 0 +chr15 43604757 0 +chr15 43604758 1,G +chr15 43604920 C +chr15 43604921 G,T,C,A +chr15 43604922 0 +chr15 43604923 0 +chr15 43604924 0 +chr15 43604925 0 +chr15 43604926 1,C +chr15 43605182 G +chr15 43605183 G,T,C,A +chr15 43605184 0 +chr15 43605185 0 +chr15 43605186 0 +chr15 43605187 0 +chr15 43605188 1,G +chr15 43605304 G +chr15 43605305 G,T,C,A +chr15 43605306 0 +chr15 43605307 0 +chr15 43605308 0 +chr15 43605309 0 +chr15 43605310 0 +chr15 43605311 1,G +chr15 43605573 G +chr15 43605574 G,T,C,A +chr15 43605575 0 +chr15 43605576 0 +chr15 43605577 0 +chr15 43605578 0 +chr15 43605579 1,G +chr15 43605580 A +chr15 43605581 G,T,C,A +chr15 43605582 0 +chr15 43605583 0 +chr15 43605584 0 +chr15 43605585 0 +chr15 43605586 1,A +chr15 43605611 A +chr15 43605612 G,T,C,A +chr15 43605613 0 +chr15 43605614 0 +chr15 43605615 0 +chr15 43605616 0 +chr15 43605617 0 +chr15 43605618 0 +chr15 43605619 0 +chr15 43605620 0 +chr15 43605621 0 +chr15 43605622 0 +chr15 43605623 0 +chr15 43605624 1,A +chr15 43605704 A +chr15 43605705 G,T,C,A +chr15 43605706 0 +chr15 43605707 0 +chr15 43605708 0 +chr15 43605709 0 +chr15 43605710 0 +chr15 43605711 1,A +chr15 43605759 T +chr15 43605760 G,T,C,A +chr15 43605761 0 +chr15 43605762 0 +chr15 43605763 0 +chr15 43605764 0 +chr15 43605765 0 +chr15 43605766 0 +chr15 43605767 T,1 +chr15 43606031 T +chr15 43606032 G,T,C,A +chr15 43606033 0 +chr15 43606034 0 +chr15 43606035 0 +chr15 43606036 0 +chr15 43606037 T,1 +chr15 43606202 T +chr15 43606203 G,T,C,A +chr15 43606204 0 +chr15 43606205 0 +chr15 43606206 0 +chr15 43606207 0 +chr15 43606208 0 +chr15 43606209 T,1 +chr15 43606302 A +chr15 43606303 G,T,C,A +chr15 43606304 0 +chr15 43606305 0 +chr15 43606306 0 +chr15 43606307 0 +chr15 43606308 0 +chr15 43606309 0 +chr15 43606310 0 +chr15 43606311 1,A +chr15 43606312 A +chr15 43606313 G,T,C,A +chr15 43606314 0 +chr15 43606315 0 +chr15 43606316 0 +chr15 43606317 0 +chr15 43606318 1,A +chr15 43606437 A +chr15 43606438 G,T,C,A +chr15 43606439 0 +chr15 43606440 0 +chr15 43606441 0 +chr15 43606442 0 +chr15 43606443 1,A +chr15 43606445 A +chr15 43606446 G,T,C,A +chr15 43606447 0 +chr15 43606448 0 +chr15 43606449 0 +chr15 43606450 0 +chr15 43606451 1,A +chr15 43606599 A +chr15 43606600 G,T,C,A +chr15 43606601 0 +chr15 43606602 0 +chr15 43606603 0 +chr15 43606604 0 +chr15 43606605 1,A +chr15 43606740 A +chr15 43606741 G,T,C,A +chr15 43606742 0 +chr15 43606743 0 +chr15 43606744 0 +chr15 43606745 0 +chr15 43606746 1,A +chr15 43606845 A +chr15 43606846 G,T,C,A +chr15 43606847 0 +chr15 43606848 0 +chr15 43606849 0 +chr15 43606850 0 +chr15 43606851 1,A +chr15 43606999 A +chr15 43607000 G,T,C,A +chr15 43607001 0 +chr15 43607002 0 +chr15 43607003 0 +chr15 43607004 0 +chr15 43607005 0 +chr15 43607006 0 +chr15 43607007 0 +chr15 43607008 0 +chr15 43607009 0 +chr15 43607010 0 +chr15 43607011 0 +chr15 43607012 0 +chr15 43607013 0 +chr15 43607014 0 +chr15 43607015 1,A +chr15 43607016 G,T,C,A +chr15 43607017 0 +chr15 43607018 0 +chr15 43607019 0 +chr15 43607020 0 +chr15 43607021 0 +chr15 43607022 0 +chr15 43607023 0 +chr15 43607024 1,A +chr15 43607025 G,T,C,A +chr15 43607026 0 +chr15 43607027 0 +chr15 43607028 0 +chr15 43607029 0 +chr15 43607030 0 +chr15 43607031 1,A +chr15 43607336 T +chr15 43607337 G,T,C,A +chr15 43607338 0 +chr15 43607339 0 +chr15 43607340 0 +chr15 43607341 0 +chr15 43607342 T,1 +chr15 43607353 A +chr15 43607354 G,T,C,A +chr15 43607355 0 +chr15 43607356 0 +chr15 43607357 0 +chr15 43607358 0 +chr15 43607359 1,A +chr15 43607466 T +chr15 43607467 G,T,C,A +chr15 43607468 0 +chr15 43607469 0 +chr15 43607470 0 +chr15 43607471 0 +chr15 43607472 T,1 +chr15 43607477 T +chr15 43607478 G,T,C,A +chr15 43607479 0 +chr15 43607480 0 +chr15 43607481 0 +chr15 43607482 0 +chr15 43607483 T,1 +chr15 43607769 A +chr15 43607770 G,T,C,A +chr15 43607771 0 +chr15 43607772 0 +chr15 43607773 0 +chr15 43607774 0 +chr15 43607775 1,A +chr15 43608068 C +chr15 43608069 G,T,C,A +chr15 43608070 0 +chr15 43608071 0 +chr15 43608072 0 +chr15 43608073 0 +chr15 43608074 1,C +chr15 43608326 T +chr15 43608327 G,T,C,A +chr15 43608328 0 +chr15 43608329 0 +chr15 43608330 0 +chr15 43608331 0 +chr15 43608332 T,1 +chr15 43608563 A +chr15 43608564 G,T,C,A +chr15 43608565 0 +chr15 43608566 0 +chr15 43608567 0 +chr15 43608568 0 +chr15 43608569 1,A +chr15 43608571 A +chr15 43608572 G,T,C,A +chr15 43608573 0 +chr15 43608574 0 +chr15 43608575 0 +chr15 43608576 0 +chr15 43608577 1,A +chr15 43608724 A +chr15 43608725 G,T,C,A +chr15 43608726 0 +chr15 43608727 0 +chr15 43608728 0 +chr15 43608729 0 +chr15 43608730 0 +chr15 43608731 0 +chr15 43608732 0 +chr15 43608733 0 +chr15 43608734 0 +chr15 43608735 0 +chr15 43608736 0 +chr15 43608737 0 +chr15 43608738 0 +chr15 43608739 0 +chr15 43608740 0 +chr15 43608741 0 +chr15 43608742 0 +chr15 43608743 0 +chr15 43608744 1,A +chr15 43609182 C +chr15 43609183 G,T,C,A +chr15 43609184 0 +chr15 43609185 0 +chr15 43609186 0 +chr15 43609187 0 +chr15 43609188 1,C +chr15 43609226 A +chr15 43609227 G,T,C,A +chr15 43609228 0 +chr15 43609229 0 +chr15 43609230 0 +chr15 43609231 0 +chr15 43609232 0 +chr15 43609233 0 +chr15 43609234 1,A +chr15 43609388 C +chr15 43609389 G,T,C,A +chr15 43609390 0 +chr15 43609391 0 +chr15 43609392 0 +chr15 43609393 0 +chr15 43609394 1,C +chr15 43609529 A +chr15 43609530 G,T,C,A +chr15 43609531 0 +chr15 43609532 0 +chr15 43609533 0 +chr15 43609534 0 +chr15 43609535 1,A +chr15 43609564 G +chr15 43609565 G,T,C,A +chr15 43609566 0 +chr15 43609567 0 +chr15 43609568 0 +chr15 43609569 0 +chr15 43609570 1,G +chr15 43609703 T +chr15 43609704 G,T,C,A +chr15 43609705 0 +chr15 43609706 0 +chr15 43609707 0 +chr15 43609708 0 +chr15 43609709 T,1 +chr15 43609725 A +chr15 43609726 G,T,C,A +chr15 43609727 0 +chr15 43609728 0 +chr15 43609729 0 +chr15 43609730 0 +chr15 43609731 0 +chr15 43609732 1,A +chr15 43609734 A +chr15 43609735 G,T,C,A +chr15 43609736 0 +chr15 43609737 0 +chr15 43609738 0 +chr15 43609739 0 +chr15 43609740 1,A +chr15 43609864 A +chr15 43609865 G,T,C,A +chr15 43609866 0 +chr15 43609867 0 +chr15 43609868 0 +chr15 43609869 0 +chr15 43609870 0 +chr15 43609871 0 +chr15 43609872 0 +chr15 43609873 0 +chr15 43609874 0 +chr15 43609875 1,A +chr15 43610023 A +chr15 43610024 G,T,C,A +chr15 43610025 0 +chr15 43610026 0 +chr15 43610027 0 +chr15 43610028 0 +chr15 43610029 0 +chr15 43610030 1,A +chr15 43610031 G,T,C,A +chr15 43610032 0 +chr15 43610033 0 +chr15 43610034 0 +chr15 43610035 0 +chr15 43610036 0 +chr15 43610037 0 +chr15 43610038 0 +chr15 43610039 0 +chr15 43610040 0 +chr15 43610041 1,A +chr15 43610043 A +chr15 43610044 G,T,C,A +chr15 43610045 0 +chr15 43610046 0 +chr15 43610047 0 +chr15 43610048 0 +chr15 43610049 1,A +chr15 43610514 G +chr15 43610515 G,T,C,A +chr15 43610516 0 +chr15 43610517 0 +chr15 43610518 0 +chr15 43610519 0 +chr15 43610520 1,G +chr15 43610964 T +chr15 43610965 G,T,C,A +chr15 43610966 0 +chr15 43610967 0 +chr15 43610968 0 +chr15 43610969 0 +chr15 43610970 T,1 +chr15 43611260 G +chr15 43611261 G,T,C,A +chr15 43611262 0 +chr15 43611263 0 +chr15 43611264 0 +chr15 43611265 0 +chr15 43611266 1,G +chr15 43611966 G +chr15 43611967 G,T,C,A +chr15 43611968 0 +chr15 43611969 0 +chr15 43611970 0 +chr15 43611971 0 +chr15 43611972 1,G +chr15 43612275 G +chr15 43612276 G,T,C,A +chr15 43612277 0 +chr15 43612278 0 +chr15 43612279 0 +chr15 43612280 0 +chr15 43612281 1,G +chr15 43612421 C +chr15 43612422 G,T,C,A +chr15 43612423 0 +chr15 43612424 0 +chr15 43612425 0 +chr15 43612426 0 +chr15 43612427 1,C +chr15 43613063 A +chr15 43613064 G,T,C,A +chr15 43613065 0 +chr15 43613066 0 +chr15 43613067 0 +chr15 43613068 0 +chr15 43613069 1,A +chr15 43613085 G +chr15 43613086 G,T,C,A +chr15 43613087 0 +chr15 43613088 0 +chr15 43613089 0 +chr15 43613090 0 +chr15 43613091 1,G +chr15 43613331 T +chr15 43613332 G,T,C,A +chr15 43613333 0 +chr15 43613334 0 +chr15 43613335 0 +chr15 43613336 0 +chr15 43613337 0 +chr15 43613338 0 +chr15 43613339 T,1 +chr15 43613349 T +chr15 43613350 G,T,C,A +chr15 43613351 0 +chr15 43613352 0 +chr15 43613353 0 +chr15 43613354 0 +chr15 43613355 0 +chr15 43613356 T,1 +chr15 43613357 G,T,C,A +chr15 43613358 0 +chr15 43613359 0 +chr15 43613360 0 +chr15 43613361 0 +chr15 43613362 0 +chr15 43613363 0 +chr15 43613364 0 +chr15 43613365 T,1 +chr15 43613513 T +chr15 43613514 G,T,C,A +chr15 43613515 0 +chr15 43613516 0 +chr15 43613517 0 +chr15 43613518 0 +chr15 43613519 T,1 +chr15 43613791 A +chr15 43613792 G,T,C,A +chr15 43613793 0 +chr15 43613794 0 +chr15 43613795 0 +chr15 43613796 0 +chr15 43613797 0 +chr15 43613798 0 +chr15 43613799 1,A +chr15 43614292 G +chr15 43614293 G,T,C,A +chr15 43614294 0 +chr15 43614295 0 +chr15 43614296 0 +chr15 43614297 0 +chr15 43614298 1,G +chr15 43614534 C +chr15 43614535 G,T,C,A +chr15 43614536 0 +chr15 43614537 0 +chr15 43614538 0 +chr15 43614539 0 +chr15 43614540 1,C +chr15 43614748 A +chr15 43614749 G,T,C,A +chr15 43614750 0 +chr15 43614751 0 +chr15 43614752 0 +chr15 43614753 0 +chr15 43614754 1,A +chr15 43614883 A +chr15 43614884 G,T,C,A +chr15 43614885 0 +chr15 43614886 0 +chr15 43614887 0 +chr15 43614888 0 +chr15 43614889 1,A +chr15 43614899 A +chr15 43614900 G,T,C,A +chr15 43614901 0 +chr15 43614902 0 +chr15 43614903 0 +chr15 43614904 0 +chr15 43614905 1,A +chr15 43615055 A +chr15 43615056 G,T,C,A +chr15 43615057 0 +chr15 43615058 0 +chr15 43615059 0 +chr15 43615060 0 +chr15 43615061 1,A +chr15 43615083 T +chr15 43615084 G,T,C,A +chr15 43615085 0 +chr15 43615086 0 +chr15 43615087 0 +chr15 43615088 0 +chr15 43615089 T,1 +chr15 43615238 A +chr15 43615239 G,T,C,A +chr15 43615240 0 +chr15 43615241 0 +chr15 43615242 0 +chr15 43615243 0 +chr15 43615244 1,A +chr15 43615655 C +chr15 43615656 G,T,C,A +chr15 43615657 0 +chr15 43615658 0 +chr15 43615659 0 +chr15 43615660 0 +chr15 43615661 1,C +chr15 43615836 G +chr15 43615837 G,T,C,A +chr15 43615838 0 +chr15 43615839 0 +chr15 43615840 0 +chr15 43615841 0 +chr15 43615842 1,G +chr15 43615874 C +chr15 43615875 G,T,C,A +chr15 43615876 0 +chr15 43615877 0 +chr15 43615878 0 +chr15 43615879 0 +chr15 43615880 1,C +chr15 43615973 C +chr15 43615974 G,T,C,A +chr15 43615975 0 +chr15 43615976 0 +chr15 43615977 0 +chr15 43615978 0 +chr15 43615979 1,C +chr15 43616080 G +chr15 43616081 G,T,C,A +chr15 43616082 0 +chr15 43616083 0 +chr15 43616084 0 +chr15 43616085 0 +chr15 43616086 0 +chr15 43616087 1,G +chr15 43616183 G +chr15 43616184 G,T,C,A +chr15 43616185 0 +chr15 43616186 0 +chr15 43616187 0 +chr15 43616188 0 +chr15 43616189 1,G +chr15 43616210 G +chr15 43616211 G,T,C,A +chr15 43616212 0 +chr15 43616213 0 +chr15 43616214 0 +chr15 43616215 0 +chr15 43616216 1,G +chr15 43616303 A +chr15 43616304 G,T,C,A +chr15 43616305 0 +chr15 43616306 0 +chr15 43616307 0 +chr15 43616308 0 +chr15 43616309 1,A +chr15 43616618 G +chr15 43616619 G,T,C,A +chr15 43616620 0 +chr15 43616621 0 +chr15 43616622 0 +chr15 43616623 0 +chr15 43616624 1,G +chr15 43616625 G,T,C,A +chr15 43616626 0 +chr15 43616627 0 +chr15 43616628 0 +chr15 43616629 0 +chr15 43616630 1,G +chr15 43616857 A +chr15 43616858 G,T,C,A +chr15 43616859 0 +chr15 43616860 0 +chr15 43616861 0 +chr15 43616862 0 +chr15 43616863 0 +chr15 43616864 1,A +chr15 43616935 C +chr15 43616936 G,T,C,A +chr15 43616937 0 +chr15 43616938 0 +chr15 43616939 0 +chr15 43616940 0 +chr15 43616941 1,C +chr15 43617069 G +chr15 43617070 G,T,C,A +chr15 43617071 0 +chr15 43617072 0 +chr15 43617073 0 +chr15 43617074 0 +chr15 43617075 1,G +chr15 43617681 G +chr15 43617682 G,T,C,A +chr15 43617683 0 +chr15 43617684 0 +chr15 43617685 0 +chr15 43617686 0 +chr15 43617687 C +chr15 43617688 G,T,C,A +chr15 43617689 0 +chr15 43617690 0 +chr15 43617691 0 +chr15 43617692 0 +chr15 43617693 1,C +chr15 43617773 A +chr15 43617774 G,T,C,A +chr15 43617775 0 +chr15 43617776 0 +chr15 43617777 0 +chr15 43617778 0 +chr15 43617779 1,A +chr15 43617941 G +chr15 43617942 G,T,C,A +chr15 43617943 0 +chr15 43617944 0 +chr15 43617945 0 +chr15 43617946 0 +chr15 43617947 1,G +chr15 43617959 G +chr15 43617960 G,T,C,A +chr15 43617961 0 +chr15 43617962 0 +chr15 43617963 0 +chr15 43617964 0 +chr15 43617965 C +chr15 43617966 G,T,C,A +chr15 43617967 0 +chr15 43617968 0 +chr15 43617969 0 +chr15 43617970 0 +chr15 43617971 1,C +chr15 43618169 G +chr15 43618170 G,T,C,A +chr15 43618171 0 +chr15 43618172 0 +chr15 43618173 0 +chr15 43618174 0 +chr15 43618175 1,G +chr15 43618191 C +chr15 43618192 G,T,C,A +chr15 43618193 0 +chr15 43618194 0 +chr15 43618195 0 +chr15 43618196 0 +chr15 43618197 1,C +chr15 43618527 T +chr15 43618528 G,T,C,A +chr15 43618529 0 +chr15 43618530 0 +chr15 43618531 0 +chr15 43618532 0 +chr15 43618533 T,1 +chr15 43618605 T +chr15 43618606 G,T,C,A +chr15 43618607 0 +chr15 43618608 0 +chr15 43618609 0 +chr15 43618610 0 +chr15 43618611 T,1 +chr15 43619312 T +chr15 43619313 G,T,C,A +chr15 43619314 0 +chr15 43619315 0 +chr15 43619316 0 +chr15 43619317 0 +chr15 43619318 T,1 +chr15 43619393 C +chr15 43619394 G,T,C,A +chr15 43619395 0 +chr15 43619396 0 +chr15 43619397 0 +chr15 43619398 0 +chr15 43619399 1,C +chr15 43619453 A +chr15 43619454 G,T,C,A +chr15 43619455 0 +chr15 43619456 0 +chr15 43619457 0 +chr15 43619458 0 +chr15 43619459 1,A +chr15 43619591 T +chr15 43619592 G,T,C,A +chr15 43619593 0 +chr15 43619594 0 +chr15 43619595 0 +chr15 43619596 0 +chr15 43619597 0 +chr15 43619598 T,1 +chr15 43619602 T +chr15 43619603 G,T,C,A +chr15 43619604 0 +chr15 43619605 0 +chr15 43619606 0 +chr15 43619607 0 +chr15 43619608 T,1 +chr15 43619609 G,T,C,A +chr15 43619610 0 +chr15 43619611 0 +chr15 43619612 0 +chr15 43619613 0 +chr15 43619614 0 +chr15 43619615 T,1 +chr15 43619624 T +chr15 43619625 G,T,C,A +chr15 43619626 0 +chr15 43619627 0 +chr15 43619628 0 +chr15 43619629 0 +chr15 43619630 0 +chr15 43619631 T,1 +chr15 43619638 T +chr15 43619639 G,T,C,A +chr15 43619640 0 +chr15 43619641 0 +chr15 43619642 0 +chr15 43619643 0 +chr15 43619644 0 +chr15 43619645 0 +chr15 43619646 T,1 +chr15 43619792 T +chr15 43619793 G,T,C,A +chr15 43619794 0 +chr15 43619795 0 +chr15 43619796 0 +chr15 43619797 0 +chr15 43619798 T,1 +chr15 43619800 T +chr15 43619801 G,T,C,A +chr15 43619802 0 +chr15 43619803 0 +chr15 43619804 0 +chr15 43619805 0 +chr15 43619806 T,1 +chr15 43619921 T +chr15 43619922 G,T,C,A +chr15 43619923 0 +chr15 43619924 0 +chr15 43619925 0 +chr15 43619926 0 +chr15 43619927 0 +chr15 43619928 0 +chr15 43619929 0 +chr15 43619930 0 +chr15 43619931 0 +chr15 43619932 0 +chr15 43619933 0 +chr15 43619934 0 +chr15 43619935 0 +chr15 43619936 0 +chr15 43619937 0 +chr15 43619938 0 +chr15 43619939 T,1 +chr15 43620088 T +chr15 43620089 G,T,C,A +chr15 43620090 0 +chr15 43620091 0 +chr15 43620092 0 +chr15 43620093 0 +chr15 43620094 T,1 +chr15 43620096 T +chr15 43620097 G,T,C,A +chr15 43620098 0 +chr15 43620099 0 +chr15 43620100 0 +chr15 43620101 0 +chr15 43620102 0 +chr15 43620103 T,1 +chr15 43620442 T +chr15 43620443 G,T,C,A +chr15 43620444 0 +chr15 43620445 0 +chr15 43620446 0 +chr15 43620447 0 +chr15 43620448 T,1 +chr15 43620796 A +chr15 43620797 G,T,C,A +chr15 43620798 0 +chr15 43620799 0 +chr15 43620800 0 +chr15 43620801 0 +chr15 43620802 1,A +chr15 43620948 A +chr15 43620949 G,T,C,A +chr15 43620950 0 +chr15 43620951 0 +chr15 43620952 0 +chr15 43620953 0 +chr15 43620954 0 +chr15 43620955 1,A +chr15 43621028 T +chr15 43621029 G,T,C,A +chr15 43621030 0 +chr15 43621031 0 +chr15 43621032 0 +chr15 43621033 0 +chr15 43621034 T,1 +chr15 43621035 T +chr15 43621036 G,T,C,A +chr15 43621037 0 +chr15 43621038 0 +chr15 43621039 0 +chr15 43621040 0 +chr15 43621041 0 +chr15 43621042 T,1 +chr15 43621044 T +chr15 43621045 G,T,C,A +chr15 43621046 0 +chr15 43621047 0 +chr15 43621048 0 +chr15 43621049 0 +chr15 43621050 T,1 +chr15 43621077 T +chr15 43621078 G,T,C,A +chr15 43621079 0 +chr15 43621080 0 +chr15 43621081 0 +chr15 43621082 0 +chr15 43621083 T,1 +chr15 43621087 A +chr15 43621088 G,T,C,A +chr15 43621089 0 +chr15 43621090 0 +chr15 43621091 0 +chr15 43621092 0 +chr15 43621093 1,A +chr15 43621094 A +chr15 43621095 G,T,C,A +chr15 43621096 0 +chr15 43621097 0 +chr15 43621098 0 +chr15 43621099 0 +chr15 43621100 0 +chr15 43621101 1,A +chr15 43621113 T +chr15 43621114 G,T,C,A +chr15 43621115 0 +chr15 43621116 0 +chr15 43621117 0 +chr15 43621118 0 +chr15 43621119 T,1 +chr15 43621273 T +chr15 43621274 G,T,C,A +chr15 43621275 0 +chr15 43621276 0 +chr15 43621277 0 +chr15 43621278 0 +chr15 43621279 T,1 +chr15 43621864 T +chr15 43621865 G,T,C,A +chr15 43621866 0 +chr15 43621867 0 +chr15 43621868 0 +chr15 43621869 0 +chr15 43621870 0 +chr15 43621871 0 +chr15 43621872 0 +chr15 43621873 0 +chr15 43621874 0 +chr15 43621875 0 +chr15 43621876 T,1 +chr15 43622028 T +chr15 43622029 G,T,C,A +chr15 43622030 0 +chr15 43622031 0 +chr15 43622032 0 +chr15 43622033 0 +chr15 43622034 0 +chr15 43622035 T,1 +chr15 43622037 T +chr15 43622038 G,T,C,A +chr15 43622039 0 +chr15 43622040 0 +chr15 43622041 0 +chr15 43622042 0 +chr15 43622043 0 +chr15 43622044 T,1 diff --git a/paraphase/data/strc/ref.fa b/paraphase/data/strc/ref.fa new file mode 100644 index 0000000..0e963f4 --- /dev/null +++ b/paraphase/data/strc/ref.fa @@ -0,0 +1,352 @@ +>chr15_43599000_43620000 +GCCGAAATATGGCAGTGAGTGAGCCTCCGGGATGTAAGATAATCTGAAATGAAATTCAGG +TTGAGTGGGGAGGCAATTGGAAATGAGCAGGCAAGTCAGTCAGTGATAAAGAAAAACTCA +GACTGTAGGAAGCAGATCAAAGATTAGTGTCCCTTAGGTGGAGCTGGTGCAACTGGTCAT +CGATGGAGTAAACTATTTGATTGATTGTGAACGGCGTCTGGAGAGAGGCCAGGATATCCG +CATCCCCACACCTGTCATCCACACCAAGCATTAACTCCCCATCGCCAGCTGATGACTCAA +GATTCCCAGGAGTTCTGCTCATTCTAATGATGGCCCATTCTACTTGCTCTGGACCTGCCC +CCGCATCCCCTGCCTCCATCCTAGTAAAGACTCCTTGCTATGCTGCAGCTGTCTGTGTTA +CTTCTAATGGTGGGGTGAGGAGGGAGCAGCCTTCAGGAAATGAAAAGAGGCAGTGGGATT +ATTTATGATGGAAAGAGACTCCAGATATGGCAACCCAGGAACACTGATTCTCAGGTGGGT +GGAAAGCATTAACATTTTACCCATATTCCTCATCAGCTTCTGAAAATAATCAGGATGCAC +TTCTGTTTGCACTTTATTCATTATGACTTAAGATTTCTCTCCCCACAATCTCCTTCTACT +GTAGAGACAGGCTCATAGCAGGTGGCCAAGGAAGCTGATAGTCAATACCAGGGACCAGGA +AGGTCGTGACCAGTCCTGGAGGCCCCAGGCTGTACTTCGACCTATAATAGACAGGGAATG +GGAGTAATATCACAACTCAGCTCTCCAGGAGCATTGATACTTGGAAATTAGCGCTCTGCC +TGTAGACTCCTTCACTCCAGGGATCTCCCTGGGTGCACTCTAAGAGCCAGACAGCACCAA +ATTAGGGGTTTGATTCTGGGTCAGGAGATGGAGGATCAAGCTGTGCAGCTGGGAACTCAC +CTTGCTGTTCTGGGCTCTCCTTTCCCTCATGTTGGGCCCATGCAACTGCTCGTCGCTGCT +CAGGACTCAGAAAGGCCATTTGCTCAGGAGTGACAGCCACAGCCTGAGCACTGGTGAGAC +TAGATAGTTGGATGGGACTAAACACCACCTGAGGGCAGGGGTAGGAATCAGTGCATGCAT +GTAGTCCCCATTGGGCCCTGGCTCTCCTGTGGTCACCCCAGTCCATTAATACTTACAGCA +AATTTAGGAGGAGGGATGACAGAAATGGCAAGAGGAGTAACGCCCTGGATCTGTCCCCGC +AGCAGTGCTGAAAGAGCCAGGTCTGGGATCCCAGCTGTTGAAGCAAGTGGCATCCAAACA +TTGTCTTAGACTGACCTTCCCTCTCTTCAAACCTATAGACCTTCTCTAACTACTCCCAAA +GTGCCCTATCATAGACCTTCCCCAATATGTCTCTAGCCCCTTATTTAAACACCCTCAGGC +CCCCACCTTAAGAATTGCAGGGCAGTCTTCCATCCAGTCCACCCATGGTATAGAAACCAA +ACCAACTTGCACCAGCAGTGGCCCAGCTCCCCACCTGCTATGGTGCCAATTTCAGTGAAG +ATCTCAGGCCCCCAGTTACTGATTGGGCCAAACCCACCAGGCAGTACAAGTAGGTGGGCC +AGAACCTCCAGTTGTTCCTCAGAGCACTGGAGATGCAGGGTGCCGAGGAAGAGAGCTGCT +TGGCTGTAGAACAGTAGGAAGGAAGGAAGAAGAATTCGGCTTCAGTGAAAGGGGCTGTGG +TCATGAGACAAAGGAAGAGATGGCTTCAAATGAGTTCCCTTCCTCCATGGGACCAGACCT +TCATGATCCTTCTTTCCCCAGTAAGTCCACCTTTACCTCAGCACCACCACCCTCAGCCCC +TTCACAAATGACCTGAACTCCCAACTGCTGATGTGCTGGAGCTCCTCTGGCCGCAGTCCA +CAGAGAGTATAACCCAGCGCTGTCAGATGAACGAAGTCCAGGTGGCTCACATGCCGACCA +CTCTGCCGTAGGAAACTGGAGACCACAATGCGGAGCTGGGGTGGGGGGTGGGAGAAGAGG +GGAAGGAGGAAAGTTATGGAGAATTAATGGACAGGGAAGTGATAGGTGTTACTGGGTTAT +ATTCTGTTACTATTAAGACCTAAGGAGTCATGGGGAAGGCTGAGGACTCAGAAAAGAAAA +GGAAAGAAAAAGAGGAAGCCTCCAGGAAAAGAGGTAGGAGACAGTATTATGTGTCCAGGG +CCTCAGAGTGAATAAATCAGAGTCCTGAAGGTCACTAGTATGGGGTATCAACAAAAGATA +GAAAGAAGGACCAGGTAGGGTCACAGGAAAAAAATTCCTTGGGCTTTAGATGATCTATAG +GGCTGGGTCTGTGGGATGGGTGTTTGGGAAGCCGTAGGGAGGAGGAAAAGTGTTACCTGA +GTGGTGCTCCAGCCATCTATCTGCCCCAGGGTGCTCAGCACTCCCCAGTCCACTAGGATC +AGCTCCTGTAGTTCCCGATCTCCTAGACCTATTAAGAGCCTACCAAGCTGCAGGATCTGC +TCAGGACGAAATCCCCGGGGGGGACCCCACAACTAGGAGAAAGACAGGAACAATGTGAGT +GGAAAAGCAGTGGATTGGGAGTCATACTGCTGGGTTTTAAGTCTTGCCTCTGCCCTTAGC +TGTATAACTCTAGGTAAATCATTTGCCTTTTCTTTAGTTTCCTCCACTATAAAATGAGAC +CGTTGCCTTACAGTTCCTCTAAGGTGTTTTGAAAGACATTACAATCAGTGGAAAAGAAGA +GCAAACTAACTCTAGTCCATGAGTTCCACACTTTTATGTGCATCAGACACAGAGACGCTT +ATTAAAACAGATGCCTAGGGCCCAGCACAGTGGCTAACAGCTGTAATCCCAGCACTTTGG +GAGGCTGAGGTGGGAGGATTGCTTGAGCCCAGAAGTTCGAGACCATCCTGGGCAACATAG +TGAAACCCCATCTCTACCAAAGATACAAAAATTAGCCAGGTGTGGTGCACCTGTAGTCCC +AGCTACTTGAGAGGCTGAGGCAGGAGGATCGTTTGAACCCAGGATGTAGAGGTTACAGTG +AGCCAAGATCACACCACTGCACTCCAGCCTGGATGCTGGGTGACAGAGTGAGACTCTGTC +TCAAAAAAAAAAAAAAAAAAAAAAAAAGGAGATGCAGATGTCTAGACTCTTTCCCCCAGA +GGCTTATTTTTTAATTTATGAGACAGGGTCTTGCTCTGACACCCAGGCTGGAGGGCAGTG +GTGCGATCTTGGCACATTGCACCCTCCACCTCCTGCACTCATGGGATTCTCCTACCCCAG +CCTCCTGAGTAGCTGGGATTACAGGCATGTGCCGCCACACCCAGCTAATGTTTTTTCTAT +TTTTAGTAGAGACAGGGTTTCACCATGTTGCCCAGTCTGGTCTCTAACTCCTGGGCTCAA +GCAATCCACCTGCCTCCGCCTCCCAAAGTGCTGTGATTACAGGCATGAGTCACTGCGCCC +AGCCTCCCCAAAGGTTTTTATTAAGTCAGTGAAGCATGGTAATCTGGCACTGGCACTGTT +CCCATGCTCAGTTCAGCAGCACATACCCCAAAAAATTGGAATGATACAGAGATTTGCATG +GCGCCTGTGCAAGGATAACACGCAAATTCATGAAGCATGCCTTTTTTTTTTTTTTTTTTT +TTTGAGATGGAGTCTTGCTCTGTCACCCAGGATGGAGTGCATTGGCTTGACTTCGGCTCA +CTGCAAGCTCCCCCTCCCAGGTTCACGCCATTCTCCTGCCTCAGCCTCCCAAGTAGCTGG +GACAACAAGTGTGCACCACCATGCCCGGCTAACTTTTTTGTATTTTTAGTAGAGACCGTG +TTTCGCCGTGTTAGACAGGATGGTCTTGATCTCCTGACCTCGTGATCTGCCCGCCTTGGC +CTCCCAAAGTTCTGAGATTATAGGCGTGAGCCGCTGCACCCAGCCGAAGCATGCCATTAT +TTTTTTAAGAAAGAAAAAAAAATAAAGAAAAAAATCTGTTCCCTTGATGATCGTGATATG +CATTAAAATTTGGTAACCACTGCTTTCGTCCCTCTATTCATTATACTAGCTCTCTTTCCT +CAATATTTATCTCCCTTCACCTCCTACTTGTGACCCAAATCTTCTAACTCTTCTATCTCT +TGCTTCCCCCACCTCTCATCCAACTCTCCATTCTCTTTGTGTCCTACATCACACCCAAAT +AGCTTCCTCATGGCCAACCCCAGTTGGCTCTCCATCCCTAACCTGTTTTGCTTTGCCCAT +GGCTGCCCGCAGTTCCTCAGGCCCAAGTCCTGGGTCTCCTGCAAATAATGTCAGGCAGTC +CTCAAAGTCTGAGAGCTCCATCTCTGCAATCTGGGTTGCAGACCAGGCTGCTGGGAATGT +CCCTCGTACATCTGCACAATTTGGCACAGGTTCTGAAGGGGGAAGGCAGGGCCAGGAGGT +CAGCGCAGTAATAAAATATGCCCAGAGAGATATCTGTAGATAGAGTGAGTCTTCCAACCT +TTGGAGGATAGAGCACTGTGAGAAATAGGGATCAAAGGAGTATTACAGAGTAATATGTAT +AGGGCTTAGGAGATAATAGAACAAGAAGTGATTGGAGATGCCAAGACTTTTATAGATGGA +AGACTGAGGATGTTATTCAGAGATTTAATGGTAGCACTGAATTTGAACCCAGTTCTCCAG +CCTCACAATGTCTCCTTCAAAGCTTCTAGGGATCTCCCCAGCAGAGAGCTCCACACTTAG +CCGAGCTAGAGCTGAGTCTAGAGCAGACACGAGAGAGCAGGAAAATGGAAGTTCTGGGCT +AAAAGAAAACTTAATAGTTTACAAACTCCCAGAACTACAGAATTCTAGAACTACAAGAGG +CTTGAAGATCATCCCTCTAAACTCTTTGCTTTATAAATTAGAAAACCCAGTCCCAGGGAA +GAGCACATGTAGAACCCAGACCGTTTGATACTCCCTGTACATCCTGCTGGACATATAAGT +ATTTGGGTAGTTTCACCTGGAAGATCCTCAGCAGCTGGTCGCACCACCCCTGCTACCAGG +GCTGCTTTCTTGGCAGCAAGCTGTGGCTCCCTACACAGCTGTCCAACTCTGCTCTGCTCC +CAGCTCTGCTGCTTTTCTAGAAGCCGCTCCAGGGTCTCTGGACCCAAGGCCTCCTGCATG +AGAAGGTAGAAGGAGAGTGGGGAGATCTGGACAGATCAGGACCTGCTGCTATAGCTCTAA +GTCCAAAGTCCTGTCTCTGTTTTGCAGTCTCCCCCCAGATCTAGGCTTCATATCCTTTGC +CCTCTTTGGCCACGGGTCCCCATAACCAGCTCACCAGTCCCCTCTATGCATTTACTCCCT +TCCCTTCTTCCTTCATCTCACCCTGGGGATCAAGGAAATTGCCTCAGTAGACAGAGTGAA +TACTAGGCGTCCAGCTTGCTCTACTTCATCCTGGCTCCACAACTCTGGTTTCCTGGGGAC +AGGAAGAAAATCGGGGGCTGGGGAGCTGAGGGAACTGTGAGGAAAAGGAAGGGGAAAAGA +GGACATGCTAGGATTTCGGACACAGGGCTCCAGGGGACCTTAAGAATATGGAACAGGCTG +CCACTGATGATGGTGGCTGAGGGACTGGGTATAGAATGAGTTAGAATCTGAAGTTCTCGA +AGGTCCATACCCAAGAACAGACTCCTGCAATAGCAGCCATCCCAGCTCTGTGGCAAATGT +CTCTCCTAGGCAGAAGCCTTGCAGCTGACTGAGATGGGACAGCAGGATCTGTAGGGGGAT +CTGTCGTGTGCTCTCTGTCCCCAGGAATCCAACCAAAGGGCCTAAGGTCTCCAGCACTTC +CCCTGAGACTGGAGTCTCCTTTGGAGCCTGGAGAAGAGCATCAGAACTTGGACAATGCAC +TTCTGACTCAGAATACCAGACACCTTGATCAAGACACCAAGCCCCCTAACCTTCCCTCCC +TCCTGCTCCCAGCTCAGCACCCTATGATGCTCACTACCTTTGTTCACTTCCTTAAGCAAT +GAGCCCAGATAGGAGCAGGCAGAAAGACTGAGAGGTGGACTCCAAGAGATTTCTCGGACT +CCAAGAGGTATGGACAAGTAACGTGAAGCATATTATCAAGGAACAGAACCCAGAGGCAAA +ACAGGGGCCAGGCCTTGTAGACAGGAATCTGTTTGGCAGCAAGAAAGCAAGAAGTAGAGG +GTTGGGGGACAGTCAGAGAGCATTGTCTGAGGAGCAAGAATTGCCCAGGGCAGCAAATCT +GAGTCTGGTAGGGTGGACTCTTACCAGGTTTTGTAGTGCCCTCTCTGCCAGGGCTGCCTG +GTGGAGGGGGGTCAGTGCCAGCAGCTGCTCTGGAGCTCTTTGCACCAGCTCCACCACCAA +CATAATGGATTCATTGGATAGTCTGTCCATCAACTGGATCCTATTACAGCAATTTGACAA +CAACAGGATTCAGGTGGAGCTGGGCCAAGTCGAGAAGGGACCACAAAACCCCACAGTCCG +CAGCTAAGATGTGACCCCAGACCAAATTTAGTGAAGCTGGACAGGAACTGATAGTAAACA +GCTCCAGGGCCAATTCCCACCATCAAGCTTGAGTGGGGGAGAAAAAGGGAGCAACATATT +TCTAGTCTTCCTAAAAAAAAAAAATAACGAACAATCCTACCTGATCTAATGGACTCAAAA +GCTCCAAATATTAAGTACCATTAACATTTCCCCATTTTATAGATTAAAAAACTGAGGTTC +ACAGAGATCAAATGTTGAGGTCTCTCACTCCCAATTCCCGTTTTTTTCCACAGGACCACC +CTTCCTCTGCTTGTGAAGAGGTCCCTTCTTGTTTGTACGTGCTATACAATTTACAAAGTT +CTTTCAGGTGTTATCTCATTTGATCCTACAACAAGACCTGGCCTCACTCCATCACTCAGG +CTGGAGCACAGTGGTGCTGCGATCTCAGTTCACTGCAACCCGCACCTCCCAGGTTCAAGC +AATTCTCGTGCCTCAGCCTCCCGAATAGCTGGAATTACACGCACGTGCCACCACGCCCAG +CTAACTTTTGTATTTTTAGTAGAGATGGTGTTTTGCCATGTTGGCCAGTCTGGTCTTGAA +CACCTGACCTCCGTGATCCACCCACCTTGGCCTCCCCAAATGCTGGGATTATAGGCATGA +GCCACTGCACCCAGACAAAATAGGTGTTTCTCTTATCCTTCTTTCACAAATGAGAAACTC +AAGTTTTTTGATGCATGGTCTAGGATCTTTCACCTCATCTGTAACCTTGGGATTCTAAAT +TATCTCACAGAACCCACATATTTAAACAGATCTGAATGGCATTAAAAAAAAGTAAAAACA +GGCCGGACGCAGTGGCTCATGCCTGTAATCCCAGCACTTTGGGAGGCCGAGGCAGGCAGA +TCACAGGGTCAGAAATTCGAGACCACCCTGAGCAACATGGTGAAACCCCGTCTCTACTAA +AAATGCAAAAATTAGCCGGGTGTGGTGGCACGCGCCTGTAATCCCAGCTACTCAGCAGGC +TGAGGCAGGAGAATTGCTTGAACCCCAGAGGGAGAGGTTGCAGTGAGCCGAGATGGCACC +ATTGCACTCCAGCCTGGGTGACAGAGCGAGACTCTGTCTCAAAAATATAAATAAATAAAC +AAATAAATAAATAAATCCCTTTTACCCGAAATCAGAGGTGATAACCTGTACCCTACCTAG +GATTACCAGTTCTGGAACTGGGCTAAGTCATACAAGAGCTGAAATCTGTGGAAAGGCCTA +TAAAAATATAAGAATGTTGGGAAGCCGAGGTGGGCAGATCACTTGAAGCCAGGAGTTCAA +GACCAGCCTGGCCAACATGGTGAAAGCTCGTCCCTACAACAAATACAAAAATTAGCCTGG +CATGGTGGTGCACACCTACAGTCCCAGCTACTCGGGAGACTGAGGCAGGAGAATTGCTTG +AACCTGGAAGATGGAGGTTGCAGTGAGCCGAGATCACGTCACTGCACTCCAGCCTTGGTG +ACACAGCAAAACTCTGTCTCAAAAAAAAAAAAAAAGAAAAAAAAGAAAAAAGTAGGAATG +AAGTCAACTGCTTTTACTCCACTTCAGCTCCATATTCCCCAGGAAGACTGTAACACCAGC +ATTTTCCTATCCTGACTTACAGTGCGGATGGCAACGGGGTGGTGGTAGGGGTAAAATGAG +AAAAGCAGCATATGAGTAACATACTAATAAATTTCTCCATGGGTAGGGAGGAGCCTTGGG +TTCCTATAGCCTTTTCTTCCCACAACCTGTATATGAACGTGCTCTACAACATTCCTGCCA +TGGGGCCATCTGGCTTAGGCTCGAACAGCTCCAAGGACCAGGAACTCACTACCTTAGTTT +TTCTTATACTATGTAAAAATCTGATCTCCGTTAACTACTCTTAGTCTTGCCCAATGGGGC +CACTCATAAATATGACAATCTAATTTCTACTTGATAGCCCCTCAGGTATTTGAAGGTATT +TATCACATTTTTCATATATTTTTCTCTCGATCTACTCCTCTGAATGCACTAAGAACACAG +TGCCCTGAGCAAACTACAATCCTCTGGGGCAGATTTGGAGCATTGCCTCCCTCACTTCAG +ATGCTCTGTCTCAATCTCAATTAATGTGGACTAGGTTATTGTAAGATTACTTGGCAGGCT +TCAATACAGTGTTGCCACCCTCAGCTGCCCTGTGCTGGAGCATATTCTGGGAACCAGTCA +ATACATAGAGCAACTTTGAATAAAATCCTCTGCTTTGGGCTATAAATTATGTCAGGGAAC +TACCAGACAGAAAAACTACCTCCTCCAGGGCATTCCCTAAGGGCAGATCTTCACCCCAGA +TTGAGCTTTCTGAGGGGCCAATATCTCTAGCTGCAGTACTTACGGTAAGTCCAGGAGTAG +CTTGCTATCCAGCCCGTTCAGTTCTGGCCCTACAGTTGTCCATTCTGGTTCTGGCATTGC +CATCCTCCGCTGTAGCTCTGCCCAGATACAGGCCCTCTGTAGGGAAGCAGTGTGAGGCCA +GAGCAGAACATAGGAGCTGGGTTCTATACCTAGGGTCCCAGCCTCCCTGCTCCCACTAAA +GTCCAGGCACCCCCTCTCACCAGGCTCCCTCGAACTCTAGTGGGCAGCTGATAGATCATG +TGCACCACTTCAAGGAAGTCTACCATGGAGTTGATCTGCTGCAGAAACTCACAGGACATG +CCTCCTGCCAGGGTGCCCAGAGCCCTGTGGGTGTGTGAGTGGGGAGAGGCTCATTCAACA +CATACAACAGCCTGTTTGCTCCTTGATATCAGTACTCTCAACACAAGTACAGATGTTCAT +CCACCGTAATTTATGGTTTGAGAAATGTTTTTACATAGATATCTCATCAGCTCCCCATAA +CATCACTGCTGTTTATCAGAGGAGAAAATAAGGATTTATGGAAGTCAGGTGACATTCCCA +ACATCATTCTGCTAGAAATGTGGTAAACTGAAACTAGAGCCATGTCTTAGATTCTGAAGC +CCTTGGGAGGCTGAGGCGGGTGGATCACCTGAGGTCAGGAGTTTGAGACCAGCCTGGCCA +AAATGGTGAAACCCCATCTCTACTAAAAATACAAAAATTAGCTGGGTATGGTGGCACATT +CCTGTAATCCCAGCTACTCCAGAGGCTGAGACAGGAGAATCGCTTGAACCTGGGAGGCAG +AGGTTGCAGTGAGCCGAAATCGCGCCACTGCACTCCAGCCTGGGCAACAGAGCAAGACTG +TCTCGAAAAAAAAAAAAAAAAAAAGATTCTGAAGCCCATTGTTCTTCTAATGTGGGTGTA +AGTAAAGTGATACCTTCTTATTCCTATTTATTCTTGCTTGGCCAGTGCCTCTGAAGTCAT +CCTGAAGTAAACTAACAGTAGCAAATGACTACTGGGACTTTACAATGGACCAAACACTGT +GTTAAATATCTTACATGACCTACCTCATTTAATCTTCACAGTAACTACCTATGAGGCTGG +TGCCATTGTAATTCTCATTTAAAGAAGGGAAAACTGACACAGGGACGTTAAATGACTTGC +CCAAGAGGACAAGTAGGTGACAGGACTGGAACCTGAATCCAGGTTTTCTGAGTCCTTTGG +CTCTAGTCAGGATAGGCATGGCACTGTGGAAGGAAAGGAGGGCAAAGGGAGGTGCTAGGA +GAACGTCCACGAGGCAGGGACTATGCATCATTCATCCCTATATCCCCCATTCCATATCAC +ATGGTGCCTGGCACATAGTAAGCACTCAAAAAAATGTTGGTCGAATTCAGCGCACTGCTC +AACACAAGTTACTCACTGCAGATTCCTGAGGGTAAGGTTGGTGGGTACTTGCATCTTCTT +CCAGAGAAACTGTGCCTACAAGAGAAAGAAAGACGAGCCCCTTCCCAGAAGAGACACTGT +CCAAGGATACCCCCTAGAGTGGAGAGGCAGGACTGCCTTGGACCCAGTCCTGCCTCCTCA +CTCTGTGGGACCCACCAGCCTCCCACTCTCCCAGATGCCCACTCTTCTCTCGAGAGTGAG +AAAGAAATGAAAGAAAAGGGAAGGAGAAAGAAAAAGAAAAGCAAGTCACATTATAAGAGA +TACTAGGGGGCTGGGTGCAGTGGCTCAGGTCTGTAATCCCAGCACTTTGGGAGGCCAAGG +CAGGAGGATTGCTTGAGGCCAGGAGTTCAAGACCAATCTGGACAGCATAGTACCTCAACA +TAGTGTCTCCACATAGTGTCCACATTTTTGTATTGTCTGTCTCTACAAAAAATTTAAAAA +TTGGCCGGGCATGGTGGCTCACACCTGTAATCCCAGCACTTTGAGAGGCAGAGGCAGGCA +GATCACAAGGTCAGCAGTTCGAGACCAGCCTGGCCAATATGGTGAAACCCTGTCTCTACT +AAAACAAAAAAAAAATTAGCCAGGCATTGTGGGGAGTGCCTGTAGTCCCAGCTACTCGGG +AGGCTGAGGCAGGAGAATCGCTTGAACCCGGGAGACAGAGGTTGCAGTGAGCCAAGCTTG +TGCCACTGTACTCAAGCCTGGGTGACAGAGCAAGACTTCATCTCAAAAAACAAAAAAAAA +ATTTAAAAATTAGCCAGGCATGGTGGTAACCACCTGTCGTCCTAGCTATTCAGGAGGCTG +AGGCAGAAAGATTGCCTGAGCTCAGGAGTTTGAGGTTACAGCGAACTCTGATCATGCCAC +TGCACTGCAGCCTGGGCAACAGAGTGAGACTGTATCTCAAACAATCAAATAAAGGATAGT +AGGGATAAGCTGGCCAAATAGGGCTAAGGGATAGGTAAAGAAGGTAGACTACATTGAGGG +AGGAAGTTCTGAAATTAAGTGGGGAGAATTACCTGCTGCTCAGACCAGGGCAGCTCCCAG +TAGCGTCTTCGATTTGCCAGAAGAGCCAAGGAATCCAGTTGTAGCAGCTTTAATGGGAGC +AGGGGAAGCAGGCAGTCTGGCCAGGTGGTGGGAATAGGCTGCTGGCGGTGATGGGTATAG +CAGTGTAGTGATAGGTGAGAGAGTCTGTACTTAGCACTCAGAGAAAAGGAACCATGGGGG +TGGAAAGGGACCAAAGCTAGGACAGAAGTGGAATGGCATTTGCCACTGTAGCAATAAACG +CCTAAGAGCCTCACACTGCAGGCTGCTGTGGTCAAGTGTTGCTGCTAGTTAGGCACCAGG +CAAACAAAGTCCCTTAGCTTCTCCCAGCCTTGGTCTTTTCATCTGAAAGCAGGGTGGGTG +GCCTGGAAGATTTCTCAGGCCCTTCCAGCTTTGACATTAGTCTAGGACAATGTGAGTGGA +TCATGCTGTCCTGCATCTGGCACTGGGGAGGGCTGCTAATGGGCACAGAGCTAACCAGAA +CTCTCAGGGCCACTGCAGGGTAGTGTGGGAGGTAGCACCAGCCATATGAAACTGCCCCAG +AATGGGGAGAGGAGCTCAGTTGGGAGATCTCACACTTACCTGACCAGGGATACACACAGC +TGGACCAGCACCTGTTGTACCCATATTTTTAACACCATCTTTAACCTGCATGAGAATTGG +TTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTAAGTGGGGAGGTATGAAAGG +CCTTCCCTGGCCTCAGGGAAGATGCCTTCCTCCCAACCAGAAAGAGCCGGTATCCCTGAT +TATCTCATCCTCCTTGCCACTCTCATACCCGAAAGGTCTGCAGCAGTGCTGCGGTCTGGC +AGGCTGAGAGGCGTGACAGTTCAGGGGCCAGGCAGGAACAAGCCCCGACCAGGACTCGCC +TAGGGATGGCCTGGAGTGTCTGGGGGCTGAGGCCTGGTAGCAGAAGGTGCAAGGAGCAGA +GTTCCTCCAGGGATAGCTAAAGAGCAAGAGAGACAAGAGGCCTGAAGAAGAAGGGAAGGG +TGTTGGGGAAGACAACAATCACAATGCAGCAAGGCAGTAAGTATGAGGGGAGCGGAGAGA +GAGGAGGGGCAGATGCAGGAGCCAGGTTAAAAGAGGATAGAGACCTCTGGCGGGAGGCTG +AGAAGTTGCTGCTACTCACATCGTGCCTAGGAAGGAGCCGTCCAAGCAGCAGGACAGCCT +GTGTTGGGATGGGACCGAAAACAACAGCTCTCAGGACCCAGCAGCCAGGTCCTAAGTGGG +TTCCTAAGGCAAACCCTCCAAAACTTCCCGCTGCCTCTTAACCATCCCTAGTAACCATCC +TCAGTGACTCAGGTCCCTTCTTGATCTGAGGAAAGGGAATGAAGGCGTCCTCCTGCAGAC +AGAGTTTACCTCTACTGTTCACCCAGGGAAAGGAGGGGCTGCTTTTATCCCTGAATGGGA +TTCTCTTGACTGGGGCTCAAGCGGGGATGGAGGGTACATGCCAGGGAGTGAGACAGGCAA +ACCTGAGCAGGTGTAACACTAGTTCCCTGCAGGACAGGAAGCATGGCTCTAAGCTGGGGC +TCTGACAGCTCCTGAAGGAAGCGGAGGCCCAGCTGAGAGAAGAGAGGGGCCCAGACCCGC +AGCTCCCGGGGGCTCAGCACCGCTCCTCCAGATGAGCGCAACACACCCAGAAGTTCATAT +GCCACCTGTAACCAAAGAAGGTTTAGGACTGGAGAGAAAAGACAGATGAACCTTGAACTT +CCTCTCTAATCCCACATCTGAATTTCCCAGGGCCTGGGCTAGGGTTTGGAACTAGGGATT +GGACTGGAGTAGATTAGGTTCAAAATGGCAAAGGGTATGTTTGGGGTGCTACAGACCAGA +TAGAGAAGGGCCAGGAAGAGGTGGTAACAGGTAAGAATGGCCATGGAGTAAGGAATGGAG +GTTGCAAAAGTGGGATGGGGGCTGGCACCACTGTGCTCAAGTTGACACAATGGGAAAGCC +ATCCACACAGACCCAGGGGAAGCCTAAAGTCTTGTAGGCTTGTGCTGAGGGGCTCACCCG +TCCTTCTGGGCTGAGGGTCCCATTGGCTGCACATTCCAGCAGCCCCCGTTCTACTGGCCC +CGTTGGATCACTCAGGGGCACTAGGCTCTGCAGCTCCTCAGGGCTCAGGAAACAGGCGAG +GGGCCCAAGCCTGAGGAGGGCATGAAGCTGACCTTTGTGTGGTCCCCAAGACTAGCACTA +AGGCAGGCCTGGCAGCAGATGGTATGGGTACATACACTGGGCACTCTGACACTACACCAC +TACACAGGGGCACCATCTATAGACCTCTCTCCTGGAACTCTATGCTGCCTGAGGCCACTG +CCTACTCTCTTGCCCCTTTCTTCCTAGAACACCGACCCAGCGTGACAGCTGTGCCTTCCT +CTCTCTACCTCTTCCACTCTTGCCTTGGTTACTGATCCCACAACAACTCACCTGCGTACC +TGCTCCTCACCATCCTGGACACTCTGGTTTACCAGGCTGCGCAGCACATGGCGGGCATGC +CCTGGCCCCAGACCTGCTGCTGGCCAGCTATCCTCCAGGGCCTGGATCTGAGATGGGGAA +GGTTAGCTCGCACTGGGGTCACTATAGGGTTCTTGCAGGGAGGGGCCACTGAGAGACATG +CCATTGACTCTTTGTCCCCACTAGCAGAAGATGGGCCAGTCAAGTAAAGAAGATGATTCT +CATACCTGGTGAGGGCTGAGCTGCAAAAAGTTCTCCAGAGGGAGGTGGGGGAGCAGGGGC +AGCACTGCCCACAGCAGCTCCTGGGGCCAGGCAGGCACTTCCCCAAACAGTTCAGGGGCC +AGCAGTCGCTTTGCCAGAGCCTCCTTCTGTTCAGGCCTCATTCCTGGGCTGTAATCCCGG +ATGGCAGCCAGGCTGCGAGGAGTCATGGGTCTTAACCCTTTGTCTTCCTCCAAACTGTTT +TCCTCTGACCCTCTGTAGCCCTGCTGTCTAGATAGGAAGGAGACTCTCCCTGTTTTTTTG +TTTTGTTTTGTTTTTTGTTTTTTTTAAGACGGAGTCTCGCTCTGTCACCCAGGCTGGAGT +GCAGTGGCACGATCTTGGTTCACTGCAACCTCTGCCTCCCAGGTTCAAGCTATCCTCCTG +CCTCAGCCTCCAGAGTAACTGGGACTACAAGCGCCCACCACCATGCCTGGCTAATTTTTG +TAGTTTAATAGAGACGGGGTTTCACCATATTGGCCAGGCTGGTCTCGAACTCACTAGGCG +GACCTAGTGATCCACCTGCCTCGGGCTCCCAACGTGCTGGGATTACAGGCATGAGCCACC +ACGCCCATCCTCTCCCTGGTTTTAATAGAGAGGAGCTCTCTGATCCTTTTGTAGTTCATT +AGCATAATGATTGGGTTTTCACACTCAGGCGTGAGATGTGCCTCTCTCAAACCTTGCTAC +GATGTTGGCACATTGCCTATCTGGCATGAAAGAAAAAAACAGAGAGGAGCTCCGGAAGGG +GCTGCTCCAGGTGGATGCGGTTGGCCTGACCGTGACAGGAGCAGGAGGAGTAGTGCAAGA +ACCTTACACACTGTCGTTGGCTAACAGGGATGGTGGGAAGCGCATCAGGTCAGAGACAGC +CAAGAAAGGGATGAGTGGTGACAGGTCAATGAAGAGCTGGGAGGTGAGGCGTGGGTACCG +CTGGAGCAGCAGGGCTGTCAGGCGACCCAGGGCCTGCTCCTCGGATGGTGGTACCTGTAG +GGGCACAGGAATGGATTCTGCCTCTGGAGATGCTCAGTATCCCTGCCATTTCCTCTCTGG +AGTTCCTCCTCCCACTAAAGCAAGCCCTCTGAACACTAGGCTCCAGGTGCCCTTGCTCCC +CTTTCTCAGTGCTCACCTGCAGCTTCCCCTGCAGCTTCCCCTGCAGCATGAGTGTCAGGA +AGCCCTGTGCAGCCTCCCTCTCTGCTGAAAGCACCAGCTGCTGGAGGTTTTCTGGGGGCA +TATGCAGGTACGCCTGCGAATAGGAGGTCATGGTGGGTATGGCCTCACCTCACGGGGCAG +GGACCTCCAGCCATGGCTGTCACCCAGCCCACCCATGTGCCCTCCACCTGTTACCTGCAC +TAGAATCTGCAGGGCCCAAACACTCTTTTCCCTCTGGAGCAGATCCCACAGCACAGGCTG +GTGGGAAGAAAGACAATATGCCAACAGAGCTGTCTTCTCTGTCTCTCCTGCAGCTCCCCC +AGGCATTCCAGCTCCACTTGCTGGGATGCTTGTCCATCATCTCAAGTGTGATTCTTCAGA +TATTAAGCTCTTTTAGGTCCGGGATCATATCTCTAATTTCTCTTGTACCCTCTCCCATGC +CAGGTCCATAGTTGGCTCAATTAATACTTGTAGACCTGAACCAAATACTATGAGACTATC +TAATTCCATTTTACTGGGCCCAGGGTTTGAAAAAGGTGGCCCCAACATGGAGTAAGAAAT +CTTTATATATCCTACAGAACCTAGCAAAGTGCCTTGCATATAATAGGCACTCAGTGAATT +AATTATGGTGTTTAACATATTCTCACCCCTCCATTCTACAGACTAAAAACTGAAGCCCAG +AAAAACTAAATGAAATTGCTTGCCTAAGATCACACTAATTTGTGGCACAGTTGAACCTAG +AACCAAAGTCTTCTGATTGGTAGTGTCCCTGAATCTTTCTCATTCCTACCTTATCTTCCT +GTATTCTGCCAGCCTCCAATTCTCACTCTCTAGGGGAAAAATTTAGAGTCCTCTAACTTG +ATCCTTTTTCTTTCTGGTGCAGAAAGTGAGAGAGTCAAAGAGCCCAAAAGAATAACAAGG +TAATATTTATTGCATGTTTGTGTATCATGCACTGTGCTAAGCCCACAGCATGTATTATCT +CATATACCATTATCATCCCTAGTTTATAAATGAGACGAGAAAAATATTGCTCAGAGAAGT +TAAATAACTCATTCAAGGCTGGTCACAGAACCAAGATATGAATCCAAGCCTGTCTGACTC +TAGAGCTGTGTGCTTCAACTACTACCCCACACTACTGCCTCTGAGTATTCGAGGAATTCC +CCTGGAGGGCCTTCTCACTGTTCTAGGAGCTTTCCCTCTGGCAGAGCACTCACACTAAAG +CAGGCCAGCAGCTCCATCTTGCTTATACCAGAGCTGGGGTTGACAGTGGGTTCAAAGCCT +GATGGGGTTGGCTGCTCCTCATCTTGTTCCAGGTATTCCCCAATCGTCCGTAGCACACTG +CGCCGGCCCTCTGGGCGAAAGGCATCCCAGAAGGAGCAGTTGTCTGGGAGACTGGAGAGC +AGTAGGGCCCGACCCAGCATCCCCTGGGCCTCAGCGCCCCCAGCCCCTGGACCCAAGAGG +AAGGTCAGCAGGCGGAGGAGATAGTGTAGGCGTGTGGGGTGAGCAAGAGCTGGGACAGAG +GACTGACAGCGTGGCAACTGGGATAGCATGCCAAGGAGGAAGGGGCCAGGGGTCAGACAG +AGTGGGGATGGTCCCAGTGGTGGCAGAGAGGGCAGAAGGGGGCCCAGCAGCCCTTCTAGG +AAGCAAGTGTCATTGCCCCCACGACTTATCCTGCATTGGCCTGCTAGCCAAGGCCAGAAG +GGCACCAGGACCTCATACATGGTGTCATTGGCACAGACCATCACCAGGAAGCTGCCCCCA +TCCGGGCAGCGTTCCCCACAGGGTCCAATGTGGCATGGTGGGGAGGCAGTGACATCTGGG +GTGGGGCCCTGGCACACATGCTGGACCCAAGCCTGGTTGCTGGGGGGCACAGCCTGTAGA +CTTGCCTCCCCACACAGTCGCTCAGCCCACAGAGTCTCATTCTCCAAGAAGCAGCCCCAA +AAGATGTCTGGGGTGAGGGGAACAGGGGGCAGGCCTTCAGGGCAGCTGGTAGGGGGTGGG +AGCAGGCCAGCACAGAGCCGCTTCACCAGGCGGCGGTTGGAGCCAGACAGGGCTGAAAAG +GAGAGGTTACTGCAGATCGCATCCAAAAACTCATCAGGAAACTGGTCGTGGCAGGCCTGT +AGCCAGCCTTGGGCACCTGGTGCCCAGGACACTGCATACCACACAGCTGTCTGGCAGATG +GCAGTGGTGCTGGGATGGGGCTGTGGAGTGGCAGGCTTGGTGTGCTGGCAGAGCAAGTGG +ATGGAGAAGTTGGAAATGCTGTAGGGTGGTGCTGGGCCTAAGTGGTTCTCACAGAGGGCC +TCCACAGTGATGGCTCGCCGTTGGCGTGGGCTGATGTGGGCTGACGGCTGAGAAGCTCTG +GGCAGAGGCACGCCCGTGCTCAGGCAGTGAAGGAGGGCAGGGGGTGGGGGTGGTGATCCA +GACAGAAAGCCCAGCGCCTGGACATCCCAGGAAAGGTTGTGCCGGACGCCCCTGGGTGCA +GAGGGAGGAGCAGGCAGGATCTGGTCAGTTCACATTTCCTCCAGCTGAAGATTCTGACCC +TGACTCAGAGCCCTCAAGGGGCATGGCAAGCATAGGCTTTTCTACAAGCGAGGTCCCTGA +GAGTTTAGGCGTTACTGGTTGAAATGCTGCAACGGGGCAAAAAAGAGAAAGCTGCCCTGC +CAAGGTCCAAAATCACCTAGGAGAAGCCCCAGTTGGAATTCTTTCTGTCCACCTGGCCCC +CTTCTGTCTCCTTTCTTCGTATGCTTTTAGATAGCTTAAACACAACCCTCAGCGGGCCTC +ATGTTAGGAAATCTGCTCAGCAACTCAATATGTCTAACTACTGTCTGGCATTGTTGTACT +GAAGTAGTAAGGGGGTAGAGGAATCCAGCTGTGTCCTTATTTTCCAAATCCCATCTTTAC +TAGCTAACATTATTAAGCGCTATGTGCCAGCACGGTTTTAAGTCGTTTGCATACACAGTA +TCAACTCATTTAACCGGGTGCAACAACTCTGTGGTAACTAATATTATCATTCTCATTTTA +TAAATGGGGAAACTGAGGTATAGAAAGACTATGTAACCTGCTCAAGGTCATATGGCTAGG +AAGAGGTGGAGCTGGGACACCAACCCAGACAGTCACACCCCAAAACCACTAAACTACACT +CCTCCCATCAGTTTTCCTCTCTAGAGGCTGAGGTCTTCAGCTCTTCACCTCCAGATTTTC +CAGGCCCAATCCCTCAGAACTGGTCTCCTGTTACTTACCATAAGAGCAGCTGTTGAAGGT +TACCTAGGAGGGAAAGAGTAAGGAAGAAAACTACCCCAATTTCACTTTCCTTGCCCTGCC +ACATCCCAGTCCCAGCCCTGGCCACACTCACCTCCCTGGCACTGCCCATTGGTATCAGGC +TCTGGCTGCCCCAAAATGGAGAAGACCTCATCCTGCAGGGAGTGAGTGACACGGAGCAGC +CCCTCCTGAAAGGCAGCATAGAGGGGGGCCCCCACTGTGCGTAGCAGACCGCCCCAAAGA +GCCTCCTTGGAGCCTAGCTCCCCTGTTGGGGTAAGCAAACCCAACAGACCCTGCAAAAAG +TGAGGAGCTGCCTCCCTCCCATCGAGGCCTGTGGCATTGGTGGGGTCCACACTGGGCTGC +ACCTGCACCAGAGCTTGCCAGCGTGTGCCCTCTAACAACAGCAGCAGAGAAGGCAACCAG +TCAGCAGCCAGGACACAGTCAGACGGCCCATCACGGGTGCATGGGGGCCGAGTTGGGGTA +GGGGGGCCCCCAGGAACTAAGGCTCCCAGCAGCACCTCCACAAGTCCACCCAGCACCCCT +GCCTGGTGCACCAGGAAATCTCGGGGAGTCTGCTCCTGTCCCAGCAGTGCCAGCATATCC +CCTAGCAGCCCTAGCATTGGCTCCCAGTCGGGGCTACCTCTCAGTGTCACTAGAAAATCA +TGGAGCCGCAGAGCAGGCGGCTGGAGAGGTGGGGGCTCTCCTACTGGTCCTTCCCCCATT +CTCCCAGGCTCAAAGGAAGAAGAAATGTTGGCCAGGAATGTAAAGAACCGTGAGCGGCTC +AGGGAGCCCTGGGGAGCCTGGTCCAGAGTGGAGAGCAATGACTTCAGGAAGGAGAGACCA +GGGTCCAGGGAATGAGGCCCAGTAGGGGCCAGAGTCACTGTAAGGAGAGAAACCAGAGGT +CACTGAAGGAGAGGAACACAGAGCCCAAACTCAACCCTGCCCCTGAGCACAGTTCCCTAC +CTGGATCTAAAGATGAGGTGAGGGGATGGTGAGCTGAACCCCTTTCACTGGCCTACAGTA +AGTCTTGGCCTGCTTTTCTTAACCCTCATTTTTCCCTTTTGCACAAGTCTCTGCCTCTCT +CCTTTGTCCTTCTATCTAACAGGTTCACCTCCATATTCTTAAAGTCTTTTTCCAGCCACC +CTCCTCCTCATCCCCAGTTCTGCTCACTGTTCTTCTTACCTGCAAAGGACAGCAGCAGCA +GCAGCAGCAGCAGCAGCAGCAGGGGCCAGAGGCTGAGAGCCATGTTTCCAAGTGAACACT +GGTACCGGAGGTGAGTTCTGGTTATTCTCACCTGTGTGGGATAGCCAGGTGAGGGCAGGG +CACTGCAGGCAACCTGAGGCTCCACTGACACTGTAGTGTGGTCTTTCAGGAAACAATATG +TGTAACCTGACAGCAGGTTTGTTACCTGAGCCGCTGAATATCTGATCCTTTAGCTCTGGA +GAAGGGTATGCTTCTGTGAGAGGGACTAATGATAGGGGATCCCAGGGGCCCCTCAGAGAA +ACCCAAGGAGATTGCTTGAGTTAGGAGCCAGTGATGCAGAAAGGAACAGGCAGACATGGA +GATGGAGAACTGGAGACCAGGGGTGTAATCTGGGGTTCTGCAGAAGCAAAGAAAACAAAG +AGGATATGAAGGAGCTAAGAAGCAAGTAGGATTTGAGGAAATTGAAGAAGAAATGACGCC +CAGGAGAAACAAACTGCACAGCAAGGTATAAACAGACGAGAGGTCCAGTCCAAGGATGGA +GAATTCAGGAAGAGGAGAGCTGCTTTCAACTACTACCCAGAACCACAGAGTGTCAAGAGC +TGAGCAGTACCCCAGAGATCACATACACCAAACTTTTTCACTTATGGATAAGGAAACAGA +CCCAGAGAAATGTGATTTGCTCATTAGGCCATTCAAATGTTTGGGTCTTTCCTTCCCCCT +CCGGGATCCCCTATCATTTGGTTCTTTCTCTGTGGCCTAGGTGAAAGCCCTATCAAAAAC +TCTGAGCTGAGAAGACAGAAACCAGATTAAATCACTATGCACAGGGACTGATCTCCCTAC +ACACAGAACTGCTCATGGAACAGAGAAAGAGATTTCATTCAAAATAATCTGGTCTGGTCA +GGGTTAGAAGTCTTTTTTCGTCATTTTTGTTTTTTGTTTTATTGGTTTTTTGGTTTTGGT +TTTTTTGAGATGGAATCTCTCTCTGTTGCCCAGGCTGGAGTTCAGTGGGGCTATCTGACC +ACTGCAACCCCTGCCTCCTGGGTTCAAGCGATTTTCCTGCCTCAGCCTCCCAAGTAGCTG +GGATTACAGGCACGTGCCACCACACCCAGCTAATTTTTGTATTTTTAGTAGAGATGGGGT +TTCACCATGTTGGCCAGGCAGGTCTTGAACTCCTGACCTCAAGTGATCTGCCCACCTCAG +CCTCCCAAAGTGCTGGGGTTACAGGTGTGAGCCACCGCGGCATTTTTTTTTTTTTTTTTG +AGATGGAGTCTCACTCTGTTACCCAGGCTGGAGTGCAGTGGCACTATCTTGGCTCACTGC +A diff --git a/paraphase/data/strc/ref.fa.fai b/paraphase/data/strc/ref.fa.fai new file mode 100644 index 0000000..b728694 --- /dev/null +++ b/paraphase/data/strc/ref.fa.fai @@ -0,0 +1 @@ +chr15_43599000_43620000 21001 25 60 61 diff --git a/paraphase/data/strc/strc_config.yaml b/paraphase/data/strc/strc_config.yaml new file mode 100755 index 0000000..c6ae844 --- /dev/null +++ b/paraphase/data/strc/strc_config.yaml @@ -0,0 +1,30 @@ +gene: strc + +# data files +data: + reference: ref.fa + homopolymer: homopolymer_sites.txt + depth_region: genome_region.bed + +# coordinates +coordinates: + hg38: + nchr: "chr15" + nchr_old: "chr15_43599000_43620000" + nchr_length: 101991189 + extract_region1: "chr15:43599000-43620000" + extract_region2: "chr15:43698814-43719466" + + noisy_region: [[43602500, 43603700], [43602165, 43602176]] + + depth_region: [[43610000, 43660000]] + + left_boundary: 43599500 + right_boundary: 43619600 + + # deletion in pseudogene + deletion1_size: 314 + del1_3p_pos1: 43602628 + del1_3p_pos2: 43602631 + del1_5p_pos1: 43602944 + del1_5p_pos2: 43602948 diff --git a/paraphase/genes/__init__.py b/paraphase/genes/__init__.py new file mode 100755 index 0000000..e69de29 diff --git a/paraphase/genes/cfc1_phaser.py b/paraphase/genes/cfc1_phaser.py new file mode 100644 index 0000000..0e05349 --- /dev/null +++ b/paraphase/genes/cfc1_phaser.py @@ -0,0 +1,96 @@ +# paraphase +# Author: Xiao Chen + + +from collections import namedtuple +from ..phaser import Phaser + + +class Cfc1Phaser(Phaser): + GeneCall = namedtuple( + "GeneCall", + "total_cn final_haplotypes two_copy_haplotypes \ + highest_total_cn assembled_haplotypes sites_for_phasing \ + unique_supporting_reads het_sites_not_used_in_phasing homozygous_sites \ + haplotype_details variant_genotypes nonunique_supporting_reads \ + read_details genome_depth", + ) + + def __init__(self, sample_id, outdir, wgs_depth=None): + Phaser.__init__(self, sample_id, outdir, wgs_depth) + + def call(self): + if self.check_coverage_before_analysis() is False: + return None + self.get_homopolymer() + self.get_candidate_pos() + # add pivot site + if "130593061_A_G" not in self.candidate_pos: + self.candidate_pos.add("130593061_A_G") + self.het_sites = sorted(list(self.candidate_pos)) + self.remove_noisy_sites() + + raw_read_haps = self.get_haplotypes_from_reads(self.het_sites) + + ( + ass_haps, + original_haps, + hcn, + uniquely_supporting_reads, + nonuniquely_supporting_reads, + raw_read_haps, + read_counts, + ) = self.phase_haps(raw_read_haps) + + tmp = {} + for i, hap in enumerate(ass_haps): + tmp.setdefault(hap, f"hap{i+1}") + ass_haps = tmp + + haplotypes = None + dvar = None + if self.het_sites != []: + haplotypes, dvar = self.output_variants_in_haplotypes( + ass_haps, + uniquely_supporting_reads, + nonuniquely_supporting_reads, + ) + + two_cp_haps = [] + if len(ass_haps) == 2: + two_cp_haps = list(ass_haps.values()) + elif len(ass_haps) == 3: + two_cp_haps = self.compare_depth(haplotypes, loose=True) + if two_cp_haps == [] and read_counts is not None: + # check if one smn1 haplotype has more reads than others + haps = list(read_counts.keys()) + counts = list(read_counts.values()) + max_count = max(counts) + cp2_hap = haps[counts.index(max_count)] + others_max = sorted(counts, reverse=True)[1] + probs = self.depth_prob(max_count, others_max) + if probs[0] < 0.15 and others_max >= 10: + two_cp_haps.append(ass_haps[cp2_hap]) + + total_cn = len(ass_haps) + len(two_cp_haps) + if total_cn < 4: + total_cn = None + + self.close_handle() + + return self.GeneCall( + total_cn, + ass_haps, + two_cp_haps, + hcn, + original_haps, + self.het_sites, + uniquely_supporting_reads, + self.het_no_phasing, + self.homo_sites, + haplotypes, + dvar, + nonuniquely_supporting_reads, + raw_read_haps, + self.mdepth, + ) diff --git a/paraphase/genes/f8_phaser.py b/paraphase/genes/f8_phaser.py new file mode 100644 index 0000000..608a7fe --- /dev/null +++ b/paraphase/genes/f8_phaser.py @@ -0,0 +1,173 @@ +# paraphase +# Author: Xiao Chen + + +from collections import namedtuple +import pysam +from ..phaser import Phaser + + +class F8Phaser(Phaser): + GeneCall = namedtuple( + "GeneCall", + "total_cn final_haplotypes two_copy_haplotypes flanking_summary sv_called \ + highest_total_cn assembled_haplotypes sites_for_phasing \ + unique_supporting_reads het_sites_not_used_in_phasing homozygous_sites \ + haplotype_details variant_genotypes nonunique_supporting_reads \ + read_details genome_depth", + ) + + def __init__(self, sample_id, outdir, wgs_depth=None, genome_bam=None): + Phaser.__init__(self, sample_id, outdir, wgs_depth, genome_bam) + + def set_parameter(self, config): + super().set_parameter(config) + self.extract_region1 = config["coordinates"]["hg38"]["extract_region1"] + self.extract_region2, self.extract_region3 = config["coordinates"]["hg38"][ + "extract_region2" + ].split() + + def get_read_positions(self, min_extension=5000): + """Get mapped region of the part of reads not overlapping repeat""" + dpos5 = {} + dpos3 = {} + genome_bamh = pysam.AlignmentFile(self.genome_bam, "rb") + for i, extract_region in enumerate( + [self.extract_region2, self.extract_region1, self.extract_region3] + ): + pos1, pos2 = extract_region.split(":")[1].split("-") + pos1 = int(pos1) + pos2 = int(pos2) + pos_name = f"region{i+1}" + for pileupcolumn in genome_bamh.pileup( + self.nchr, + pos1 - 1, + pos1, + truncate=True, + ): + for read in pileupcolumn.pileups: + if read.alignment.reference_start < pos1 - min_extension: + read_name = read.alignment.query_name + if i in [0, 1]: + dpos5.setdefault(read_name, []).append(pos_name) + else: + dpos3.setdefault(read_name, []).append(pos_name) + for pileupcolumn in genome_bamh.pileup( + self.nchr, + pos2 - 1, + pos2, + truncate=True, + ): + for read in pileupcolumn.pileups: + if read.alignment.reference_end > pos2 + min_extension: + read_name = read.alignment.query_name + if i in [0, 1]: + dpos3.setdefault(read_name, []).append(pos_name) + else: + dpos5.setdefault(read_name, []).append(pos_name) + genome_bamh.close() + return dpos5, dpos3 + + def call(self): + if self.check_coverage_before_analysis() is False: + return None + dpos5, dpos3 = self.get_read_positions() + self.get_homopolymer() + self.get_candidate_pos() + + var_found = False + for var in self.candidate_pos: + pos = int(var.split("_")[0]) + if self.clip_3p_positions[0] < pos < self.clip_3p_positions[1]: + var_found = True + break + if var_found is False: + self.candidate_pos.add("155386300_A_C") + + var_found = False + for var in self.candidate_pos: + pos = int(var.split("_")[0]) + if pos > self.clip_3p_positions[1]: + var_found = True + break + if var_found is False: + self.candidate_pos.add("155386860_C_G") + + self.het_sites = sorted(list(self.candidate_pos)) + self.remove_noisy_sites() + + raw_read_haps = self.get_haplotypes_from_reads(self.het_sites, check_clip=True) + + ( + ass_haps, + original_haps, + hcn, + uniquely_supporting_reads, + nonuniquely_supporting_reads, + raw_read_haps, + read_counts, + ) = self.phase_haps(raw_read_haps) + + total_cn = len(ass_haps) + tmp = {} + for i, hap in enumerate(ass_haps): + tmp.setdefault(hap, f"hap{i+1}") + ass_haps = tmp + + haplotypes = None + dvar = None + if self.het_sites != []: + haplotypes, dvar = self.output_variants_in_haplotypes( + ass_haps, + uniquely_supporting_reads, + nonuniquely_supporting_reads, + ) + + # check flanking region, call sv + haplotype_flanking_regions = {} + # to-do: nonuniquely supporting reads + for hap, reads in uniquely_supporting_reads.items(): + haplotype_flanking_regions.setdefault(hap, [[], []]) + for read in reads: + if read in dpos5 and len(dpos5[read]) == 1: + haplotype_flanking_regions[hap][0].append(dpos5[read][0]) + if read in dpos3 and len(dpos3[read]) == 1: + haplotype_flanking_regions[hap][1].append(dpos3[read][0]) + flanking_sum = {} + sv_hap = {} + for hap, regions in haplotype_flanking_regions.items(): + hap_name = ass_haps[hap] + p5region = list(set(regions[0])) + p3region = list(set(regions[1])) + flanking_sum.setdefault( + hap_name, "-".join(["/".join(p5region), "/".join(p3region)]) + ) + for hap, links in flanking_sum.items(): + if "region1" in links and links != "region1-region1": + if links == "region1-region2": + sv_hap.setdefault(hap_name, "deletion") + elif links == "region2-region1": + sv_hap.setdefault(hap_name, "duplication") + elif "region3" in links: + sv_hap.setdefault(hap_name, "inversion") + + self.close_handle() + + return self.GeneCall( + total_cn, + ass_haps, + [], + flanking_sum, + sv_hap, + hcn, + original_haps, + self.het_sites, + uniquely_supporting_reads, + self.het_no_phasing, + self.homo_sites, + haplotypes, + dvar, + nonuniquely_supporting_reads, + raw_read_haps, + self.mdepth, + ) diff --git a/paraphase/genes/ikbkg_phaser.py b/paraphase/genes/ikbkg_phaser.py new file mode 100644 index 0000000..59ab7b6 --- /dev/null +++ b/paraphase/genes/ikbkg_phaser.py @@ -0,0 +1,186 @@ +# paraphase +# Author: Xiao Chen + + +from collections import namedtuple +from .rccx_phaser import RccxPhaser +from ..phaser import Phaser + + +class IkbkgPhaser(Phaser): + GeneCall = namedtuple( + "GeneCall", + "total_cn gene_cn final_haplotypes deletion_haplotypes two_copy_haplotypes \ + alleles_raw alleles_final del_read_number highest_total_cn assembled_haplotypes \ + sites_for_phasing unique_supporting_reads het_sites_not_used_in_phasing \ + homozygous_sites haplotype_details variant_genotypes nonunique_supporting_reads \ + read_details genome_depth", + ) + + def __init__(self, sample_id, outdir, wgs_depth=None): + Phaser.__init__(self, sample_id, outdir, wgs_depth) + self.del1_reads = set() + self.del1_reads_partial = set() + + def set_parameter(self, config): + super().set_parameter(config) + self.deletion1_size = config["coordinates"]["hg38"]["deletion1_size"] + self.del1_3p_pos1 = config["coordinates"]["hg38"]["del1_3p_pos1"] + self.del1_3p_pos2 = config["coordinates"]["hg38"]["del1_3p_pos2"] + self.del1_5p_pos1 = config["coordinates"]["hg38"]["del1_5p_pos1"] + self.del1_5p_pos2 = config["coordinates"]["hg38"]["del1_5p_pos2"] + + def call(self): + if self.check_coverage_before_analysis() is False: + return None + self.get_homopolymer() + + ## get deletion ## + self.del1_reads, self.del1_reads_partial = self.get_long_del_reads( + self.del1_3p_pos1, + self.del1_3p_pos2, + self.del1_5p_pos1, + self.del1_5p_pos2, + self.deletion1_size, + ) + + self.get_candidate_pos(min_vaf=0.095) + + var_found = False + for var in self.candidate_pos: + pos = int(var.split("_")[0]) + if pos > self.clip_3p_positions[0]: + var_found = True + break + if var_found is False: + self.candidate_pos.add("154569800_T_G") + + var_found = False + for var in self.candidate_pos: + pos = int(var.split("_")[0]) + if pos < self.clip_5p_positions[0]: + var_found = True + break + if var_found is False: + self.candidate_pos.add("154555882_C_G") + + self.het_sites = sorted(list(self.candidate_pos)) + self.remove_noisy_sites() + + raw_read_haps = self.get_haplotypes_from_reads( + self.het_sites, + check_clip=True, + partial_deletion_reads=self.del1_reads_partial, + ) + het_sites = self.het_sites + if self.del1_reads_partial != set(): + raw_read_haps, het_sites = self.update_reads_for_deletions( + raw_read_haps, + het_sites, + self.del1_3p_pos1, + self.del1_5p_pos2, + self.del1_reads_partial, + "3", + "154558014_del10806", + ) + self.het_sites = het_sites + + ( + ass_haps, + original_haps, + hcn, + uniquely_supporting_reads, + nonuniquely_supporting_reads, + raw_read_haps, + read_counts, + ) = self.phase_haps(raw_read_haps) + + total_cn = len(ass_haps) + tmp = {} + gene_counter = 0 + pseudo_counter = 0 + dup_counter = 0 + deletion_haplotypes = [] + for i, hap in enumerate(ass_haps): + nsite = min(len(hap), 10) + start_seq = hap[:nsite] + if start_seq.startswith("0") is False: + if start_seq.count("1") >= start_seq.count("2"): + gene_counter += 1 + hap_name = f"ikbkg_hap{gene_counter}" + tmp.setdefault(hap, hap_name) + if "3" in hap: + deletion_haplotypes.append(hap_name) + else: + pseudo_counter += 1 + hap_name = f"pseudo_hap{pseudo_counter}" + tmp.setdefault(hap, hap_name) + if "3" in hap: + deletion_haplotypes.append(hap_name) + else: + dup_counter += 1 + tmp.setdefault(hap, f"dup_hap{dup_counter}") + ass_haps = tmp + + haplotypes = None + dvar = None + if self.het_sites != []: + haplotypes, dvar = self.output_variants_in_haplotypes( + ass_haps, + uniquely_supporting_reads, + nonuniquely_supporting_reads, + ) + + # this is on chrX, males have one copy of gene and one copy of pseudogene + two_cp_haps = [] + if gene_counter == 1 and pseudo_counter > 1: + two_cp_haps = self.compare_depth(haplotypes, loose=True) + if two_cp_haps == [] and read_counts is not None: + # check if one haplotype has more reads than others + haps = list(read_counts.keys()) + counts = list(read_counts.values()) + max_count = max(counts) + cp2_hap = haps[counts.index(max_count)] + others_max = sorted(counts, reverse=True)[1] + probs = self.depth_prob(max_count, others_max) + if probs[0] < 0.15 and others_max >= 10: + two_cp_haps.append(ass_haps[cp2_hap]) + for hap in two_cp_haps: + total_cn += 1 + if "ikbkg" in hap: + gene_counter += 1 + if gene_counter == 1 and pseudo_counter != 1: + gene_counter = None + total_cn = None + + alleles, links = RccxPhaser.get_alleles(uniquely_supporting_reads) + new_alleles = [] + for allele in alleles: + new_allele = [] + for hap in allele: + new_allele.append(ass_haps[hap]) + new_alleles.append(new_allele) + + self.close_handle() + + return self.GeneCall( + total_cn, + gene_counter, + ass_haps, + deletion_haplotypes, + two_cp_haps, + alleles, + new_alleles, + len(self.del1_reads_partial), + hcn, + original_haps, + self.het_sites, + uniquely_supporting_reads, + self.het_no_phasing, + self.homo_sites, + haplotypes, + dvar, + nonuniquely_supporting_reads, + raw_read_haps, + self.mdepth, + ) diff --git a/paraphase/genes/ncf1_phaser.py b/paraphase/genes/ncf1_phaser.py new file mode 100644 index 0000000..f28fe9f --- /dev/null +++ b/paraphase/genes/ncf1_phaser.py @@ -0,0 +1,138 @@ +# paraphase +# Author: Xiao Chen + + +from collections import namedtuple +from ..phaser import Phaser + + +class Ncf1Phaser(Phaser): + GeneCall = namedtuple( + "GeneCall", + "total_cn gene_cn final_haplotypes two_copy_haplotypes gene_reads pseudo_reads \ + highest_total_cn assembled_haplotypes sites_for_phasing \ + unique_supporting_reads het_sites_not_used_in_phasing homozygous_sites \ + haplotype_details variant_genotypes nonunique_supporting_reads \ + read_details genome_depth", + ) + + def __init__(self, sample_id, outdir, wgs_depth=None): + Phaser.__init__(self, sample_id, outdir, wgs_depth) + + def set_parameter(self, config): + super().set_parameter(config) + + def call(self): + if self.check_coverage_before_analysis() is False: + return None + pivot_site = self.pivot_site + for pileupcolumn in self._bamh.pileup( + self.nchr, pivot_site - 1, pivot_site, truncate=True + ): + bases = [ + a.upper() for a in pileupcolumn.get_query_sequences(add_indels=True) + ] + gene_reads = bases.count("G") + pseudo_reads = bases.count("G-2NN") + + self.get_homopolymer() + self.get_candidate_pos() + # add pivot site + if "74777266_G_A" not in self.candidate_pos: + self.candidate_pos.add("74777266_G_A") + self.het_sites = sorted(list(self.candidate_pos)) + self.remove_noisy_sites() + + raw_read_haps = self.get_haplotypes_from_reads(self.het_sites) + + ( + ass_haps, + original_haps, + hcn, + uniquely_supporting_reads, + nonuniquely_supporting_reads, + raw_read_haps, + read_counts, + ) = self.phase_haps(raw_read_haps) + + total_cn = len(ass_haps) + tmp = {} + for i, hap in enumerate(ass_haps): + tmp.setdefault(hap, f"hap{i+1}") + ass_haps = tmp + + haplotypes = None + dvar = None + if self.het_sites != []: + haplotypes, dvar = self.output_variants_in_haplotypes( + ass_haps, + uniquely_supporting_reads, + nonuniquely_supporting_reads, + ) + + hap_rename = {} + counter_gene = 0 + counter_pseudo = 0 + # main variant is 74777266_GGT_G + pivot_var = "74777266_GGT_G" + for hap in haplotypes: + var = haplotypes[hap]["variants"] + if pivot_var not in var: + counter_gene += 1 + hap_rename.setdefault(hap, f"ncf1_hap{counter_gene}") + else: + counter_pseudo += 1 + hap_rename.setdefault(hap, f"pseudo_hap{counter_pseudo}") + + tmp = {} + for hap, hap_name in ass_haps.items(): + tmp.setdefault(hap, hap_rename[hap_name]) + ass_haps = tmp + + tmp = {} + for hap_name, hap_info in haplotypes.items(): + tmp.setdefault(hap_rename[hap_name], hap_info) + haplotypes = tmp + + two_cp_haps = [] + if counter_gene == 1: + two_cp_hap_candidate = self.compare_depth(haplotypes) + if "ncf1_hap1" in two_cp_hap_candidate: + two_cp_haps = two_cp_hap_candidate + counter_gene += 1 + total_cn += 1 + + if self.mdepth is not None: + prob = self.depth_prob(gene_reads, self.mdepth / 2) + if prob[0] < 0.9 and counter_gene == 1: + counter_gene = None + total_cn = None + if prob[0] > 0.95 and counter_gene > 1 and two_cp_haps != []: + counter_gene = None + total_cn = None + # scenario where only three haplotypes are found, possibly each at CN2 + if counter_gene == 1 and counter_pseudo == 2 and total_cn == 3: + counter_gene = None + total_cn = None + + self.close_handle() + + return self.GeneCall( + total_cn, + counter_gene, + ass_haps, + two_cp_haps, + gene_reads, + pseudo_reads, + hcn, + original_haps, + self.het_sites, + uniquely_supporting_reads, + self.het_no_phasing, + self.homo_sites, + haplotypes, + dvar, + nonuniquely_supporting_reads, + raw_read_haps, + self.mdepth, + ) diff --git a/paraphase/genes/neb_phaser.py b/paraphase/genes/neb_phaser.py new file mode 100644 index 0000000..b8b50e1 --- /dev/null +++ b/paraphase/genes/neb_phaser.py @@ -0,0 +1,150 @@ +# paraphase +# Author: Xiao Chen + + +from collections import namedtuple +from .rccx_phaser import RccxPhaser +from ..phaser import Phaser + + +class NebPhaser(Phaser): + GeneCall = namedtuple( + "GeneCall", + "total_cn final_haplotypes two_copy_haplotypes alleles_raw alleles_final \ + repeat_name highest_total_cn assembled_haplotypes sites_for_phasing \ + unique_supporting_reads het_sites_not_used_in_phasing homozygous_sites \ + haplotype_details variant_genotypes nonunique_supporting_reads \ + read_details genome_depth", + ) + + def __init__(self, sample_id, outdir, wgs_depth=None): + Phaser.__init__(self, sample_id, outdir, wgs_depth) + + def set_parameter(self, config): + super().set_parameter(config) + + def call(self): + if self.check_coverage_before_analysis() is False: + return None + self.get_homopolymer() + self.get_candidate_pos() + self.het_sites = sorted(list(self.candidate_pos)) + self.remove_noisy_sites() + + raw_read_haps = self.get_haplotypes_from_reads(self.het_sites) + + ( + ass_haps, + original_haps, + hcn, + uniquely_supporting_reads, + nonuniquely_supporting_reads, + raw_read_haps, + read_counts, + ) = self.phase_haps(raw_read_haps) + + tmp = {} + for i, hap in enumerate(ass_haps): + tmp.setdefault(hap, f"hap{i+1}") + ass_haps = tmp + + haplotypes = None + dvar = None + if self.het_sites != []: + haplotypes, dvar = self.output_variants_in_haplotypes( + ass_haps, + uniquely_supporting_reads, + nonuniquely_supporting_reads, + ) + + # assign tri 1 2 3 + tri1 = [] + tri2 = [] + tri3 = [] + for hap, hap_name in ass_haps.items(): + nsite = min(int(len(hap) / 2), 10) + start_seq = hap[:nsite] + end_seq = hap[0 - nsite :] + if start_seq.count("1") >= start_seq.count("2") and end_seq.count( + "1" + ) >= end_seq.count("2"): + tri1.append(hap_name) + elif start_seq.count("1") < start_seq.count("2") and end_seq.count( + "1" + ) < end_seq.count("2"): + tri3.append(hap_name) + else: + tri2.append(hap_name) + + # find two copy haplotypes + two_cp_haps = [] + if len(ass_haps) == 3 and len(tri1) == 1 and len(tri2) == 1 and len(tri3) == 1: + two_cp_haps = list(ass_haps.values()) + elif len(ass_haps) < 6: + two_cp_haps = self.compare_depth(haplotypes, loose=True) + if two_cp_haps == [] and read_counts is not None: + # check if one haplotype has more reads than others + haps = list(read_counts.keys()) + counts = list(read_counts.values()) + max_count = max(counts) + cp2_hap = haps[counts.index(max_count)] + others_max = sorted(counts, reverse=True)[1] + probs = self.depth_prob(max_count, others_max) + if probs[0] < 0.15 and others_max >= 10: + two_cp_haps.append(ass_haps[cp2_hap]) + + for hap in two_cp_haps: + if hap in tri1: + tri1.append(hap) + elif hap in tri2: + tri2.append(hap) + elif hap in tri3: + tri3.append(hap) + if len(tri1) == 1: + two_cp_haps.append(tri1[0]) + tri1.append(tri1[0]) + if len(tri3) == 1: + two_cp_haps.append(tri3[0]) + tri3.append(tri3[0]) + + alleles = [] + new_alleles = [] + if two_cp_haps == []: + alleles, links = RccxPhaser.get_alleles(uniquely_supporting_reads) + for allele in alleles: + new_allele = [] + for hap in allele: + new_allele.append(ass_haps[hap]) + new_alleles.append(new_allele) + total_cn = len(ass_haps) + len(two_cp_haps) + # incorract phasing suggests haplotypes with cn > 1 + if len(new_alleles) == 1 and sorted(new_alleles[0]) == sorted( + ass_haps.values() + ): + new_alleles = [] + total_cn = None + elif len(tri1) > 2 or len(tri3) > 2: + total_cn = None + new_alleles = [] + + self.close_handle() + + return self.GeneCall( + total_cn, + ass_haps, + two_cp_haps, + alleles, + new_alleles, + {"tri1": tri1, "tri2": tri2, "tri3": tri3}, + hcn, + original_haps, + self.het_sites, + uniquely_supporting_reads, + self.het_no_phasing, + self.homo_sites, + haplotypes, + dvar, + nonuniquely_supporting_reads, + raw_read_haps, + self.mdepth, + ) diff --git a/paraphase/genes/pms2_phaser.py b/paraphase/genes/pms2_phaser.py new file mode 100755 index 0000000..2330195 --- /dev/null +++ b/paraphase/genes/pms2_phaser.py @@ -0,0 +1,98 @@ +# paraphase +# Author: Xiao Chen + + +from collections import namedtuple +from ..phaser import Phaser + + +class Pms2Phaser(Phaser): + GeneCall = namedtuple( + "GeneCall", + "total_cn gene_cn final_haplotypes two_copy_haplotypes \ + highest_total_cn assembled_haplotypes sites_for_phasing \ + unique_supporting_reads het_sites_not_used_in_phasing homozygous_sites \ + haplotype_details variant_genotypes nonunique_supporting_reads \ + read_details genome_depth", + ) + + def __init__(self, sample_id, outdir, wgs_depth=None): + Phaser.__init__(self, sample_id, outdir, wgs_depth) + + def call(self): + if self.check_coverage_before_analysis() is False: + return None + self.get_homopolymer() + self.get_candidate_pos() + self.het_sites = sorted(list(self.candidate_pos)) + self.remove_noisy_sites() + # for distinguishing pms2 from pms2cl + self.het_sites.append("5989137_G_A") + + raw_read_haps = self.get_haplotypes_from_reads(self.het_sites, check_clip=True) + + ( + ass_haps, + original_haps, + hcn, + uniquely_supporting_reads, + nonuniquely_supporting_reads, + raw_read_haps, + read_counts, + ) = self.phase_haps(raw_read_haps) + + tmp = {} + counter_gene = 0 + counter_pseudo = 0 + for hap in ass_haps: + if hap[-1] in ["0", "x"]: + counter_pseudo += 1 + tmp.setdefault(hap, f"pms2clhap{counter_pseudo}") + else: + counter_gene += 1 + tmp.setdefault(hap, f"pms2hap{counter_gene}") + ass_haps = tmp + + haplotypes = None + dvar = None + if self.het_sites != []: + haplotypes, dvar = self.output_variants_in_haplotypes( + ass_haps, + uniquely_supporting_reads, + nonuniquely_supporting_reads, + ) + + # two-cp haplotypes + two_cp_haps = [] + if len(ass_haps) < 4: + if counter_gene == 1 and counter_pseudo == 1: + two_cp_haps = list(ass_haps.values()) + elif len(ass_haps) == 3 and 2 in [counter_gene, counter_pseudo]: + two_cp_haps = self.compare_depth(haplotypes, loose=True) + + total_cn = len(ass_haps) + len(two_cp_haps) + pms2_cn = len([a for a in ass_haps.values() if "cl" not in a]) + len( + [a for a in two_cp_haps if "cl" not in a] + ) + # bigger cnvs are not handled here yet + if pms2_cn != 2: + pms2_cn = None + self.close_handle() + + return self.GeneCall( + total_cn, + pms2_cn, + ass_haps, + two_cp_haps, + hcn, + original_haps, + self.het_sites, + uniquely_supporting_reads, + self.het_no_phasing, + self.homo_sites, + haplotypes, + dvar, + nonuniquely_supporting_reads, + raw_read_haps, + self.mdepth, + ) diff --git a/paraphase/genes/rccx_phaser.py b/paraphase/genes/rccx_phaser.py new file mode 100644 index 0000000..4b99b19 --- /dev/null +++ b/paraphase/genes/rccx_phaser.py @@ -0,0 +1,655 @@ +# paraphase +# Author: Xiao Chen + + +from collections import namedtuple +import copy +from ..phaser import Phaser + + +class RccxPhaser(Phaser): + GeneCall = namedtuple( + "GeneCall", + "total_cn final_haplotypes two_copy_haplotypes starting_hap ending_hap deletion_hap \ + phasing_success alleles_final annotated_alleles hap_variants alleles_raw hap_links \ + highest_total_cn assembled_haplotypes sites_for_phasing \ + unique_supporting_reads het_sites_not_used_in_phasing homozygous_sites \ + haplotype_details variant_genotypes nonunique_supporting_reads \ + read_details genome_depth", + ) + + def __init__(self, sample_id, outdir, wgs_depth=None): + Phaser.__init__(self, sample_id, outdir, wgs_depth) + self.has_gene1 = False + self.has_gene2 = False + self.gene1_reads = set() + self.gene2_reads = set() + self.del1_reads = set() + self.del1_reads_partial = set() + self.del2_reads = set() + self.del2_reads_partial = set() + + def set_parameter(self, config): + super().set_parameter(config) + self.variant_def = config["data"]["snp_file"] + self.known_variants = {} + with open(self.variant_def) as f: + for line in f: + split_line = line.split() + self.known_variants.setdefault( + "_".join([split_line[1], split_line[2], split_line[3]]), + split_line[-1], + ) + self.deletion1_size = config["coordinates"]["hg38"]["deletion1_size"] + self.deletion2_size = config["coordinates"]["hg38"]["deletion2_size"] + self.del2_3p_pos1 = config["coordinates"]["hg38"]["del2_3p_pos1"] + self.del2_3p_pos2 = config["coordinates"]["hg38"]["del2_3p_pos2"] + self.del2_5p_pos1 = config["coordinates"]["hg38"]["del2_5p_pos1"] + self.del2_5p_pos2 = config["coordinates"]["hg38"]["del2_5p_pos2"] + self.del1_3p_pos1 = config["coordinates"]["hg38"]["del1_3p_pos1"] + self.del1_3p_pos2 = config["coordinates"]["hg38"]["del1_3p_pos2"] + self.del1_5p_pos1 = config["coordinates"]["hg38"]["del1_5p_pos1"] + self.del1_5p_pos2 = config["coordinates"]["hg38"]["del1_5p_pos2"] + + def allow_del_bases(self, pos): + """ + During variant calling, allow "bases in deletions" for positions in + two long deletions + """ + if ( + self.del2_reads_partial != set() + and self.del2_3p_pos1 <= pos <= self.del2_5p_pos2 + ): + return True + if ( + self.del1_reads_partial != set() + and self.del1_3p_pos1 <= pos <= self.del1_5p_pos2 + ): + return True + return False + + @staticmethod + def get_alleles(reads): + """ + Phase haplotypes into alleles using read evidence + """ + new_reads = {} + for hap in reads: + hap_reads = set() + for read in reads[hap]: + if "sup" not in read: + hap_reads.add(read) + else: + hap_reads.add(read.split("_sup")[0]) + new_reads.setdefault(hap, hap_reads) + links = {} + checked = set() + for hap1 in new_reads: + r1 = new_reads[hap1] + for hap2 in new_reads: + hap_pair = "_".join(sorted([hap1, hap2])) + if hap_pair not in checked and hap1 != hap2: + checked.add(hap_pair) + r2 = new_reads[hap2] + read_overlap = r1.intersection(r2) + if len(read_overlap) >= 2: + links.setdefault(hap1, []).append(hap2) + links.setdefault(hap2, []).append(hap1) + links = dict(sorted(links.items(), key=lambda item: len(item[1]), reverse=True)) + alleles = [] + if links != {}: + alleles = [[list(links.keys())[0]] + list(links.values())[0]] + for hap1 in links: + for hap2 in links[hap1]: + hap1_in = sum([hap1 in a for a in alleles]) + hap2_in = sum([hap2 in a for a in alleles]) + if hap1_in == 0 and hap2_in == 0: + alleles.append([hap1, hap2]) + elif hap1_in == 0: + for a in alleles: + if hap2 in a: + a_index = alleles.index(a) + if hap1 not in alleles[a_index]: + alleles[a_index].append(hap1) + if hap2 not in alleles[a_index]: + alleles[a_index].append(hap2) + else: + for a in alleles: + if hap1 in a: + a_index = alleles.index(a) + if hap2 not in alleles[a_index]: + alleles[a_index].append(hap2) + if hap1 not in alleles[a_index]: + alleles[a_index].append(hap1) + return alleles, links + + def output_variants_in_haplotypes(self, haps, reads, nonunique, two_cp_haps=[]): + """ + Summarize all variants in each haplotype. + Output all variants and their genotypes. + Haplotypes are different length, so a range (boundary) is reported + """ + het_sites = self.het_sites + haplotype_variants = {} + haplotype_info = {} + dvar = {} + var_no_phasing = copy.deepcopy(self.het_no_phasing) + for hap, hap_name in haps.items(): + haplotype_variants.setdefault(hap_name, []) + # het sites not used in phasing + if reads != {}: + for var in var_no_phasing: + genotypes = [] + var_reads = self.check_variants_in_haplotypes(var) + haps_with_variant = [] + for hap, hap_name in haps.items(): + hap_reads = reads[hap] + hap_reads_nonunique = [a for a in nonunique if hap in nonunique[a]] + genotype = self.get_genotype_in_hap( + var_reads, hap_reads, hap_reads_nonunique + ) + genotypes.append(genotype) + if genotype == "1": + haps_with_variant.append(hap_name) + if haps_with_variant == []: + self.het_no_phasing.remove(var) + else: + for hap_name in haps_with_variant: + haplotype_variants[hap_name].append(var) + dvar.setdefault(var, genotypes) + # het sites and homo sites + for hap, hap_name in haps.items(): + for i in range(len(hap)): + if hap[i] == "2": + haplotype_variants[hap_name].append(het_sites[i]) + elif ( + hap[i] == "3" + and "32043718_del120" not in haplotype_variants[hap_name] + ): + haplotype_variants[hap_name].append("32043718_del120") + elif ( + hap[i] == "4" + and "32017431_del6367" not in haplotype_variants[hap_name] + ): + haplotype_variants[hap_name].append("32017431_del6367") + if "32017431_del6367" in haplotype_variants[hap_name]: + pos1 = self.del1_3p_pos1 + pos2 = self.del1_5p_pos2 + for var in self.homo_sites: + pos = int(var.split("_")[0]) + if pos < pos1 or pos > pos2: + haplotype_variants[hap_name].append(var) + elif "32043718_del120" in haplotype_variants[hap_name]: + pos1 = self.del2_3p_pos1 + pos2 = self.del2_5p_pos2 + for var in self.homo_sites: + pos = int(var.split("_")[0]) + if pos < pos1 or pos > pos2: + haplotype_variants[hap_name].append(var) + else: + haplotype_variants[hap_name] += self.homo_sites + + var_nstart, var_nend = self.get_hap_variant_ranges(hap) + var_tmp = haplotype_variants[hap_name] + var_tmp1 = [ + a for a in var_tmp if var_nstart <= int(a.split("_")[0]) <= var_nend + ] + var_tmp1 = list(set(var_tmp1)) + var_tmp2 = sorted(var_tmp1, key=lambda x: int(x.split("_")[0])) + haplotype_info.setdefault( + hap_name, {"variants": var_tmp2, "boundary": [var_nstart, var_nend]} + ) + + # summary per variant + all_haps = haps + nhap = len(all_haps) + for var in self.homo_sites: + dvar.setdefault(var, ["1"] * nhap) + for i, var in enumerate(het_sites): + dvar.setdefault(var, []) + for hap, hap_name in haps.items(): + base_call = "." + if hap[i] == "2": + base_call = "1" + elif hap[i] == "1": + base_call = "0" + dvar[var].append(base_call) + if hap_name in two_cp_haps: + dvar[var].append(base_call) + + return haplotype_info, { + var: "|".join(dvar[var]) for var in dict(sorted(dvar.items())) + } + + def annotate_var(self, allele_var): + """annotate an allele with variants""" + annotated_allele = None + if len(allele_var) == 2: + if [] in allele_var: + annotated_allele = "WT" + else: + tmp = sorted(allele_var, key=lambda x: len(x)) + annotated_allele = ",".join(tmp[0]) + elif len(allele_var) == 1: + if allele_var == [[]]: + annotated_allele = "pseudogene_deletion" + else: + annotated_allele = "deletion_" + ",".join(allele_var[0]) + elif len(allele_var) == 3: + tmp = sorted(allele_var, key=lambda x: len(x)) + if tmp[0] == []: + if tmp[1] == []: + annotated_allele = "gene_duplication" + elif abs(len(tmp[1]) - len(tmp[2])) <= 1 and len(tmp[2]) >= 6: + annotated_allele = "pseudogene_duplication" + else: + annotated_allele = "duplicaton_WT_plus_" + ",".join(tmp[1]) + else: + if abs(len(tmp[1]) - len(tmp[2])) <= 1 and len(tmp[2]) >= 6: + annotated_allele = ",".join(tmp[0]) + "_pseudogene_duplication" + else: + annotated_allele = ( + "duplication_" + ",".join(tmp[0]) + "_plus_" + ",".join(tmp[1]) + ) + + return annotated_allele + + def annotate_alleles( + self, + successful_phasing, + new_alleles, + hap_variants, + ending_copies, + ass_haps, + two_cp_haplotypes, + ): + """annotate the allele type""" + annotated_alleles = [] + if successful_phasing: + allele1 = new_alleles[0] + allele2 = new_alleles[1] + allele1_var = [hap_variants[a] for a in allele1] + allele2_var = [hap_variants[a] for a in allele2] + for allele_var in [allele1_var, allele2_var]: + annotated_allele = self.annotate_var(allele_var) + annotated_alleles.append(annotated_allele) + else: + if ( + len(ending_copies) == 2 + and len(ass_haps) == 4 + and two_cp_haplotypes == [] + ): + for allele_var in [hap_variants[a] for a in ending_copies]: + annotated_allele = None + if allele_var == []: + annotated_allele = "WT" + else: + annotated_allele = ",".join(allele_var) + annotated_alleles.append(annotated_allele) + return annotated_alleles + + def update_alleles( + self, + new_alleles, + haplotypes, + final_haps, + single_copies, + starting_copies, + ending_copies, + ): + """Update phased alleles""" + two_cp_haplotypes = [] + successful_phasing = False + # the deletion haplotype will be reported as an allele + if len(single_copies) == 1 and len(final_haps) < 5: + if ( + len(new_alleles) == 1 + and len(new_alleles[0]) == len(final_haps) - 1 + and single_copies not in new_alleles + ): + new_alleles.append(single_copies) + elif ( + len(new_alleles) == 1 + and len(new_alleles[0]) < len(final_haps) - 1 + and single_copies not in new_alleles + and len(starting_copies) == 1 + and len(ending_copies) == 1 + ): + new_alleles = [] + if new_alleles == []: + new_alleles.append(single_copies) + remaining_hap = [ + a for a in final_haps.values() if a not in single_copies + ] + if len(remaining_hap) == len(final_haps) - 1: + new_alleles.append(remaining_hap) + # deletions on each allele + elif ( + len(single_copies) == 2 + and len(new_alleles) == 0 + and len(final_haps) == 2 + and len(starting_copies) == 0 + and len(ending_copies) == 0 + ): + new_alleles = [[single_copies[0]], [single_copies[1]]] + successful_phasing = True + elif single_copies == []: + # homozygous, each haplotype has cn 2 + if ( + len(final_haps) == 2 + and len(ending_copies) == 1 + and len(starting_copies) == 1 + and len(new_alleles) == 1 + ): + two_cp_haplotypes = list(final_haps.values()) + new_alleles.append(new_alleles[0]) + successful_phasing = True + # depth-based adjustment when found 3 haplotypes or <2 ending haplotypes + if haplotypes is not None: + two_cp_hap_candidate = self.compare_depth(haplotypes, loose=True) + if len(ending_copies) == 1 and len(starting_copies) == 2: + if two_cp_hap_candidate == ending_copies: + two_cp_haplotypes = two_cp_hap_candidate + if len(final_haps) == 3: + new_alleles = [ + [starting_copies[0], ending_copies[0]], + [starting_copies[1], ending_copies[0]], + ] + successful_phasing = True + elif len(starting_copies) == 1 and len(ending_copies) == 2: + if two_cp_hap_candidate == starting_copies: + two_cp_haplotypes = two_cp_hap_candidate + if len(final_haps) == 3: + new_alleles = [ + [starting_copies[0], ending_copies[0]], + [starting_copies[0], ending_copies[1]], + ] + successful_phasing = True + + # add missing links when there is no two-cp haplotypes + if two_cp_haplotypes == []: + # add the missing link in cn=4 + if ( + len(final_haps) in [3, 4] + and len(new_alleles) == 1 + and len(new_alleles[0]) == 2 + ): + remaining_hap = [ + a for a in final_haps.values() if a not in new_alleles[0] + ] + if len(remaining_hap) == len(final_haps) - 2: + new_alleles.append(remaining_hap) + # add the missing link in cn=5 + if len(final_haps) == 5: + if ( + len(new_alleles) == 1 + and len(new_alleles[0]) == 2 + and ( + ( + new_alleles[0][0] in starting_copies + and new_alleles[0][1] in ending_copies + ) + or ( + new_alleles[0][1] in starting_copies + and new_alleles[0][0] in ending_copies + ) + ) + ): + remaining_hap = [ + a for a in final_haps.values() if a not in new_alleles[0] + ] + if len(remaining_hap) == 3: + new_alleles.append(remaining_hap) + if len(new_alleles) == 2: + allele1 = ( + new_alleles[0][0] in starting_copies + and new_alleles[0][1] in ending_copies + ) or ( + new_alleles[0][1] in starting_copies + and new_alleles[0][0] in ending_copies + ) + allele2 = ( + new_alleles[1][0] in starting_copies + and new_alleles[1][1] in ending_copies + ) or ( + new_alleles[1][1] in starting_copies + and new_alleles[1][0] in ending_copies + ) + if allele1 is True and allele2 is False: + remaining_hap = [ + a + for a in final_haps.values() + if a not in new_alleles[0] + ] + if len(remaining_hap) == 3: + new_alleles = [new_alleles[0], remaining_hap] + elif allele1 is False and allele2 is True: + remaining_hap = [ + a + for a in final_haps.values() + if a not in new_alleles[1] + ] + if len(remaining_hap) == 3: + new_alleles = [new_alleles[1], remaining_hap] + + # check wrong phasing + wrong_allele = False + for allele in new_alleles: + hp_set = set(allele) + for hp in hp_set: + if ( + hp in ending_copies + and allele.count(hp) > 1 + and hp not in two_cp_haplotypes + ): + wrong_allele = True + break + if wrong_allele: + new_alleles = [] + + if len(new_alleles) == 2: + if sorted(new_alleles[0] + new_alleles[1]) == sorted( + list(final_haps.values()) + ): + successful_phasing = True + + return successful_phasing, new_alleles, two_cp_haplotypes + + def call(self): + if self.check_coverage_before_analysis() is False: + return None + self.get_homopolymer() + self.del2_reads, self.del2_reads_partial = self.get_long_del_reads( + self.del2_3p_pos1, + self.del2_3p_pos2, + self.del2_5p_pos1, + self.del2_5p_pos2, + self.deletion2_size, + ) + self.del1_reads, self.del1_reads_partial = self.get_long_del_reads( + self.del1_3p_pos1, + self.del1_3p_pos2, + self.del1_5p_pos1, + self.del1_5p_pos2, + self.deletion1_size, + ) + + # scan for polymorphic sites + regions_to_check = [] + if self.del2_reads_partial != set(): + regions_to_check += [ + [self.del2_3p_pos1, self.del2_3p_pos2], + [self.del2_5p_pos1, self.del2_5p_pos2], + ] + if self.del1_reads_partial != set(): + regions_to_check += [ + [self.del1_3p_pos1, self.del1_3p_pos2], + [self.del1_5p_pos1, self.del1_5p_pos2], + ] + self.get_candidate_pos(regions_to_check=regions_to_check) + + # add last snp outside of repeat + var_found = False + for var in self.candidate_pos: + pos = int(var.split("_")[0]) + if pos > self.clip_3p_positions[0]: + var_found = True + break + if var_found is False and self.candidate_pos != set(): + self.candidate_pos.add("32046300_G_A") + # add last snp outside of repeat, 5prime + var_found = False + for var in self.candidate_pos: + pos = int(var.split("_")[0]) + if pos < self.clip_5p_positions[0]: + var_found = True + break + if var_found is False and self.candidate_pos != set(): + self.candidate_pos.add("32013265_A_T") + + self.het_sites = sorted(list(self.candidate_pos)) + self.remove_noisy_sites() + het_sites = self.het_sites + + raw_read_haps = self.get_haplotypes_from_reads( + het_sites, check_clip=True, partial_deletion_reads=self.del1_reads_partial + ) + if self.del2_reads_partial != set(): + raw_read_haps, het_sites = self.update_reads_for_deletions( + raw_read_haps, + het_sites, + self.del2_3p_pos1, + self.del2_5p_pos2, + self.del2_reads_partial, + "3", + "32043718_del120", + ) + if self.del1_reads_partial != set(): + raw_read_haps, het_sites = self.update_reads_for_deletions( + raw_read_haps, + het_sites, + self.del1_3p_pos1, + self.del1_5p_pos2, + self.del1_reads_partial, + "4", + "32017431_del6367", + ) + self.het_sites = het_sites + + # assemble haplotypes + ( + ass_haps, + original_haps, + hcn, + uniquely_supporting_reads, + nonuniquely_supporting_reads, + raw_read_haps, + read_counts, + ) = self.phase_haps(raw_read_haps) + + tmp1 = {} + tmp2 = {} + for i, hap in enumerate(ass_haps): + hap_name = f"hap{i+1}" + tmp1.setdefault(hap, hap_name) + tmp2.setdefault(hap_name, hap) + final_haps = tmp1 + # get haps that extend into tnxb + ending_copies = [ + final_haps[a] + for a in ass_haps + if a[0] not in ["0", "x"] and a[-1] not in ["0", "x"] + ] + starting_copies = [ + final_haps[a] for a in ass_haps if a[0] == "0" and a[-1] == "0" + ] + single_copies = [ + final_haps[a] for a in ass_haps if a[0] == "0" and a[-1] not in ["0", "x"] + ] + + haplotypes = None + dvar = None + if self.het_sites != []: + haplotypes, dvar = self.output_variants_in_haplotypes( + final_haps, + uniquely_supporting_reads, + nonuniquely_supporting_reads, + ) + + # phase haplotypes into alleles + alleles, links = self.get_alleles(uniquely_supporting_reads) + # switch to hap name + new_alleles = [] + for pair in alleles: + new_pair = [] + for hap1 in pair: + new_pair.append(final_haps[hap1]) + new_alleles.append(new_pair) + new_links = {} + for hap in links: + hap_links = [final_haps[a] for a in links[hap]] + new_links.setdefault(final_haps[hap], []).append(hap_links) + links = new_links + + successful_phasing, new_alleles, two_cp_haplotypes = self.update_alleles( + new_alleles, + haplotypes, + final_haps, + single_copies, + starting_copies, + ending_copies, + ) + + # annotate haplotypes by checking the diff sites + # output variants carried by each haplotype + hap_variants = {} + if haplotypes is not None: + for hap, hap_info in haplotypes.items(): + hap_variants.setdefault(hap, []) + for var in hap_info["variants"]: + if var in self.known_variants: + hap_variants[hap].append(self.known_variants[var]) + + total_cn = len(ass_haps) + len(two_cp_haplotypes) + if ass_haps == [] and self.het_sites == []: + # homozygous, feed all reads to call variants + total_cn = 2 + if total_cn < 2: + total_cn = None + + annotated_alleles = self.annotate_alleles( + successful_phasing, + new_alleles, + hap_variants, + ending_copies, + ass_haps, + two_cp_haplotypes, + ) + + self.close_handle() + + return self.GeneCall( + total_cn, + final_haps, + two_cp_haplotypes, + starting_copies, + ending_copies, + single_copies, + successful_phasing, + new_alleles, + annotated_alleles, + hap_variants, + alleles, + new_links, + hcn, + original_haps, + self.het_sites, + uniquely_supporting_reads, + self.het_no_phasing, + self.homo_sites, + haplotypes, + dvar, + nonuniquely_supporting_reads, + raw_read_haps, + self.mdepth, + ) diff --git a/paraphase/smn_phaser.py b/paraphase/genes/smn1_phaser.py similarity index 99% rename from paraphase/smn_phaser.py rename to paraphase/genes/smn1_phaser.py index b80f39f..4b0608e 100755 --- a/paraphase/smn_phaser.py +++ b/paraphase/genes/smn1_phaser.py @@ -5,10 +5,10 @@ from collections import namedtuple import copy import json -from .phaser import Phaser +from ..phaser import Phaser -class SmnPhaser(Phaser): +class Smn1Phaser(Phaser): SmnCall = namedtuple( "SmnCall", "smn1_cn smn2_cn smn2_del78_cn smn1_read_number smn2_read_number \ @@ -20,8 +20,8 @@ class SmnPhaser(Phaser): ) HaplotypeInfo = namedtuple("HaplotypeInfo", "variants boundary haplogroup") - def __init__(self, sample_id, outdir, config, wgs_depth=None): - Phaser.__init__(self, sample_id, outdir, config, wgs_depth) + def __init__(self, sample_id, outdir, wgs_depth=None): + Phaser.__init__(self, sample_id, outdir, wgs_depth) self.has_smn1 = False self.has_smn2 = False self.smn1_reads = set() @@ -32,6 +32,9 @@ def __init__(self, sample_id, outdir, config, wgs_depth=None): self.smn1_del_reads_partial = set() self.smn2_del_reads = set() self.smn2_del_reads_partial = set() + + def set_parameter(self, config): + super().set_parameter(config) self.deletion1_size = config["coordinates"]["hg38"]["deletion1_size"] self.deletion2_size = config["coordinates"]["hg38"]["deletion2_size"] self.del2_3p_pos1 = config["coordinates"]["hg38"]["del2_3p_pos1"] diff --git a/paraphase/genes/strc_phaser.py b/paraphase/genes/strc_phaser.py new file mode 100644 index 0000000..1224ab8 --- /dev/null +++ b/paraphase/genes/strc_phaser.py @@ -0,0 +1,139 @@ +# paraphase +# Author: Xiao Chen + + +from collections import namedtuple +import pysam +from ..phaser import Phaser + + +class StrcPhaser(Phaser): + GeneCall = namedtuple( + "GeneCall", + "total_cn gene_cn final_haplotypes two_copy_haplotypes intergenic_depth \ + highest_total_cn assembled_haplotypes sites_for_phasing \ + unique_supporting_reads het_sites_not_used_in_phasing homozygous_sites \ + haplotype_details variant_genotypes nonunique_supporting_reads \ + read_details genome_depth", + ) + + def __init__(self, sample_id, outdir, wgs_depth=None, genome_bam=None): + Phaser.__init__(self, sample_id, outdir, wgs_depth, genome_bam) + self.del1_reads = set() + self.del1_reads_partial = set() + + def set_parameter(self, config): + super().set_parameter(config) + self.deletion1_size = config["coordinates"]["hg38"]["deletion1_size"] + self.del1_3p_pos1 = config["coordinates"]["hg38"]["del1_3p_pos1"] + self.del1_3p_pos2 = config["coordinates"]["hg38"]["del1_3p_pos2"] + self.del1_5p_pos1 = config["coordinates"]["hg38"]["del1_5p_pos1"] + self.del1_5p_pos2 = config["coordinates"]["hg38"]["del1_5p_pos2"] + self.intergenic = config["coordinates"]["hg38"]["depth_region"] + self.depth_region = config["coordinates"]["hg38"]["depth_region"] + + def call(self): + if self.check_coverage_before_analysis() is False: + return None + genome_bamh = pysam.AlignmentFile(self.genome_bam, "rb") + intergenic_depth = self.get_regional_depth(genome_bamh, self.depth_region)[0] + genome_bamh.close() + self.get_homopolymer() + self.del1_reads, self.del1_reads_partial = self.get_long_del_reads( + self.del1_3p_pos1, + self.del1_3p_pos2, + self.del1_5p_pos1, + self.del1_5p_pos2, + self.deletion1_size, + ) + self.get_candidate_pos() + self.het_sites = sorted(list(self.candidate_pos)) + self.remove_noisy_sites() + + raw_read_haps = self.get_haplotypes_from_reads(self.het_sites) + het_sites = self.het_sites + if self.del1_reads_partial != set(): + raw_read_haps, het_sites = self.update_reads_for_deletions( + raw_read_haps, + het_sites, + self.del1_3p_pos1, + self.del1_5p_pos2, + self.del1_reads_partial, + "3", + "43602630_del314", + ) + self.het_sites = het_sites + ( + ass_haps, + original_haps, + hcn, + uniquely_supporting_reads, + nonuniquely_supporting_reads, + raw_read_haps, + read_counts, + ) = self.phase_haps(raw_read_haps) + + tmp = {} + counter_gene = 0 + counter_pseudo = 0 + for hap in ass_haps: + if "3" in hap: + counter_pseudo += 1 + tmp.setdefault(hap, f"strcp1_hap{counter_pseudo}") + else: + counter_gene += 1 + tmp.setdefault(hap, f"strc_hap{counter_gene}") + ass_haps = tmp + + haplotypes = None + dvar = None + if self.het_sites != []: + haplotypes, dvar = self.output_variants_in_haplotypes( + ass_haps, + uniquely_supporting_reads, + nonuniquely_supporting_reads, + ) + + two_cp_haps = [] + if intergenic_depth > 5 and len(ass_haps) == 2: + two_cp_haps = list(ass_haps.values()) + elif counter_gene == 1 or counter_pseudo == 1: + two_cp_haps = self.compare_depth(haplotypes) + for hap in two_cp_haps: + if "strcp1" not in hap: + counter_gene += 1 + else: + counter_pseudo += 1 + + total_cn = len(ass_haps) + len(two_cp_haps) + + # check depth between STRC and pseudogene + if self.mdepth is not None: + prob = self.depth_prob(int(intergenic_depth), self.mdepth / 2) + if prob[0] < 0.9 and counter_gene == 1: + counter_gene = None + total_cn = None + if prob[0] > 0.95 and counter_gene > 1 and two_cp_haps != []: + counter_gene = None + total_cn = None + + self.close_handle() + + return self.GeneCall( + total_cn, + counter_gene, + ass_haps, + two_cp_haps, + intergenic_depth, + hcn, + original_haps, + self.het_sites, + uniquely_supporting_reads, + self.het_no_phasing, + self.homo_sites, + haplotypes, + dvar, + nonuniquely_supporting_reads, + raw_read_haps, + self.mdepth, + ) diff --git a/paraphase/genome_depth.py b/paraphase/genome_depth.py index fff4bd9..0989fbd 100755 --- a/paraphase/genome_depth.py +++ b/paraphase/genome_depth.py @@ -7,16 +7,16 @@ class GenomeDepth: - def __init__(self, bam, config): + def __init__(self, bam, genome_depth_region_file): self.bam = bam - self.config = config + self.genome_depth_region_file = genome_depth_region_file self._bamh = pysam.AlignmentFile(bam, "rb") self.mdepth = None self.mad = None def get_genome_depth(self): depth = [] - with open(self.config["data"]["depth_region"]) as f: + with open(self.genome_depth_region_file) as f: for line in f: at = line.split() nchr = at[0] diff --git a/paraphase/haplotype_assembler.py b/paraphase/haplotype_assembler.py index c13ccd6..d62b6ef 100755 --- a/paraphase/haplotype_assembler.py +++ b/paraphase/haplotype_assembler.py @@ -148,7 +148,7 @@ def get_next_pos(self, pos): def get_previous_pos(self, pos): """Get previous position and its haplotype blocks in the graph""" positions = sorted(list(set([int(a.split("-")[1]) for a in self.nodes]))) - if pos <= 0: + if pos <= positions[0]: return None, None for position in positions[::-1]: if position < pos: diff --git a/paraphase/paraphase.py b/paraphase/paraphase.py index d34f1f8..19f0a81 100755 --- a/paraphase/paraphase.py +++ b/paraphase/paraphase.py @@ -9,66 +9,214 @@ import logging import datetime import shutil +import pysam import multiprocessing as mp from functools import partial from paraphase.genome_depth import GenomeDepth from paraphase.prepare_bam_and_vcf import ( BamRealigner, BamTagger, + VcfGenerater, TwoGeneVcfGenerater, ) -from paraphase.smn_phaser import SmnPhaser +from paraphase.genes.smn1_phaser import Smn1Phaser +from paraphase.genes.pms2_phaser import Pms2Phaser +from paraphase.genes.rccx_phaser import RccxPhaser +from paraphase.genes.strc_phaser import StrcPhaser +from paraphase.genes.ncf1_phaser import Ncf1Phaser +from paraphase.genes.cfc1_phaser import Cfc1Phaser +from paraphase.genes.neb_phaser import NebPhaser +from paraphase.genes.ikbkg_phaser import IkbkgPhaser +from paraphase.genes.f8_phaser import F8Phaser +ACCEPTED_GENES = [ + "smn1", + "rccx", + "pms2", + "strc", + "ncf1", + "cfc1", + "neb", + "ikbkg", + "f8", +] +GENOME_DEPTH_GENES = ["smn1", "strc", "ncf1"] +NO_VCF_GENES = ["f8"] -def process_sample(bamlist, outdir, config, dcov={}, vcf=False): + +def process_sample(bamlist, outdir, configs, dcov={}, novcf=False): + """Main workflow""" for bam in bamlist: sample_id = bam.split("/")[-1].split(".")[0] logging.info(f"Processing sample {sample_id} at {datetime.datetime.now()}...") - - logging.info(f"Getting genome depth at {datetime.datetime.now()}...") + sample_out = {} gdepth = None - if sample_id in dcov: - gdepth = dcov[sample_id] - if gdepth is None: - depth = GenomeDepth(bam, config) - gdepth, gmad = depth.call() - if gdepth < 10 or gmad > 0.25: - logging.warning( - "Due to low or highly variable genome coverage, genome coverage is not used for depth correction." + query_genes = set(configs.keys()) + if query_genes.intersection(set(GENOME_DEPTH_GENES)) != set(): + logging.info(f"Getting genome depth at {datetime.datetime.now()}...") + if sample_id in dcov: + gdepth = dcov[sample_id] + if gdepth is None: + depth = GenomeDepth( + bam, + os.path.join( + os.path.dirname(__file__), "data", "genome_region.bed" + ), ) - gdepth = None + gdepth, gmad = depth.call() + if gdepth < 10 or gmad > 0.25: + logging.warning( + "Due to low or highly variable genome coverage, genome coverage is not used for depth correction." + ) + gdepth = None - logging.info(f"Realigning reads at {datetime.datetime.now()}...") - bam_realigner = BamRealigner(bam, outdir, config) - bam_realigner.write_realign_bam() + for gene in configs: + config = configs[gene] + logging.info(f"Running analysis for {gene} at {datetime.datetime.now()}...") + logging.info(f"Realigning reads for {gene} at {datetime.datetime.now()}...") + bam_realigner = BamRealigner(bam, outdir, config) + bam_realigner.write_realign_bam() - logging.info(f"Phasing haplotypes at {datetime.datetime.now()}...") - smn_phaser = SmnPhaser(sample_id, outdir, config, gdepth) - smn_phaser_call = smn_phaser.call()._asdict() + logging.info( + f"Phasing haplotypes for {gene} at {datetime.datetime.now()}..." + ) - logging.info(f"Tagging reads at {datetime.datetime.now()}...") - bam_tagger = BamTagger(sample_id, outdir, config, smn_phaser_call) - bam_tagger.write_bam(random_assign=True) + phasers = { + "smn1": Smn1Phaser(sample_id, outdir, gdepth), + "rccx": RccxPhaser(sample_id, outdir), + "pms2": Pms2Phaser(sample_id, outdir), + "strc": StrcPhaser(sample_id, outdir, gdepth, genome_bam=bam), + "ncf1": Ncf1Phaser(sample_id, outdir, gdepth), + "cfc1": Cfc1Phaser(sample_id, outdir), + "neb": NebPhaser(sample_id, outdir), + "ikbkg": IkbkgPhaser(sample_id, outdir), + "f8": F8Phaser(sample_id, outdir, genome_bam=bam), + } + phaser = phasers.get(gene) + phaser.set_parameter(config) - if vcf: - logging.info(f"Generating VCFs at {datetime.datetime.now()}...") - vcf_generater = TwoGeneVcfGenerater( - sample_id, outdir, config, smn_phaser_call - ) - vcf_generater.run() + phaser_call = phaser.call() + if phaser_call is not None: + phaser_call = phaser_call._asdict() + + logging.info( + f"Tagging reads for {gene} at {datetime.datetime.now()}..." + ) + bam_tagger = BamTagger(sample_id, outdir, config, phaser_call) + bam_tagger.write_bam(random_assign=True) + + if novcf is False and gene not in NO_VCF_GENES: + logging.info( + f"Generating VCFs for {gene} at {datetime.datetime.now()}..." + ) + vcf_dir = os.path.join(outdir, f"{sample_id}_vcfs") + if os.path.exists(vcf_dir) is False: + os.makedirs(vcf_dir) + if gene == "smn1": + vcf_generater = TwoGeneVcfGenerater( + sample_id, outdir, config, phaser_call + ) + vcf_generater.run() + else: + vcf_generater = VcfGenerater( + sample_id, outdir, config, phaser_call + ) + vcf_generater.run_without_realign() + + sample_out.setdefault(gene, phaser_call) + + logging.info(f"Merging all bams at {datetime.datetime.now()}...") + merge_bams(query_genes, outdir, sample_id) - sample_out = smn_phaser_call logging.info(f"Writing to json at {datetime.datetime.now()}...") out_json = os.path.join(outdir, sample_id + ".json") with open(out_json, "w") as json_output: json.dump(sample_out, json_output, indent=4) +def merge_bams(query_genes, outdir, sample_id): + """Merge realigned tagged bams for each gene into one bam""" + bams = [] + for gene in query_genes: + gene_bam = os.path.join(outdir, f"{sample_id}_{gene}_realigned_tagged.bam") + if os.path.exists(gene_bam) is False: + gene_bam = os.path.join(outdir, f"{sample_id}_{gene}_realigned.bam") + if os.path.exists(gene_bam): + bams.append(gene_bam) + bam_list_file = os.path.join(outdir, f"{sample_id}_bam_list.txt") + with open(bam_list_file, "w") as fout: + for bam in bams: + fout.write(bam + "\n") + merged_bam = os.path.join(outdir, f"{sample_id}_realigned_tagged.bam") + tmp_bam = os.path.join(outdir, f"{sample_id}_merged.bam") + pysam.merge("-f", "-o", tmp_bam, "-b", bam_list_file) + pysam.sort("-o", merged_bam, tmp_bam) + pysam.index(merged_bam) + os.remove(tmp_bam) + os.remove(bam_list_file) + for bam in bams: + os.remove(bam) + os.remove(bam + ".bai") + + +def update_config(gene_list, args): + """Get config info for each gene""" + data_path = os.path.join(os.path.dirname(__file__), "data") + configs = {} + for gene in gene_list: + config_file = os.path.join(data_path, gene, f"{gene}_config.yaml") + # parse config file + with open(config_file, "r") as f: + try: + config = yaml.safe_load(f) + except yaml.YAMLError as yaml_error: + raise Exception(f"Error reading {config_file}: \n\n{yaml_error}") + configs.setdefault(gene, config) + + # check third-party tools + configs[gene].setdefault("tools", {}) + samtools_check = [ + a for a in [args.samtools, shutil.which("samtools")] if a is not None + ] + samtools_check2 = [a for a in samtools_check if os.path.exists(a)] + if samtools_check2 == []: + raise Exception("samtools is not found") + else: + configs[gene]["tools"].setdefault("samtools", samtools_check2[0]) + + minimap2_check = [ + a for a in [args.minimap2, shutil.which("minimap2")] if a is not None + ] + minimap2_check2 = [a for a in minimap2_check if os.path.exists(a)] + if minimap2_check2 == []: + raise Exception("minimap2 is not found") + else: + configs[gene]["tools"].setdefault("minimap2", minimap2_check2[0]) + + # update paths + data_paths = configs[gene].get("data") + for data_entry in data_paths: + old_data_file = data_paths[data_entry] + if data_entry != "depth_region": + new_data_file = os.path.join(data_path, gene, old_data_file) + else: + new_data_file = os.path.join(data_path, old_data_file) + data_paths[data_entry] = new_data_file + + for data_file in list(data_paths.values()): + if os.path.exists(data_file) is False: + raise Exception(f"File {data_file} not found.") + return configs + + def main(): parser = argparse.ArgumentParser( - description="paraphase: HiFi-based SMN1/SMN2 variant caller." + description="paraphase: HiFi-based caller for highly homologous genes" ) - input_group = parser.add_mutually_exclusive_group(required=True) + all_genes_joined = ",".join(ACCEPTED_GENES) + inputp = parser.add_argument_group("Input Options") + outputp = parser.add_argument_group("Output Options") + input_group = inputp.add_mutually_exclusive_group(required=True) input_group.add_argument( "-b", "--bam", @@ -79,23 +227,16 @@ def main(): "--list", help="File listing absolute paths to multiple input BAM files, one per line", ) - parser.add_argument( + outputp.add_argument( "-o", "--out", help="Output directory", required=True, ) parser.add_argument( - "-v", - "--vcf", - help="Optional. If specified, paraphase will produce vcf for each haplotype", - required=False, - action="store_true", - ) - parser.add_argument( - "-c", - "--config", - help="Optional path to config yaml file", + "-g", + "--gene", + help=f"Optionally specify which gene(s) to run (separated by comma). Will run all genes if not specified. Currently supported genes are {all_genes_joined}", required=False, ) parser.add_argument( @@ -104,6 +245,12 @@ def main(): help="Optional path to a file listing average depth for each sample", required=False, ) + parser.add_argument( + "--novcf", + help="Optional. If specified, paraphase will not write vcfs", + required=False, + action="store_true", + ) parser.add_argument( "-t", "--threads", @@ -125,61 +272,22 @@ def main(): args = parser.parse_args() outdir = args.out - config_file = args.config logging.basicConfig(level=logging.DEBUG) if os.path.exists(outdir) is False: os.makedirs(outdir) - data_path = os.path.join(os.path.dirname(__file__), "data") - if config_file is None: - config_file = os.path.join(data_path, "smn1", "config.yaml") - - # parse config file - with open(config_file, "r") as f: - try: - config = yaml.safe_load(f) - except yaml.YAMLError as yaml_error: - raise Exception(f"Error reading {config_file}: \n\n{yaml_error}") - - # check third-party tools - tools = config.get("tools") - samtools_check = [ - a - for a in [tools.get("samtools"), args.samtools, shutil.which("samtools")] - if a is not None - ] - samtools_check2 = [a for a in samtools_check if os.path.exists(a)] - if samtools_check2 == []: - raise Exception("samtools is not found") - else: - config["tools"]["samtools"] = samtools_check2[0] - - minimap2_check = [ - a - for a in [tools.get("minimap2"), args.minimap2, shutil.which("minimap2")] - if a is not None - ] - minimap2_check2 = [a for a in minimap2_check if os.path.exists(a)] - if minimap2_check2 == []: - raise Exception("minimap2 is not found") + gene_list = args.gene + if gene_list is None: + gene_list = ACCEPTED_GENES else: - config["tools"]["minimap2"] = minimap2_check2[0] - - # update paths - gene = config.get("gene") - data_paths = config.get("data") - for data_entry in data_paths: - old_data_file = data_paths[data_entry] - if data_entry != "depth_region": - new_data_file = os.path.join(data_path, gene, old_data_file) - else: - new_data_file = os.path.join(data_path, old_data_file) - data_paths[data_entry] = new_data_file + gene_list = [a for a in gene_list.split(",") if a in ACCEPTED_GENES] + if gene_list == []: + raise Exception( + f"Gene names not recognized. Currently accepted genes are {all_genes_joined}" + ) - for data_file in list(data_paths.values()): - if os.path.exists(data_file) is False: - raise Exception(f"File {data_file} not found.") + configs = update_config(gene_list, args) # parse depth file dcov = {} @@ -197,9 +305,9 @@ def main(): process_sample( bamlist, outdir, - config, + configs, dcov, - args.vcf, + args.novcf, ) else: print(f"{args.bam} bam or bai file doesn't exist") @@ -216,9 +324,9 @@ def main(): process_sample_partial = partial( process_sample, outdir=outdir, - config=config, + configs=configs, dcov=dcov, - vcf=args.vcf, + novcf=args.novcf, ) bam_groups = [bamlist[i::nCores] for i in range(nCores)] pool = mp.Pool(nCores) diff --git a/paraphase/phaser.py b/paraphase/phaser.py index d25472b..bad990e 100755 --- a/paraphase/phaser.py +++ b/paraphase/phaser.py @@ -3,12 +3,12 @@ import pysam -from pprint import pprint import os -import numpy as np import copy +import numpy as np from collections import Counter import re +import logging from scipy.stats import poisson from .haplotype_assembler import VariantGraph @@ -19,28 +19,75 @@ class Phaser: clip_3p = r"\d+S$|\d+H$" deletion = r"\d+D" - def __init__(self, sample_id, outdir, config, wgs_depth=None): + def __init__(self, sample_id, outdir, wgs_depth=None, genome_bam=None): self.outdir = outdir self.sample_id = sample_id - self.bam = os.path.join(outdir, self.sample_id + "_realigned.bam") - if os.path.exists(self.bam) is False: - raise Exception(f"File {self.bam} not found.") - self._bamh = pysam.AlignmentFile(self.bam, "rb") - self.homopolymer_file = config["data"]["homopolymer"] self.homopolymer_sites = {} self.het_sites = [] # for phasing self.het_no_phasing = [] self.homo_sites = [] self.candidate_pos = set() self.mdepth = wgs_depth + self.genome_bam = genome_bam + + def set_parameter(self, config): + self.gene = config["gene"] + self.bam = os.path.join( + self.outdir, self.sample_id + f"_{self.gene}_realigned.bam" + ) + if os.path.exists(self.bam) is False: + raise Exception(f"File {self.bam} not found.") + self._bamh = pysam.AlignmentFile(self.bam, "rb") + self.homopolymer_file = config["data"]["homopolymer"] self.nchr = config["coordinates"]["hg38"]["nchr"] self.ref = config["data"]["reference"] self._refh = pysam.FastaFile(self.ref) self.left_boundary = config["coordinates"]["hg38"]["left_boundary"] self.right_boundary = config["coordinates"]["hg38"]["right_boundary"] - self.pivot_site = config["coordinates"]["hg38"]["pivot_site"] + self.pivot_site = None + if "pivot_site" in config["coordinates"]["hg38"]: + self.pivot_site = config["coordinates"]["hg38"]["pivot_site"] self.nchr_old = config["coordinates"]["hg38"]["nchr_old"] self.offset = int(self.nchr_old.split("_")[1]) - 1 + self.use_supplementary = False + if "use_supplementary" in config: + self.use_supplementary = config["use_supplementary"] + self.clip_3p_positions = [] + self.clip_5p_positions = [] + if "clip_3p_positions" in config["coordinates"]["hg38"]: + self.clip_3p_positions = config["coordinates"]["hg38"]["clip_3p_positions"] + if "clip_5p_positions" in config["coordinates"]["hg38"]: + self.clip_5p_positions = config["coordinates"]["hg38"]["clip_5p_positions"] + self.noisy_region = [] + if "noisy_region" in config["coordinates"]["hg38"]: + self.noisy_region = config["coordinates"]["hg38"]["noisy_region"] + + def get_regional_depth(self, bam_handle, query_region, ninterval=100): + """Get depth of the query regions""" + region_depth = [] + for region in query_region: + depth = [] + nstep = max(1, int((region[1] - region[0]) / ninterval)) + for pos in range(region[0], region[1], nstep): + for pileupcolumn in bam_handle.pileup( + self.nchr, pos - 1, pos, truncate=True + ): + site_depth = pileupcolumn.get_num_aligned() + depth.append(site_depth) + region_depth.append(np.median(depth)) + return region_depth + + def check_coverage_before_analysis(self): + """check low coverage regions for enrichment data""" + region_depth = self.get_regional_depth( + self._bamh, [[self.left_boundary, self.right_boundary]] + )[0] + if np.isnan(region_depth) or region_depth < 10: + logging.warning( + "This region does not appear to have coverage. Will not attempt to phase haplotypes." + ) + return False + return True def get_homopolymer(self): """Parse the homopolymer site file""" @@ -97,29 +144,31 @@ def get_long_del_reads( pos2 = p3_pos2 reference_start_cutoff = pos1 - min_extend for read in bamh.fetch(self.nchr, pos1, pos2): + read_name = self.get_read_name(read) find_clip_3p = re.findall(self.clip_3p, read.cigarstring) if find_clip_3p != [] and pos1 < read.reference_end < pos2: if ( int(find_clip_3p[0][:-1]) >= min_clip_len and read.reference_start < reference_start_cutoff ): - p3_reads.add(read.query_name) + p3_reads.add(read_name) if self.check_del(read, del_size): - del_reads.add(read.query_name) + del_reads.add(read_name) # 5 prime clip pos1 = p5_pos1 pos2 = p5_pos2 reference_end_cutoff = pos2 + min_extend for read in bamh.fetch(self.nchr, pos1, pos2): + read_name = self.get_read_name(read) find_clip_5p = re.findall(self.clip_5p, read.cigarstring) if find_clip_5p != [] and pos1 < read.reference_start < pos2: if ( int(find_clip_5p[0][:-1]) >= min_clip_len and read.reference_end > reference_end_cutoff ): - p5_reads.add(read.query_name) + p5_reads.add(read_name) if self.check_del(read, del_size): - del_reads.add(read.query_name) + del_reads.add(read_name) if del_reads != set() or (p3_reads != set() and p5_reads != set()): return ( del_reads.union(p3_reads.intersection(p5_reads)), @@ -134,7 +183,39 @@ def get_pivot_site_index(self): return positions.index(self.pivot_site), True return -1, False - def get_haplotypes_from_reads(self, het_sites, exclude_reads=[], min_mapq=5): + def get_read_name(self, read): + """Rename reads when supplementary""" + read_name = read.query_name + if read.is_supplementary and self.use_supplementary: + read_name = ( + read_name + f"_sup_{read.reference_start}_{read.reference_length}" + ) + return read_name + + def get_read_names(self, read, partial_deletion_reads): + """Add read names for supplementary alignments""" + read_names = [read.query_name] + if read.is_supplementary and self.use_supplementary: + sup_name = ( + read.query_name + f"_sup_{read.reference_start}_{read.reference_length}" + ) + read_names = [sup_name] + if ( + sup_name in partial_deletion_reads + and read.query_name in partial_deletion_reads + ): + read_names.append(read.query_name) + return read_names + + def get_haplotypes_from_reads( + self, + het_sites, + exclude_reads=[], + min_mapq=5, + min_clip_len=50, + check_clip=False, + partial_deletion_reads=[], + ): """ Go through reads and get bases at sites of interest Returns: @@ -151,28 +232,69 @@ def get_haplotypes_from_reads(self, het_sites, exclude_reads=[], min_mapq=5): snp_position - 1, snp_position, truncate=True, - min_base_quality=30, + min_base_quality=29, ): for read in pileupcolumn.pileups: - read_name = read.alignment.query_name - if ( - not read.is_del - and not read.is_refskip - and not read.alignment.is_secondary - and read.alignment.mapping_quality >= min_mapq - and read_name not in exclude_reads - ): - read_seq = read.alignment.query_sequence - start_pos = read.query_position - end_pos = start_pos + 1 - if end_pos < len(read_seq): - hap = read_seq[start_pos:end_pos] - if read_name not in read_haps: - read_haps.setdefault(read_name, ["x"] * nvar) - if hap.upper() == allele1.upper(): - read_haps[read_name][dsnp_index] = "1" - elif hap.upper() == allele2.upper(): - read_haps[read_name][dsnp_index] = "2" + read_names = self.get_read_names( + read.alignment, partial_deletion_reads + ) + for read_name in read_names: + if ( + not read.is_del + and not read.is_refskip + and not read.alignment.is_secondary + and read.alignment.mapping_quality >= min_mapq + and read_name not in exclude_reads + ): + read_seq = read.alignment.query_sequence + start_pos = read.query_position + end_pos = start_pos + 1 + if end_pos < len(read_seq): + hap = read_seq[start_pos:end_pos] + if read_name not in read_haps: + read_haps.setdefault(read_name, ["x"] * nvar) + if hap.upper() == allele1.upper(): + read_haps[read_name][dsnp_index] = "1" + elif hap.upper() == allele2.upper(): + read_haps[read_name][dsnp_index] = "2" + + # for softclips starting at a predefined position, mark sites as 0 instead of x + if check_clip: + for dsnp_index, allele_site in enumerate(het_sites): + snp_position_gene1, allele1, allele2, *at = allele_site.split("_") + snp_position = int(snp_position_gene1) + for clip_position in sorted(self.clip_3p_positions): + if snp_position > clip_position: + for read in self._bamh.fetch( + self.nchr, clip_position - 10, clip_position + 10 + ): + read_name = self.get_read_name(read) + if read_name in read_haps: + if abs(read.reference_end - clip_position) < 20: + find_clip_3p = re.findall( + self.clip_3p, read.cigarstring + ) + if ( + find_clip_3p != [] + and int(find_clip_3p[0][:-1]) >= min_clip_len + ): + read_haps[read_name][dsnp_index] = "0" + for clip_position in sorted(self.clip_5p_positions, reverse=True): + if snp_position < clip_position: + for read in self._bamh.fetch( + self.nchr, clip_position - 10, clip_position + 10 + ): + read_name = self.get_read_name(read) + if read_name in read_haps: + if abs(read.reference_start - clip_position) < 20: + find_clip_5p = re.findall( + self.clip_5p, read.cigarstring + ) + if ( + find_clip_5p != [] + and int(find_clip_5p[0][:-1]) >= min_clip_len + ): + read_haps[read_name][dsnp_index] = "0" return read_haps def allow_del_bases(self, pos): @@ -309,6 +431,16 @@ def get_candidate_pos(self, regions_to_check=[], min_read_support=5, min_vaf=0.1 ref_seq, var_seq = variants_no_phasing[pos] self.het_no_phasing.append(f"{pos}_{ref_seq}_{var_seq}") + def remove_noisy_sites(self): + """remove variants in predefined noisy sites""" + problematic_sites = [] + for site in self.het_sites: + for region in self.noisy_region: + if region[0] <= int(site.split("_")[0]) <= region[1]: + problematic_sites.append(site) + for site in problematic_sites: + self.het_sites.remove(site) + @staticmethod def simplify_read_haps(read_haps): """Simplify read haplotypes for output""" @@ -381,7 +513,7 @@ def get_hap_variant_ranges(self, hap): nend_next_pos = int(self.het_sites[nend_next].split("_")[0]) - 1 return nstart_previous_pos, nend_next_pos - def output_variants_in_haplotypes(self, haps, reads, nonunique): + def output_variants_in_haplotypes(self, haps, reads, nonunique, two_cp_haps=[]): """ Summarize all variants in each haplotype. Output all variants and their genotypes. @@ -389,10 +521,10 @@ def output_variants_in_haplotypes(self, haps, reads, nonunique): """ het_sites = self.het_sites haplotype_variants = {} + haplotype_info = {} dvar = {} var_no_phasing = copy.deepcopy(self.het_no_phasing) - for hap_index, hap in enumerate(haps): - hap_name = f"hap{hap_index}" + for hap, hap_name in haps.items(): haplotype_variants.setdefault(hap_name, []) # het sites not used in phasing if reads != {}: @@ -400,8 +532,7 @@ def output_variants_in_haplotypes(self, haps, reads, nonunique): genotypes = [] var_reads = self.check_variants_in_haplotypes(var) haps_with_variant = [] - for hap_index, hap in enumerate(haps): - hap_name = f"hap{hap_index}" + for hap, hap_name in haps.items(): hap_reads = reads[hap] hap_reads_nonunique = [a for a in nonunique if hap in nonunique[a]] genotype = self.get_genotype_in_hap( @@ -417,8 +548,7 @@ def output_variants_in_haplotypes(self, haps, reads, nonunique): haplotype_variants[hap_name].append(var) dvar.setdefault(var, genotypes) # het sites and homo sites - for hap_index, hap in enumerate(haps): - hap_name = f"hap{hap_index}" + for hap, hap_name in haps.items(): for i in range(len(hap)): if hap[i] == "2": haplotype_variants[hap_name].append(het_sites[i]) @@ -431,11 +561,10 @@ def output_variants_in_haplotypes(self, haps, reads, nonunique): a for a in var_tmp if var_nstart <= int(a.split("_")[0]) <= var_nend ] var_tmp1 = list(set(var_tmp1)) - haplotype_variants[hap_name] = sorted( - var_tmp1, - key=lambda x: int(x.split("_")[0]), + var_tmp2 = sorted(var_tmp1, key=lambda x: int(x.split("_")[0])) + haplotype_info.setdefault( + hap_name, {"variants": var_tmp2, "boundary": [var_nstart, var_nend]} ) - haplotype_variants[hap_name].append((var_nstart, var_nend)) # summary per variant all_haps = haps @@ -443,15 +572,18 @@ def output_variants_in_haplotypes(self, haps, reads, nonunique): for var in self.homo_sites: dvar.setdefault(var, ["1"] * nhap) for i, var in enumerate(het_sites): - dvar.setdefault(var, ["."] * nhap) - for hap_index in range(len(all_haps)): - hap = all_haps[hap_index] + dvar.setdefault(var, []) + for hap, hap_name in haps.items(): + base_call = "." if hap[i] == "2": - dvar[var][hap_index] = "1" + base_call = "1" elif hap[i] == "1": - dvar[var][hap_index] = "0" + base_call = "0" + dvar[var].append(base_call) + if hap_name in two_cp_haps: + dvar[var].append(base_call) - return haplotype_variants, { + return haplotype_info, { var: "|".join(dvar[var]) for var in dict(sorted(dvar.items())) } @@ -509,10 +641,19 @@ def update_reads_for_deletions( if read in del_reads_partial: hap[pos1] = base elif ( - "x" not in hap[(pos1 - 2) : pos1] - and "x" not in hap[(pos1 + 1) : (pos1 + 3)] + hap[pos1 - 1] == "0" + and pos1 - 1 >= 0 + and hap[pos1 + 1] == "0" + and pos1 + 1 < len(hap) ): - hap[pos1] = "1" + hap[pos1] = "0" + else: + flanking_left = hap[min(0, pos1 - 2) : pos1] + flanking_right = hap[ + max(pos1 + 1, len(hap)) : max(pos1 + 3, len(hap)) + ] + if "x" not in flanking_left and "x" not in flanking_right: + hap[pos1] = "1" raw_read_haps[read] = "".join(hap) return raw_read_haps, het_sites @@ -587,9 +728,10 @@ def phase_haps(self, raw_read_haps, debug=False): read_counts = self.get_read_counts(uniquely_supporting_haps) uniquely_supporting_reads = {} + for hap in ass_haps: + uniquely_supporting_reads.setdefault(hap, []) for hap in uniquely_supporting_haps: for read_hap in uniquely_supporting_haps[hap]: - uniquely_supporting_reads.setdefault(hap, []) uniquely_supporting_reads[hap] += haplotypes_to_reads[read_hap] for hap in uniquely_supporting_haps: uniquely_supporting_reads[hap] = list(set(uniquely_supporting_reads[hap])) @@ -611,6 +753,68 @@ def phase_haps(self, raw_read_haps, debug=False): read_counts, ) + def compare_depth(self, haplotypes, loose=False): + """ + For each haplotype, identify the variants where it's different + from other haplotypes. Check depth at those variant sites and + see if the depth suggests twice coverage. + """ + if haplotypes is None or len(haplotypes) == 1: + return [] + two_cp_haps = [] + bamh = self._bamh + boundaries = [haplotypes[a]["boundary"] for a in haplotypes] + nstart = max([a[0] for a in boundaries]) + nend = min(a[1] for a in boundaries) + variants = set() + for hap in haplotypes: + vars = haplotypes[hap]["variants"] + for var in vars: + if len(var.split("_")) == 3: + pos, ref, alt = var.split("_") + pos = int(pos) + if nstart < pos < nend and var in self.het_sites: + variants.add(var) + + for hap in haplotypes: + sites = {} + other_haps = [a for a in haplotypes.keys() if a != hap] + other_cn = len(other_haps) + this_hap_var = haplotypes[hap]["variants"] + other_haps_var = [] + for a in other_haps: + other_haps_var += haplotypes[a]["variants"] + for var in variants: + pos, ref, alt = var.split("_") + pos = int(pos) + if var in this_hap_var and var not in other_haps_var: + sites.setdefault(pos, alt) + elif var not in this_hap_var and other_haps_var.count(var) == other_cn: + sites.setdefault(pos, ref) + + counts = [] + for pos in sites: + hap_base = sites[pos] + for pileupcolumn in bamh.pileup( + self.nchr, pos - 1, pos, truncate=True, min_base_quality=29 + ): + bases = [a.upper() for a in pileupcolumn.get_query_sequences()] + base_num = bases.count(hap_base) + counts.append([base_num, len(bases) - base_num]) + + probs = [] + nsites = len(sites) + for n1, n2 in counts: + probs.append(self.depth_prob(n1, n2 / other_cn)) + probs_fil = [a for a in probs if a[0] < 0.25] + if len(probs_fil) >= nsites * 0.6 and nsites >= 5: + two_cp_haps.append(hap) + elif loose is True: + if len(probs_fil) >= nsites * 0.5 and nsites >= 5: + two_cp_haps.append(hap) + + return two_cp_haps + def close_handle(self): self._bamh.close() self._refh.close() diff --git a/paraphase/prepare_bam_and_vcf.py b/paraphase/prepare_bam_and_vcf.py index ccfae3e..32e2ef0 100755 --- a/paraphase/prepare_bam_and_vcf.py +++ b/paraphase/prepare_bam_and_vcf.py @@ -8,6 +8,7 @@ import random import re from collections import Counter +from .haplotype_assembler import VariantGraph class BamRealigner: @@ -16,11 +17,12 @@ class BamRealigner: """ min_mapq = 50 - min_aln = 400 + min_aln = 800 def __init__(self, bam, outdir, config): self.bam = bam self.outdir = outdir + self.gene = config["gene"] self.nchr = config["coordinates"]["hg38"]["nchr"] self.ref = config["data"]["reference"] self.nchr_old = config["coordinates"]["hg38"]["nchr_old"] @@ -30,13 +32,16 @@ def __init__(self, bam, outdir, config): self.extract_region2 = config["coordinates"]["hg38"]["extract_region2"] self.samtools = config["tools"]["samtools"] self.minimap2 = config["tools"]["minimap2"] + self.max_mismatch = 1 + if "check_nm" in config: + self.max_mismatch = config["check_nm"] self._bamh = pysam.AlignmentFile(bam, "rb") self.sample_id = bam.split("/")[-1].split(".")[0] self.realign_bam = os.path.join( - self.outdir, self.sample_id + "_realigned_old.bam" + self.outdir, self.sample_id + f"_{self.gene}_realigned_old.bam" ) self.realign_out_bam = os.path.join( - self.outdir, self.sample_id + "_realigned.bam" + self.outdir, self.sample_id + f"_{self.gene}_realigned.bam" ) def write_realign_bam(self): @@ -65,6 +70,7 @@ def write_realign_bam(self): if ( read.mapping_quality >= self.min_mapq and read.query_alignment_length >= self.min_aln + and (read.get_tag("NM") < read.reference_length * self.max_mismatch) ): read.reference_start += self.offset ltags = read.tags @@ -103,18 +109,26 @@ class BamTagger: """ read_color = "166,206,227" + read_color_allele1 = "178,223,138" + read_color_allele2 = "177,156,217" def __init__(self, sample_id, outdir, config, call_sum): self.sample_id = sample_id self.outdir = outdir self.call_sum = call_sum - self.bam = os.path.join(outdir, self.sample_id + "_realigned.bam") + self.gene = config["gene"] + self.bam = os.path.join(outdir, self.sample_id + f"_{self.gene}_realigned.bam") self.nchr = config["coordinates"]["hg38"]["nchr"] self._bamh = pysam.AlignmentFile(self.bam, "rb") - self.tmp_bam = os.path.join(self.outdir, self.sample_id + "_tmp.bam") + self.tmp_bam = os.path.join( + self.outdir, self.sample_id + f"_{self.gene}_tmp.bam" + ) self.tagged_realigned_bam = os.path.join( - self.outdir, self.sample_id + "_realigned_tagged.bam" + self.outdir, self.sample_id + f"_{self.gene}_realigned_tagged.bam" ) + self.use_supplementary = False + if "use_supplementary" in config: + self.use_supplementary = config["use_supplementary"] random.seed(0) def add_tag_to_read( @@ -123,6 +137,8 @@ def add_tag_to_read( hp_keys, reads_to_tag, nonunique, + read_details, + alleles=[], random_assign=False, ): """ @@ -132,18 +148,48 @@ def add_tag_to_read( while unique reads are in blue. """ hp_found = False + read_name = read.qname + if read.is_supplementary and self.use_supplementary: + read_name = ( + read_name + f"_sup_{read.reference_start}_{read.reference_length}" + ) for hap, hap_name in hp_keys.items(): - if read.qname in reads_to_tag[hap]: + if read_name in reads_to_tag[hap]: read.set_tag("HP", hap_name, "Z") - if random_assign: - read.set_tag("YC", self.read_color, "Z") + read.set_tag("YC", self.read_color, "Z") + if alleles != []: + if hap_name in alleles[0]: + read.set_tag("YC", self.read_color_allele1, "Z") + elif len(alleles) > 1 and hap_name in alleles[1]: + read.set_tag("YC", self.read_color_allele2, "Z") hp_found = True + if hp_found is False: + # find closest match + if read_name in read_details: + read_seq = read_details[read_name] + keys = [] + mismatches = [] + for ass_hap in hp_keys.keys(): + match, mismatch, extend = VariantGraph.compare_two_haps( + read_seq, ass_hap + ) + keys.append(ass_hap) + mismatches.append(mismatch) + mismatches_sorted = sorted(mismatches) + if ( + len(mismatches_sorted) > 1 + and 0 < mismatches_sorted[0] <= 2 + and mismatches_sorted[1] >= mismatches_sorted[0] + 2 + ): + best_match = keys[mismatches.index(mismatches_sorted[0])] + read.set_tag("HP", hp_keys[best_match], "Z") + hp_found = True if hp_found is False: if random_assign is False: read.set_tag("HP", "Unassigned", "Z") else: - if read.qname in nonunique: - possible_haps = nonunique[read.qname] + if read_name in nonunique: + possible_haps = nonunique[read_name] random_hap = possible_haps[ random.randint(0, len(possible_haps) - 1) ] @@ -164,14 +210,20 @@ def write_bam(self, random_assign=False): if nonunique_reads is None: nonunique_reads = {} hp_keys = call_sum.get("final_haplotypes") + read_details = call_sum.get("read_details") + alleles = call_sum.get("alleles_final") + if alleles is None: + alleles = [] for read in self._bamh.fetch(self.nchr): - if read.is_secondary is False: # and read.is_supplementary is False: + if read.is_secondary is False: read = self.add_tag_to_read( read, hp_keys, unique_reads, nonunique_reads, + read_details, + alleles, random_assign=random_assign, ) out_bamh.write(read) @@ -195,7 +247,11 @@ def __init__(self, sample_id, outdir, config, call_sum): self.sample_id = sample_id self.outdir = outdir self.call_sum = call_sum - self.bam = os.path.join(outdir, self.sample_id + "_realigned_tagged.bam") + self.gene = config["gene"] + self.bam = os.path.join( + outdir, self.sample_id + f"_{self.gene}_realigned_tagged.bam" + ) + self.vcf_dir = os.path.join(self.outdir, f"{self.sample_id}_vcfs") self.nchr = config["coordinates"]["hg38"]["nchr"] self.nchr_old = config["coordinates"]["hg38"]["nchr_old"] self.offset = int(self.nchr_old.split("_")[1]) - 1 @@ -203,6 +259,9 @@ def __init__(self, sample_id, outdir, config, call_sum): self.ref = config["data"]["reference"] self.samtools = config["tools"]["samtools"] self.minimap2 = config["tools"]["minimap2"] + self.use_supplementary = False + if "use_supplementary" in config: + self.use_supplementary = config["use_supplementary"] self.match = {} def get_range_in_other_gene(self, pos): @@ -245,7 +304,9 @@ def merge_vcf(self, vars_list): """ Merge vcfs from multiple haplotypes. """ - merged_vcf = os.path.join(self.outdir, self.sample_id + f"_variants.vcf") + merged_vcf = os.path.join( + self.vcf_dir, self.sample_id + f"_{self.gene}_variants.vcf" + ) with open(merged_vcf, "w") as fout: self.write_header(fout) for vars in vars_list: @@ -337,7 +398,9 @@ def get_var(all_bases, ref_seq): gt = "1" return [var_seq, dp, ad, gt] - def read_pileup(self, hap_bam, hap_vcf_out, hap_bound, offset, ref, uniq_reads): + def call_variants_from_hp_bam( + self, hap_bam, hap_vcf_out, hap_bound, offset, ref, uniq_reads + ): """ Call variants from bam. Take the most supported base at each position. """ @@ -348,11 +411,11 @@ def read_pileup(self, hap_bam, hap_vcf_out, hap_bound, offset, ref, uniq_reads): self.write_header(vcf_out) pileups_raw = {} read_names = {} - variants = [] for pileupcolumn in bamh.pileup( ref_name, truncate=True, min_base_quality=30, + min_mapping_quality=59, ): pos = pileupcolumn.pos + 1 pileups_raw.setdefault( @@ -363,47 +426,15 @@ def read_pileup(self, hap_bam, hap_vcf_out, hap_bound, offset, ref, uniq_reads): pos, pileupcolumn.get_query_names(), ) - for pos in pileups_raw: - all_bases = pileups_raw[pos] - true_pos = pos + offset - ref_seq = refh.fetch(ref_name, pos - 1, pos) - alt_all_reads = self.get_var(all_bases, ref_seq) - if None not in hap_bound and hap_bound[0] < true_pos < hap_bound[1]: - # use only unique reads for positions at the edge - if true_pos < hap_bound[2] or true_pos > hap_bound[3]: - bases_uniq_reads = [] - for i, read_base in enumerate(all_bases): - if read_names[pos][i] in uniq_reads: - bases_uniq_reads.append(read_base) - alt_uniq_reads = self.get_var(bases_uniq_reads, ref_seq) - if alt_uniq_reads[-1] != ".": - var_seq, dp, ad, gt = alt_uniq_reads - else: - var_seq, dp, ad, gt = alt_all_reads - gt = "." - else: - var_seq, dp, ad, gt = alt_all_reads - - ref_seq, var_seq = self.refine_indels( - ref_seq, var_seq, pos, refh, ref_name - ) - var = f"{true_pos}_{ref_seq}_{var_seq}" - qual = "." - variants.append([true_pos, var, dp, ad, qual, gt]) - if gt == "1": - vcf_out_line = [ - self.nchr, - str(true_pos), - ".", - ref_seq, - var_seq, - str(qual), - "PASS", - ".", - "GT:DP:AD", - f"1:{dp}:{ad}", - ] - vcf_out.write("\t".join(vcf_out_line) + "\n") + variants = self.pileup_to_variant( + pileups_raw, + read_names, + uniq_reads, + refh, + offset, + hap_bound, + vcf_out, + ) vcf_out.close() bamh.close() refh.close() @@ -412,7 +443,6 @@ def read_pileup(self, hap_bam, hap_vcf_out, hap_bound, offset, ref, uniq_reads): def run_step( self, final_haps, - haplotype_details, ref_seq, offset, match_range=False, @@ -430,18 +460,7 @@ def run_step( ) i = 0 for hap_name in final_haps.values(): - hap_bound = list(haplotype_details[hap_name]["boundary"]) - # find the positions next to the existing boundaries - for var in self.call_sum["sites_for_phasing"]: - confident_position = int(var.split("_")[0]) - if confident_position > hap_bound[0]: - break - hap_bound.append(confident_position) - for var in reversed(self.call_sum["sites_for_phasing"]): - confident_position = int(var.split("_")[0]) - if confident_position < hap_bound[1]: - break - hap_bound.append(confident_position) + hap_bound = self.get_hap_bound(hap_name) # convert to positions in the other gene if match_range: hap_bound = [ @@ -450,7 +469,10 @@ def run_step( self.get_range_in_other_gene(hap_bound[2]), self.get_range_in_other_gene(hap_bound[3]), ] - hap_bam = os.path.join(self.outdir, self.sample_id + f"_{hap_name}.bam") + hap_bam = os.path.join( + self.outdir, self.sample_id + f"_{self.gene}_{hap_name}.bam" + ) + realign_cmd = ( f"{self.samtools} view -d HP:{hap_name} {self.bam} |" + f'awk \'BEGIN {{FS="\\t"}} {{print "@" $1 "\\n" $10 "\\n+\\n" $11}}\'' @@ -463,8 +485,10 @@ def run_step( pysam.index(hap_bam) # call variants - hap_vcf_out = os.path.join(self.outdir, self.sample_id + f"_{hap_name}.vcf") - variants_called = self.read_pileup( + hap_vcf_out = os.path.join( + self.vcf_dir, self.sample_id + f"_{self.gene}_{hap_name}.vcf" + ) + variants_called = self.call_variants_from_hp_bam( hap_bam, hap_vcf_out, hap_bound, offset, ref_seq, uniq_reads ) os.remove(hap_bam) @@ -483,14 +507,194 @@ def run_step( def run(self): """Process haplotypes one by one""" call_sum = self.call_sum + if call_sum is None: + return vars = self.run_step( call_sum["final_haplotypes"], - call_sum["haplotype_details"], self.ref, self.offset, ) self.merge_vcf([vars]) + def get_hap_bound(self, hap_name): + """Get haplotype boundaries""" + hap_bound = list(self.call_sum["haplotype_details"][hap_name]["boundary"]) + # find the positions next to the existing boundaries + for var in self.call_sum["sites_for_phasing"]: + confident_position = int(var.split("_")[0]) + if confident_position > hap_bound[0]: + break + hap_bound.append(confident_position) + for var in reversed(self.call_sum["sites_for_phasing"]): + confident_position = int(var.split("_")[0]) + if confident_position < hap_bound[1]: + break + hap_bound.append(confident_position) + return hap_bound + + def pileup_to_variant( + self, + pileups_raw, + read_names, + uniq_reads, + refh, + offset, + hap_bound, + vcf_out, + ): + """ + Filter pileups and make variant calls. + """ + ref_name = refh.references[0] + variants = [] + for pos in pileups_raw: + all_bases = pileups_raw[pos] + if offset < 0: + true_pos = pos + refh_pos = pos + offset + else: + true_pos = pos + offset + refh_pos = pos + ref_seq = refh.fetch(ref_name, refh_pos - 1, refh_pos) + alt_all_reads = self.get_var(all_bases, ref_seq) + if None not in hap_bound and hap_bound[0] < true_pos < hap_bound[1]: + # use only unique reads for positions at the edge + if true_pos < hap_bound[2] or true_pos > hap_bound[3]: + bases_uniq_reads = [] + for i, read_base in enumerate(all_bases): + if read_names[pos][i] in uniq_reads: + bases_uniq_reads.append(read_base) + alt_uniq_reads = self.get_var(bases_uniq_reads, ref_seq) + if alt_uniq_reads[-1] != ".": + var_seq, dp, ad, gt = alt_uniq_reads + else: + var_seq, dp, ad, gt = alt_all_reads + gt = "." + else: + var_seq, dp, ad, gt = alt_all_reads + + ref_seq, var_seq = self.refine_indels( + ref_seq, var_seq, refh_pos, refh, ref_name + ) + var = f"{true_pos}_{ref_seq}_{var_seq}" + qual = "." + variants.append([true_pos, var, dp, ad, qual, gt]) + if gt == "1": + vcf_out_line = [ + self.nchr, + str(true_pos), + ".", + ref_seq, + var_seq, + str(qual), + "PASS", + ".", + "GT:DP:AD", + f"1:{dp}:{ad}", + ] + vcf_out.write("\t".join(vcf_out_line) + "\n") + return variants + + def run_without_realign(self): + """ + Make vcf from existing alignment. + This works for gene/pseudogene scenarios, + i.e. no need to realign to pseudogene reference. + """ + call_sum = self.call_sum + if call_sum is None: + return + final_haps = call_sum["final_haplotypes"] + uniq_reads = [] + for read_set in self.call_sum["unique_supporting_reads"].values(): + for read_name in read_set: + read_name_split = read_name.split("_") + # supplementary alignments + if self.use_supplementary and len(read_name_split) > 1: + uniq_reads.append("_".join(read_name_split[:-1])) + else: + uniq_reads.append(read_name) + vars = {} + two_cp_haplotypes = self.call_sum.get("two_copy_haplotypes") + nhap = len(final_haps) + len( + [a for a in two_cp_haplotypes if a in final_haps.values()] + ) + + bamh = pysam.AlignmentFile(self.bam, "rb") + refh = pysam.FastaFile(self.ref) + + i = 0 + for hap_name in final_haps.values(): + hap_bound = self.get_hap_bound(hap_name) + hap_vcf_out = os.path.join( + self.vcf_dir, self.sample_id + f"_{self.gene}_{hap_name}.vcf" + ) + vcf_out = open(hap_vcf_out, "w") + self.write_header(vcf_out) + + # by HP tag + pileups_raw = {} + read_names = {} + for pileupcolumn in bamh.pileup( + self.nchr, + truncate=True, + min_base_quality=30, + ): + pos = pileupcolumn.pos + 1 + this_pos_bases = [ + a.upper() for a in pileupcolumn.get_query_sequences(add_indels=True) + ] + this_position_hps = [] + this_pos_read_names = [] + this_pos_read_names_sup = [] + + for read in pileupcolumn.pileups: + read_tag = read.alignment.get_tag("HP") + this_position_hps.append(read_tag) + read_name = read.alignment.query_name + new_read_name = read_name + if self.use_supplementary and read.alignment.is_supplementary: + new_read_name = ( + read_name + f"_sup_{read.alignment.reference_start}" + ) + this_pos_read_names.append(read_name) + this_pos_read_names_sup.append(new_read_name) + + for read_num, read_hap in enumerate(this_position_hps): + if read_hap == hap_name: + pileups_raw.setdefault(pos, []).append(this_pos_bases[read_num]) + if self.use_supplementary: + read_names.setdefault(pos, []).append( + this_pos_read_names_sup[read_num] + ) + else: + read_names.setdefault(pos, []).append( + this_pos_read_names[read_num] + ) + + variants_called = self.pileup_to_variant( + pileups_raw, + read_names, + uniq_reads, + refh, + 0 - self.offset, + hap_bound, + vcf_out, + ) + vcf_out.close() + + for pos, var_name, dp, ad, qual, gt in variants_called: + vars.setdefault(pos, [None] * nhap) + vars[pos][i] = [var_name, dp, ad, qual, gt] + if hap_name in two_cp_haplotypes: + vars[pos][i + 1] = [var_name, dp, ad, qual, gt] + if hap_name in two_cp_haplotypes: + i += 1 + i += 1 + self.merge_vcf([vars]) + bamh.close() + refh.close() + class TwoGeneVcfGenerater(VcfGenerater): """ @@ -514,15 +718,15 @@ def run(self): in this two-gene scenario """ call_sum = self.call_sum + if call_sum is None: + return vars_smn1 = self.run_step( call_sum["smn1_haplotypes"], - call_sum["haplotype_details"], self.ref, self.offset, ) vars_smn2 = self.run_step( call_sum["smn2_haplotypes"], - call_sum["haplotype_details"], self.ref_gene2, self.offset_gene2, match_range=True, diff --git a/setup.py b/setup.py index f97638b..094a8c6 100644 --- a/setup.py +++ b/setup.py @@ -8,19 +8,18 @@ def readme(): setup( name="paraphase", - version="1.1.3", - description="paraphase: HiFi-based SMN1/SMN2 variant caller", + version="2.0.0", + description="paraphase: HiFi-based caller for highly homologous genes", long_description=readme(), url="https://github.com/PacificBiosciences/paraphase", author="Xiao Chen", author_email="xchen@pacificbiosciences.com", license="BSD-3-Clause-Clear", - packages=["paraphase"], - package_data={"paraphase": ["data/*", "data/smn1/*"]}, + packages=["paraphase", "paraphase.genes"], + package_data={"paraphase": ["data/*", "data/**/*"]}, install_requires=["pysam", "numpy", "scipy", "networkx", "matplotlib", "pyyaml"], setup_requires=["pytest-runner"], tests_require=["pytest"], entry_points={"console_scripts": ["paraphase=paraphase.paraphase:main"]}, - include_package_data=True, long_description_content_type="text/markdown", ) diff --git a/tests/test_data/HG00733_realigned.bam b/tests/test_data/HG00733_smn1_realigned.bam similarity index 100% rename from tests/test_data/HG00733_realigned.bam rename to tests/test_data/HG00733_smn1_realigned.bam diff --git a/tests/test_data/HG00733_realigned.bam.bai b/tests/test_data/HG00733_smn1_realigned.bam.bai similarity index 100% rename from tests/test_data/HG00733_realigned.bam.bai rename to tests/test_data/HG00733_smn1_realigned.bam.bai diff --git a/tests/test_phaser.py b/tests/test_phaser.py index 9cc88eb..c10e03e 100755 --- a/tests/test_phaser.py +++ b/tests/test_phaser.py @@ -10,7 +10,7 @@ class TestPhaser(object): sample_dir = os.path.join(cur_dir, "test_data") sample_id = "HG00733" data_dir = os.path.join(os.path.dirname(cur_dir), "paraphase", "data") - config_file = os.path.join(data_dir, "smn1", "config.yaml") + config_file = os.path.join(data_dir, "smn1", "smn1_config.yaml") with open(config_file, "r") as f: config = yaml.safe_load(f) data_paths = config.get("data") @@ -18,7 +18,8 @@ class TestPhaser(object): old_data_file = data_paths[data_entry] new_data_file = os.path.join(data_dir, "smn1", old_data_file) data_paths[data_entry] = new_data_file - phaser = Phaser(sample_id, sample_dir, config) + phaser = Phaser(sample_id, sample_dir) + phaser.set_parameter(config) def test_depth_prob(self): prob = Phaser.depth_prob(40, 20) diff --git a/tests/test_smn_phaser.py b/tests/test_smn1_phaser.py similarity index 97% rename from tests/test_smn_phaser.py rename to tests/test_smn1_phaser.py index 1d5b2b5..2c3500a 100755 --- a/tests/test_smn_phaser.py +++ b/tests/test_smn1_phaser.py @@ -1,16 +1,16 @@ import pytest import yaml import os -from paraphase.smn_phaser import SmnPhaser +from paraphase.genes.smn1_phaser import Smn1Phaser -class TestSmnPhaser(object): +class TestSmn1Phaser(object): cur_dir = os.path.dirname(__file__) sample_dir = os.path.join(cur_dir, "test_data") sample_id = "HG00733" data_dir = os.path.join(os.path.dirname(cur_dir), "paraphase", "data") - config_file = os.path.join(data_dir, "smn1", "config.yaml") + config_file = os.path.join(data_dir, "smn1", "smn1_config.yaml") with open(config_file, "r") as f: config = yaml.safe_load(f) data_paths = config.get("data") @@ -18,7 +18,8 @@ class TestSmnPhaser(object): old_data_file = data_paths[data_entry] new_data_file = os.path.join(data_dir, "smn1", old_data_file) data_paths[data_entry] = new_data_file - phaser = SmnPhaser(sample_id, sample_dir, config) + phaser = Smn1Phaser(sample_id, sample_dir) + phaser.set_parameter(config) def test_check_smn1_smn2_presence(self): self.phaser.check_smn1_smn2_presence()