m6A sites identified on transcript basis #377

baibhav-bioinfo · 2025-02-19T18:19:50Z

Hello,
I am using DRS dataset to identify m6A sites in plant dataset using dorado+modkit tools.

If i want to get the m6A sites identified on the transcript locations, can i align the unaligned basecalled output bam files to the transcript.fasta instead of genome.fasta. and then use modkit to get bedmethyl files of m6A sites and do downstream analysis.

and then later find out the genomic locations of the sites for representation and visualisation etc.

ArtRand · 2025-02-20T02:31:41Z

Hello @baibhav-bioinfo,

Two options I can think of off-the-top are:

Align to the transcriptome and the genome, then use their pileups for individual tasks.
Align to the genome, then make a BED of the transcripts and use the --include-bed argument to modkit pileup so that you only have m6A positions included. Of course, you'll have to make sure to get the BED file right so that you get the isoforms you want.

You could convert transcript coordinates to genome coordinates with LiftOver or a similar tool, but Modkit doesn't have any such functionality.

baibhav-bioinfo · 2025-02-25T01:05:31Z

hi,
i tried the alignment with the transcriptome.
Then i made bedmethyl file using modkit and now i see that every site is in + strand.

why is that?.....when i did align to genome earlier i was getting half and half on both the strands.

is there any particular reason for that?
(pPS: for alignment i have selected only full length reads which align to reference transcriptome)

baibhav-bioinfo · 2025-02-28T16:20:54Z

hi @ArtRand
I think i will choose the second option from what you suggested (as the mapping to transcriptome is too much confusing)

so i will map the reads to genome and then while modkit pileup i will provide the bed file of transcripts (--include-bed option) to get the m6A sites only in those regions.
i will use the gtf file to get the transcripts.bed file.

ArtRand added the question Looking for clarification on inputs and/or outputs label Feb 20, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

m6A sites identified on transcript basis #377

m6A sites identified on transcript basis #377

baibhav-bioinfo commented Feb 19, 2025

ArtRand commented Feb 20, 2025

baibhav-bioinfo commented Feb 25, 2025 •

edited

Loading

baibhav-bioinfo commented Feb 28, 2025

m6A sites identified on transcript basis #377

m6A sites identified on transcript basis #377

Comments

baibhav-bioinfo commented Feb 19, 2025

ArtRand commented Feb 20, 2025

baibhav-bioinfo commented Feb 25, 2025 • edited Loading

baibhav-bioinfo commented Feb 28, 2025

baibhav-bioinfo commented Feb 25, 2025 •

edited

Loading