-
Notifications
You must be signed in to change notification settings - Fork 2
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
What's the format of the output MSA, fasta, a3m or a2m? #14
Comments
Hi, I don't think anything went wrong in your conversion. Otherwise: What do you meant exactly by "aligned terribly"? Could you send me input, learnMSA output and the learnMSA version and settings (if non default) you used? |
Three motifs A B C of RdRP domain weren't aligned compared to the muscle super. So, I think, maybe, that I misunderstand the format of the out. I installed the learnMSA as the README and run the following commond:
The input and output files are sent to you by e-mail (beckerfelix94). |
Update: I checked reformat.pl. Perhaps its the character limit per line. learnMSA does not have a character limit, but reformat.pl seems to assumed one (100 per default). Using the argument -l "alignment length" in reformat.pl could help. I'll also check your files. I will add more options to configure the learnMSA output soon and make the documentation clearer. |
Thanks for reply. I think that some sequences shared low similarity (or having longer insert or longer) to the other RdRP sequences, which made the alignment seem bad. Actually, most of the sequences (about 2500 in 3265 ) are well-aligned. PS: I expected three motifs are aligned. DXXXX[D/E] [S/T]G [G/S/A]D[D/N] |
I checked your files and I agree: learnMSA found your motifs, but some sequences seem to be aligned off, because of low amino acid similarity. I was curious if aligning with language model support ( To reproduce: I aligned with version 2.0.8 (published today) and Best, |
I aligned ~3k of the RdRP domain RNA virus. The output is like the following
The MSA seems to be in a2m format. So, I tried to covert the a2m to fasta format by reformat.pl script in hhsuite. However, the MSA seemed to be algined terribly. What's the format of the output MSA, fasta, a3m or a2m?
The text was updated successfully, but these errors were encountered: