-
Notifications
You must be signed in to change notification settings - Fork 23
SB Replace subseq
alicarea edited this page Oct 26, 2016
·
4 revisions
Search through the sequences in all records and replace matches with something new.
Provide a sequences or regular expression pattern to search sequences with (case insensitive).
Optional. If provided, this string will replace any matches found, otherwise matches will be deleted.
>Dme-Panxδ1
YKLLGSLKSYLKWQIQTDNAVFRLHNSFTTVLLLTCSLIITATQYVGQPI
>Dme-Panxδ2
MDVFGSVKGLLKIDQVDNNVFRMHYKATVIILIAFSLLVTSRQYIGDPID
>Dme-Panxδ3
GFIKIDNMVFRCHYRITAILFTCCIIVTANNLIGDPISCIIPMHVINTFC
>Dme-Panxδ4
MAAVKPLSKYLQFKVHIYDAIFTLHSKVTVALLLACTFLLSSKQYFGDPI
>Mle-Panxα1
MYWIFEICQEIKRAQSCRKFAIDGPFDWTNRIIMPTLMVICCFLQTFTFM
>Mle-Panxα5
MIYWVWAVFKRMAPFKVVTLDDRWDQMNRSFMMPLTMSFAYLIDYGIIAG
>Mle-Panxα6
MLLEILANFKGATPFKEIVLDDKWDQINRCYMFLLCVIFGTVVTFRQYTG
>Mle-Panxα9
MLDILSKFKGVTPFKGITIDDGWDQLNRSFMFVLLVVMGTTVTVRQYTGS
Simple search and replace
$: sb C-terms.fa -rs "LL" "*"
>Dme-Panxδ1
YK*GSLKSYLKWQIQTDNAVFRLHNSFTTV*LTCSLIITATQYVGQPI
>Dme-Panxδ2
MDVFGSVKG*KIDQVDNNVFRMHYKATVIILIAFS*VTSRQYIGDPID
>Dme-Panxδ3
GFIKIDNMVFRCHYRITAILFTCCIIVTANNLIGDPISCIIPMHVINTFC
>Dme-Panxδ4
MAAVKPLSKYLQFKVHIYDAIFTLHSKVTVA*LACTF*SSKQYFGDPI
>Mle-Panxα1
MYWIFEICQEIKRAQSCRKFAIDGPFDWTNRIIMPTLMVICCFLQTFTFM
>Mle-Panxα5
MIYWVWAVFKRMAPFKVVTLDDRWDQMNRSFMMPLTMSFAYLIDYGIIAG
>Mle-Panxα6
M*EILANFKGATPFKEIVLDDKWDQINRCYMF*CVIFGTVVTFRQYTG
>Mle-Panxα9
MLDILSKFKGVTPFKGITIDDGWDQLNRSFMFV*VVMGTTVTVRQYTGS
Simple search and delete for all 'M' residues (i.e., blank replace
argument)
$: sb C-terms.fa -rs "m"
>Dme-Panxδ1
YKLLGSLKSYLKWQIQTDNAVFRLHNSFTTVLLLTCSLIITATQYVGQPI
>Dme-Panxδ2
DVFGSVKGLLKIDQVDNNVFRHYKATVIILIAFSLLVTSRQYIGDPID
>Dme-Panxδ3
GFIKIDNVFRCHYRITAILFTCCIIVTANNLIGDPISCIIPHVINTFC
>Dme-Panxδ4
AAVKPLSKYLQFKVHIYDAIFTLHSKVTVALLLACTFLLSSKQYFGDPI
>Mle-Panxα1
YWIFEICQEIKRAQSCRKFAIDGPFDWTNRIIPTLVICCFLQTFTF
>Mle-Panxα5
IYWVWAVFKRAPFKVVTLDDRWDQNRSFPLTSFAYLIDYGIIAG
>Mle-Panxα6
LLEILANFKGATPFKEIVLDDKWDQINRCYFLLCVIFGTVVTFRQYTG
>Mle-Panxα9
LDILSKFKGVTPFKGITIDDGWDQLNRSFFVLLVVGTTVTVRQYTGS
More complicated regular expression replacement
$: sb C-terms.fa -rs "[IL].{1,4}[IL]" "_motif_"
>Dme-Panxδ1
YK_motif_KSY_motif_QTDNAVFRLHNSFTTV_motif_TCS_motif_TATQYVGQ
PI
>Dme-Panxδ2
MDVFGSVKG_motif_DQVDNNVFRMHYKATV_motif_AFSLLVTSRQY_motif_D
>Dme-Panxδ3
GF_motif_DNMVFRCHYR_motif_FTCCIIVTANN_motif_SCI_motif_NTFC
>Dme-Panxδ4
MAAVKP_motif_QFKVH_motif_FTLHSKVTVA_motif_ACTFLLSSKQYFGDPI
>Mle-Panxα1
MYW_motif_CQEIKRAQSCRKFAIDGPFDWTNR_motif_MV_motif_QTFTFM
>Mle-Panxα5
MIYWVWAVFKRMAPFKVVTLDDRWDQMNRSFMMPLTMSFAY_motif_IAG
>Mle-Panxα6
M_motif_ANFKGATPFKE_motif_DDKWDQINRCYMF_motif_FGTVVTFRQYTG
>Mle-Panxα9
M_motif_SKFKGVTPFKG_motif_DDGWDQLNRSFMFVLLVVMGTTVTVRQYTGS
Retain part of a match in the replacement string
$: sb C-terms.fa -rs "[IL](.{1,4})[IL]" ">\1<"
>Dme-Panxδ1
YK>LGS<KSY>KWQ<QTDNAVFRLHNSFTTV>L<TCS>I<TATQYVGQPI
>Dme-Panxδ2
MDVFGSVKG>LK<DQVDNNVFRMHYKATV>IL<AFSLLVTSRQY>GDP<D
>Dme-Panxδ3
GF>K<DNMVFRCHYR>TAI<FTCCIIVTANN>IGDP<SCI>PMHV<NTFC
>Dme-Panxδ4
MAAVKP>SKY<QFKVH>YDA<FTLHSKVTVA>L<ACTFLLSSKQYFGDPI
>Mle-Panxα1
MYW>FE<CQEIKRAQSCRKFAIDGPFDWTNR>IMPT<MV>CCF<QTFTFM
>Mle-Panxα5
MIYWVWAVFKRMAPFKVVTLDDRWDQMNRSFMMPLTMSFAY>IDYG<IAG
>Mle-Panxα6
M>LEI<ANFKGATPFKE>V<DDKWDQINRCYMF>LCV<FGTVVTFRQYTG
>Mle-Panxα9
M>DI<SKFKGVTPFKG>T<DDGWDQLNRSFMFVLLVVMGTTVTVRQYTGS