其他分享
首页 > 其他分享> > 基因组注释之软件使用

基因组注释之软件使用

作者:互联网

1、RepeatMasker

1.1、输入

输入格式为fasta序列,不接受其它 GenBank, Staden,等格式。它既可以处理一个批文件(一个文件包含许多条序列),也可以批处理许多文件(每个文件含有一条序列)。

RepeatMasker *.fasta

 该命令将mask当前目录下所有的以.fasta文件结尾,并为每个文件提供单独的报告。虽然处理批文件更快,但是处理单个文件更精准。

This command will mask all files that end with .fasta in the current directory and give separate reports for each file. Note that if you have
multiple small sequences it is considerably faster to run RepeatMasker on one batch file than on many single sequence files. The summary file 
will be more informative as well. However, analysis on single files (when larger than 2 kb each) can be slightly more accurate, since GC levels
 for each sequence will be calculated and used to choose appropriate parameters.

 

标签:files,will,文件,基因组,注释,RepeatMasker,file,软件,fasta
来源: https://www.cnblogs.com/djx571/p/12340799.html