六、选项参数 
blast 常用选项参数 
|  选项 |  释义 |  |  -h |  显示选项参数 |  |  -help |  显示帮助文档 |  |  -db |  比对数据库 |  |  -query |  待比对序列 |  |  -out |  输出文件名 |  |  -evalue |  比对 e 值 |  |  -outfmt |  输出文件格式 |  |  -task |  比对类型 |  |  -num_threads |  使用线程数 |  |  -subject |  两两比对,目标序列 |  |  -remote |  联网比对 |  |  -query_loc |  设定 query 的起始和终止位点 |  |  -num_alignments |  显示比对上的结果数目 |  |  -strand |  比对方向 |  
  
7 输出格式  
7.1 blast 输出格式 
- 0 = Pairwise,
 
 -  1 = Query-anchored showing identities,
 
 -  2 = Query-anchored no identities,
 
 -  3 = Flat query-anchored showing identities,
 
 -  4 = Flat query-anchored no identities,
 
 -  5 = BLAST XML,
 
 -  6 = Tabular,
 
 -  7 = Tabular with comment lines,
 
 -  8 = Seqalign (Text ASN.1),
 
 -  9 = Seqalign (Binary ASN.1),
 
 -  10 = Comma-separated values,
 
 -  11 = BLAST archive (ASN.1),
 
 -  12 = Seqalign (JSON),
 
 -  13 = Multiple-file BLAST JSON,
 
 -  14 = Multiple-file BLAST XML2,
 
 -  15 = Single-file BLAST JSON,
 
 -  16 = Single-file BLAST XML2,
 
 -  17 = Sequence Alignment/Map (SAM),
 
 -  18 = Organism Report
 
  复制代码 
7.2 自定义格式 
blast 默认的 18 中格式中,6,7,17 为列表格式,可以重新自定义,也包括格式 10。 
格式为: 
- -outfmt "6 qseqid sseqid pident"
 
  复制代码 具体输出关键字 
- qseqid means Query Seq-id
 
 -  qgi means Query GI
 
 -  qacc means Query accesion
 
 -  qaccver means Query accesion.version
 
 -  qlen means Query sequence length
 
 -  sseqid means Subject Seq-id
 
 -  sallseqid means All subject Seq-id(s), separated by a ';'
 
 -  sgi means Subject GI
 
 -  sallgi means All subject GIs
 
 -  sacc means Subject accession
 
 -  saccver means Subject accession.version
 
 -  sallacc means All subject accessions
 
 -  slen means Subject sequence length
 
 -  qstart means Start of alignment in query
 
 -  qend means End of alignment in query
 
 -  sstart means Start of alignment in subject
 
 -  send means End of alignment in subject
 
 -  qseq means Aligned part of query sequence
 
 -  sseq means Aligned part of subject sequence
 
 -  evalue means Expect value
 
 -  bitscore means Bit score
 
 -  score means Raw score
 
 -  length means Alignment length
 
 -  pident means Percentage of identical matches
 
 -  nident means Number of identical matches
 
 -  mismatch means Number of mismatches
 
 -  positive means Number of positive-scoring matches
 
 -  gapopen means Number of gap openings
 
 -  gaps means Total number of gaps
 
 -  ppos means Percentage of positive-scoring matches
 
 -  frames means Query and subject frames separated by a '/'
 
 -  qframe means Query frame
 
 -  sframe means Subject frame
 
 -  btop means Blast traceback operations (BTOP)
 
 -  staxid means Subject Taxonomy ID
 
 -  ssciname means Subject Scientific Name
 
 -  scomname means Subject Common Name
 
 -  sblastname means Subject Blast Name
 
 -  sskingdom means Subject Super Kingdom
 
 -  staxids means unique Subject Taxonomy ID(s), separated by a ';'
 
 -  (in numerical order)
 
 -  sscinames means unique Subject Scientific Name(s), separated by a ';'
 
 -  scomnames means unique Subject Common Name(s), separated by a ';'
 
 -  sblastnames means unique Subject Blast Name(s), separated by a ';'
 
 -  (in alphabetical order)
 
 -  sskingdoms means unique Subject Super Kingdom(s), separated by a ';'
 
 -  (in alphabetical order)
 
 -  stitle means Subject Title
 
 -  salltitles means All Subject Title(s), separated by a '<>'
 
 -  sstrand means Subject Strand
 
 -  qcovs means Query Coverage Per Subject
 
 -  qcovhsp means Query Coverage Per HSP
 
 -  qcovus means Query Coverage Per Unique Subject (blastn only)
 
 -  When not provided, the default value is:
 
 -  'qaccver saccver pident length mismatch gapopen qstart qend sstart send
 
 -  evalue bitscore', which is equivalent to the keyword 'std'
 
 -  The supported format specifier for option 17 is:
 
 -  SQ means Include Sequence Data
 
 -  SR means Subject as Reference Seq
 
  复制代码 |