TY - JOUR
T1 - AMAP
T2 - A pipeline for whole-genome mutation detection in Arabidopsis thaliana
AU - Ishii, Kotaro
AU - Kazama, Yusuke
AU - Hirano, Tomonari
AU - Hamada, Michiaki
AU - Ono, Yukiteru
AU - Yamada, Mieko
AU - Abe, Tomoko
N1 - Funding Information:
This research was supported by the Council for Science, Technology and Innovation (CSTI), Cross-ministerial Strategic Innovation Promotion Program (SIP), “Technologies for creating next-generation agriculture, forestry and fisheries” (funding agency: Bio-oriented Technology Research Advancement Institution, NARO); by the Japan Society for the Promotion of Science (JSPS) through the ‘Funding Program for Next Generation World-Leading Researchers (NEXT Program)’ to T. A. (GR096) and through a Grant-in-Aid for Scientific Research (B) (Y. K., No. 25292009); by the Ministry of Education, Culture, Sports, Science and Technology (MEXT) through KAKENHI (T. A., No. 221S0002); and by the RIKEN Biomass Engineering Program.
Publisher Copyright:
© 2016, Genetics Society of Japan. All rights reserved.
PY - 2016
Y1 - 2016
N2 - Detection of mutations at the whole-genome level is now possible by the use of high-throughput sequencing. However, determining mutations is a time-consum-ing process due to the number of false positives provided by mutation-detecting programs. AMAP (automated mutation analysis pipeline) was developed to overcome this issue. AMAP integrates a set of well-validated programs for mapping (BWA), removal of potential PCR duplicates (Picard), realignment (GATK) and detection of mutations (SAMtools, GATK, Pindel, BreakDancer and CNVnator). Thus, all types of mutations such as base substitution, deletion, insertion, translocation and chromosomal rearrangement can be detected by AMAP. In addition, AMAP automatically distinguishes false positives by comparing lists of candidate mutations in sequenced mutants. We tested AMAP by inputting already analyzed read data derived from three individual Arabidopsis thaliana mutants and confirmed that all true mutations were included in the list of candidate mutations. The result showed that the number of false positives was reduced to 12% of that obtained in a previous analysis that lacked a process of reducing false positives. Thus, AMAP will accelerate not only the analysis of mutation induction by individual mutagens but also the process of forward genetics.
AB - Detection of mutations at the whole-genome level is now possible by the use of high-throughput sequencing. However, determining mutations is a time-consum-ing process due to the number of false positives provided by mutation-detecting programs. AMAP (automated mutation analysis pipeline) was developed to overcome this issue. AMAP integrates a set of well-validated programs for mapping (BWA), removal of potential PCR duplicates (Picard), realignment (GATK) and detection of mutations (SAMtools, GATK, Pindel, BreakDancer and CNVnator). Thus, all types of mutations such as base substitution, deletion, insertion, translocation and chromosomal rearrangement can be detected by AMAP. In addition, AMAP automatically distinguishes false positives by comparing lists of candidate mutations in sequenced mutants. We tested AMAP by inputting already analyzed read data derived from three individual Arabidopsis thaliana mutants and confirmed that all true mutations were included in the list of candidate mutations. The result showed that the number of false positives was reduced to 12% of that obtained in a previous analysis that lacked a process of reducing false positives. Thus, AMAP will accelerate not only the analysis of mutation induction by individual mutagens but also the process of forward genetics.
KW - Arabidopsis thaliana
KW - Heavy-ion beam
KW - Mutation detection
KW - Pipeline
KW - Whole-genome re-sequencing
UR - http://www.scopus.com/inward/record.url?scp=85015656518&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85015656518&partnerID=8YFLogxK
U2 - 10.1266/ggs.15-00078
DO - 10.1266/ggs.15-00078
M3 - Article
C2 - 27452041
AN - SCOPUS:85015656518
SN - 1341-7568
VL - 91
SP - 229
EP - 233
JO - Genes and Genetic Systems
JF - Genes and Genetic Systems
IS - 4
ER -