عنوان
|
New Approximate Statistical Significance of Gapped Alignments Based on the Greedy Extension Model
|
نوع پژوهش
|
مقاله چاپ شده
|
کلیدواژهها
|
biological sequenceslocal,alignmentlocal score,Poisson clumping,sequence comparison,statistical significance
|
چکیده
|
Sequence alignment is a fundamental concept in bioinformatics to distinguish regions of similarity among various sequences. The degree of similarity has been considered as a score. There are a number of various methods to find the statistical significance of similarity in the gapped and ungapped cases. In this article, we improve the statistical significance accuracy of the local score by introducing a new approximate p-value. This is developed according to Poisson clumping and the exact distribution of a partial sum of random variables. The efficiency of the proposed method is compared with that of previous methods on real and simulated data. The results yield a remarkable improvement in accuracy of the p-value in the gapped case. This is an evidence for the method to be considered as a prospective candidate for sequences comparison.
|
پژوهشگران
|
لوییس فره (نفر چهارم)، سابین مرسیه (نفر سوم)، افشین فیاض موقر (نفر دوم)، امیر حسین کرمی (نفر اول)
|