Abstract
We model DNA count data as a multiple change point problem. This means that the data are divided into multiple segments based on the unknown number of change points. Each segment of the process is modeled by using a zero modified count data distribution. We observe that zeroinflated negative binomial (ZINB) model fits the data better than the competing negative binomial (NB) model. We propose an extension to the Cross-Entropy (CE) method that utilizes a beta distribution to simulate the locations of change points. Furthermore, parallel implementation of the extended CE method results a significant improvement in total processing time, in which the procedures are computationally highly intensive. We consider an artificially generated count data sequence to assess the performance of the propose method. Finally, a real DNA count data set is used to illustrate the usefulness of the proposed methodology.
Original language | English |
---|---|
Title of host publication | Proceedings of the 4th International Conference on Computational Methods |
Subtitle of host publication | ICCM 2012 |
Editors | YuanTong Gu, Suvash C. Saha |
Place of Publication | Brisbane |
Publisher | Queensland University of Technology |
Pages | 1-8 |
Number of pages | 8 |
ISBN (Print) | 9781921897542 |
Publication status | Published - 2012 |
Event | International Conference on Computational Methods (4th : 2012) - Gold Coast Duration: 25 Nov 2012 → 28 Nov 2012 |
Conference
Conference | International Conference on Computational Methods (4th : 2012) |
---|---|
City | Gold Coast |
Period | 25/11/12 → 28/11/12 |
Keywords
- Cross-Entropy method
- change-point problem
- combinatorial optimization
- zero-inflated negative binomial
- DNA count data