Abstract
Modern biological research increasingly recognises the importance of genome-wide gene regulatory network inference; however, a range of statistical, technological and biological factors make it a difficult and intractable problem. One approach that some research has used is to cluster the data and then infer a structural model of the clusters. When using this kind of approach it is very important to choose the clustering algorithm carefully. In this paper we explicitly analyse the attributes that make a clustering algorithm appropriate, and we also consider how to measure the quality of the identified clusters. Our analysis leads us to develop three novel cluster quality measures that are based on regulatory overlap. Using these measures we evaluate two modern candidate algorithms: FLAME, and KMART. Although FLAME was specifically developed for clustering gene expression profile data, we find that KMART is probably a better algorithm to use if the goal is to infer a structural model of the clusters.
Original language | English |
---|---|
Title of host publication | Research and Development in Intelligent Systems XXVI: Incorporating Applications and Innovations in Intelligent Systems XVII |
Publisher | Springer |
Pages | 137-149 |
Number of pages | 13 |
ISBN (Print) | 9781848829824 |
DOIs | |
Publication status | Published - 2010 |
Externally published | Yes |
Event | 29th SGAI International Conference on Innovative Techniques and Applications of Artificial Intelligence, AI 2009 - Cambridge, United Kingdom Duration: 15 Dec 2009 → 17 Dec 2009 |
Conference
Conference | 29th SGAI International Conference on Innovative Techniques and Applications of Artificial Intelligence, AI 2009 |
---|---|
Country/Territory | United Kingdom |
City | Cambridge |
Period | 15/12/09 → 17/12/09 |
ASJC Scopus subject areas
- Artificial Intelligence
- Information Systems