Robust consensus clustering for identification of expressed genes linked to malignancy of human colorectal carcinoma.
Journal Title: Bioinformation - Year 2011, Vol 6, Issue 7
Abstract
Previous studies have been conducted in gene expression profiling to identify groups of genes that characterize the colorectal carcinoma disease. Despite the success of previous attempts to identify groups of genes in the progression of the colorectal carcinoma disease, their methods either require subjective interpretation of the number of clusters, or lack stability during different runs of the algorithms. All of which limits the usefulness of these methods. In this study, we propose an enhanced algorithm that provides stability and robustness in identifying differentially expressed genes in an expression profile analysis. Our proposed algorithm uses multiple clustering algorithms under the consensus clustering framework. The results of the experiment show that the robustness of our method provides a consistent structure of clusters, similar to the structure found in the previous study. Furthermore, our algorithm outperforms any single clustering algorithms in terms of the cluster quality score.
Authors and Affiliations
Gatot Wahyudi, Ito Wasito, Tisha Melia, Indra Budi
Molecular modelling of the TSR domain of R-spondin 4.
R-spondin 4 is a secreted protein mainly associated with embryonic nail development. R-spondins have been recently identified as heparin-binding proteins with high affinity. Proteoglycan binding has been associated with...
sRNATarget: a web server for prediction of bacterial sRNA targets.
In bacteria, there exist some small non-coding RNAs (sRNAs) with 40-500 nucleotides in length. Most of them function as posttranscriptional regulation of gene expression through binding to their target mRNAs, in which Hf...
A database for allergenic proteins and tools for allergenicity prediction.
The AllergenPro database has developed a web-based system that will provide information about allergen in microbes, animals and plants. The database has three major parts and functions:(i) database list; (ii) allergen se...
Identification of Comamonas species using 16S rRNA gene sequence.
A bacterial strain Bz02 was isolated from a water sample collected from river Gomti at the Indian city of Lucknow. We characterized the strain using 16S rRNA sequence. Phylogenetic analysis showed that the strain formed...
Structural prediction and analysis of VIH-related peptides from selected crustacean species
The tentative elucidation of the 3D-structure of vitellogenesis inhibiting hormone (VIH) peptides is conversely underprivileged by difficulties in gaining enough peptide or protein, diffracting crystals, and numerous ext...