Large Scale Advanced Data Analytics on Skin Conditions from Genotype to Phenotype
Journal Title: Informatics - Year 2018, Vol 5, Issue 4
Abstract
A crucial factor in Big Data is to take advantage of available data and use that for new discovery or hypothesis generation. In this study, we analyzed Large-scale data from the literature to OMICS, such as the genome, proteome or metabolome, respectively, for skin conditions. Skin acts as a natural barrier to the world around us and protects our body from different conditions, viruses, and bacteria, and plays a big part in appearance. We have included Hyperpigmentation, Postinflammatory Hyperpigmentation, Melasma, Rosacea, Actinic keratosis, and Pigmentation in this study. These conditions have been selected based on reasoning of big scale UCSF patient data of 527,273 females from 2011 to 2017, and related publications from 2000 to 2017 regarding skin conditions. The selected conditions have been confirmed with experts in the field from different research centers and hospitals. We proposed a novel framework for large-scale available public data to find the common genotypes and phenotypes of different skin conditions. The outcome of this study based on Advance Data Analytics provides information on skin conditions and their treatments to the research community and introduces new hypotheses for possible genotype and phenotype targets. The novelty of this work is a meta-analysis of different features on different skin conditions. Instead of looking at individual conditions with one or two features, which is how most of the previous works are conducted, we looked at several conditions with different features to find the common factors between them. Our hypothesis is that by finding the overlap in genotype and phenotype between different skin conditions, we can suggest using a drug that is recommended in one condition, for treatment in the other condition which has similar genes or other common phenotypes. We identified common genes between these skin conditions and were able to find common areas for targeting between conditions, such as common drugs. Our work has implications for discovery and new hypotheses to improve health quality, and is geared towards making Big Data useful.
Authors and Affiliations
Maryam Panahiazar, Darya Fadavi, Jihad Aljabban, Laraib Safeer, Imad Aljabban and Dexter Hadley
Choosing a Model for eConsult Specialist Remuneration: Factors to Consider
Electronic consultation (eConsult) is an innovative solution that allows specialists and primary care providers to communicate electronically, improving access to specialist care. Understanding the cost implications of...
How The Arts Can Help Tangible Interaction Design: A Critical Re-Orientation
There is a long history of creative encounters between tangible interface design and the Arts. However, in comparison with media art, tangible interaction seems to be quite anchored into many of the traditional methodolo...
Analyzing Spatiotemporal Anomalies through Interactive Visualization
As we move into the big data era, data grows not just in size, but also in complexity, containing a rich set of attributes, including location and time information, such as data from mobile devices (e.g., smart phones),...
On Ensemble SSL Algorithms for Credit Scoring Problem
Credit scoring is generally recognized as one of the most significant operational research techniques used in banking and finance, aiming to identify whether a credit consumer belongs to either a legitimate or a suspic...
Visual Analysis of Relationships between Heterogeneous Networks and Texts: An Application on the IEEE VIS Publication Dataset
The visual exploration of large and complex network structures remains a challenge for many application fields. Moreover, a growing number of real-world networks is multivariate and often interconnected with each other...