Literature Survey on Outlier Detection Techniques For Imperfect Data Labels

Journal Title: International Journal of Science and Research (IJSR) - Year 2015, Vol 4, Issue 1

Abstract

Abstract- A dataset may contain objects that do not comply with the general behaviour or model of data .These data objects are outlier. Outlier detection has attracted increasing attention in machine learning, data mining and and statistics literature. A well-known definition of "outlier" is given as "an observation which deviates so much from other observations as to arouse suspicions that it was generated by a different mechanism," which gives the general idea of an outlier and motivates many anomaly detection methods Common general techniques for data classification include both unsupervised and supervised pattern classification methods. Some common approaches use clustering instead of simple feature selection, linear discriminant methods,neural networks and support vector machines Feature selection forms an important subset within the much larger area of data classification. Correctly identifying the relevant features in a data is of vital importance to the task of text classification. Our objective would be to actively select instances with higher probabilities to be informative in determining feature relevance so as to improve the performance of feature selection without increasing the number of sampled instances. Active sampling used in active feature selection chooses instances in two steps: first, it partitions the data according to some homogeneity criterion; and second, it randomly selects instances from these partitions.

Authors and Affiliations

Keywords

Related Articles

Clinical Development of Biomarker To Detect Oral Carcinoma In Relation To Genetic Polymorphism At MMP-9

Premalignant/potentially malignant oral lesion and condition such as oral submucous fibrosis are known to be transformed into oral cancer. The malignant transformation is often associated with genetic polymorphism which...

Successful Homeopathic Treatment of Transmissible Tumour in Dogs–Case Report

Four cases of canine transmissible venereal tumour (TVT) were treated successfully within 1-3 months by homeopathic medicine -Thuja 200c orally daily.

To Study the Anxiety Level and Self-Concept among Army Personnel

"ABSTRACT Anxiety means the nervousness, unpleasant state of inner feelings. Anxiety, worry and stress are all a part of most people’s life today. Simply experiencing anxiety does not mean that person needs some help but...

Design of GPS and GSM Based Vehicle Location and Tracking System

A vehicle tracking system combines the installation of an electronic device in a vehicle, or fleet of vehicles, with purpose-designed computer software to enable the owner or a third party to track the vehicles location,...

The Influence of Store Layout and Interior Displays against a Purchase Decision (In KFC Fast Food Bandung, Indonesia)

Abstract : An intense competition in the field of fast-food restaurant requires this business to have a competitive advantage. The competitive advantage can be generated by conducting the strategy of differentiation, nam...

Download PDF file
  • EP ID EP341745
  • DOI -
  • Views 85
  • Downloads 0

How To Cite

(2015). Literature Survey on Outlier Detection Techniques For Imperfect Data Labels. International Journal of Science and Research (IJSR), 4(1), -. https://europub.co.uk./articles/-A-341745