MCIP: Mining Crop Image Data On pysparkdataframe Using Feature Selection and Cluster Based Techniques

Journal Title: International Journal of Experimental Research and Review - Year 2023, Vol 34, Issue 5

Abstract

Crop-related problems such as pests and diseases in India lead to yearly losses exceeding $500 billion. Leaf blight is identified as the principal factor responsible for the substantial financial losses amounting to $500 billion. Farmers engaged in the cultivation of forage and grain sorghum experience the greatest degree of hardship. This disease has a significant impact on various crops, including maize, rice, tomato, potato, millet, and onion. The timely detection and evaluation of disease in plants can contribute to mitigating the extent of associated losses. However, the task presents difficulties as a result of variations in crop species, varieteis of crop diseases, and environmental factors. The current methodologies lack generalizability in their ability to classify and predict diseases. All of the techniques employed in this study are applied to a dataset with predetermined input values and corresponding output values. The current methodologies involve preprocessing the images and performing segmentation for extracting the appropriate characteristics. The process of segmentation necessitates the implementation of pre-processing techniques, such as dilation and edge detection. As a consequence, the loss of crucial information occurs, which subsequently leads to inaccurate classification. Furthermore, the methodologies employed thus far have not been designed to evaluate the performance of the algorithm on specialised or specific datasets. Deep learning methodologies are susceptible to the issue of overfitting. This paper proposed an approach for extracting and analysing crop image data using the PySpark (MCIP) data frame. The MCIP framework employs Principal Component Analysis (PCA) as a method for selecting pertinent features. The PCA features that have been gathered are subsequently employed to identify homogeneous subgroups through the utilisation of the K-means algorithm. The utilisation of a categorised predictive output facilitates the identification and detection of diseases present in potato leaves. The utilisation of the Multispectral Crop Imaging Platform (MCIP) extends beyond the examination of potatoes exclusively, as it possesses the capability to identify diseases present in the foliage of various agricultural crops. In order to validate our assertion, we conducted an experiment utilising the MCIP algorithm on a dataset pertaining to rice diseases. In order to assess the robustness of MCIP, we conducted an evaluation of its Accuracy, Silhouette score, speed, and F1 score. The MCIP model demonstrated high performance in terms of both speed and accuracy compared to existed approaches. The level of accuracy is remarkably near 100 percent.

Authors and Affiliations

yashi chaudhary, Heman Pathak

Keywords

Related Articles

Raw fruit juice processing wastewater treatment using electrochemical coagulation followed by synthesis of CuO Nano sorbents using leaf extract

Stainless steel and aluminium electrode was utilized inside batch electro chemicals coagulations (BECCs) using current densities (CD) for the treatment of fruit juice processing wastewater (FJPW). During ECC, ~65-70% col...

Nutritional status and haemoglobin level among adult Bengalee women in a sub-urban area in West Bengal

Nutritional status measured by anthropometry has been a reliable indicator of individual as well as population health. It is associated with morbidities, reduced activity and fitness, impaired cognitive development and a...

A Hybrid Framework for Plant Leaf Region Segmentation: Comparative Analysis of Swarm Intelligence with Convolutional Neural Networks

Agriculture is important for the survival of humanity since about 70% of the world's population is engaged in agricultural pursuits to varying degrees. The previous and present methodology lacks ways to identify diseases...

A Novel Framework for Multilingual Script Detection and Pattern Analysis in Mixed Script Queries

A script detection system that is capable of handling several languages is becoming more necessary in today's world. The task of identifying scripts written in various languages has been substantially facilitated by the...

Enhancing Software Maintainability Prediction Using Multiple Linear Regression and Predictor Importance

Accurate maintenance effort and cost estimation are essential for effective software development. By identifying software modules with poor maintainability, Software Maintainability Prediction (SMP) plays a crucial role...

Download PDF file
  • EP ID EP722925
  • DOI https://doi.org/10.52756/ijerr.2023.v34spl.011
  • Views 66
  • Downloads 0

How To Cite

yashi chaudhary, Heman Pathak (2023). MCIP: Mining Crop Image Data On pysparkdataframe Using Feature Selection and Cluster Based Techniques. International Journal of Experimental Research and Review, 34(5), -. https://europub.co.uk./articles/-A-722925