Large Scale Cross-media Data Retrieval based on Hadoop

Journal Title: EAI Endorsed Transactions on Cloud Systems - Year 2015, Vol 1, Issue 2

Abstract

With the rapid development of the Internet and speedy increase of the data size, there are more and more data intensive applications which often involve hundreds of megabytes of data. It is important and necessary to obtain the retrieval results from cross-media data quickly and accurately. Large scale cross-media data retrieval based on Hadoop is proposed to speed up the retrieval in this paper. We divide cross-media feature extraction and cross-media retrieval into paralleled pipeline, and implement with the combination of the HDFS, HBase and MapReduce framework. To verify the performance of the proposed method, comparisons with stand-alone mode on different sizes of the image dataset are conducted, and the experimental results demonstrate the good performances of proposed method, which sharply decreases time-consuming, and meanwhile keeps the same query precision.

Authors and Affiliations

Wenchen Cheng, Jiang Qian, Zhicheng Zhao, Fei Su

Keywords

Related Articles

A Counterfeit Solution for Pharma Supply Chain

This paper provides a detailed overview of the blockchain technology and how it can be used to build a foolproof system in eliminating counterfeit products in pharmaceutical industries. Study by various reports indicate...

QoE Aware Resource Allocation for Video Communications over LTE Based Mobile Networks

As the limits of video compression and usable wireless radio resources are exhausted, providing increased protection to critical data is regarded as a way forward to increase the effective capacity for delivering video d...

Future Factories – Automated Welding Cell based on Cloud Computing Technology

The advent of cloud technology, machine learning and internet of things (IoT) has foreseen the possibility of completely autonomous factories. Future shop-floor operations are completely automated and controlled by cloud...

Defining an Elasticity Metric for Cloud Computing Environments

Elasticity is a key property of cloud computing environments and one of the features which distinguishes this paradigm from other ones. An elasticity metric could be used to define and to monitor Service Level Agreements...

Monitoring as-a-service to drive more efficient future system design

In the services world, the expected benefits are the fastest time to market, lower costs, greater consistency in the application, and increased agility. The reuse and sharing properties of software components are useful...

Download PDF file
  • EP ID EP45558
  • DOI http://dx.doi.org/10.4108/eai.19-8-2015.2260108
  • Views 406
  • Downloads 0

How To Cite

Wenchen Cheng, Jiang Qian, Zhicheng Zhao, Fei Su (2015). Large Scale Cross-media Data Retrieval based on Hadoop. EAI Endorsed Transactions on Cloud Systems, 1(2), -. https://europub.co.uk./articles/-A-45558