Designing an Efficient Deduplication Algorithm for Audio Files in Cloud Storage

Authors

  • Prof. Dr. Ammar Zakzouk Al-Wataniya Private University Author
  • Eng. Hasan Hasan Homs University Author

Keywords:

Deduplication, Hash Table, MD6, Audio Files, Cloud Storage

Abstract

Duplicate data poses a significant challenge in big data storage systems as it consumes storage space, affecting data organization, management, and processing. To solvethis problem, hashalgorithms are used to generate hashkeys for files. However, as theamount of data stored in the cloud increases, the search and matching process takes longer. Additionally, hashkeys can match different files, known as collisions, which are related to the length of the hashkey. The longer the key, the less likely collisions will occur.In this paper, we present a technique for eliminating duplicate data at the file level to reduce storage of duplicate audio data in the cloud storage system. The proposed technique aims to reduce the search time for hashvalues by creatinga reduction table with multiple indexes. These indexes are designed based on the audio file format. Therefore, the hashtable includes multiple indexes, each for a specific format. To minimize the probabilityof collisions, MD6 algorithm is used, which produces a key with a length of 512 bits.

Downloads

Download data is not yet available.

Downloads

Published

2023-12-03

Issue

Section

Research Articles – Volume 1, Number 1

Categories

How to Cite

Designing an Efficient Deduplication Algorithm for Audio Files in Cloud Storage. (2023). Journal of Al-Wataniya Private University , 1(1), 184-201. https://wpu.edu.sy/wpuj/index.php/wpuh/article/view/19

Similar Articles

You may also start an advanced similarity search for this article.