A pattern-driven Huffman encoding and positional encoding for DNA compression
Downloads
Published
Keywords:
Compression Ratio, Deoxyribonucleic Acid, Huffman Coding, Positional Encoding TechniqueDimensions Badge
Issue
Section
License
Copyright (c) 2025 The Scientific Temper

This work is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International License.
Researchers from bioinformatics, biology, biotechnology, and medical sciences who are engaged in genetic data analysis face significant challenges in the manipulation and storage of large datasets. Compression algorithms are essential for increasing storage capacity and reducing the number of bits required to represent nucleotide bases. The Pattern-driven Huffman Encoding and Positional Encoding for DNA Compression (P2DNAComp) algorithm is designed to compress both non-repetitive and repetitive pattern bases within DNA sequences. This demonstrates the algorithm’s adaptability across various pattern types in genomic data. P2DNAComp employs a systematic approach to efficiently compress DNA sequences. It reads the sequences and constructs a symbol table to maintain the positional values of repeated patterns. Using Huffman coding, the algorithm determines the optimal bit representation for each repeated pattern to maximize storage efficiency. For non-repetitive patterns, a coded table is created to store positional values. Subsequently, a positional encoding technique is applied to minimize the number of bits needed for efficient representation. The maximum positional value is set as the upper limit, and the minimum number of bits required is computed using a binary logarithm function. The final compressed sequence is generated by encoding both repetitive and non-repetitive patterns. Using standard datasets from the GenBank database, the performance of the P2DNAComp algorithm was evaluated based on compression ratio, compression/decompression time, and compression gain. The algorithm achieved an average compression ratio of 1.09 bits per base (bpb), an average compression gain of 86.279%, and average compression and decompression times of 0.547 and 0.563 seconds, respectively.Abstract
How to Cite
Downloads
Similar Articles
- V. Babydeepa, K. Sindhu, Piecewise adaptive weighted smoothing-based multivariate rosenthal correlative target projection for lung and uterus cancer prediction with big data , The Scientific Temper: Vol. 15 No. 03 (2024): The Scientific Temper
- Belgundkar Babita, Kharde Sangeeta, Dodamani Suneel, Socio-demographic and reproductive determinants of spontaneous abortion- A cross-sectional comparative research at a tertiary care hospital in North Karnataka, India , The Scientific Temper: Vol. 15 No. 01 (2024): The Scientific Temper
- Mahima Srivastava, Chemical facets of environment-friendly corrosion impediment of low-carbon steel in aqueous solutions of inorganic mineral acid , The Scientific Temper: Vol. 14 No. 02 (2023): The Scientific Temper
- Sampa Mondal, Baibaswata Bhattacharjee, Tweaking of the morphological pattern in copper sulphide nanoparticles: How does it affect the optical properties? , The Scientific Temper: Vol. 15 No. 03 (2024): The Scientific Temper
- Ellakkiya Mathanraj, Ravi N. Reddy, Enhanced principal component gradient round-robin load balancing in cloud computing , The Scientific Temper: Vol. 15 No. 01 (2024): The Scientific Temper
- V.K. Pandey, R.N. Mishra, Shipra Upadhyaya, Anand Swaroop, TOXICITY OF PAPER MILL EFFLUENTS EFFECTS LIVER PROTEIN AND AMINO ACID DURING ANNUAL BREEDING CYCLE OF HETEROPNEUSTES FOSSILIS (BLOCH) , The Scientific Temper: Vol. 1 No. 01 (2010): The Scientific Temper
- Kiruthiga R., Bharathidasan R., Thiruneelakandan G., Molecular docking insights into the anticancer potential of bioactive compounds from Streptomyces coelicolor KR23 through regulation of apoptotic proteins , The Scientific Temper: Vol. 16 No. 01 (2025): The Scientific Temper
- Naveen Kumar, Sunder S. Arya, Mamta Sawariya, Ajay Kumar, Neha Yadav, Jyoti Sharma, Himanshu Mehra, Unraveling the effect of salicylic acid on Vigna radiata L. under PEG- induced drought stress , The Scientific Temper: Vol. 14 No. 04 (2023): The Scientific Temper
- B.V.Thacker, G.P. Vadodaria, G.V. Priyadarshi, M.H. Trivedi, Biopolymer-based fly ash-activated zeolite for the removal of chromium from acid mine drainage , The Scientific Temper: Vol. 14 No. 04 (2023): The Scientific Temper
- Atal Bihari Bajpai, Pragati Misra, Manjul Diman, Indra Rautela, Rajesh Rayal, Kamlesh Jeena, Manish Dev Sharma, Study on the Chemical Composition and Antioxidant Activity of Extracts from Wild and in vitro Raised Endangered Medicinal Plant Ephedra gerardiana , The Scientific Temper: Vol. 12 No. 1&2 (2021): The Scientific Temper
You may also start an advanced similarity search for this article.

