A NOVEL APPROACH FOR COMPRESSING DNA SEQUENCES USING SEMI-STATISTICAL COMPRESSOR

Ashutosh Gupta and Suneeta Agarwal

Keywords

DNA sequences, DNA compression, word-based tagged code

Abstract

In this paper, we present an algorithm for DNA sequence compression that uses a replacement method. The replacement method introduces words and a word-based compression scheme is used for encoding. The encoder uses ranks to assign the code of words. The developed statistical compression algorithm is competent and useful for DNA chain compression. We have experimentally showed that the designed algorithm is better than existing compressors on typical DNA sequence datasets.

Important Links:



Go Back