Lossless Text Compression Technique Using Syllable Based Morphology

No Thumbnail Available

Date

2011

Journal Title

Journal ISSN

Volume Title

Publisher

Zarka Private Univ

Open Access Color

OpenAIRE Downloads

OpenAIRE Views

Research Projects

Journal Issue

Abstract

In this paper, we present a new lossless text compression technique which utilizes syllable-based morphology of multi-syllabic languages. The proposed algorithm is designed to partition words into its syllables and then to produce their shorter bit representations for compression. The method has six main components namely source file, filtering unit, syllable unit, compression unit, dictionary file and target file. The number of bits in coding syllables depends on the number of entries in the dictionary file. The proposed algorithm is implemented and tested using 20 different texts of different lengths collected from different fields. The results indicated a compression of up to 43%.

Description

Bayindir, Hakan/0000-0003-4911-9056; Misra, Sanjay/0000-0002-3556-9331

Keywords

Algorithm, text compression technique, syllable, multi-syllabic languages

Turkish CoHE Thesis Center URL

Fields of Science

Citation

WoS Q

Q4

Scopus Q

Q2

Source

International Arab Journal of Information Technology

Volume

8

Issue

1

Start Page

66

End Page

74

Collections

Google Scholar Logo
Google Scholar™

Sustainable Development Goals

3

GOOD HEALTH AND WELL-BEING
GOOD HEALTH AND WELL-BEING Logo

5

GENDER EQUALITY
GENDER EQUALITY Logo

6

CLEAN WATER AND SANITATION
CLEAN WATER AND SANITATION Logo

7

AFFORDABLE AND CLEAN ENERGY
AFFORDABLE AND CLEAN ENERGY Logo

9

INDUSTRY, INNOVATION AND INFRASTRUCTURE
INDUSTRY, INNOVATION AND INFRASTRUCTURE Logo

14

LIFE BELOW WATER
LIFE BELOW WATER Logo

16

PEACE, JUSTICE AND STRONG INSTITUTIONS
PEACE, JUSTICE AND STRONG INSTITUTIONS Logo