12
http://www.iaeme.com/IJCET/index.asp 73 [email protected] International Journal of Computer Engineering & Technology (IJCET) Volume 9, Issue 4, July-Aug 2018, pp. 73–84, Article IJCET_09_04_008 Available online at http://www.iaeme.com/IJCET/issues.asp?JType=IJCET&VType=9&IType=4 Journal Impact Factor (2016): 9.3590(Calculated by GISI) www.jifactor.com ISSN Print: 0976-6367 and ISSN Online: 0976–6375 © IAEME Publication CONVERSION OF TEXT IMAGE DOCUMENT IN SOUTHERN INDIAN LANGUAGES INTO BRAILLE FOR VISUALLY CHALLENGED PEOPLE Dr. G.Gayathri Devi Department of Computer Science, SDNB Vaishnav College for Women, Chennai, TamilNadu, India G.Sathyanarayanan Senior Professional Project Management, DXC Technology, ABSTRACT Speech and text is the significant intermediate for human communication. The Braille encoding system represents textual documents in a readable format for the visually challenged person. This paper projects a model to assist visually impaired or blind person in reading the text present on southern Indian Language text image by converting the text into corresponding Braille script. The experimentation of the algorithms was carried out on southern Indian Language text image dataset. Experimental results show that the projected method has a good performance on converting the extracted text regions in an image to corresponding Braille Script. Key words: Dravidian Language, Southern Indian Language, Braille conversion, Braille System, Visually challenged people, Image Processing. Cite this Article: Dr. G.Gayathri Devi and G.Sathyanarayanan, Conversion of Text Image Document In Southern Indian Languages Into Braille For Visually Challenged People. International Journal of Computer Engineering & Technology, 9(4), 2018, pp. 73–84. http://www.iaeme.com/IJCET/issues.asp?JType=IJCET&VType=9&IType=4 1. INTRODUCTION The Braille encoding system represents textual documents in a readable format for the visually challenged persons. As there is a shortage of Braille compatible reading materials, visually challenged people face trouble in necessities like education and employment. Reading text documents is difficult for visually challenged people in various circumstances. Visually impaired persons can read only by use of Braille script. The majority of printed works does not include Braille or speech versions. There is a need of a system for automatic recognition of text documents to Braille and speech to reduce communication gap between the written text systems

CONVERSION OF TEXT IMAGE DOCUMENT IN SOUTHERN … · CONVERSION OF TEXT IMAGE DOCUMENT IN SOUTHERN INDIAN LANGUAGES INTO BRAILLE FOR VISUALLY CHALLENGED PEOPLE ... Malayalam and Kannada

Embed Size (px)

Citation preview

Page 1: CONVERSION OF TEXT IMAGE DOCUMENT IN SOUTHERN … · CONVERSION OF TEXT IMAGE DOCUMENT IN SOUTHERN INDIAN LANGUAGES INTO BRAILLE FOR VISUALLY CHALLENGED PEOPLE ... Malayalam and Kannada

http://www.iaeme.com/IJCET/index.asp 73 [email protected]

International Journal of Computer Engineering & Technology (IJCET) Volume 9, Issue 4, July-Aug 2018, pp. 73–84, Article IJCET_09_04_008

Available online at http://www.iaeme.com/IJCET/issues.asp?JType=IJCET&VType=9&IType=4

Journal Impact Factor (2016): 9.3590(Calculated by GISI) www.jifactor.com

ISSN Print: 0976-6367 and ISSN Online: 0976–6375

© IAEME Publication

CONVERSION OF TEXT IMAGE DOCUMENT

IN SOUTHERN INDIAN LANGUAGES INTO

BRAILLE FOR VISUALLY CHALLENGED

PEOPLE

Dr. G.Gayathri Devi

Department of Computer Science,

SDNB Vaishnav College for Women, Chennai, TamilNadu, India

G.Sathyanarayanan

Senior Professional Project Management,

DXC Technology,

ABSTRACT

Speech and text is the significant intermediate for human communication. The

Braille encoding system represents textual documents in a readable format for the

visually challenged person. This paper projects a model to assist visually impaired or

blind person in reading the text present on southern Indian Language text image by

converting the text into corresponding Braille script. The experimentation of the

algorithms was carried out on southern Indian Language text image dataset.

Experimental results show that the projected method has a good performance on

converting the extracted text regions in an image to corresponding Braille Script.

Key words: Dravidian Language, Southern Indian Language, Braille conversion, Braille

System, Visually challenged people, Image Processing.

Cite this Article: Dr. G.Gayathri Devi and G.Sathyanarayanan, Conversion of Text

Image Document In Southern Indian Languages Into Braille For Visually Challenged

People. International Journal of Computer Engineering & Technology, 9(4), 2018, pp.

73–84.

http://www.iaeme.com/IJCET/issues.asp?JType=IJCET&VType=9&IType=4

1. INTRODUCTION

The Braille encoding system represents textual documents in a readable format for the visually

challenged persons. As there is a shortage of Braille compatible reading materials, visually

challenged people face trouble in necessities like education and employment. Reading text

documents is difficult for visually challenged people in various circumstances. Visually

impaired persons can read only by use of Braille script. The majority of printed works does not

include Braille or speech versions. There is a need of a system for automatic recognition of text

documents to Braille and speech to reduce communication gap between the written text systems

Page 2: CONVERSION OF TEXT IMAGE DOCUMENT IN SOUTHERN … · CONVERSION OF TEXT IMAGE DOCUMENT IN SOUTHERN INDIAN LANGUAGES INTO BRAILLE FOR VISUALLY CHALLENGED PEOPLE ... Malayalam and Kannada

Dr. G.Gayathri Devi and G.Sathyanarayanan

http://www.iaeme.com/IJCET/index.asp 74 [email protected]

used by sighted persons and access mechanisms through which visually impaired people can

communicate

2. BRAILLE SYSTEM

Braille [1,2] is a tactile writing system used by visually challenged people. Braille is a system

of raised dots that can be read with the fingers by people who have low vision or blind. Braille

is named after Louis Braille, the French man who designed. Braille symbols are formed within

units of space known as Braille cells. A complete Braille cell comprises of six raised dots

arranged in two parallel rows each having three dots. The locations of dot are recognized by

numbers from one through six and it is shown in the figure 1. Sixty-four combinations (2^6)

are possible using one or more of these six dots. A single cell can be used to denote an alphabet

letter, number, punctuation mark, or even an entire word.

Fig 1 Braille Cell

2.1. Southern Indian Language

The Dravidian languages are a language spoken mainly in Tamil Nadu, Kerala, Telangana,

Andhra, Karnataka and as well as in Bangladesh , Bhutan , Mauritius, Sri Lanka, some parts

of Pakistan, Burma, southern Afghanistan, Nepal, Malaysia, Africa, Indonesia and Singapore.

The Dravidian languages with the most speakers are Tamil, Kannada , Telugu and Malayalam.

Vowels and Consonants of Indian Languages Indian languages are phonetic in nature. All

the languages have their own set of Vowels and Consonants. They have a good mapping from

one language to the other with respect to the features. It is observed that the fundamental vowels

and consonants for all the four languages are same besides some specialty with respect to the

consonants for each of the languages and moreover the absence of some of the consonants in

the typical Dravidian language Tamil. Consonants and vowels are combined to form Composite

characters.

The Tamil script has 12 vowels 18 consonants and one special character, the ayudha

ezhuthu. The complete script, therefore, consists of the 31 letters in their independent form and

an additional 216 composite letters, for a total of 247 combinations of a consonant and a vowel,

a mute consonant, or a vowel alone.

The Telugu script consists of 60 symbols – 16 vowels, 3 vowel modifiers, and 41

consonants. The Malayalam alphabet has 13 vowel letters, 36 consonant letters, and a few other

symbols. Kannada language uses 49 phonemic letters and it is divided into 3-groups,15 Vowels

, 34 Consonants and modifier glyphs (Half-letter) .There are 510 (34 * 15)composite

characters.

2.2. Southern Indian Language Braille System

Braille is used by many people all over the world in their mother tongue, and gives a means of

knowledge for all. Bharati Braille [4] or Indian Braille is a largely unified Braille script for

writing the Indian languages. Initially 11 Braille scripts were in usage in various areas of the

Page 3: CONVERSION OF TEXT IMAGE DOCUMENT IN SOUTHERN … · CONVERSION OF TEXT IMAGE DOCUMENT IN SOUTHERN INDIAN LANGUAGES INTO BRAILLE FOR VISUALLY CHALLENGED PEOPLE ... Malayalam and Kannada

Conversion of Text Image Document In Southern Indian Languages Into Braille For Visually

Challenged People

http://www.iaeme.com/IJCET/index.asp 75 [email protected]

India and for different languages. Bharati Braille had become a nationwide standard Braille

script and it has been followed by Bangladesh., Sri Lanka, Nepal. Across the different Indian

languages there are up to sixteen vowels and about forty consonants. Figure 2, Figure 3, Figure

4 and Figure 5 shows the Tamil, Telugu, Malayalam and Kannada Braille alphabets sheets

respectively. The alphabets of Indian languages are divided into vowels and consonants. Across

the languages, fifteen vowels and thirty three consonants are common (with the exception of

Tamil) and hence the basic assignment in Bharati Braille is same. Bharati Braille assigns an

individual cell to each vowel and a consonant.

Fig 2 Tamil Braille Alphabets Sheet Fig 3 Telugu Braille Alphabets Sheet

Fig 4 Malayalam Braille Alphabets Sheet Fig 5Kannada Braille Alphabets Sheet

3. PROPOSED METHODLOGY

Gayathri et.al [4] discussed research work projected earlier for recognition the Southern Indian

Braille script from a Braille document.

This research proposes a successful conversion system from southern Indian Languages to

Braille and it contains i) Image Pre-processing, ii) OCR iii) Unicode Conversion, iv) Grade 1

Braille script conversion The flow of proposed system is shown in fig 5.

Page 4: CONVERSION OF TEXT IMAGE DOCUMENT IN SOUTHERN … · CONVERSION OF TEXT IMAGE DOCUMENT IN SOUTHERN INDIAN LANGUAGES INTO BRAILLE FOR VISUALLY CHALLENGED PEOPLE ... Malayalam and Kannada

Dr. G.Gayathri Devi and G.Sathyanarayanan

http://www.iaeme.com/IJCET/index.asp 76 [email protected]

Fig 5 Flow of the Work

3.1. Work Flow

Mapping Table Creation - A database is created where it stores the alphabets of Southern

Indian Languages and its corresponding Braille Unicode and its Braille equivalent symbol.

Input - The system accepts scanned document image and the language as input

Stage 1: Preprocessing - Preprocessing is necessary for efficient recovery of the text

information from scanned image. A text extraction algorithm [5,6] was presented to preserve

text area and remove non-text area. The method was based on gamma correction method and

positional connected component labeling algorithm [7,8].

Stage 2: OCR Conversion – The language of scanned text document image is selected by the

user. OCR System is a system that automates the process of getting and processing the images

of text document into editable text file. The output of stage 1 is given to the corresponding free

open OCR Engine for text conversion.

Stage 3: Unicode Conversion system – The editable text is converted in to its expanded form

and those expanded text is converted into its corresponding Unicode equivalent.

Stage 4: Braille Conversion System – The expanded text Unicode is searched in look up table

and its corresponding Braille Unicode is identified and it is converted to its Braille Symbol.

3.2. Algorithm

The steps of the algorithm are as follows:

Algorithm – Text Image to Braille Conversion

Input: Scanned Text Image and the language of Scanned Image

Output: Braille Equivalent

1. The Scanned Image is Preprocessed and converted into editable String SLangChar[] using

appropriate OCR Engine.

2. Create a Mapping look-up database (Slang2SlangUni) where it stores Southern Indian language

alphabet and its corresponding Unicode

3. Create a Mapping look-up database (Charuni2BrailleUni where it stores Southern Indian

language Unicode and its corresponding Braille Unicode.

4. Create a Mapping look-up database (BrailleUni2BrailleSym) where it stores Braille Unicode

and its corresponding Symbol.

5. Buni [] = BSym[]= SLanguni[]=SLangCharExtend[]=NULL

6. Read the text string SLangChar[]

7. For i =1 to len of SLangChar

Is SLangChar[i] a Vowel or Consonant

Page 5: CONVERSION OF TEXT IMAGE DOCUMENT IN SOUTHERN … · CONVERSION OF TEXT IMAGE DOCUMENT IN SOUTHERN INDIAN LANGUAGES INTO BRAILLE FOR VISUALLY CHALLENGED PEOPLE ... Malayalam and Kannada

Conversion of Text Image Document In Southern Indian Languages Into Braille For Visually

Challenged People

http://www.iaeme.com/IJCET/index.asp 77 [email protected]

SLangCharExtend[]+= SLangChar[i]

else

Is SLangChar[i] a Syllable

Break the corresponding Syllable in to Consonant[s] +Vowel and strore in

temp[]

SLangCharExtend[]+=temp

End

End for

For i =1 to len of SLangCharExtend

Find the Match for SLangCharExtend[i] in Mapping look-up

database(Slang2SlangUni) and store in Tlang

SLanguni[]+=Tlang

End for 8. For i =1 to len of SLanguni

Find the Match for SLanguni[i] in Mapping look-up database

(Charuni2BrailleUni database) and store in Tuni

Buni[]+=Tuni

End for 9. For i =1 to len of Buni[]

Find the Match for Buni[i] in Mapping look-up database

(BrailleUni2BrailleSym ) and store in Tsym

Bsym[]+=Tsym

End for 10. Return BSym[].

Indian Language Braille follows the expanded format for representing the vowel-consonant

combined character. For ex, the Braille notation for த and இ/◌ி are written together to

represent the combined character தி. The normal text is converted in to expanded form using

look up table. The expanded form is converted in to Unicode form. Then the Unicode form is

converted in to its corresponding Braille Unicode version using look up table. At last, Braille

Unicode version is converted in to Braille version using look up table. Some of the examples

is shown below

Example 1

Tamil Word -

Expanded form - =

Tamil Unicode Version -

Braille Equivalent Unicode –

���பாைவ

த+ இ+ ர+ உ+ ப+ ◌+ ப+ ஆ+ வ+ ஐ

0BA4 0B87 0BB0 0B89 0BAA 0BCD 0BAA 0B86 0BB5 0B90

281E 280A 2817 2825 280F 2808 280F 281C 2827 280C

Page 6: CONVERSION OF TEXT IMAGE DOCUMENT IN SOUTHERN … · CONVERSION OF TEXT IMAGE DOCUMENT IN SOUTHERN INDIAN LANGUAGES INTO BRAILLE FOR VISUALLY CHALLENGED PEOPLE ... Malayalam and Kannada

Dr. G.Gayathri Devi and G.Sathyanarayanan

http://www.iaeme.com/IJCET/index.asp 78 [email protected]

Tamil Braille Version -

Example 2

Telugu Word -

Expanded form - =

Telugu Unicode Version -

Braille Equivalent Unicode -

Telugu Braille Version -

Example 3

Kannada Word -

Expanded form - =

Kannada Unicode Version -

Braille Equivalent Unicode -

Kannada Braille Version -

Example 4

Malayalam Word -

Expanded form - =

Malayalam Unicode Version -

Braille Equivalent Unicode -

⠞ ⠊ ⠗ ⠥ ⠏ ⠈ ⠏ ⠜ ⠧ ⠌

�ర�

త ఇ ర ఉ ప ◌ ప ఆ వ ఐ

0C24 0C07 0C30 0C09 0C2A 0C4D 0C2A 0C06 0C35 0C10

281E 280A 2817 2825 280F 2808 280F 281C 2827

⠞ ⠊ ⠗ ⠥ ⠏ ⠈ ⠏ ⠜ ⠧

തിര�ാൈവ

ത ഇ ര എ പ ◌ പ ആ വ ഐ

0D24 0D3F 0D30 0D41 0D2A 0D4D 0D2A 0D3E 0D35

281E 280A 2817 2825 280F 2808 280F 281C 2827

�ರು�ಾ��ೖ

ತ ಇ ರ ಉ ಪ ◌ ಪ ಆ ವ ಐ

0CA4 0CBF 0CB0 0CC1 0CAA 0CCD 0CAA 0CBE 0CB5

281E 280A 2817 2825 280F 2808 280F 281C 2827

⠞ ⠊ ⠗ ⠥ ⠏ ⠈ ⠏ ⠜ ⠧

Page 7: CONVERSION OF TEXT IMAGE DOCUMENT IN SOUTHERN … · CONVERSION OF TEXT IMAGE DOCUMENT IN SOUTHERN INDIAN LANGUAGES INTO BRAILLE FOR VISUALLY CHALLENGED PEOPLE ... Malayalam and Kannada

Conversion of Text Image Document In Southern Indian Languages Into Braille For Visually

Challenged People

http://www.iaeme.com/IJCET/index.asp 79 [email protected]

Malayalam Braille Version -

4. EXPERIMENTAL RESULTS

The experimentation of the algorithms was carried out on Tamil, Telugu, Malayalam, Kannada

text Image. Recognition rate achieved is 90% and the accuracy rate of converting Braille from

recognized text is 100%. Some of the experimental results are shown below.

Input - Tamil Text Image

⠞ ⠊ ⠗ ⠥ ⠏ ⠈ ⠏ ⠜ ⠧

Expanded Tamil Text

வ ஆ ர ண ம ஆ ய இ ர ம ◌ ச ஊ ழ வ ல ம ◌ ச எ ய ◌ த உ

ந ஆ ர ண ந ம ◌ ப இ ந ட க ◌ க இ ன ◌ ற ஆ ன எ ன ◌ ற எ த இ ர ◌

ப ஊ ர ண ப ஒ ற ◌ க உ ட ம ◌ வ ஐ த ◌ த உ ப ◌ ப உ ற ம எ ங ◌ க உ ம ◌

த ஓ ர ண ம ◌ ந ஆ ட ◌ ட க ◌ க ன ஆ க ◌ க ண ◌ ட ஏ ன ◌ த ஓ ழ ஈ ந ஆ ன ◌

Tamil Unicode Equivalent

0BB5 0B86 0BB0 0BA3 0020 0BAE 0B86 0BAF 0B87 0BB0 0BAE 0BCD 0020 0B9A 0B8A 0BB4 0BB5 0020 0BB2 0BAE

0BCD 0B9A 0B8E 0BAF 0BCD 0BA4 0B89 000A

0BA8 0B86 0BB0 0BA3 0020 0BA8 0BAE 0BCD 0BAA 0B87 0020 0BA8 0B9F 0B95 0BCD 0B95 0B87 0BA9 0BCD 0BB1 0B86 0020

0BA9 0B8E 0BA9 0BCD 0BB1 0B8E 0BA4 0B87 0BB0 0BCD 000A

0BAA 0B8A 0BB0 0BA3 0020 0BAA 0B92 0BB1 0BCD 0B95 0B89 0B9F 0BAE 0BCD 0020 0BB5 0B90 0BA4 0BCD 0BA4 0B89 0BAA

0BCD 0020 0BAA 0B89 0BB1 0BAE 0B8E 0B99 0BCD 0B95 0B89 0BAE 0BCD 000A

0BA4 0B93 0BB0 0BA3 0BAE 0BCD 0020 0BA8 0B86 0B9F 0BCD 0B9F 0B95 0BCD 0020 0B95 0BA9 0B86 0B95 0BCD 0B95 0BA3

0BCD 0B9F 0B8F 0BA9 0BCD 0020 0BA4 0B93 0BB4 0B88 0BA8 0B86 0BA9 0BCD

Braille Unicode Mapping

2827 281C 2817 283C 0020 280D 281C 283D 280A 2817 280D 2808 0020 2809 2833 2837 2827 0020 2807 280D 2808 2809 2822

283D 2808 281E 2825 000A

281D 281C 2817 283C 0020 281D 280D 2808 280F 280A 0020 281D 283E 2805 2808 2805 280A 2830 2808 283B 281C 0020 2830

2822 2830 2808 283B 2822 281E 280A 2817 2808 000A

280F 2833 2817 283C 0020 280F 282D 283B 2808 2805 2825 283E 280D 2808 0020 2827 280C 281E 2808 281E 2825 280F 2808

0020 280F 2825 283B 280D 2822 282C 2808 2805 2825 280D 2808 000A

281E 2815 2817 283C 280D 2808 0020 281D 281C 283E 2808 283E 2805 2808 0020 2805 2830 281C 2805 2808 2805 283C 2808

283E 2821 2830 2808 0020 281E 2815 2837 2814 281D 281C 2830 2808

Editable Tamil Text after applying Preprocessing Technique & OCR

வாரண மாய�ர !ழவ ல ெச#$

நாரண ந ப� நட%கி&றா ென&ெறதி'

(ரண ெபா)*ட ைவ,$- .றெம/*

ேதாரண நா1ட% கனா%க2ேட& ேதாழ3நா&

Page 8: CONVERSION OF TEXT IMAGE DOCUMENT IN SOUTHERN … · CONVERSION OF TEXT IMAGE DOCUMENT IN SOUTHERN INDIAN LANGUAGES INTO BRAILLE FOR VISUALLY CHALLENGED PEOPLE ... Malayalam and Kannada

Dr. G.Gayathri Devi and G.Sathyanarayanan

http://www.iaeme.com/IJCET/index.asp 80 [email protected]

Input – Telugu Text Image

Tamil Braille Equivalent

⠧⠜⠗⠼ ⠍⠜⠽⠊⠗⠍⠈ ⠉⠳⠷⠧ ⠇⠍⠈⠉⠢⠽⠈⠞⠥

⠝⠜⠗⠼ ⠝⠍⠈⠏⠊ ⠝⠾⠅⠈⠅⠊⠰⠈⠻⠜ ⠰⠢⠰⠈⠻⠢⠞⠊⠗⠈

⠏⠳⠗⠼ ⠏⠭⠻⠈⠅⠥⠾⠍⠈ ⠧⠌⠞⠈⠞⠥⠏⠈ ⠏⠥⠻⠍⠢⠬⠈⠅⠥⠍⠈

⠞⠕⠗⠼⠍⠈ ⠝⠜⠾⠈⠾⠅⠈ ⠅⠰⠜⠅⠈⠅⠼⠈⠾⠡⠰⠈ ⠞⠕⠷⠔⠝⠜⠰⠈

Expanded Telugu Text

చఇట�ఇ చఇలకమఆ అమ కఒట�ఇందఆ

తఓటకఎల�ఆవఆ పండఉ తఎచ�ఆవఆ

గఊట� ఓ పఎట�ఆవఆ గఉటఉక�ఉన మఇంగఆవవ ◌

Telugu Unicode Equivalent

0C1A 0C07 0C1F 0C4D 0C1F 0C07 0020 0C1A 0C07 0C32 0C15 0C2E 0C4D 0C2E 0C06 0020 0C05 0C2E 0C4D 0C2E 0020

0C15 0C12 0C1F 0C4D 0C1F 0C07 0C02 0C26 0C06 000A

0C24 0C13 0C1F 0C15 0C0E 0C32 0C4D 0C32 0C06 0C35 0C06 0020 0C2A 0C02 0C21 0C09 0020 0C24 0C0E 0C1A 0C4D

0C1A 0C06 0C35 0C06 000A

0C17 0C0A 0C1F 0C4D 0C32 0C13 0020 0C2A 0C0E 0C1F 0C4D 0C1F 0C06 0C35 0C06 0020 0C17 0C09 0C1F 0C09 0C15

0C4D 0C15 0C09 0C28 0020 0C2E 0C07 0C02 0C17 0C06 0C35 0C06

Braille Unicode Mapping

2809 280A 283E 2808 283E 280A 0020 2809 280A 2807 2805 280D 2808 280D 281C 0020 2801

280D 2808 280D 0020 2805 282D 283E 2808 283E 280A 2818 2819 281C 000A

281E 2815 283E 2805 2822 2807 2808 2807 281C 2827 281C 0020 280F 2818 282B 2825 0020

281E 2822 2809 2808 2809 281C 2827 281C 000A

281B 2833 283E 2808 2807 2815 0020 280F 2822 283E 2808 283E 281C 2827 281C 0020 281B

2825 283E 2825 2805 2808 2805 2825 281D 0020 280D 280A 2818 281B 281C 2827 281C

Editable Telugu Text after applying Preprocessing Technique & OCR

�ట�� �లకమ అమ !"ట�� ం#$

%&ట!'ల � ( పండ) %+$�(

గ,ట-� ./ట0� ( గ1ట2క3�న 4ం5(

Page 9: CONVERSION OF TEXT IMAGE DOCUMENT IN SOUTHERN … · CONVERSION OF TEXT IMAGE DOCUMENT IN SOUTHERN INDIAN LANGUAGES INTO BRAILLE FOR VISUALLY CHALLENGED PEOPLE ... Malayalam and Kannada

Conversion of Text Image Document In Southern Indian Languages Into Braille For Visually

Challenged People

http://www.iaeme.com/IJCET/index.asp 81 [email protected]

Input -Kannada Text Image

Telugu Braille Equivalent

⠉⠊⠾⠈⠾⠊ ⠉⠊⠇⠅⠍⠈⠍⠜ ⠁⠍⠈⠍ ⠅⠭⠾⠈⠾⠊⠘⠙⠜

⠞⠕⠾⠅⠢⠇⠈⠇⠜⠧⠜ ⠏⠘⠫⠥ ⠞⠢⠉⠈⠉⠜⠧⠜

⠛⠳⠾⠈⠇⠕ ⠏⠢⠾⠈⠾⠜⠧⠜ ⠛⠥⠾⠥⠅⠈⠅⠥⠝ ⠍⠊⠘⠛⠜⠧⠜

Expanded Kannada Text

ತ ಓ ಟ ಕ ಎ ಹ ಓ ಗ ಓ ತ ಇ ಮ ◌ ಮ

ತ ಓ ಳ ಬ ◌ಂ ದ ಈ ತ ಮ ◌ ಮ

ಹ ಸ ಉ ಮ ಐ ಸ ಓ ತ ಇ ಮ ◌ ಮ

ಹ ಸ ಉ ಹ ಆ ದ ಈ ತ ಮ ◌ ಮ

Kannada Unicode Equivalent

0CA4 0C93 0C9F 0C95 0C8E 0020 0CB9 0C93 0C97 0C93 0020 0CA4 0C87 0CAE 0CCD 0CAE 000A

0CA4 0C93 0CB3 0020 0CAC 0C82 0CA6 0C88 0CA4 0CAE 0CCD 0CAE 000A

0CB9 0CB8 0C89 0020 0CAE 0C90 0CB8 0C93 0020 0CA4 0C87 0CAE 0CCD 0CAE 000A

0CB9 0CB8 0C89 0020 0CB9 0C86 0CA6 0C88 0CA4 0CAE 0CCD 0CAE

Braille Unicode Mapping

281E 2815 283E 2805 2822 0020 2813 2815 281B 2815 0020 281E 280A 280D 2808 280D 000A

281E 2815 2838 0020 2803 2818 2819 2814 281E 280D 2808 280D 000A

2813 280E 2825 0020 280D 280C 280E 2815 0020 281E 280A 280D 2808 280D 000A

2813 280E 2825 0020 2813 281C 2819 2814 281E 280D 2808 280D

Editable Kannada Text after applying Preprocessing Technique & OCR

!ೂೕಟ$ %ೂೕ&ೂೕ �ಮ'

!ೂೕಳ ಬಂ(ೕತಮ'

ಹಸು )ೖ*ೂೕ �ಮ'

ಹಸು %ಾ(ೕತಮ'

Page 10: CONVERSION OF TEXT IMAGE DOCUMENT IN SOUTHERN … · CONVERSION OF TEXT IMAGE DOCUMENT IN SOUTHERN INDIAN LANGUAGES INTO BRAILLE FOR VISUALLY CHALLENGED PEOPLE ... Malayalam and Kannada

Dr. G.Gayathri Devi and G.Sathyanarayanan

http://www.iaeme.com/IJCET/index.asp 82 [email protected]

Input - Malayalam Text Image

Kannada Braille Equivalent

⠞⠕⠾⠅⠢ ⠓⠕⠛⠕ ⠞⠊⠍⠈⠍

⠞⠕⠸ ⠃⠘⠙⠔⠞⠍⠈⠍

⠓⠎⠥ ⠍⠌⠎⠕ ⠞⠊⠍⠈⠍

⠓⠎⠥ ⠓⠜⠙⠔⠞⠍⠈⠍

Expanded Malayalam Text

കഒച ◌ചഉ പഊച ◌ച കഉഞ ◌ഞഇനഒരഉ കഒച ◌ചമളഇ പറ ◌റഇ കആച ◌ചഇ വച ◌ച ചഊട ◌ പആല ◌ ഓടഇ ചഎന ◌ന ◌ നക ◌കഇ കഉഞ ◌ഞഉ നആവ ◌ പഒള ◌ളഇപ ◌പഓയഇ കഉഞ ◌ഞഇ പഊച ◌ച കഏണഉ മ ◌യആവഉ മ ◌യആവഉ മ ◌യആവഉ മ ◌യആവഉ

Malayalam Unicode Equivalent

0D15 0D12 0D1A 0D4D 0D1A 0D09 0020 0D2A 0D0A 0D1A 0D4D 0D1A 0020 0D15 0D09 0D1E 0D4D 0D1E 0D07 0D28 0D12 0D30

0D09 0020 0D15 0D12 0D1A 0D4D 0D1A 0D2E 0D33 0D07 0020 0D2A 0D31 0D4D 0D31 0D07 000A

0D15 0D06 0D1A 0D4D 0D1A 0D07 0020 0D35 0D1A 0D4D 0D1A 0020 0D1A 0D0A 0D1F 0D4D 0020 0D2A 0D06 0D32 0D4D 0020

0D13 0D1F 0D07 0020 0D1A 0D0E 0D28 0D4D 0D28 0D4D 0020 0D28 0D15 0D4D 0D15 0D07 0020 000A

0D15 0D09 0D1E 0D4D 0D1E 0D09 0020 0D28 0D06 0D35 0D4D 0020 0D2A 0D12 0D33 0D4D 0D33 0D07 0D2A 0D4D 0D2A 0D13

0D2F 0D07 0020 0D15 0D09 0D1E 0D4D 0D1E 0D07 0020 0D2A 0D0A 0D1A 0D4D 0D1A 0020 0D15 0D0F 0D23 0D09 0020 000A

0D2E 0D4D 0D2F 0D06 0D35 0D09 0020 0D2E 0D4D 0D2F 0D06 0D35 0D09 0020 0D2E 0D4D 0D2F 0D06 0D35 0D09 0020 0D2E 0D4D

0D2F 0D06 0D35 0D09

Editable Malayalam Text after applying Preprocessing Technique & OCR

െകാ"# പ" ക%ിെനാര െകാ"മളി പ&ി കാ"ി വ" ച' പാ( ഓടി െച) ന*ി ക% നാ+ െപാ,ിേ�ായി ക%ി പ" േകണ മ.ാവ മ.ാവ മ.ാവ മ.ാവ

Page 11: CONVERSION OF TEXT IMAGE DOCUMENT IN SOUTHERN … · CONVERSION OF TEXT IMAGE DOCUMENT IN SOUTHERN INDIAN LANGUAGES INTO BRAILLE FOR VISUALLY CHALLENGED PEOPLE ... Malayalam and Kannada

Conversion of Text Image Document In Southern Indian Languages Into Braille For Visually

Challenged People

http://www.iaeme.com/IJCET/index.asp 83 [email protected]

CONCLUSION

This proposed approach presents an algorithm to assist visually impaired or blind person in

reading the Southern Indian languages text present on printed image. Experimental results show

that the proposed method has a good performance on converting text regions in an image into

Braille Script. The research work successfully converts Tamil, Telugu, Kannada, Malayalam

languages to Braille script and this work can be further developed to recognize Indian language

present in an image and convert it to corresponding Braille system.

ACKNOWLEDGMENT

This work is a part of UGC Minor Research Project. Reference Number-No.F MRP-

6538/16(SERO/UGC), Dated: 30/06/2017.Authors thank the Management and Principal of

Shrimathi Devkunvar Nanalal Bhatt Vaishnav College for Women for extending their support

for this project.

REFERENCES

[1] Working with Braille: A study of Braille as a medium of communication BARRY

HAMPSHIRE,the UNESCO Press, 1981

[2] https://en.wikipedia.org/wiki/Braille

[3] http://www.acb.org/tennessee/braille.html

[4] G.Gayathri Devi,G.Sathyanarayanan , Braille Document Recognition in Southern Indian

Languages – A Review"", EEE Fourth International Conference on Advances in Electrical,

Electronics, Information, Communication And Bioinformatics at Prathyusha Engineering

College, Chennai , Feb 21-22 2018,

Braille Unicode Mapping

2805 282D 2809 2808 2809 2825 0020 280F 2833 2809 2808 2809 0020 2805 2825 2812 2808 2812 280A

281D 282D 2817 2825 0020 2805 282D 2809 2808 2809 280D 2838 280A 0020 280F 283B 2808 283B

280A 000A

2805 281C 2809 2808 2809 280A 0020 2827 2809 2808 2809 0020 2809 2833 283E 2808 0020 280F 281C

2807 2808 0020 2815 283E 280A 0020 2809 2822 281D 2808 281D 2808 0020 281D 2805 2808 2805

280A 0020 000A

2805 2825 2812 2808 2812 2825 0020 281D 281C 2827 2808 0020 280F 282D 2838 2808 2838 280A 280F

2808 280F 2815 283D 280A 0020 2805 2825 2812 2808 2812 280A 0020 280F 2833 2809 2808 2809 0020

2805 2821 283C 2825 0020 000A

280D 2808 283D 281C 2827 2825 0020 280D 2808 283D 281C 2827 2825 0020 280D 2808 283D 281C 2827

2825 0020 280D 2808 283D 281C 2827 2825

Malayalam Braille Equivalent

⠅⠭⠉⠈⠉⠥ ⠏⠳⠉⠈⠉ ⠅⠥⠒⠈⠒⠊⠝⠭⠗⠥ ⠅⠭⠉⠈⠉⠍⠸⠊ ⠏⠻⠈⠻⠊

⠅⠜⠉⠈⠉⠊ ⠧⠉⠈⠉ ⠉⠳⠾⠈ ⠏⠜⠇⠈ ⠕⠾⠊ ⠉⠢⠝⠈⠝⠈ ⠝⠅⠈⠅⠊

⠅⠥⠒⠈⠒⠥ ⠝⠜⠧⠈ ⠏⠭⠸⠈⠸⠊⠏⠈⠏⠕⠽⠊ ⠅⠥⠒⠈⠒⠊ ⠏⠳⠉⠈⠉ ⠅⠡⠼⠥

⠍⠈⠽⠜⠧⠥ ⠍⠈⠽⠜⠧⠥ ⠍⠈⠽⠜⠧⠥ ⠍⠈⠽⠜⠧⠥

Page 12: CONVERSION OF TEXT IMAGE DOCUMENT IN SOUTHERN … · CONVERSION OF TEXT IMAGE DOCUMENT IN SOUTHERN INDIAN LANGUAGES INTO BRAILLE FOR VISUALLY CHALLENGED PEOPLE ... Malayalam and Kannada

Dr. G.Gayathri Devi and G.Sathyanarayanan

http://www.iaeme.com/IJCET/index.asp 84 [email protected]

[5] Dr.C.P.Sumathi, G.Gayathri Devi, 2014, “Automatic Text Extraction from Complex

Colored Images Using Gamma Correction Method”, Journal of Computer Science, pages

706-715. (Scopus Indexed)

[6] G.Gayathri Devi, Dr.C.P.Sumathi, “Text Extraction from Images using Gamma Correction

Method and different Text Extraction Methods – A Comparative Analysis”, International

Conference on Information Communication and Embedded Systems ICICES 2014, S.A.

Engineering College, Chennai, Feb 27-28, 2014. (Scopus Indexed)

[7] G. Gayathri Devi, C. P. Sumathi, "Positional Connected Component Labeling Algorithm",

Indian Journal of Science & Technology, Volume 7, Issue 3, March 2014, pp 306-311.

[8] G Gayathri Devi, G Sathyanarayanan ,“A Connected Components Labeling Algorithm for

4-Connectivity Based On Position Matrix”, International Journal of Scientific Research in

Computer Science, Engineering and Information Technology Volume (2) : Issue (6) : Dec

2017 , ISSN:2456-3307 pg 50 to 58