Abstract
Braille is the system invented for visually impaired peoples to capable them for written communication. The basic constructs of each natural language can be transited to corresponding Braille characters. Optical Braille recognition system is capable to read the Braille pattern images and convert it to corresponding language text. The images captures from portable camera devices and scanners are generally geometrically and visually disturbed images. In this paper, a disruption robust model is presented to improve the scope and accuracy of Braille character recognition system. The proposed framework improved the image visibility using SD adaptive filter and used the polar cosine transformation to resolve geometric misalignment. Later on grid mapper and binary encoder are applied on emboss enhanced character image for transforming it to binary coded array. Finally, the peered map on this binary character is implied to recognize the corresponding language text. The proposed framework is implemented on three different sample sets of Braille character and document images collected randomly from the web. The comparative evaluation against PCA (principal component analysis) and LDA (linear discriminant analysis) method shows that the proposed model significantly improved the accuracy of Braille character and document recognition.
Similar content being viewed by others
Explore related subjects
Discover the latest articles, news and stories from top researchers in related subjects.Avoid common mistakes on your manuscript.
1 Introduction
Visually impaired people use the Braille [1, 2] language for written communication. Braille was invented by Louis Braille in 1800 to empower the blind people to read and write books, letters and documents. In the traditional form of this language, 6 dots in 3 rows and 2 columns are used in different combinations to represent 26 alphabets uniquely shown in Fig. 1a. Later on various transcripts are also defined by various authors to map the Baillie to other languages like Spanish, German, and Italian etc. The character based Braille is referred as “grade 1” which occupy large space on pages and slow down the writing speed. To overcome these deficiencies, “grade 2” Braille [1,2,3] is defined to represent the “dotted characters” for words or group of characters shown in Fig. 1b. In more advanced form, “grade 3” Braille reflects the shorthand to provide symbolic form for larger words. But this type of Braille is not in common use. Braille can be written on embossed paper either manually or optically. The embossed paper, presents the dot pattern as raised symbols which can be recognized by touch sense. To write Braille manually or optically some specialized metal or plastic device and tools are required. These devices punch the holes on embossed paper to print the raised dots. The orientation and the punching pressure should be proper for faster, correct and optical reading of the Braille.
The quality of Braille recognition system [4,5,6,7] depends on various associated phenomenon. The dependency elements include the quality of capturing equipment, observed sample space, capturing environment, adapted pre-processing techniques etc. Various methods can be applied to resolve the irregularity issues and to generate the effective decisive features. The processing feature space can ensure the accuracy of textual prediction. The extracted features can be processed through some classifier to recognize the character. Various supervised and unsupervised [8] classifiers are available to recognize the class of dotted patterns. The testing data is once identified in specialized class, the natural language constructs can be mapped to recognize the actual character or word.
In this paper, an alignment and impurities robust model is presented to improve the capabilities and accuracy of Braille recognition system. In this model, various mathematical filters and operators are applied in composite form to perform a peered mapping of Braille character to text. In this section, the significance, scope and characterization of Braille recognition system is explained. In Sect. 2, various methods and framework provided by the earlier researchers to enhance the scope and accuracy of Braille recognition and character recognition is described. In Sect. 3, the proposed disruption and alignment robust model is provided with relative mathematical formulation. In Sect. 4, the implementation results applied on two different dataset is presented and discussed with comparative evaluation. In Sect. 5, the conclusion obtained from the work is presented.
2 Related work
Braille recognition system is valuable automated recognition method defined to read the documents or text written by visually challenged persons. To read these documents correctly, the visibility and alignment of the dotted pattern in scanned documents is the essential requirement. Various pre-processing methods are also applied by the researchers to highlight the pattern and extract the pattern accurately. Various classification methods are also proposed by the researchers to convert the Braille to English or other languages. In this section, various methods and formulations provided by the researchers is discussed. A study work on OBR (optical Braille recognition) for converting the Braille to Text script was provided by Hanumanthappa and Murthy [1]. The standard recognition model with each intermediate stage is also explained by the author. Key requirement of OBR is to identify the dot pattern accurately; a wider pre-processing and dot exploration stage is required. A study against skewness, noise, dot-cell exploration was provided by Isayed and Tahboub [2]. Author described the dot extraction based on horizontal and vertical projection profiling.
Various machine learning methods can be applied on extracted dots to identify Braille corresponding text correctly. Various supervised and unsupervised learning methods were invented by the researchers to handle the existing structural, quality and featured challenges. A work on Deep Learning [9] classifier against noise and corrupt input was introduced to minimize the recognition error. A fine regression tuner was applied against multi-class problem to generalize the classification process. A hybrid model using Haar Wavelet and SVM [4] was applied to tackle the geometric and color specific variations. The boundary separator and edge features from different orientations were processed to improve the accuracy of OBR. The contour [6] measure was used to generate the structural elements of Braille characters and map them over lookup table. Author applied the morphological operators and applied a serious of methods at pre-processing stage to extract the information accurately. Li and Yan [10] used the SVM classifier on segmented featured information extracted from Braille images. Author applied the contrast and geometric correction to reduce the recognition rate. A work on slanted image and deviation [11] preserved method was provided to improve the robustness of Braille recognition. The neighbour dot analysis and grid feature extraction and interpretation were implemented by the author to observe various aspects of image features. To extend the segmentation phenomenon for multi-class Braille recognition, the inter-class variation estimation and Gamma [12] distribution was applied to avoid the impact of asymmetric nature of captured pattern images. The statistical evaluation was also implied to increase the probability of segmentation process. The parameter specific estimation under rotation, indentation and orientation was provided by Tai et al. [13]. The extracted key elements were processed by belief propagation and hidden Markov model for probabilistic recognition. The orientation and angular adjustment was also implied to avoid the error in dot detection. Another work improves the Braille recognition against skewness was provided by Hassen and Assabie [14]. The scale and distance independent OBR was suggested by the author.
The ontology [15] theory was adopted to train the Braille characters for more effective and independent recognition. The expansion was provided to existing OBR system by learning the larger set of Braille characters and its mapping to Chinese text. A twin shadow [3] approach was considered in Braille character recognition under different illumination. An effective solution of Braille recognition was provided to resolve the problem of background texture and noise in real time capturing of characters. Yousefi et al. [16] has proposed a probabilistic statistical method to recognize the Braille document effectively. The method was adaptive to scaling, spacing and skewness parameters.
The capturing device plays an important role for effective extraction and recognition of Braille characters. A portable camera [17] based interactive Braille recognition was recommended for real time scenarios. The scanner is able to read the hand gesture and identify the required action based on action mapping. The interactive system is able to recognize the sign, dot pattern and the gesture for visually impaired people. A portable translation device was invented by Murray and Pasquale [18]. The device is able to recognize the dot area, compile it and translate it to the corresponding text. A pin-matrix device and graphical screen reader based HyperBraille [19] system was designed for Braille character recognition. The semi-graphic view was generated through widget to enable the interaction with blind users.
Braille is accepted by all languages, including German, Arabic, English, Italian etc., with specific dot pattern map to the language constructs. A work to generate the Chinese and Arabic [5] text from Braille by covering the neural and interlunation. Author applied the mathematical morphology to generate the structural elements and to extract the dot-cells. The structural information was processed one-to-one for recognition of text accurately. Another work on Braille to Urdu [7] transition using different image processing techniques was obtained. Author applied the grid pattern based mapping with threshold criteria specification to link the pattern vectors to corresponding Urdu alphabets. The conversion of Braille to Chinese [20] characters was provided segmented feature constraints. A chain of convolution operators, Gaussian method and boundary extractor were applied to enhance the pattern features and to extract the pattern accurately.
The basic formulation of each work stage of OBR is similar to the character recognition system. To understand the OBR, some study work on characterization of OCR and relative approaches is also done. A study work on character recognition system was provided by Govindan and Shivaprasad [21]. Author identifies the capabilities and gap of various character recognition methods. A recognition system on degraded characters using SIFT [22] feature processing was provided. The neighbour map was applied on segmented characters to recognize the relative class. Juneja and Gill [23] has presented an Art network based noise robust method for improving the digital character recognition. A work on Braille block [24] recognition was provided using PCA approach. Author has classified the blocks in four classes called vertical, right-slanted, left-slanted and dotted stop blocks. The pattern map was considered under camera height, angle and illumination issues. A transition method of scanned Braille book [25] to electronic Braille book was provided. The transition is performed for both the text and the graphics. Bebartta and Mohanty [26] has defined a method to recognize the Roman and Odia text based printed document recognition. The linear and circular content map was applied to identify the individual script from the bilingual documents.
Some other applications that increases the human–computer interactions include the hand sign, gesture, face expression or gaze recognition. A descriptive study on scope and directions of sign language [27] was provided to improve interaction capability of hearing and speech impaired people. Various methods for feature generation, template mapping and sign symbol recognition was provided. A color distribution and visual feature analysis based method for image saliency [28] detection was provided. The content and local feature based distance measures were applied for detection of objects. Kumar and Priyanka [29] has proposed the fingerprint recognition on corrupted and degraded images. The effect of environment, sink, sensor and other impression specific features were considered by the author. The image enhancement based mapping process was defined for effective fingerprint recognition.
3 Alignment and disruption robust mapper
Braille recognition system is required to read the documents, provided by visually challenged people, without the help of a Braille expert. The Braille script can be captured through specialized scanner or through portable camera devices. The alignment of the document placement or camera position results the misalignment of complete document. This misalignment can be identified as the skewed or rotated Braille characters. The dirt, dust or lighting conditions can also affects the capturing image quality in the form of noise. In this paper, a noise and alignment problem rectified peered solution is provided to recognize the Braille character correctly. In this paper, mathematical filters are applied in composite and sequential form to resolve the impact of image disturbance. Figure 2 shows the proposed Braille recognition model with specification of each integrated work stages. In this figure, the process stage is defined in gray box and the white box is representing the corresponding filter applied to implement that particular stage.
The document or the character image captured through scanner device or through portable camera device is taken as the input to the system. To enhance the content quality of dotted pattern, the SD adaptive filter is applied to reduce the noise from rough captured image. To apply the grid mapper, the input patterns must be defined in structured and symmetric form. The extracted images cannot ensure the proper aligned capturing. The PCT (polar cosine transformation) [30] and geometric map through horizontal and vertical axes. To highlight the content, the emboss filter is applied using mathematical operators. The aligned, clear and highlighted image processed using binarization to separate the content and the background. The paper specific disturbance in the capturing are removed in this stage. Now the dot-pattern contents are present in black and the white background represents the paper. The grid mapper is applied on this extracted pattern image to perform the content verification and based on this verification; the binary code is generated for each character image. This binary coded array is compared with each Braille pattern code and corresponding English character. In this section, each of the intermediate stage is also explained.
The scanner or camera quality can disturb the images in the form of inclusive noise. A filtration window tracked and standard deviation adaptive evaluation method is defined to evaluate the degree of impurity. The smoothing vector is also defined to remove the noise and to improve the quality of particular segment. The difference value evaluation based on standard deviation and relative threshold specification corresponding to the center pixel is done to smooth the pixels in filtration window. The evaluation specific smoothening process is shown in Algorithm 1
Algorithm 1 processed the raw Braille image and applied the block adaptive evaluation method based on the center difference and standard deviation vector to set the rule for smoothing the block pixels. The results on noise adjustment are shown in Fig. 3. Figure 3a is showing the raw scanned image and Fig. 3b is showing the enhanced image.
The scanned Braille document also suffers from the problem of misalignment. To locate the problem of misalignment, a geometric and content specific map is required respective to horizontal and vertical axis. To perform the geometric map, the peak points of the contents on the document are identified. At these peak points, the straight horizontal and vertical lines are placed for geometric evaluation. The angular evaluation is done based on mathematical mapping. Let (x1, y1) is the initial point and (x2, y2) is the peak point of horizontal line. The identify content peak points are represented by (cx1, cy1) and (cx2, cy2). From these coordinate points, m1 is the slope of line of geometric straight line and m2 is the slope of content touched line. The angular evaluation based on the slope vectors is shown in Eq. (1)
If θ is zero, the Braille document is already aligned. If θ ≠ 0, then PCT (polar cosine transformation) is applied to align the document. Figure 4 is showing the misalignment problem on partial document image and the aligned document obtained after geometric map and PCT.
PCT is the harmonic transformation that rotate the image using cos function. The rotational vector is evaluated based on the radius and base coordinate specification. PCT evaluation is shown in Eq. (2)
where, r is radius, n is number of coordinate points.
Based on this, the aligned points of the image are shown in Eqs. (3) and (4)
Once these real and imaginary values are obtained for the image, the result value is generated. The intensity evaluation is obtained for the image relative to real and imaginary value shown in Eqs. (5) and (6)
Based on this, the pixel rotated Braille document is obtained. This document is completely aligned. To extract the dot pattern from Braille document the emboss filter is executed. This filter highlights the core area using 3D shadow effect. In this method, the bump map is applied by subtracting the side area and to explore the key area. To apply this filter, the convolution matrix map is applied over the image. In this filter, the specific matrix of 3 × 3 size is tracked over the image as a window. The sum of products of window image pixels and the emboss matrix is substituted to each pixel of the window. The correlation adaptive matrix evaluates the symmetric measure for each pixel in the convolution window. Figure 5b is showing the embossed feature Braille image.
The emboss filter evaluate the gradient vector for each pixel respective to the light source. The positive projection is applied onto the vector pointing towards the light. The bump map is generated by applying the positional offset. The water level specific alpha channel is applied to highlight the emboss effect. After generating the embossed feature image, grid mapper is applied on each separated Braille character. The emboss filter separated the each of the dot pattern separately and quality it to recognize the Braille character accurately. For this recognition, the grid mapper is applied on each separated character. The separated character and relative grid mapper is shown in Fig. 5c, d. To perform the grid mapper, the positional and geometric estimation is applied over the character space. According to this evaluation, the grid structure is applied on the Braille. On each character, the grid cell is evaluated for the information containment. If dot exist, it is considered as binary 1 and if no data exist then it is marked as 0. In this way, each character with six dots is represented by six binary digits. This binary generated code is compared with available binary training set which represents each Braille character distinctively. Binary code of the test Braille character is mapped with Binary set code to recognize the equivalent textual character.
4 Analysis
In the previous section, the proposed robust model is suggested to enhance the Braille recognition under worst constraints. The proposed model is capable to handle the real time issues including the noise, file format, resolution and misalignment problem. The experimental evaluation is applied on three different sample sets. The Braille characters and document images are collected from random web sources. The features of these samplesets and corresponding comparative evaluation are also shown in Table 1. The impurities over these samplesets either exist initially or added explicitly to them.
Table 1, shows that these three sample sets able to defend the robust evaluation of the proposed framework. The samplesets are defined with different file formats, resolution, type and disruption inclusion. The comparative evaluation is taken against PCA and LDA methods. The comparative evaluation of accuracy for these sample sets is shown in Fig. 6.
To verify the significance of proposed model, the comparative evaluation is performed against PCA and LDA methods. The analysis is performed against available three sample sets of Braille documents and characters. Figure 6 depict that the proposed framework improved the accuracy over the existing evaluated methods.
5 Conclusion
Braille is the communication language adopted by visually impaired people for written conversation. To connect the Braille with normal communication, the transition between Braille and language text is required. In this paper, a disruption and alignment robust optical Braille recognition system is presented. The presented model, applied the mathematical, adaptive and geometric operators to extract and improve the Braille character features. PCT adaptive geometric filter is applied to correct the misalignment problem. Later, the emboss filter is applied using convolution filter to extract the dotted pattern clearly over the document. Finally the grid mapper is applied based on positional and structural measure to generate the binary code. This binary code is compared over the cookbook to recognize the character accurately. The experimentation is applied on three heterogeneous sample sets of Braille characters and documents. The comparative evaluation shows that the proposed model improved the accuracy extensively over PCA and LDA methods.
In this paper, the character mapping is done using grid adaptive content map. In future, more intelligent supervised learning such as neural network or probabilistic Bayesian networks can be applied to improve recognition rate. In future, the proposed recognition method can be applied on larger Braille character dataset. The work can be extended to perform character recognition or sign recognition.
References
Hanumanthappa M, Murthy VV (2016) Optical Braille recognition and its correspondence in the conversion of Braille script to text—a literature review. In: 2016 International conference on computation system and information technology for sustainable solutions (CSITSS), Bangalore, pp 297–301
Isayed S, Tahboub R (2015) A review of optical Braille recognition. In: 2015 2nd World symposium on web applications and networking (WSWAN), Sousse, pp 1–6
Hentzschel TW, Blenkhorn P (1995) An optical reading system for embossed Braille characters using a twin shadows approach. J Microcomput Appl 18(4):341–354
Li J, Yan X, Zhang D (2010) Optical Braille recognition with Haar wavelet features and support-vector machine. In: 2010 International conference on computer, mechatronics, control and electronic engineering, Changchun, pp 64–67
Yin J, Wang L, Li J (2010) The research on paper-mediated braille automatic recognition method. In: 2010 Fifth international conference on frontier of computer science and technology, Changchun, Jilin Province, pp 619–624
Subur J, Sardjono TA, Mardiyanto R (2015) Braille character recognition using find contour method. In: 2015 International conference on electrical engineering and informatics (ICEEI), Denpasar, pp 699–703
Wajid M, Abdullah MW, Farooq O (2011) Imprinted Braille-character pattern recognition using image processing techniques. In: 2011 International conference on image information processing, Himachal Pradesh, pp 1–5
Sahu SK, Kumar P, Singh AP (2017) Modified K-NN algorithm for classification problems with improved accuracy. Int J Inf Technol. https://doi.org/10.1007/s41870-017-0058-z
Li T, Zeng X, Xu S (2014) A deep learning method for Braille recognition. In: 2014 International conference on computational intelligence and communication networks, Bhopal, pp 1092–1095
Li J, Yan X (2010) Optical Braille character recognition with support-vector machine classifier. In: 2010 International conference on computer application and system modeling (ICCASM 2010), Taiyuan, pp V12-219–V12-222
Mennens J, Van Tichelen L, Francois G, Engelen J (1993) Optical recognition of Braille writing. In: Proceedings of the second international conference on document analysis and recognition, 1993, Tsukuba Science City, pp 428–431
AlSalman A, El-Zaart A, Al-Salman S, Gumaei A (2012) A novel approach for Braille images segmentation. In: 2012 International conference on multimedia computing and systems, Tangier, pp 190–195
Tai Z, Cheng S, Verma P, Zhai Y (2010) Braille document recognition using belief propagation. J Vis Commun Image Represent 21(7):722–730
Hassen M, Assabie Y (2012) Recognition of Ethiopic braille characters. In: Proceedings of the international conference on management of emergent digital EcoSystems (MEDES ‘12). ACM, New York, pp 158–165
Tang J (2013) Using ontology and RFID to develop a new Chinese Braille learning platform for blind students. Expert Syst Appl 40(8):2817–2827
Yousefi M, Famouri M, Nasihatkon B, Azimifa Z, Fieguth P (2012) A robust probabilistic Braille recognition system. Int J Doc Anal Recognit 15(3):253–266
Mihara Y, Sugimoto A, Shibayama E, Takahashi S (2005) An interactive Braille-recognition system for the visually impaired based on a portable camera. In: CHI ‘05 extended abstracts on human factors in computing systems (CHI EA ‘05). ACM, New York, pp 1653–1656
Murray I, Pasquale A (2006) A portable device for the translation of braille to text. In: Proceedings of the 8th international ACM SIGACCESS conference on computers and accessibility (Assets ‘06). ACM, New York, pp 231–232
Prescher D, Bornschein J, Köhlmann W, Weber G (2017) Touching graphical applications: bimanual tactile interaction on the HyperBraille pin-matrix display. Univers Access Inf Soc. https://doi.org/10.1007/s10209-017-0538-8
Ng CM, Ng V, Lau Y (1999) Regular feature extraction for recognition of Braille. In: Third international conference on computational intelligence and multimedia applications, 1999. ICCIMA ‘99. Proceedings, New Delhi, pp 302–306
Govindan VK, Shivaprasad AP (1990) Character recognition—a review. Pattern Recognit 23(7):671–683
Saleem S, Hollaus F, Sablatnig R (2014) Recognition of degraded ancient characters based on dense SIFT. In: Proceedings of the first international conference on digital access to textual cultural heritage (DATeCH ‘14). ACM, New York, pp 15–20
Juneja K, Gill NS (2013) An enhanced digital character recognition using art network classifier. Int J Comput Appl 64(07):28–33. ISSN:0975-8887 (Impact Factor: 0.814)
Park S-J, Shin D (2012) PCA-based image recognition of braille blocks for guiding the visually handicapped. Int J Precis Eng Manuf 13(12):2115–2120
Park T, Jung J, Cho J (2016) A method for automatically translating print books into electronic Braille books. Sci China Inf Sci 59:1–14
Bebartta HND, Mohanty S (2017) Algorithm for segmenting script-dependant portion in a bilingual Optical Character Recognition system. Pattern Recognit Image Anal 27(3):560–568
Sinha GR (2017) Indian sign language (ISL) biometrics for hearing and speech impaired persons: review and recommendation. Int J Inf Technol 9(4):425–430
Ambeth Kumar VD (2017) Coalesced global and local feature discrimination for content-based image retrieval. Int J Inf Technol 9(4):431–446
Kumar M, Priyanka (2017) Various image enhancement and matching techniques used for fingerprint recognition system. Int J Inf Technol. https://doi.org/10.1007/s41870-017-0061-4
Juneja K, Gill NS (2015) A PCT/PST improved HMM-PCA model for pose robust facial recognition. In: Proceedings of 2015 international conference on applied and theoretical computing and communication technology (iCATccT), Davangere, India, Oct 29–31, 2015, pp 131–136
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Juneja, K., Rana, C. Alignment and disruption robust binary mapper for optical Braille recognition. Int. j. inf. tecnol. 12, 1291–1298 (2020). https://doi.org/10.1007/s41870-017-0069-9
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s41870-017-0069-9