Tag Archives: Perso-Arabic

Software to melt India, Pakistan’s Sindhi script barrier

By , TNN

PATIALA: Bringing down the script barrier between 25 lakh Sindhis in India and four crore in Pakistan, a first-of-its-kind software will enable Sindhis settled on both sides of the border to read each others’ literature despite the different scripts.

The yet-to-be-launched software has been developed by Punjabi researchers in Punjabi University, Patiala and Manchester University, England.

Despite having the same language, Sindhis residing on both sides of the border could not read each others’ literature since Pakistani Sindhis use Perso-Arabic script and those in India follow the Devnagari script.

The software, which is in trial stage, will remove this barrier as it will transliterate Perso-Arabic Sindhi into Devnagari and vice-versa.

“Like Punjabis, Sindhis also follow two scripts. Hence, the immense need to remove this language barrier. We had begun work on this project in March, last year. A Punjabi scholar form Manchester University is also collaborating on this,” said Dr GS Lehal of Punjabi University, coordinator of the project.

Dr Lehal said that the software will be equipped with over one crore Sindhi words in Perso-Arabic script and around 50 lakh Sindhi words in Devnagari script.

“Word bank of Sindhi words in Devnagari is smaller as the volume of Sindhi literature published in India is much less than that in Perso-Arabic. We found soft copies of numerous Sindhi magazines, newspapers and books published in Perso-Arabic script. These words were converted into data bank. Besides, there is dictionary of over 25,000 basic words, which is part of the word pool,” he added.

He said that phase I of the project is complete, which means that software has the capacity to transliterate with 90% accuracy. “We will launch it after we achieved accuracy rate of 95%, which likely in the next few months”, he added.

TRANSITION

Till 1850s, Sindhi was written in several scripts including Perso-Arabic and Gurmukhi by people of different religions residing in Sindh province of Pakistan. “However, in 1850s, a special committee constituted by British mandated use of Perso-Arabic script to write Sindhi, said Dr Lehal. The practice continued till 1947, when large number of Sindhis migrated to India and settled in Maharashtra, Gujarat and Rajasthan. Shortly after Partition, Indian Sindhis adopted the Devnagari script.

Courtesy: The Times of India
http://timesofindia.indiatimes.com/india/Software-to-melt-India-Pakistans-Sindhi-script-barrier/articleshow/41556896.cms

Sindhi Resource Grammar Released – funded by the European Union

We are happy to release the Sindhi Resource Grammar. It is 5th Indo-Iranian language added in the GF resource grammar library (Others are Urdu, Punjabi, Persian, and Nepali). The development took almost 6 months, and was developed as a Master thesis project. Sindhi belongs to the Indo-Aryan branch of the Indo-Iranian family. It is widely spoken in Pakistan and India. In Pakistan it is the official language of Sindh (province of Pakistan), and in India it one of the scheduled languages officially recognized by federal government of India. There are more than 22 million native Sindhi speakers. In Pakistan Sindhi is written in Perso-Arabic script from right-to-left, while in India it is written in Devanagari script from left-to-right. Rich morphology, verb-compounding, relatively free word-order are the major characteristics of Sindhi.

Courtesy: http://www.molto-project.eu/story/sindhi-resource-grammar-released