Center for Language Engineering (CLE)

Projects



Urdu Voice Enabled Assistive Technologies for Print Disabled Community of Pakistan

[On-going]

To improve Urdu TTS in the areas of Natural Language Processing (NLP) and speech synthesis.

To improve date formats that contain English slashes (/), commas (,), dashes (-), dots (.) and English names of months (e.g., March, Mar) will be…

Read More




Language Engineering Lab

[On-going]

  • Providing effective access to online and additional Urdu content.
  • Releasing Urdu language processing toolkit for accelerating research and development of Urdu language technology.
  • Delivering data analytics based on local users and local content for improving decision making for business…

    Read More




Local Language Speech Interfaces for Banking Sector of Pakistan

[On-going]

  • Design and develop a continuous speech corpus of 15 hours of read and spontaneous speech from multiple speakers.
  • Develop Urdu speech recognition system using state of the art machine learning techniques.
  • Design and develop advanced telephony and dialog framework
  • Read More




Automatic Pakistani Postal Address Recognition and Parcel Routing

[On-going]

  • Develop an android application to assist the user, to capture, enhance and validate the images containing addresses dispatched on the envelop
  • Develop address text area detection system by resolving skew, horizontal and vertical perspective distortion in camera captured images
  • Read More




Urdu Text to Speech: Integrating Prosody of Emotions

[On-going]

  • Develop emotional speech corpus for Urdu that is prosodically and syntactically annotated.
  • Develop linguistic resources to concretely represent such analyses, for further research and development in linguistics and computational work.
  • Conduct joint research and produce joint, collaborative publications with…

    Read More




Urdu Search Engine

[On-going]

  • To provide access patterns of our communities to commercial content development market
  • To strengthen industry-academia ties by providing solutions to Urdu language specific projects as well as the projects that demand larger distributed storage and computation infrastructure.
  • To initiate…

    Read More




Text to Speech for Urdu Understanding Intonation

[Completed]

  • Develop the capacity to do grammatical and semantic analyses of languages
  • Develop critical linguistic resources for Urdu to concretely represent such Analyses, for further use in linguistics, psycholinguistic and computational work
  • Develop a formal relationship between German and Pakistani…

    Read More




Translation of Websites

[Completed]

Translation and grammatical correction of the text, news, advertisement banners, sms msgs, email msgs and labels of the emirates post group EPG website and mobile application from English to Urdu language.

Read More




Investigating the Impact of OER on Secondary and tertiary Education in Pakistan

[Completed]

Ascertain the extent of OER use by secondary and tertiary students and teachers in Pakistan. Specific objectives of the project include:

  • Deepen understanding of the ways in which OER engagement influences teacher educators’ epistemological and pedagogical stance and to…

    Read More




Computerized Corpus of Persian texts along with their commentaries.

[Completed]

Keying and proofreading of Persian texts and their annotations.

Read More




Digital Dictionaries of South Asia

[Completed]

  • Make available the highest quality electronic dictionaries for South Asian languages as free public services via the Internet.
  • Encompasses South Asian languages of Pakistan: Kashmiri, Punjabi and Sindhi.

Read More




NDA Text Classification

[Completed]

  • Create a solution to apply natural language processing on NDA.
  • Create an application that allows users to create annotated NDA’s based on NDA golden rules.

Read More




Enabling Information Access through Mobile Based Dialog Systems and Screen Readers for Urdu (ASR)

[Completed]

Enabling Information Access through Mobile Based Dialog Systems and Screen Readers for Urdu project aims to equitable information access for the marginalized community in Pakistan, especially the non-literate, semi-literate and print impaired population for their socio-economic benefit. Specific objectives of…

Read More




Language Resources Production (Lexicon)

[Completed]

Production of a phonetic lexicon of Pashto

Read More




Urdu Nastalique Optical Character Recognition System

[Completed]

  • To develop and mature algorithms for analyzing and recognizing Urdu text images based on segmentation-based and ligature-based methods.
  • To develop automatic scaling algorithms for Urdu ligatures to make font size independent system.
  • To develop Urdu OCR for Nastalique style…

    Read More




Investigating the Long Term Residual Impact of ICT Integration across Gender for a Sustainable Project Design

[Completed]

  • To improve literacy level and overall education.
  • To investigate the impact of ICT at formal and non formal educational institutes by paying special attention to the gender.

Read More




Essential Linguistic Research Capacity and Resource Development for Urdu

[Completed]

  • Develop the capacity to do grammatical and semantic analyses of languages.
  • Develop critical linguistic resources for Urdu to concretely represent such analyses for further use in linguistics psycholinguistic and computational work.
  • Develop a formal relationship between German and Pakistani…

    Read More




Enabling Information Access for Rural Population through Urdu Dialog System

[Completed]

  • Undertake research and development of applications to provide access to relevant online content by Pakistani citizens using Urdu dialogue system with mobile phones, addressing the current literacy, language and connectivity barriers.
  • Develop research capacity in advanced speech and language…

    Read More




Pashto Translation Project

[Completed]

  • Develop speech database
  • Develop monolingual corpus for Pashto language

Read More




IDRC Research Chair in Multi Lingual Computing

[Completed]

  • To support, sustain and grow the online PAN Localization network for multilingual computing.
  • To formalize the PAN Localization network for sustained research collaboration through self or externally funded projects.
  • To explore commercial models for research and development at national…

    Read More




Online Torwali Dictionary

[Completed]

To resolve issues of orthography, documentation, and language preservation for future generations and promotes the community's identity and culture.

Read More




Subh-e-nau

[Completed]

Design and organize an intense training specifically customized for empowering women survivors of violence by equipping them with the necessary information communication technology (ICT) tools to voice their stories and for self healing.

Read More




Punjab Government's Flood Relief Website-Urdu Version

[On-going]

CLE in collaboration with Punjab Information Technology Board (PITB) has developed Urdu Language Version of Punjab Government's Flood Relief Website.

Read More




Asian Language Support on Mobile Platform

[On-going]

The project is researching to enable Asian languages on mobile platform. This minimally includes enabling complex Asian writing systems and input methods on this platform. Current mobile technology deploys bit map image based fonts for this purpose. However, over…

Read More




PAN Localization Project

[On-going]

This project is an initiative of International Development Research Center (IDRC). The Objective of this project is to build local language computing capacity in regional institutions of Asia.Phase II of PAN Localization project will research into challenges associated with…

Read More