Latest News

Latest News

The one-day SWiP event comprised an exciting panel discussion by experts in the fields of preservation of languages, culture, and digitisation of information, and an introductory mini workshop to Wikipedia focusing on editing, translating, and making content available online.

The one-day SWiP event comprised an exciting panel discussion by experts in the fields of preservation of languages, culture, and digitisation of information, and an introductory mini workshop to Wikipedia focusing on editing, translating, and making content available online.

Read More

  • Resource guidelines

    Our repository functions as a digital storage space for a range of language resources and is available open access. The repository is made up of three parts: The Index contains metadata for all assets in the repository, those that can be downloaded and those that can be requested by the person listed as the contact…

  • Guidelines

    SADiLaR, as a research infrastructure, drives technological innovation in the fields of humanities and social sciences. We do this through the creation, management and distribution of digital language resources which are freely available through our repository. We encourage all those who work in, or have an interest in, language technologies to access and use our…

  • Spelling checkers for South African languages

    SADiLaR, in collaboration with the Centre for Text Technology (CTexT®), is pleased to offer free spelling checkers and hyphenators for ten of South Africa’s official languages. The installer can be downloaded free of charge and is designed for use with Microsoft ® Office for Windows. Installation is in English and limited support is available.   Languages offered include:…

  • Third workshop on Resources for African Indigenous Languages (RAIL)

    Registration is now open The registration form can be found here: https://forms.gle/58KzRxGtMGQvDZt3A The South African Centre for Digital Language Resources (SADiLaR) is organising the 3rd RAIL workshop in the field of Resources for African Indigenous Languages. This workshop aims to bring together researchers who are interested in showcasing their research and thereby boosting the field…

  • Enabling localised language technology applications: A Computational Wide coverage resource grammar for isiZulu

    Project Type: NodeProject Start Date: 1 April 2020Project Status: Completed  Project Aims: The CSIR node of SADiLaR recently completed a project with as its main aim to deliver to the research community a high-quality, computational, wide coverage resource grammar (WCRG) for isiZulu.  WCRGs unlock opportunities for the South African languages to participate in multilingual research,…

  • Digitisation of Language resources

    Project Type: NodeProject Start Date: 1 April 2017Project Status: Ongoing Project Aims and Motivation: The UP digitisation node focuses on the preservation of invaluable language (and cultural) resources for the African languages by digitising textual, video and audio material, and providing language communities with access to these digital resources via the SADiLaR repository. Digitised content delivered by…

  • Linguistic corpus enrichment for conjunctively written South African languages

    Project Type: NodeStart Date: 1 October 2017Project Status: Completed and delivered   Project Aims: This project was developed under the Nodes Specialisation Project, makes linguistically enriched corpora available for the four official South African languages with a conjunctive orthography, i.e. isiNdebele, isiXhosa, isiZulu, and Siswati. The parallel corpora consist of approximately 50,000 tokens each, aligned…

  • Mobile Dictionary application framework

    Project Type: NodeProject Start Date: 1 August 2020Project Status: In Progress  Project Aims: The project aims to develop an open-source hybrid mobile application framework that will allow for online access to a TMS and dictionary API, managed through a TMS API manager (TAM) and offline access to local dictionary content. The framework will create a…

  • Parallel corpora for English-isiXhosa and English-Siswati

    Project Type: NodeProject Start Date: 1 July 2019Project Status: Completed and delivered English-Siswati corpus Project Aims: This project entailed the collection and processing of bilingual data to develop a 2-million-word English–Siswati parallel-aligned corpus that can be used to train machine translation systems. The data was acquired by crawling various South African web domains and human…