Latest News

The one-day SWiP event comprised an exciting panel discussion by experts in the fields of preservation of languages, culture, and digitisation of information, and an introductory mini workshop to Wikipedia focusing on editing, translating, and making content available online.

Resource guidelines

Our repository functions as a digital storage space for a range of language resources and is available open access. The repository is made up of three parts: The Index contains metadata for all assets in the repository, those that can be downloaded and those that can be requested by the person listed as the contact…
Guidelines

SADiLaR, as a research infrastructure, drives technological innovation in the fields of humanities and social sciences. We do this through the creation, management and distribution of digital language resources which are freely available through our repository. We encourage all those who work in, or have an interest in, language technologies to access and use our…
Spelling checkers for South African languages

SADiLaR, in collaboration with the Centre for Text Technology (CTexT®), is pleased to offer free spelling checkers and hyphenators for ten of South Africa’s official languages. The installer can be downloaded free of charge and is designed for use with Microsoft ® Office for Windows. Installation is in English and limited support is available. Languages offered include:…
Third workshop on Resources for African Indigenous Languages (RAIL)

Registration is now open The registration form can be found here: https://forms.gle/58KzRxGtMGQvDZt3A The South African Centre for Digital Language Resources (SADiLaR) is organising the 3rd RAIL workshop in the field of Resources for African Indigenous Languages. This workshop aims to bring together researchers who are interested in showcasing their research and thereby boosting the field…
Development of a multi-level, multi-genre learner corpus academic writing

Project Type: NodeProject Start Date: 1 March 2017Project Status: Completed Project Aims: Development of a multi-genre, multi-level learner corpus of academic writing in order to develop, refine and implement an online academic writing tool Collect additional data from various universities in South Africa to grow the multi-level, multi-genre learner corpus of academic writing Redevelopment…
Enabling localised language technology applications: A Computational Wide coverage resource grammar for isiZulu

Project Type: NodeProject Start Date: 1 April 2020Project Status: Completed Project Aims: The CSIR node of SADiLaR recently completed a project with as its main aim to deliver to the research community a high-quality, computational, wide coverage resource grammar (WCRG) for isiZulu. WCRGs unlock opportunities for the South African languages to participate in multilingual research,…
Digitisation of Language resources

Project Type: NodeProject Start Date: 1 April 2017Project Status: Ongoing Project Aims and Motivation: The UP digitisation node focuses on the preservation of invaluable language (and cultural) resources for the African languages by digitising textual, video and audio material, and providing language communities with access to these digital resources via the SADiLaR repository. Digitised content delivered by…
Linguistic corpus enrichment for conjunctively written South African languages

Project Type: NodeStart Date: 1 October 2017Project Status: Completed and delivered Project Aims: This project was developed under the Nodes Specialisation Project, makes linguistically enriched corpora available for the four official South African languages with a conjunctive orthography, i.e. isiNdebele, isiXhosa, isiZulu, and Siswati. The parallel corpora consist of approximately 50,000 tokens each, aligned…
Mobile Dictionary application framework

Project Type: NodeProject Start Date: 1 August 2020Project Status: In Progress Project Aims: The project aims to develop an open-source hybrid mobile application framework that will allow for online access to a TMS and dictionary API, managed through a TMS API manager (TAM) and offline access to local dictionary content. The framework will create a…

Latest News

Latest News

Resource guidelines

Guidelines

Spelling checkers for South African languages

Third workshop on Resources for African Indigenous Languages (RAIL)

Development of a multi-level, multi-genre learner corpus academic writing

Enabling localised language technology applications: A Computational Wide coverage resource grammar for isiZulu

Digitisation of Language resources

Linguistic corpus enrichment for conjunctively written South African languages

Mobile Dictionary application framework

Our Sponsor

Our Partners