{"id":12035,"date":"2025-12-02T10:49:14","date_gmt":"2025-12-02T08:49:14","guid":{"rendered":"https:\/\/sadilar.org\/?p=12035"},"modified":"2026-04-15T08:58:25","modified_gmt":"2026-04-15T06:58:25","slug":"seventh-workshop-on-resources-for-african-indigenous-languages-rail-2026","status":"publish","type":"post","link":"https:\/\/sadilar.org\/en\/seventh-workshop-on-resources-for-african-indigenous-languages-rail-2026\/","title":{"rendered":"Seventh Workshop on Resources for African Indigenous Languages (RAIL) 2026"},"content":{"rendered":"\n<figure class=\"wp-block-image size-full\"><img fetchpriority=\"high\" decoding=\"async\" width=\"480\" height=\"270\" src=\"https:\/\/sadilar.org\/wp-content\/uploads\/2025\/12\/rail_2026.gif\" alt=\"\" class=\"wp-image-12103\"\/><\/figure>\n\n\n\n<p>Co-located with <a href=\"https:\/\/www.elra.info\/lrec2026\">LREC 2026<\/a><br>RAIL Workshop date: 12 May 2026<br>RAIL website: <a href=\"https:\/\/sadilar.org\/en\/seventh-workshop-on-resources-for-african-indigenous-languages-rail-2026\/\">https:\/\/sadilar.org\/en\/seventh-workshop-on-resources-for-african-indigenous-languages-rail-2026\/<\/a><br>Submission link for the RAIL workshop: <a href=\"https:\/\/softconf.com\/lrec2026\/RAIL2026\/\">https:\/\/softconf.com\/lrec2026\/RAIL2026\/<\/a><br>LREC Conference dates: 11-16 May 2026<br>LREC website: <a href=\"https:\/\/www.elra.info\/lrec2026\/\">https:\/\/www.elra.info\/lrec2026\/<\/a><br>Venue: Palau de Congressos de Palma, Palma de Mallorca (Spain)<\/p>\n\n\n\n<p>The Resources for African Indigenous Languages (RAIL) workshop provides an interdisciplinary platform for researchers working on resources such as data collections and annotations, Human Language Technologies (HLT) and Natural Language Processing (NLP) tools, and their applications, specifically targeted towards African indigenous languages. In particular, it aims to create the conditions for the emergence of a scientific community of practice that focuses on data, as well as computational linguistic tools specifically designed for or applied to indigenous languages found in Africa. The seventh Resources for African Indigenous Languages (RAIL) workshop will be co-located with the Language Resources and Evaluation Conference (LREC) 2026 in Palau de Congressos de Palma, Palma, Mallorca (Spain).<\/p>\n\n\n\n<p>Many African languages are under-resourced while only a few are considered to be somewhat better resourced. These languages often share interesting properties such as writing systems, making them different from most high-resourced languages. From a computational perspective, these languages lack enough corpora to undertake high level development of NLP and HLT tools, which in turn impedes the development of African languages in these areas. During previous workshops, it was noted that the problems and solutions presented were not only applicable to African languages but were also relevant to many other low-resource languages across the world. Because these languages share similar challenges, this workshop provides researchers with opportunities to work collaboratively on issues of language resource development and learn from each other.<\/p>\n\n\n\n<p>The RAIL workshop has several aims. First, the workshop brings together researchers who work on African indigenous languages, forming a community of practice for people working on indigenous languages. Second, the workshop aims to reveal currently unknown or unpublished existing resources (corpora, NLP tools, and applications), resulting in a better overview of the current state-of-the-art, and also allows for discussions on novel, desired resources for future research in this area. Third, it enhances sharing of knowledge on the development of low-resource languages. Finally, it enables discussions on how to improve the quality as well as availability of the resources.<\/p>\n\n\n\n<p>The workshop theme is \u201cCreating resources for less-resourced African languages\u201d, but submissions on any topic related to properties of African indigenous languages (including related non-African languages) may be accepted. Suggested topics include (but are not limited to) the following:<\/p>\n\n\n\n<ul class=\"wp-block-list\">\n<li>Digital representations of linguistic structures<\/li>\n\n\n\n<li>Descriptions of corpora or other data sets of African indigenous languages<\/li>\n\n\n\n<li>Building resources for (under-resourced) African indigenous languages<\/li>\n\n\n\n<li>Developing and using African indigenous languages in the digital age<\/li>\n\n\n\n<li>Effectiveness of digital technologies for the development of African indigenous languages<\/li>\n\n\n\n<li>Revealing unknown or unpublished existing resources for African indigenous languages<\/li>\n\n\n\n<li>Developing desired resources for African indigenous languages<\/li>\n\n\n\n<li>Improving quality, availability and accessibility of African indigenous language resources<\/li>\n\n\n\n<li>Applications that make use of data collections of African indigenous languages<\/li>\n<\/ul>\n\n\n\n<p><strong>Submission requirements:<\/strong><br>We invite papers on original, unpublished work related to the topics of the workshop. Submissions, presenting completed work, should adhere to the LREC conference requirements. These requirements are described in LREC\u2019s authors kit: <a href=\"https:\/\/lrec2026.info\/authors-kit\/\">https:\/\/lrec2026.info\/authors-kit\/<\/a>. The submission should be double blind and each submission should be between four and eight pages. Only oral papers should be submitted. The maximum number of pages excludes a compulsory ethics statement, discussion on limitations, and references and optional acknowledgements, as well as data and code availability statements if applicable. Appendices or supplementary material are allowed, but this information will not necessarily be taken into account during the review process.<\/p>\n\n\n\n<p>The submission link for the RAIL workshop: <a href=\"https:\/\/softconf.com\/lrec2026\/RAIL2026\/\">https:\/\/softconf.com\/lrec2026\/RAIL2026\/<\/a><\/p>\n\n\n\n<p>Authors are encouraged to upload their datasets to the SADiLaR repository: <a href=\"https:\/\/repo.sadilar.org\/\">https:\/\/repo.sadilar.org\/<\/a>. In case of difficulties uploading the datasets, please reach out to Benito Trollip (benito.trollip@nwu.ac.za).<\/p>\n\n\n\n<p><strong>Important dates:<\/strong><br>Submission deadline: 1 March 2026 AoE<br>Date of notification: 18 March 2026 AoE<br>Camera ready copy deadline: 30 March 2026 AoE<br>Workshop: 12 May 2026<\/p>\n\n\n\n<p><strong>Organising Committee:<\/strong><br>Muzi Matfunjwa, South African Centre for Digital Language Resources (SADiLaR), South Africa<br>Mmasibidi Setaka, South African Centre for Digital Language Resources (SADiLaR), South Africa<br>Rooweither Mabuya, South African Centre for Digital Language Resources (SADiLaR), South Africa<br>Menno van Zaanen, South African Centre for Digital Language Resources (SADiLaR), South Africa<\/p>\n\n\n\n<p>Note that the RAIL workshop is part of a series of workshops. You can find information on the other workshops at <a href=\"https:\/\/sadilar.org\/en\/rail\/\">https:\/\/sadilar.org\/en\/rail\/<\/a>.<\/p>\n\n\n\n\t<h2 id=\"program\">RAIL 2026: Program<\/h2>\n\n    <table>\n      <tbody><!-- DAY Tuesday, May 12, 2026 -->\n\n<tr bgcolor=\"#707070\">\n  <td colspan=\"2\">\n    <font color=\"#e9e9e9\">\n      <b>Tuesday, May 12, 2026<\/b>\n    <\/font>\n  <\/td>\n<\/tr><tr bgcolor=\"#d0d0d0\">\n\t<td valign=top>\n\t  14:00  &#8211; 18:00 &nbsp;&nbsp;\n\t<\/td>\n\t<td valign=top>\n\t  Session RAIL &#8211; \n\t  <b>The Seventh Workshop on Resources for African Indigenous Languages 2026<\/b>\n\t   &#8211; Room #2\n\t  \n\t  \n\n\t  \n\t  <br>Chair:  Muzi Matfunjwa, South African Center for Digital Language Resources\n\t  \n\t  \n\t  \n\t  \n\n\t  \n\t  <br>Co-Chair:  Mmasibidi Setaka and Menno van Zaanen, South African Center for Digital Language Resources\n\t  \n\t  \n\t  \n\t  \n\n\t<\/td>\n\n<\/tr><tr bgcolor=\"#a0a0a0\">\n\t<td>\n\t  14:00  &#8211; 14:15 &nbsp;&nbsp;\n\t<\/td>\n\t<td>\n\t  \n\t  Opening\n\n\t  \n\t  <br><i>Menno van Zaanen<\/i>\n\t  \n\t<\/td>\n\n<\/tr><tr bgcolor=\"#ededed\">\n\t<td nowrap valign=top>\n\t  14:15 &#8211; 14:30 &nbsp;&nbsp;\n\t<\/td>\n\t<td>\n\t  Session RAIL &#8211; \n\t    \n\t    <b>A Morpho-Syntactically Annotated Corpus of \u00d2g\u00e8 Folk Narratives with a Focus on Nominal Structure<\/b>\n\t  <br>\n\t  <em>Priscilla Adenuga<\/em><br>\n\t  Independent Researcher<\/a>\n\t<\/td>\n\n<\/tr><tr bgcolor=\"#ededed\">\n\t<td nowrap valign=top>\n\t  14:30 &#8211; 14:45 &nbsp;&nbsp;\n\t<\/td>\n\t<td>\n\t  Session RAIL &#8211; \n\t    \n\t    <b>Extension of Linguistic Resources for South African Languages: Part-of-Speech Annotated Domain-Specific Data<\/b>\n\t  <br>\n\t  <em>Tanja Gaustad<sup>1<\/sup>,&nbsp;Roald Eiselen<sup>2<\/sup>,&nbsp;Cindy Arlene McKellar<sup>3<\/sup><\/em><br>\n\t  <sup>1<\/sup>Centre for Text Technology (CTexT), North-West University, <sup>2<\/sup>Centre for Text Technology, North-West University, <sup>3<\/sup>Centre for Text Technology, North-West University, Potchefstroom Campus<\/a>\n\t<\/td>\n\n<\/tr><tr bgcolor=\"#ededed\">\n\t<td nowrap valign=top>\n\t  14:45 &#8211; 15:00 &nbsp;&nbsp;\n\t<\/td>\n\t<td>\n\t  Session RAIL &#8211; \n\t    \n\t    <b>Mining Large Language Models for Low-Resource Language Data: Comparing Elicitation Strategies for Hausa and Fongbe<\/b>\n\t  <br>\n\t  <em>Pericles Adjovi<sup>1<\/sup>,&nbsp;Prasenjit Mitra<sup>1<\/sup>,&nbsp;Roald Eiselen<sup>2<\/sup><\/em><br>\n\t  <sup>1<\/sup>Carnegie Mellon University Africa, <sup>2<\/sup>Northwestern University<\/a>\n\t<\/td>\n\n<\/tr><tr bgcolor=\"#ededed\">\n\t<td nowrap valign=top>\n\t  15:00 &#8211; 15:15 &nbsp;&nbsp;\n\t<\/td>\n\t<td>\n\t  Session RAIL &#8211; \n\t    \n\t    <b>Comparing Source Language Selection Strategies for Multi-Source Cross-Lingual Transfer to African Languages<\/b>\n\t  <br>\n\t  <em>Tewodros Kederalah Idris<sup>1<\/sup>,&nbsp;Prasenjit Mitra<sup>2<\/sup>,&nbsp;Roald Eiselen<sup>3<\/sup><\/em><br>\n\t  <sup>1<\/sup>Carnegie Mellon University Africa, <sup>2<\/sup>Leibniz University of Hannover, <sup>3<\/sup>Centre for Text Technology, North-West University<\/a>\n\t<\/td>\n\n<\/tr><tr bgcolor=\"#ededed\">\n\t<td nowrap valign=top>\n\t  15:15 &#8211; 15:30 &nbsp;&nbsp;\n\t<\/td>\n\t<td>\n\t  Session RAIL &#8211; \n\t    \n\t    <b>Benchmarking text embedding models for South African languages<\/b>\n\t  <br>\n\t  <em>Ockert de Villiers and Roald Eiselen<\/em><br>\n\t  Centre for Text Technology, North-West University<\/a>\n\t<\/td>\n\n<\/tr><tr bgcolor=\"#ededed\">\n\t<td nowrap valign=top>\n\t  15:30 &#8211; 15:45 &nbsp;&nbsp;\n\t<\/td>\n\t<td>\n\t  Session RAIL &#8211; \n\t    \n\t    <b>Improving Amharic Information Retrieval with Translative and Multi-Agent Debate Retrieval Augmented Generation<\/b>\n\t  <br>\n\t  <em>Abel Alemu Jotie and Prasenjit Mitra<\/em><br>\n\t  Carnegie Mellon University Africa<\/a>\n\t<\/td>\n\n<\/tr><tr bgcolor=\"#ededed\">\n\t<td nowrap valign=top>\n\t  15:45 &#8211; 16:00 &nbsp;&nbsp;\n\t<\/td>\n\t<td>\n\t  Session RAIL &#8211; \n\t    \n\t    <b>Less can be More: Towards a Parameter-Efficient Fine-Tuning of Wav2Vec2 XLSR for Low-Resource Cape Verdean Creole ASR<\/b>\n\t  <br>\n\t  <em>Mateus Neves Andrade<sup>1<\/sup>,&nbsp;Mouhamadou Lamine BA<sup>2<\/sup>,&nbsp;Idy Diop<sup>2<\/sup>,&nbsp;Arlindo Oliveira da Veiga<sup>3<\/sup><\/em><br>\n\t  <sup>1<\/sup>University of Cape Verde, <sup>2<\/sup>Uiversit\u00e9 Cheikh Anta Diop, Senegal, <sup>3<\/sup>University of Cape Verde, Cape Verde<\/a>\n\t<\/td>\n\n<\/tr><tr bgcolor=\"#a0a0a0\">\n\t<td>\n\t  16:00  &#8211; 16:30 &nbsp;&nbsp;\n\t<\/td>\n\t<td>\n\t  \n\t  Afternoon Coffee Break\n\n\t  \n\t<\/td>\n\n<\/tr><tr bgcolor=\"#ededed\">\n\t<td nowrap valign=top>\n\t  16:30 &#8211; 16:45 &nbsp;&nbsp;\n\t<\/td>\n\t<td>\n\t  Session RAIL &#8211; \n\t    \n\t    <b>From Script to Semantics: Prompting Strategies for African NLI<\/b>\n\t  <br>\n\t  <em>Anuj Tiwari<sup>1<\/sup>,&nbsp;Terry Oko-odion<sup>2<\/sup>,&nbsp;Hannah Nwokocha<sup>2<\/sup><\/em><br>\n\t  <sup>1<\/sup>Noida Institute of Engineering and Technology, <sup>2<\/sup>ML Collective<\/a>\n\t<\/td>\n\n<\/tr><tr bgcolor=\"#ededed\">\n\t<td nowrap valign=top>\n\t  16:45 &#8211; 17:00 &nbsp;&nbsp;\n\t<\/td>\n\t<td>\n\t  Session RAIL &#8211; \n\t    \n\t    <b>HaYo: Repurposing DiaSafety Dataset for Dialogue Safety Evaluation in Hausa and Yoruba<\/b>\n\t  <br>\n\t  <em>Tunde Oluwaseyi Ajayi<sup>1<\/sup>,&nbsp;Bolade Deborah Ashaolu<sup>2<\/sup>,&nbsp;Falalu Ibrahim Lawan<sup>3<\/sup>,&nbsp;Daud Olamide Abolade<sup>4<\/sup>,&nbsp;Amina Imam Abubakar<sup>5<\/sup>,&nbsp;Oluwatosin Ayomide Akinrinde<sup>6<\/sup>,&nbsp;Murja Sani Gadanya<sup>7<\/sup>,&nbsp;Omodolapo Dorcas Ashaolu<sup>4<\/sup>,&nbsp;Abubakar Khalid Auwal<sup>7<\/sup>,&nbsp;Adewumi Awujoola<sup>2<\/sup>,&nbsp;Shamsuddeen Umaru Adamu<sup>8<\/sup>,&nbsp;Israel Olawole Ashaolu<sup>2<\/sup>,&nbsp;Mihael Arcan<sup>9<\/sup>,&nbsp;Paul Buitelaar<sup>10<\/sup><\/em><br>\n\t  <sup>1<\/sup>Insight Research Ireland Centre for Data Analytics, Data Science Institute, University of Galway, <sup>2<\/sup>University of Ilorin, <sup>3<\/sup>Federal University of Technology Babura, <sup>4<\/sup>Masakhane, <sup>5<\/sup>University of Abuja, <sup>6<\/sup>Ladoke Akintola University of  Technology at Ogbomosho, <sup>7<\/sup>Bayero University Kano, <sup>8<\/sup>Kaduna State University, <sup>9<\/sup>Lua Health, <sup>10<\/sup>University of Galway<\/a>\n\t<\/td>\n\n<\/tr><tr bgcolor=\"#ededed\">\n\t<td nowrap valign=top>\n\t  17:00 &#8211; 17:15 &nbsp;&nbsp;\n\t<\/td>\n\t<td>\n\t  Session RAIL &#8211; \n\t    \n\t    <b>Reclaiming African Voices: Surveying Indigenous Writing Systems for Inclusive NLP<\/b>\n\t  <br>\n\t  <em>Mamady Traore<sup>1<\/sup>,&nbsp;Ngoc Tan Le<sup>2<\/sup>,&nbsp;Fatiha Sadat<sup>1<\/sup><\/em><br>\n\t  <sup>1<\/sup>UQAM, <sup>2<\/sup>Universite du Quebec a Montreal<\/a>\n\t<\/td>\n\n<\/tr><tr bgcolor=\"#ededed\">\n\t<td nowrap valign=top>\n\t  17:15 &#8211; 17:30 &nbsp;&nbsp;\n\t<\/td>\n\t<td>\n\t  Session RAIL &#8211; \n\t    \n\t    <b>Getting Close to Cloze: Towards Readability Resources for Afrikaans<\/b>\n\t  <br>\n\t  <em>Susan Lotz,&nbsp;Rik van Noord,&nbsp;Gertjan van Noord<\/em><br>\n\t  University of Groningen<\/a>\n\t<\/td>\n\n<\/tr><tr bgcolor=\"#ededed\">\n\t<td nowrap valign=top>\n\t  17:30 &#8211; 17:45 &nbsp;&nbsp;\n\t<\/td>\n\t<td>\n\t  Session RAIL &#8211; \n\t    \n\t    <b>The Hundzula Retreat-Based Infrastructure Model for African Natural Language Processing<\/b>\n\t  <br>\n\t  <em>Johannes Sibeko<sup>1<\/sup>,&nbsp;Seani Rananga<sup>2<\/sup>,&nbsp;Neo N Putini<sup>3<\/sup>,&nbsp;Dan Masethe<sup>4<\/sup><\/em><br>\n\t  <sup>1<\/sup>Nelson Mandela University, <sup>2<\/sup>University of Pretoria, <sup>3<\/sup>University of KwaZulu-Natal, <sup>4<\/sup>Tshwane University of Technology<\/a>\n\t<\/td>\n\n<\/tr><tr bgcolor=\"#ededed\">\n\t<td nowrap valign=top>\n\t  17:45 &#8211; 18:00 &nbsp;&nbsp;\n\t<\/td>\n\t<td>\n\t  Session RAIL &#8211; \n\t    \n\t    <b>Open but Incompatible: A License Compatibility Analysis of Corpora for Low-Resource African Languages<\/b>\n\t  <br>\n\t  <em>Ernst A.P. van Gassen<\/em><br>\n\t  Arktos Applied Intelligence<\/a>\n\t<\/td>\n  \n<\/tr><\/tbody>\n<\/table>\n\n","protected":false},"excerpt":{"rendered":"<p>Co-located with LREC 2026RAIL Workshop date: 12 May 2026RAIL website: https:\/\/sadilar.org\/en\/seventh-workshop-on-resources-for-african-indigenous-languages-rail-2026\/Submission link for the RAIL workshop: https:\/\/softconf.com\/lrec2026\/RAIL2026\/LREC Conference dates: 11-16 May 2026LREC website: https:\/\/www.elra.info\/lrec2026\/Venue: Palau de Congressos de Palma, Palma de Mallorca (Spain) The Resources for African Indigenous Languages (RAIL) workshop provides an interdisciplinary platform for researchers working on resources such as data collections and [&hellip;]<\/p>\n","protected":false},"author":291,"featured_media":0,"comment_status":"closed","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"_acf_changed":false,"footnotes":""},"categories":[881],"tags":[],"class_list":["post-12035","post","type-post","status-publish","format-standard","hentry","category-uncategorized-en"],"acf":[],"_links":{"self":[{"href":"https:\/\/sadilar.org\/en\/wp-json\/wp\/v2\/posts\/12035","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/sadilar.org\/en\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/sadilar.org\/en\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/sadilar.org\/en\/wp-json\/wp\/v2\/users\/291"}],"replies":[{"embeddable":true,"href":"https:\/\/sadilar.org\/en\/wp-json\/wp\/v2\/comments?post=12035"}],"version-history":[{"count":14,"href":"https:\/\/sadilar.org\/en\/wp-json\/wp\/v2\/posts\/12035\/revisions"}],"predecessor-version":[{"id":12333,"href":"https:\/\/sadilar.org\/en\/wp-json\/wp\/v2\/posts\/12035\/revisions\/12333"}],"wp:attachment":[{"href":"https:\/\/sadilar.org\/en\/wp-json\/wp\/v2\/media?parent=12035"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/sadilar.org\/en\/wp-json\/wp\/v2\/categories?post=12035"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/sadilar.org\/en\/wp-json\/wp\/v2\/tags?post=12035"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}