{"id":6294,"date":"2020-05-16T17:51:07","date_gmt":"2020-05-16T17:51:07","guid":{"rendered":"https:\/\/sadilar.org\/ctext-node\/"},"modified":"2023-12-12T13:46:16","modified_gmt":"2023-12-12T13:46:16","slug":"ctext-node","status":"publish","type":"post","link":"https:\/\/sadilar.org\/en\/ctext-node\/","title":{"rendered":"NWU Centre for Text Technology: Text Node"},"content":{"rendered":"\n<figure class=\"wp-block-embed is-type-video is-provider-youtube wp-block-embed-youtube wp-embed-aspect-16-9 wp-has-aspect-ratio\"><div class=\"wp-block-embed__wrapper\">\n<iframe title=\"SADiLaR: CTexT\u00ae Node\" width=\"800\" height=\"450\" data-src=\"https:\/\/www.youtube.com\/embed\/4UMLUlec9XI?feature=oembed\" frameborder=\"0\" allow=\"accelerometer; autoplay; clipboard-write; encrypted-media; gyroscope; picture-in-picture; web-share\" allowfullscreen src=\"data:image\/svg+xml;base64,PHN2ZyB3aWR0aD0iMSIgaGVpZ2h0PSIxIiB4bWxucz0iaHR0cDovL3d3dy53My5vcmcvMjAwMC9zdmciPjwvc3ZnPg==\" class=\"lazyload\" data-load-mode=\"1\"><\/iframe>\n<\/div><\/figure>\n\n\n\n<p>Technology offers an opportunity for South Africans to cross the many language barriers that exist in a country with 11 official languages, and to communicate more effectively with one another. The Centre of Text Technology (CTexT\u00ae) focuses on the research and development of Human Language Technology (HLT) within the South African context. Their goal is to improve the interaction between human and computer by developing modern technological programs and applications. This in turn can improve the communication between people of different language backgrounds.<\/p>\n\n\n\n<p>Based at the North-West University, Potchefstroom Campus, CTexT\u00ae combines research expertise and the essential technical and administrative support for expanding the much-needed resources for HLT. They conduct cutting-edge research in text technology and use that as the basis for the development of innovative and relevant technological applications for resource-scarce languages.<\/p>\n\n\n\n<p>As the official text node of SADiLaR, CTexT\u00ae focuses on the advancement of multilingualism and building indigenous South African languages. For under-resourced languages, text data isn\u2019t as available as it is for English, for example, and yet this is needed if new technologies within big data and artificial intelligence (AI), responsive to the unique South African context, are to be developed.<\/p>\n\n\n\n<h2 class=\"wp-block-heading\">Key projects<\/h2>\n\n\n\n<p><strong>Linguistic corpus enrichment project<\/strong><\/p>\n\n\n\n<p>In this long-term project CTexT\u00ae sources, collects, and processes resources which include corpora, i.e., larger collections of texts, and develops core technologies, such as&nbsp;<a href=\"https:\/\/www.languagehumanities.org\/what-is-morphological-analysis.htm#:~:text=Morphological%20analysis%20refers%20to%20the,the%20meaningful%20parts%20contained%20within\" target=\"_blank\" rel=\"noopener noreferrer\">morphological analysers<\/a>&nbsp;or part-of-speech taggers, which make up the building blocks for language technologies for these indigenous languages.<\/p>\n\n\n\n<p><strong><a href=\"https:\/\/mt.nwu.ac.za\/\" target=\"_blank\" rel=\"noopener noreferrer\">Autshumato project<\/a><\/strong><\/p>\n\n\n\n<p>In our other key long-term project, we focus on parallel corpus development for use in machine translation systems. This series of projects which fall under the Autshumato umbrella funded by SADiLaR and the Department of Sports, Arts and Culture, has provided easy-to-use, open-source technologies that simplify the translation process, promote terminology standardisation and shorten translation time.&nbsp;<a href=\"https:\/\/literator.org.za\/index.php\/literator\/article\/view\/1766\/3532\" target=\"_blank\" rel=\"noopener noreferrer\">This provides the public with improved access to information in their mother-tongue and aids effective public service delivery<\/a>.<\/p>\n\n\n\n<h2 class=\"wp-block-heading\">Other key technologies<\/h2>\n\n\n\n<p>Examples of the noteworthy technologies that CTexT\u00ae has developed include spelling checkers for 10 official SA languages (excluding English), automatic machine translation systems, various machine learning-based core technologies, and collections of text data and tools (i.e. technology for optical character recognition, identification of languages or parts of speech etc.) for future development.<\/p>\n\n\n\n<p><a href=\"https:\/\/humanities.nwu.ac.za\/ctext\" target=\"_blank\" rel=\"noopener noreferrer\">For more information visit the CTeXT website<\/a><\/p>\n","protected":false},"excerpt":{"rendered":"<p>Technology offers an opportunity for South Africans to cross the many language barriers that exist in a country with 11 official languages, and to communicate more effectively with one another. The Centre of Text Technology (CTexT\u00ae) focuses on the research and development of Human Language Technology (HLT) within the South African context. Their goal is [&hellip;]<\/p>\n","protected":false},"author":246,"featured_media":0,"comment_status":"closed","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"_acf_changed":false,"footnotes":""},"categories":[730],"tags":[],"class_list":["post-6294","post","type-post","status-publish","format-standard","hentry","category-general"],"acf":[],"_links":{"self":[{"href":"https:\/\/sadilar.org\/en\/wp-json\/wp\/v2\/posts\/6294","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/sadilar.org\/en\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/sadilar.org\/en\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/sadilar.org\/en\/wp-json\/wp\/v2\/users\/246"}],"replies":[{"embeddable":true,"href":"https:\/\/sadilar.org\/en\/wp-json\/wp\/v2\/comments?post=6294"}],"version-history":[{"count":7,"href":"https:\/\/sadilar.org\/en\/wp-json\/wp\/v2\/posts\/6294\/revisions"}],"predecessor-version":[{"id":8573,"href":"https:\/\/sadilar.org\/en\/wp-json\/wp\/v2\/posts\/6294\/revisions\/8573"}],"wp:attachment":[{"href":"https:\/\/sadilar.org\/en\/wp-json\/wp\/v2\/media?parent=6294"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/sadilar.org\/en\/wp-json\/wp\/v2\/categories?post=6294"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/sadilar.org\/en\/wp-json\/wp\/v2\/tags?post=6294"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}