{"id":5800,"date":"2018-04-16T10:57:36","date_gmt":"2018-04-16T10:57:36","guid":{"rendered":"https:\/\/sadilar.org\/rma-integration\/"},"modified":"2018-04-16T10:57:36","modified_gmt":"2018-04-16T10:57:36","slug":"rma-integration","status":"publish","type":"post","link":"https:\/\/sadilar.org\/en\/rma-integration\/","title":{"rendered":"Integrating the RMA into SADiLaR with new technologies"},"content":{"rendered":"<div class=\"googlefontscall\"><\/div>\n<div class=\"rowck ckstack3 ckstack2 ckstack1 uick-sortable\" id=\"row_ID1524041935751\" data-gutter=\"2%\" data-nb=\"1\" style=\"position: relative;\">\n<style class=\"ckcolumnwidth\">[data-gutter=\"2%\"][data-nb=\"1\"]:not(.ckadvancedlayout) [data-width=\"100\"] {width:100%;}[data-gutter=\"2%\"][data-nb=\"1\"].ckadvancedlayout [data-width=\"100\"] {width:100%;}[data-gutter=\"2%\"][data-nb=\"1\"]:not(.ckadvancedlayout) .blockck:not(:first-child) {margin-left:2%;}<\/style>\n<div class=\"inner animate clearfix\">\n<div class=\"blockck\" id=\"block_ID1524041935751\" data-real-width=\"100%\" data-width=\"100\" style=\"position: relative;\">\n<div class=\"ckstyle\"><\/div>\n<div class=\"inner animate resizable\">\n<div class=\"innercontent uick-sortable\">\n<div id=\"ID1524041935767\" class=\"cktype\" data-type=\"text\" style=\"position: relative;\">\n<div class=\"ckstyle\"><\/div>\n<div class=\"cktext inner\" style=\"position: relative;\" spellcheck=\"false\">\n<p><span style=\"font-family: 'trebuchet ms', geneva, sans-serif;\" data-mce-style=\"font-family: 'trebuchet ms', geneva, sans-serif;\">Over the past five years the Language Resource Management Agency (RMA) has been the central repository for the distribution and management of language resources, data and software tools, for the official languages of South Africa. The RMA has provided an excellent foundation for SADiLaR to build on. With the knowledge and skills obtained from the RMA project, we are now ready to advance to a new phase, and integrate the RMA into an international platform that will have not only national, but also global impact.<\/span><\/p>\n<p><span style=\"font-family: 'trebuchet ms', geneva, sans-serif;\" data-mce-style=\"font-family: 'trebuchet ms', geneva, sans-serif;\">SADiLaR provides a platform to access linguistic data and reuse this data, while also offering researchers technologies and software to simplify linguistic analysis. The main distribution channel for resources will be a repository that allows interested parties to access any of the language resources distributed by SADiLaR. \u201cThe repository will also link to larger international infrastructures and language distribution agencies, such as the European Language Resource Association (ELRA)&nbsp;and CLARIN in Europe, and the Language Data Consortium (LDC)&nbsp;in the USA,\u201d says Dr Roald Eiselen, SADiLaR\u2019s technical manager.<\/span><\/p>\n<p><span style=\"font-family: 'trebuchet ms', geneva, sans-serif;\" data-mce-style=\"font-family: 'trebuchet ms', geneva, sans-serif;\">The move to an institutional repository for the distribution of language resources has primarily been done for the following four reasons:<\/span><\/p>\n<ol>\n<li><span style=\"font-family: 'trebuchet ms', geneva, sans-serif;\" data-mce-style=\"font-family: 'trebuchet ms', geneva, sans-serif;\">to simplify the access and download procedures for users by moving away from the \u201cshopping cart\u201d experience;<\/span><\/li>\n<li><span style=\"font-family: 'trebuchet ms', geneva, sans-serif;\" data-mce-style=\"font-family: 'trebuchet ms', geneva, sans-serif;\">to provide all resources with a digital object identifier (DOI), which is integrated into the international digital handle system;<\/span><\/li>\n<li><span style=\"font-family: 'trebuchet ms', geneva, sans-serif;\" data-mce-style=\"font-family: 'trebuchet ms', geneva, sans-serif;\">to allow easy integration of the data resources into other repositories and data infrastructures, such as CLARIN and LDC; and<\/span><\/li>\n<li><span style=\"font-family: 'trebuchet ms', geneva, sans-serif;\" data-mce-style=\"font-family: 'trebuchet ms', geneva, sans-serif;\">as a first step in the process of getting the \u201cdata seal of approval\u201d for SADiLaR, which will give the repository a more solid standing in the data distribution community.<\/span><\/li>\n<\/ol>\n<p><span style=\"font-family: 'trebuchet ms', geneva, sans-serif;\" data-mce-style=\"font-family: 'trebuchet ms', geneva, sans-serif;\">SADiLaR will also make available several research enabling technologies such as:<\/span><\/p>\n<ul>\n<li><span style=\"font-family: 'trebuchet ms', geneva, sans-serif;\" data-mce-style=\"font-family: 'trebuchet ms', geneva, sans-serif;\">metadata and data processing infrastructures that are specifically linked to particular projects;<\/span><\/li>\n<li><span style=\"font-family: 'trebuchet ms', geneva, sans-serif;\" data-mce-style=\"font-family: 'trebuchet ms', geneva, sans-serif;\">general language data analytic platforms made available online; and<\/span><\/li>\n<li><span style=\"font-family: 'trebuchet ms', geneva, sans-serif;\" data-mce-style=\"font-family: 'trebuchet ms', geneva, sans-serif;\">automatic language analysis modules that support the development of more complex language technologies.<\/span><\/li>\n<\/ul>\n<p><span style=\"font-family: 'trebuchet ms', geneva, sans-serif;\" data-mce-style=\"font-family: 'trebuchet ms', geneva, sans-serif;\">Although a substantial number of open-source technologies are reused and adapted to the South African context, several of the technologies and services that are being developed will be new technologies that will be distributed for further use by language communities both in Africa and around the world.<\/span><\/p>\n<p><span style=\"font-family: 'trebuchet ms', geneva, sans-serif;\" data-mce-style=\"font-family: 'trebuchet ms', geneva, sans-serif;\">Over the coming year, SADiLaR will expand the set of available language resources on an ongoing basis, while also extending the set of automatic analysis tools that are available via web interfaces. It is expected that these technologies will enable end-users to more easily analyse their own linguistic data, or search and analyse the data available from SADiLaR.<\/span><\/p>\n<\/div>\n<p><input type=\"hidden\" name=\"mce_0\">\n\t\t<\/div>\n<\/div>\n<\/div>\n<\/div>\n<\/div>\n<div class=\"ckstyle\"><\/div>\n<\/div>\n","protected":false},"excerpt":{"rendered":"<p>Over the past five years the Language Resource Management Agency (RMA) has been the central repository for the distribution and management of language resources, data and software tools, for the official languages of South Africa. The RMA has provided an excellent foundation for SADiLaR to build on. With the knowledge and skills obtained from the [&hellip;]<\/p>\n","protected":false},"author":246,"featured_media":0,"comment_status":"","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"_acf_changed":false,"footnotes":""},"categories":[733],"tags":[],"class_list":["post-5800","post","type-post","status-publish","format-standard","hentry","category-news"],"acf":[],"_links":{"self":[{"href":"https:\/\/sadilar.org\/en\/wp-json\/wp\/v2\/posts\/5800","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/sadilar.org\/en\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/sadilar.org\/en\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/sadilar.org\/en\/wp-json\/wp\/v2\/users\/246"}],"replies":[{"embeddable":true,"href":"https:\/\/sadilar.org\/en\/wp-json\/wp\/v2\/comments?post=5800"}],"version-history":[{"count":0,"href":"https:\/\/sadilar.org\/en\/wp-json\/wp\/v2\/posts\/5800\/revisions"}],"wp:attachment":[{"href":"https:\/\/sadilar.org\/en\/wp-json\/wp\/v2\/media?parent=5800"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/sadilar.org\/en\/wp-json\/wp\/v2\/categories?post=5800"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/sadilar.org\/en\/wp-json\/wp\/v2\/tags?post=5800"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}