October 2022 – We launch a Nyingarn Project poster illustrated by Allyra Murray.
Early manuscripts containing wordlists and language information. These manuscripts are dispersed across Australia and often off-country. Being far away from the communities presents several challenges. Community access to their information is one such challenge. Manuscripts are also written in cursive handwriting, which can be difficult to read. The Nyingarn project is working to locate and access these manuscripts. Following protocols developed in line with community wishes, manuscript images are transcribed and converted into text format and stored in one place. This process makes manuscripts searchable, more ‘user-friendly and more easily accessible to communities. Nyingarn is an important resource for language revitalisation, maintenance and sustainability.
July 2022 – Our team continues to test the Nyingarn Workspace. We can ingest manuscript images for automated optical character recognition (OCR), and to join manuscript images to their corresponding transcriptions created in DigiVol or FromthePage. The image below shows the Nyingarn Workspace user view: manuscript image on the left and transcription text on the right.
Our latest work allows existing manuscript transcriptions in word format by our CIs to be converted into tei.xml and added to the Workspace. The Nyingarn Project is working with communities and our partner institutions to gather early language manuscripts and make them accessible in one place.
March 2022 – The Nyingarn Platform workspace has been built, and the Project Team are now testing ways to convert images of manuscripts to text using either Optical Character Recognition (OCR) or crowdsourced transcription platforms. We have tested two OCR systems with differing results for typed and hand-written manuscripts.
Manuscripts with existing permissions have also been transcribed through the online crowdsourcing platform, DigiVol, hosted by the Australian Museum. We have created manuscript transcription ‘expeditions’ in DigiVol that allow us to refine transcribing guides for volunteers and then export this language data into our workspace. To date, expeditions of 20-40 pages have been transcribed by volunteers within days. At Nyingarn, we will continue refining these transcription techniques as more manuscripts in state and national institutions are identified by CIs and Steering Committee members.
Registered users can access the workspace