CLDF dataset of Enggano word list from 1895 in Stokhof and Almanar’s (1987) Holle List

14/06/2023 Views : 272

Gede Primahadi Wijaya Rajeg

This is a digitised and processed word list of the 19th-century Enggano (collected in c1895 by Abs vd Noord). The word list appears in Stokhof and Almanar's (1987) as part of the collection of word lists in the Indonesian archipelago collected based on the reference word list (the Holle List) (Stokhof 1980). This Holle List contains around more than 1000 lexical items in Dutch and their English and Indonesian translations. The original version of the Enggano word list (Stokhof and Almanar 1987) is not matched with the Dutch, English, and Indonesian translations in the reference Holle List, but appears as a separate publication. However, the Enggano words in the list are provided with Index numbers that originally should be matched "manually" with the Index number of their Dutch, English and Indonesian translations in the reference Holle List. As part of the current project, we have digitised the reference Holle List (https://engganolang.github.io/digitised-holle-list/) as well as this Enggano word list to enable computational and automatic matching between the Enggano forms and their translations in Dutch, English, and Indonesian in the reference Holle List. This dataset also comes with the programmatic R codes to process the raw Enggano word list into a table and then match it with the corresponding translations in the reference Holle List. The Enggano word list in this dataset is conformant to the Cross-Linguistic Data Format (CLDF) specification and we provided the Python code to validate the CLDF specification of the dataset.


The dataset and programmatic codes are maintained and curated on GitHub (https://github.com/engganolang/holle-list-enggano-1895). This work is part of the AHRC-funded project (https://gtr.ukri.org/project/8AB0C3DC-F1C9-4CFA-BB4D-5BE748213372) on the lexical resources for Enggano, led by the Faculty of Linguistics, Philology and Phonetics at the University of Oxford, UK. Visit the central webpage of the Enggano project at https://enggano.ling-phil.ox.ac.uk/


Keywords:

Enggano, Indonesian Language, Endangered Language, Language Documentation, Digital Humanities, Cross-Linguistic Data Format, CLDF, Holle List, Word List