About the CHIRILA database

CHIRILA_update_10-27_edits (1)-01

Welcome to CHIRILA, a database of the languages of Australia.

What does CHIRILA stand for?

CHIRILA stands for Contemporary and Historical Reconstruction in the Indigenous Languages of Australia.

Why that name?

The name is based on the word tyirilya, a widespread term for ‘echidna‘ in the languages of Australia’s Western Desert Region. We wanted an acronym that describes the database, that relates to the languages we are working with, and that is pronounceable for English speakers.

What’s in the CHIRILA database?

The database is a lexical database (a database with words from different languages). Currently there are about 780,000 words, from all over Australia. The publicly available portion of the database, at present, is considerably smaller, representing the materials that we have received explicit permission to release.

We have a map of data holdings.

I am including published language materials, and other materials which are freely available (for example, on unrestricted view through the archive at AIATSIS in Canberra). As part of doing this, I have been looking at old sources – that is, materials published in the 19th century – other published materials, and fieldnotes. We (my students and I) have been typing these materials into the computer and making a database program.

How will this database be useful to Aboriginal people?

There are a couple of ways that this project might be useful to Aboriginal and Torres Strait Islander groups. If you are working on reviving your language and looking for materials, there might be some words from your language in the database. If your language was mostly written down a while ago, sometimes it can be hard to know exactly how words should be said. Sometimes we can work out what the right pronunciation is through looking at other languages. The database is digital, which means it is easy to produce word lists for individual languages. My team and I can help in making language materials like wordlist books, which could be the start of a dictionary. Please drop us a line if we can help with your program.

Can I download data from the database?

Yes! You can download the portions of the database; those lists for which we have received permission from the authors and/or communities to distribute data. We will be continuing to contact people and to prepare data for future releases. Please also see the acknowledgments. At present, it’s a bit under 20% of the total data holdings. I expect that to change over the next year or so.

How do I cite the data?

See citing CHIRILA data.

Contributing to CHIRILA

We accept data contributions of all types relating to the languages of Australia. You can upload files directly to our site. The more data we have in the database, the more generally useful it will be. We would like, eventually, to have a database that contains words from all the Aboriginal and Torres Strait Islander languages in Australia. There are some parts of the country where we don’t have much information at the moment. These include:

  • Cape York Peninsula
  • Western Arnhem Land and the Central Top End
  • Southwest Western Australia

We’d be particularly interested in getting in contact with researchers and communities from these regions.

Why can’t I see data for my language?

The first release of the data contains about 150,000 records. This is less than 20% of the data in the full database. So, if there isn’t any material for your language in Chirila yet, there might be several reasons:
  • It might be included in Phase II release (in mid-2016); languages in this group currently include Dieri (Diyari), Iwaidja, Kurtjar and Duuŋidjawu.
  • I don’t have any data for that language (yet) in the database. This is particularly likely for languages of the Cape York and Central Arnhem Land region. If you know of a data source that I could use, please let me know, or even better, send it to me!
  • It could be that I have data, but it’s not processed yet. It’s a little harder to give a full list of languages for this, but it includes Mawng, Wambaya, Iwaidja, Anjumarla and Tiwi. If you would like one of these languages prioritized, please let me know and we’ll try our best to include it in the next release. I’ll put up a list of sources which we want to prioritize here.
  • I have data, it’s processed, but I haven’t (yet) received permission to release it, because I haven’t been able to contact the right holders (or they haven’t yet got back to me, or because they needed to contact others).
  • The final group is a small number of languages where the rights holders did not give me permission to release any data. This is a very small group compared to the others, almost everyone that I have been able to contact has been enthusiastic about having their data included.

Who is developing the database?

The main developer is Claire Bowern, Associate Professor of Linguistics at Yale University.

Who funds it?

We gratefully acknowledge support from the USA’s National Science Foundation through grants BCS-0844550 and BCS-1423711. Any opinions, findings, and conclusions or recommendations expressed in this material are those of the author(s) and do not necessarily reflect the views of the National Science Foundation.

What work has come out of the database so far?

See our resources page for information.

Where can I learn more about Australian languages?

See our links pages for more information.

Why can’t I download all the data?

Because we don’t have permission from the authors and/or language communities to release all of it (yet). Different people have different opinions about how accessible information about their language and research should be. We cannot accommodate all views (given that those views range from all Aboriginal materials should be freely available, to no one who is not from that particular group should have access). The vast majority of Aboriginal people from around Australia that I have talked to about the project felt that the decisions should rest with communities, and that in general, as long as appropriate respect for the language is shown, that it’s ok for others to learn about those languages too.

Click here to be taken to the terms of use, registration, and data download.

License
Creative Commons License
The CHIRILA Database by Claire Bowern is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License.

About our logo
The CHIRILA echidna was designed by Yale’s STC Studios.
The graphic to the left is a detail of a painting Kurrajong Bush Medicine Leaves by Anmatyerre artist Sharon Numina.