TE PAPAKUPU O TE TAITOKERAU
TAI TOKERAU MÄORI DICTIONARY
 
Work on this dictionary was begun in 1997, with the assistance of a grant from the Environment and Heritage Committee of the New Zealand Lottery Grants Board. Extensive fieldwork was conducted in Tai Tokerau, and meetings of contributors and advisors were held regularly in Auckland and Northland from 1997 to 1999. Almost 200 people were directly involved during this time. This phase of the work was supported with further grants from Te Puni Kökiri and the University of Auckland. A huge amount of information was collected, and the next few years were spent transferring the data from written records and tapes into a database. This work was finally completed at the end of 2002. A year earlier the on-line version of the dictionary was launched at a hui in Kaeo, so that whatever work was completed would be quickly available to as many people as possible, in Taitokerau and elsewhere. During 2003 work commenced on checking and augmenting the existing entries, and adding new material. This work has been continued by volunteers since the James Henare Maori Research Centre went into temporary recess at the beginning of 2004.
   
  Dictionary-making, if it is to be done properly, is a very time-consuming process. We cannot allow the Tai Tokerau Dictionary to be a hurried, sloppy job. At the same time, people are naturally very impatient to see some results from their work. Fortunately, the World Wide Web gives us the opportunity to let people have access to work in progress without having to wait until everything is edited and ready for publication.
   
  These web pages contain the latest printouts from the dictionary databases. They will be replaced by updated versions as more new entries are added and existing ones edited. Please remember that all of this material still requires extensive editing, and that there are still many gaps to be filled in each file. Right now, we cannot guarantee the accuracy of any of the material in these files. We would however like to have your comments, questions and suggestions, including new items which should be added or looked for. There is an e-mail link for you to get in touch with us electronically included on this page.
   

If this is the first time that you have looked at this dictionary, please take the time to read these introductory files:

   
  <
  Copyright Notice
  General Background Information
.... Te Röpü Äwhina 1997-2004 (List of contributors)
  Abbreviations and Sources
 

Nga Whakapapa o nga Kupu Mäori

  Kinds of Words and Their Meanings [including an important note on hapü names]
  Searching for Words
   
NGA KUPU KÖRERO, NGA PÄTAI HOKI
  You can contact us by e-mail at the address below to send us any comments, questions, suggestions, or new material for inclusion in the dictionary
   
   
   
   
   
  E-mail kupu at rakiora.org
   

STAGES OF EDITING

   
 

Collecting the material is one thing, but preparing it for an audience is quite another. All the material collected in the initial stages of the research was entered in a DOS--based computer database. More recent entries are being entered using a much more modern and flexible program which has been configured with the help of some overseas colleagues. Eventually all entries will be "migrated" to the new database, but in the mean time they have all been exported to a text file from which they can be reformatted for transfer to web files -- eventually, they will be able to be exported directly from the database to the web files with the proper formatting attended to automatically. Each individual entry needs to be checked for accuracy and completeness. As well as that, the entries need to be checked against other dictionaries and word lists to ensure that there is adequate coverage of the words and ideas that people learning the language are likely to need. Many of the words needed have buried in the examples in the dictionary but overlooked as headwords, so the contents of the dictionary itself have to checked in this regard.

In their final form, it is planned that entries will consist of

  • (1) a headword (in bold type);
  • (2) a number in square brackets to distinguish entries for words with the same form but expressing different ideas (e.g. hau [1] "wind", hau [2] "chop", hau [3] "I, me");
  • (3) Spelling variants -- we are going to use the macron consistently in all entries, but to make it easier to find headwords, those which are written with macrons will also be followed by the "double vowel" and unmarked variants, inside angle brackets. (At present, all three variants are scattered through the data, as everything was originally transcribed exactly as it was written by each of our 200 plus contributors.)
  • (4) Information about passive and nominal suffixes (where appropriate -- for example, in the entry for inu, below) and also reduplicated forms when these have very similar meanings to the base form;
  • (5) Where applicable, the list or file from which the word was incorporated in the original database (a three-character abbreviation in braces);
  • (6) Part of speech (in square brackets);
  • (7) An explanation and/or English equivalents. (Eventually, we want to have explanations in English and Maori, but to start with most of these are in English;
  • (8) Examples of how the word is used, contributed by our native-speaker advisors and from writings and recordings of Taitokerau Maori speakers;
  • (9) An English translation of each example (in italic type -- this is a formatting detail which still has to be completed for most entries), followed by the source of the example (a three or four character abbreviation enclosed in square brackets);
  • (10) Notes on idiomatic expressions incorporating the word, and/or special uses of derived forms;
  • (11) Notes (in parentheses) on the history or origin of the word, where this is known;
  • (12) Cross-references to related words and ideas.

Even when the editing is completed, not all these components will be in every entry -- the essentials are items 1, 2,6,7,8 and 9. Most of these, except (4), which does not apply to the word hapü, and (12), cross-references, are present in the first example, the second is more basic, but it does include a cross-reference, and the third (just a portion of which is reproduced here) includes the information about affixes (4) since this word does have passive and nominalized forms.

hapü [1] < hapuu, hapu > {CL1} [Stative] pregnant Ko te wahine nei kua hapü. This woman is pregnant . [TWK] Kua hapü taku hunaonga. My daughter-in-law is pregnant . [TTU] Tuatahi he tokorua kia hapü, ka whänau, ka tokotoru, puta ake he iwi. Firstly when two people are going to have a child, they give birth to a third person, a new generation . [TTU] (Nuclear Polynesian *sapuu “pregnant”).

hapü [2] < hapuu, hapu > {CL1} [Noun] The basic traditional political grouping, a small tribe, or an autonomous division of a larger tribal grouping. Ko ngä kaarangaranga hapü o Ngäti Whätua he maha. There are many hapü in Ngati Whatua [NWH]. (Probably derived from hapü [1].) Cf. iwi, waka .

inu [1] ~mia, ~manga [Universal] drink, absorb E tamariki ma, ma koutou nga hoiho e whiu atu ki te inu wai i te awa. Children, you can take the horses for a drink at the river. [TTU] ...

There is more information about the editing process in the General Background Information file. Towards the end of 2004 all the existing entries were sorted in alphabetical order in a master file, and formatting of individual entries had commenced, although details like bolding and italics had been included only in a few hundred entries.

As editing progresses, an increasing number of entries will have an "encyclopaedic" quality. That is, either the definition will include a very full description of the object, place, state or activity that the word refers to, as in the example below, or else there will be similar information included in a note at the end of the entry.

punawaru [2] {HWW} [Noun]. A woody herb with small sticky flowers and fruit which most people regard as a weed (botanical name Siegesbeckia orientalis ), but which has become an important source of ingredients in some cosmetics and anti-aging preparations because of the aspirin-like chemicals in its sap; it is a member of the daisy family, native to Southeast Asia, Aotearoa and Chile; it is used in China to treat rheumatic pain; in England it is called the holy herb . [040105]

Headwords in the current version of Te Papakupu o te Taitokerau

Notes on the number of entries included in each file and progress made from then on are included here for each letter of the Maori alphabet:

 
 

Stage reached November 26, 2004 [with notes on some subsequent developments]

A 194 basic entries and 19 derived forms. Still plenty of editing needed and additions to be made. [Further work done in 2005 increased the basic entries to 222, excluding derived words with whaka- and kai-]

E 30 basic entries and 9 derived forms. A bit more advanced than "A", but still plenty of editing needed and additions to be made

H 269 basic entries and 30 derived forms. Still plenty of editing needed and additions to be made.

I 53 basic entries and 11 derived forms. Still plenty of editing needed and additions to be made.

K 620 basic entries and 25 derived forms, plus 49 words prefixed by kai- and kaiwhaka- which are also included in the same files as the base words to which the prefixes are attached. Still plenty of editing needed and additions to be made.

M 405 basic entries and 49 derived forms. Still plenty of editing needed and additions to be made.

N 96 basic entries and 4 derived forms. Still plenty of editing needed and additions to be made.

NG 50 basic entries and 11 derived forms. Still plenty of editing needed and additions to be made.

O 69 basic entries and 4 derived forms. Still plenty of editing needed and additions to be made.

P 1542 basic entries and 69 derived forms. Parts of this file have been carefully edited and others look messy as this is the first to have a big influx of possible new words from alternative databases. It will become much tidier as editing proceeds, and probably have fewer entries (we estimate that there may be 400-500 items here that are either duplicates or should be consolidated with other existing entries).

R 200 basic entries and 22 derived forms. Still plenty of editing needed and additions to be made

T 723 basic entries and 87 derived forms. This file includes all the 1997-2002 database entries, cross-checked against each other, plus others, and has about twice as many entries as the first set of headwords beginning with "T" to go on-line in 2001. As with the other files, a lot more editing still needed.

U 109 basic entries and 9 derived forms. This file has more of the specifically Taitokerau material, that is, names of places, iwi and hapü, and ancestral names than the others at this time, and a higher proportion of entries have been checked and re-formatted.

W 68 basic entries and 12 derived forms. Editing still "under way".

WH 96 basic entries and 25 forms derived from these, plus another 331 words prefixed with whaka- which are also included in other files. A mixture of edited words and others which still need attention.

   

NUMBER OF ENTRIES

  Target 1997: 3,000
.... Revised mid-1999: 6,000
  Long-term target: 15,000+
  (The average 5 year old child knows about 5,000 different words)
 
  Estimated entries in database, excluding duplicates
 
  • October 1999: 825
    September 2001: 2,734
    February 2003: 3,274

    November 2004: 4,441
  • December 2005: 4,561 [+ 10,000 in "Kupu Hou" database]
 
 

This page was last revised January 19, 2006, 11.45pm.

 
E-mail kupu at rakiora.org