14
Nikolaus P. Himmelmann
Table 1. Basic format of a
language documentation
Primary data
Apparatus
Per session
For documentation as a whole
recordi
n
gs
/r
ecords
o
f obs
er
va
bl
e l
ingui
st
ic
behavi
o
r
and met
al
in
g
u
is
ti
c know
le
dge
(pos
si
bl
e bas
ic format
s:
s
es
si
o
n
and l
exi
cal
dat
abas
e)
Metadata
–
time and location of
recording
–
participants
–
recording team
–
recording equipment
–
content descriptors
…
Annotations
–
transcription
–
translation
–
further linguistic and
ethnographic glossing
and commentary
Metadata
–
location of documented
community
–
project team(s) contributing
to documentation
–
participants
in documentation
–
acknowledgements
…
General access resources
–
introduction
–
orthographical conventions
–
ethnographic sketch
–
sketch grammar
–
glossing conventions
–
indices
–
links to other resources
…
3.2. What’s new?
Language documentation in the way depicted in Table 1 is not a totally new
enterprise. The compilation of annotated collections of written historical
documents and culturally important speech events (legends, epic poems,
and the like) was the major concern of philologists in the nineteenth cen-
tury. Linguistic and anthropological fieldwork in the Boasian tradition has
also always put major emphasis on the recording of speech events. Within
linguistic anthropology, recording and interpreting oral literature is a major
task. All of these traditions have had a major influence on documentary
linguistics as developed in this book.
Chapter 1 – Language documentation: What is it and what is it good for?
15
Nevertheless, the idea of a language documentation
as sketched above is
new for mainstream linguistics, and even compared to these earlier ap-
proaches, it is new with regard to the following important features:
–
Focus on primary data: The main goal of a language documentation is
to make primary data available for a broad group of users. Unlike in the
philological tradition, there is no restriction to culturally or historically
“important” documents, however such importance may be defined.
–
Explicit concern for accountability: The focus on primary data implies
that considerable care is given to the issue of making it possible to
evaluate the quality of the data. This in turn implies that the field situa-
tion is made transparent and that all documents are accompanied by
metadata which detail the recording circumstances as well as the further
steps undertaken in processing a particular document.
–
Concern for long-term storage and preservation of primary data: This
involves two aspects. On the one hand, metadata are crucial for users of
a documentation to locate and evaluate a given document, as just men-
tioned. On the other hand, long-term storage is essentially a matter of
technology, and while compilers of language documentations do not
have to be able to handle all the technology themselves, they need to
have a basic understanding of the core issues involved so that they avoid
basic mistakes in recording and processing primary data. Among other
things, the quality of the recording is of utmost importance for long-
term storage and hence needs explicit attention. See further Chapters 4,
13, and 14.
–
Work in interdisciplinary teams: Work on a truly comprehensive lan-
guage documentation needs expertise in a multitude of disciplines in ad-
dition to the basic linguistic expertise required in transcription and trans-
lation. Such disciplines include anthropology, ethnomusicology, oral
history and literature, as well as all the major subdisciplines of linguis-
tics (socio- and psycholinguistics, phonetics, discourse analysis, corpus
linguistics, etc.). There are probably no individuals who are experts in
all of these fields, and few who have acquired significant expertise in a
substantial number of them. Hence, good documentation work usually
requires a team of researchers with different backgrounds and areas of
expertise.
–
Close cooperation with and direct involvement of speech community:
The documentation format sketched above strongly encourages the active
involvement of (members of) the speech community in two ways. On