Vol. 10 (2016), pp. 641–681
http://nlrc.hawaii.edu/ldc
http://hdl.handle.net/10125/24721
Revised Version Received: 16 Nov 2016
Series: Emergent Use and Conceptualization of Language Archives
Michael Alvarez Shepard, Gary Holton & Ryan Henke (eds.)
Bringing User-Centered Design to the Field of Language
Archives
Christina Wasson
University of North Texas
Gary Holton
University of Hawai‘i at Mānoa
Heather S. Roth
Independent Consultant
This article describes indings from a workshop that initiated a dialogue between
the ields of user-centered design (UCD) and language archives. One of the challenges facing language archives is the fact that they typically have multiple user
groups with signiicantly different information needs, as well as varying cultural
practices of data sharing, access and use. UCD, informed by design anthropology,
can help developers of language archives identify the main user groups of a particular archive; work with those user groups to map their needs and cultural practices; and translate those insights into archive design. The article describes indings from the workshop on User-Centered Design of Language Archives in February 2016. It reviews relevant aspects of language archiving and user-centered
design to construct the rationale for the workshop, relates key insights produced
during the workshop, and outlines next steps in the larger research trajectory initiated by this workshop. One major insight from the workshop was the discovery
that at present, most language archives are not meeting the needs of most users.
Representatives from all user groups expressed frustration at the current design of
most language archives. This discovery points to the value of introducing a usercentered approach, so that the design of language archives can be better informed
by the needs of users.
Licensed under Creative Commons
Attribution-NonCommercial 4.0 International
E-ISSN 1934-5275
Bringing User-Centered Design to the Field of Language Archives
642
1. Introduction1
1.1 Overview This article describes indings from the workshop on User-Centered
Design of Language Archives, held in February 2016. It reviews relevant aspects of
language archiving and user-centered design to construct the rationale for the workshop, relates key insights produced during the workshop, and outlines next steps
in the larger research trajectory initiated by this workshop. The article is based on
a white paper that was produced immediately after the workshop to document the
foundational insights and initial conceptual frameworks that emerged from the workshop.
One major insight from the workshop was the discovery that at present, most
language archives are not meeting the needs of most users. Representatives from all
user groups expressed frustration at the current design of most language archives.
This discovery points to the value of introducing a user-centered approach, so that
the design of language archives can be better informed by the needs of users.
The indings presented here are based on careful analysis of video recordings of
the February 2016 workshop and transcript summaries of those recordings. The
recordings and transcripts have been deposited with the University of North Texas
Library’s digital repository and are accessible for research and reference; see §1.4 for
more details.
1.2 Introduction to research topic: User-centered design of language archives As
readers of this journal know, an alarming number of languages are at risk of no longer
being spoken by the end of this century. Language archives provide the promise of
long-term preservation of linguistic materials, while also facilitating access to these
resources. Language archives typically include recordings of spoken language, as well
as transcriptions, translations, annotated texts, and ieldnotes. In some cases, such
materials provide the only surviving record of now sleeping languages. Language
archives may also include related forms of cultural documentation, such as photos,
genealogies, and historical documents. In this way, they may preserve broader cultural information, including traditional knowledge (Holton 2012).
In the last twenty years or so, there has been a movement toward creating digital
language archives, many of which are accessible online, through the Internet. Such
archives can facilitate access to resources even among dispersed user communities.
They are the main focus of this article.
One of the complex aspects of digital language archives is that they seek to cater
to diverse user groups. Most importantly, they are not merely academic repositories
1We would like to extend our deepest appreciation to the workshop participants for giving so generously of
their time, insights, and ideas. Our collaborative discussions were engaging, exciting, and productive; the
collective knowledge generated was truly impressive. Any remaining errors of fact or interpretation found
in this paper are solely the responsibility of the authors and should not be attributed to the workshop
participants. Christina and Gary would also like to express our profound gratitude to Heather Roth,
Research Assistant extraordinaire. She was amazingly helpful every step of the way, from event planning
to literature reviews to preparation and analysis of workshop transcripts. Finally, we are grateful to the
U.S. National Science Foundation’s Documenting Endangered Languages Program, whose support made
this workshop possible through grants BCS-1543763 and BCS-1543828.
Language Documentation & Conservation Vol. 10, 2016
Bringing User-Centered Design to the Field of Language Archives
643
but also resources for members of the language community. These members may
use archives to support language revitalization efforts, or more broadly for accessing
information about cultural heritage. In addition, language archives are intended for
use by linguists, who may combine linguistic data from multiple archives for the
purpose of cross-linguistic comparisons. There are usually additional user groups as
well. The challenge is to design language archives that accommodate the needs of all
user groups. A user-centered design process could help make language archives more
accessible and more useful to larger numbers of users.
1.3 Research trajectory initiated by workshop The workshop initiated a research
trajectory that brings the ields of language archives and user-centered design (UCD)
into dialogue, with the long-term goal of improving users’ experiences with language archives and making archives more accessible and useful to diverse user groups.
Through this research trajectory, we ultimately aim to encourage a paradigm shift in
language archives toward the adoption of UCD principles, parallel to the shift that
occurred in the corporate world about twenty years ago.
The co-organizers of the workshop represented the two ields that were being
brought into dialogue. Christina Wasson, a linguistic anthropologist, has been active in UCD and design anthropology since 1996 (Wasson 2000, 2002, 2005, Wasson & Metcalf 2013, Wasson & Squires 2012, Aiken et al. 2014). Gary Holton, a
linguist, has been involved in international efforts to develop best practices for the
digital preservation and access of endangered language materials since 1999 (Dobrin
& Holton 2013, Holton 2011a, Holton 2011b, Holton 2012, Holton 2014, Holton
et al. 2007).
1.4 The workshop The immediate goal of the workshop was to map the terrain at
the intersection of language archives and user-centered design by engaging representatives of key stakeholder groups in guided discussions. The key outcomes that emerged
from our conversations were 1) mapping the diverse perspectives of different stakeholder groups, 2) mapping types of language archives and their varying relationships
with user groups, and 3) identifying current access issues.
The workshop took place February 20–21, 2016, at the University of North Texas.
Further details are at https://designinglanguagearchives.com. Video recordings and
transcript summaries of the workshop have been placed in the UNT Digital Library,
which can be accessed at http://digital.library.unt.edu/explore/collections/LANGAR/.
1.5 Workshop participants Participants were selected to represent the main stakeholder groups that engage with archives for endangered languages. It was challenging
to narrow down the list of potential participants, since there are so many individuals
in all the stakeholder groups who have valuable contributions to make on this topic.
Many of the participants we selected belonged to more than one stakeholder group.
They are listed in Table 1. In addition, the workshop was attended by the organizers,
Christina Wasson and Gary Holton, and Research Assistant Heather Roth.
Language Documentation & Conservation Vol. 10, 2016
Bringing User-Centered Design to the Field of Language Archives
644
Table 1. Invited workshop participants
Name
Afiliation
Edward Alexander
Dinjii Zhu K’yaa (Gwich’in Language Center and Archive) | Executive Director
Gwich’in Council International | Co-Chair
Daryl Baldwin
Myaamia Center | Director
Miami Tribe of Oklahoma | Member
Santosh Basapur
Illinois Institute of Technology (IIT) Institute of Design (ID) | Researcher | Faculty
Member
Andrea Berez-Kroeker
University of Hawai‘i at Mānoa—Department of Linguistics | Associate Professor
Kaipuleohone—University of Hawai‘i Digital Language Archive | Director
Shobhana Chelliah
University of North Texas—Program in Linguistics | Professor
Documenting Endangered Languages (DEL) Program at the National Science
Foundation (NSF) | Former Program Director, 2012–2015
Susan Kung
University of Texas at Austin Archive of the Indigenous Languages of Latin America
(AILLA) | Manager
Wesley Leonard
Southern Oregon University—Native American Studies Program | Associate
Professor and Chair
Miami Tribe of Oklahoma | Member
Crysta Metcalf
Independent Consultant—Design Anthropologist
Interactive Media User Research, Motorola Mobility | Former Director
Jennifer O’Neal
University of Oregon Libraries | Corrigan Solari University Historian and Archivist
National Museum of the American Indian, Smithsonian Institution | Former Head
Archivist
Felix Rau
University of Cologne—Department of Linguistics | Researcher
Language Archive Cologne | Team Member
Loriene Roy
University of Texas at Austin—School of Information | Professor
Minnesota Chippewa Tribe—Anishinabe | Member
Mandana Seyfeddinipur
Endangered Languages Documentation Programme | Director
Endangered Languages Archive (ELAR) | Head
SOAS, University of London—Department of Linguistics | Research Fellow
Michael Shepard
Goucher College—Cultural and Environmental Sustainability Program | Professor
of Practice
Mohave Community College | Anthropology Instructor
Justin Spence
University of California, Davis—Native American Studies | Assistant Professor
Alexander Wadsworth
FirstVoices | Manager | Information Technology | Language Mapping
2. Review of language archives and user-centered design This section reviews the
aspects of language archives and user-centered design that provided the rationale for
the workshop. The contents are based on Gary Holton’s and Christina Wasson’s
presentations at the start of the workshop. The slide decks are available at https://de-
Language Documentation & Conservation Vol. 10, 2016
Bringing User-Centered Design to the Field of Language Archives
645
signinglanguagearchives.com/workshop-products/ and are also deposited in the UNT
Digital Library at http://digital.library.unt.edu/ark:/67531/metadc848634/.
2.1 The role of language archives in providing access to language materials While
the application of user-centered design to the development of language archives may
be a new concept, language archives themselves have a long history and have become
integral to the ield of language documentation. The idea that documentation should
be preserved for future generations of scholars is fundamental to the ield and has
now become reiied in the policies of granting agencies, academic programs, and
publishers. Moreover, a number of dedicated digital language archives have been
created to implement these best practices and serve the increasing need for language
archiving. Henke & Berez-Kroeker (2016) provide a thorough history of language
archives from the late 19th century to the present day, noting several milestones along
the way. The most signiicant event in this history is the transition from brick and
mortar archives which are relatively dificult to access, to modern digital archives
which can in theory provide ready access to a variety of user communities. Yet for
the most part, this potential for wider access to and use of language archives has yet
to be realized.
Part of the explanation for this inattention to access can be found in the history
of the emergence of digital language archiving. Digital language archiving emerged
over the past two decades during a time of two unprecedented historic shifts. The
irst of these shifts was the realignment of the ield of linguistics with its documentary roots, as linguists came to acknowledge that many of the world’s languages and
indeed the world’s linguistic diversity were under threat (Krauss 1992). Prior to this
shift, the documentary focus of linguistics in the irst half of the 20th century had
largely been forgotten by mainstream linguistics, as the profession became fascinated
by theoretical models and universal explanations for the nature of language.
At the same time that linguists were awakening to the problem of endangered languages, the ield was also facing an unprecedented technological transformation. As
linguists rushed to record endangered languages, technology was evolving so quickly
that recording devices speciied in a grant proposal became obsolete before the proposal was funded. Documentary linguists were thus forced to grapple with technological standards in order to ensure that the records they were creating would be of
lasting value. An exponential increase in the ability to create language documentation data in digital form necessitated the development of standards for formats and
metadata, so that digital data could be effectively managed. Faced with increasing
language endangerment and threats of technological obsolescence, it is not surprising
that the ield prioritized the development of digital preservation standards over access. This is even more understandable given the absence of accepted digital standards
and best practices within the mainstream archiving world at the time. When linguists
convened in December 2000 to create the Open Language Archives Community, the
Library of Congress preservation recommendation for digital data was to copy the
data to open reel analog tape. For 21st century linguists living and working in an
increasingly digital world, that simply wasn’t an option.
Language Documentation & Conservation Vol. 10, 2016
Bringing User-Centered Design to the Field of Language Archives
646
Today, preservation standards and practices for digital (language) archives are
well-established, allowing archives to devote greater attention to issues of access.
Much recent discussion has focused on the potential consequences of a looming
paradigm shift that will transition linguistics to a more data-oriented science, in which
theoretical claims are supported by citable, archival data (cf. Berez 2015). The reuse
of archival linguistic data lies at the heart of the documentary linguistics paradigm
(Himmelmann 1998: 163), but the feasibility of such reuse has only recently begun
to be explored. For example, a 2011 workshop hosted at the Max Planck InstituteLeipzig addressed the “potentials” of language documentation by examining how the
data accumulated in the massive archive associated with the Documentation of Endangered Languages (DoBeS) project can be used by linguists other than those who
collected the data (Seifart et al. 2012). However, even these efforts have been criticized as being too narrow in scope, focusing primarily on academic users (Nathan
2014: 189). Henke & Berez-Kroeker (2016: 412) go so far as to contend that this
limited-access model is inherent in modern linguistic work.
At the same time there is increasing recognition among archivists that language
archives are being used by people other than linguists for purposes which were not
originally envisioned by the creators of the documentation. Language archives contain much more than just language data, and users often approach language archives
in search of information about topics as seemingly devoid of linguistic content as family history and photographs (Holton 2012). Anecdotal reports suggest that most users
of existing digital language archives are speakers of the languages being archived or
their descendants (Austin 2011). Archives have made explicit attempts to serve this
community; however, the vast majority of these attempts have been top-down in nature, inspired by the archives’ vision of the needs of these user communities (Dobrin
& Holton 2013; Holton 2014). Language archives have yet to attempt to understand
their user communities in a structured way.
The emergence of participatory archive frameworks offers language archives a
way to better engage with user communities. The participatory framework acknowledges that “usability does not denote use alone, but also denotes a deeper level of
involvement in the sense of actual participation in the archive and in the archival
process” (Huvila 2008: 25). Although a relatively recent phenomenon, this renewed
interest in usability has important implications for language archives, especially with
respect to language conservation efforts. As work continues to document endangered
languages and to ensure that documentation is properly archived, there is a concomitant need to support language conservation efforts. Often the urgency is such that
documentation and conservation activities become closely intertwined. In this sense
the mission of the language archive is not fundamentally different from archives more
generally. Access is a fundamental part of the archive mission to identify, preserve and
make available relevant resources (Hunter 2003, Green 2003).
For communities with few if any remaining luent speakers, language archives represent important repositories of linguistic and cultural knowledge. Those language
communities do not merely want to be consumers of the knowledge in these repositories; rather, they desire to actively engage with and shape how the record is accessed
Language Documentation & Conservation Vol. 10, 2016
Bringing User-Centered Design to the Field of Language Archives
647
and used. If language archives fail to facilitate this kind of participatory access they
risk remaining in an era when, in the words of William Hagan, “to be an Indian is
to have non-Indians control your documents from which other non-Indians write
their versions of your history” (1978: 135). Noting that “archives have continuously
evolved to remain relevant” in the face of changing technologies and user demographics, Linn observes that “endangered language archives are well-placed to participate
in and articulate these shifts” (2014: 65). As users repeatedly make clear, language
archives are not merely repositories of past knowledge but rather living, forwardlooking platforms for propagating Indigenous languages and cultures.
“Our digital archives are only useful if they enable us to have analog
meetings and connections between actual people conversing and sharing”
(Edward Alexander, quoted in Shepard 2015: 226).
User-centered design offers the promise of a participatory approach, in which
users can have a voice in determining how they interact with and access their linguistic
legacy. In this way language archives can help traditionally marginalized language
communities to “preserve empowered narratives” (Shilton & Srinivasan 2007: 90) by
taking a more active role in the documentation and conservation of their languages.
2.2 User-centered design Although linguists and managers of language archives
have repeatedly acknowledged the need—and indeed obligation—to provide improved
access to language documentation, they lack a structured methodology for understanding their user communities and translating that understanding into the design
of an archive.
A well-developed methodology for understanding user communities does exist,
and it could be productively applied to the design of language archives. User-centered
design (UCD) is a structured process for ensuring that technologies (and other products) are designed to meet the needs and constraints of their users. It is an interdisciplinary endeavor that involves close collaboration among users, researchers who
analyze user needs, designers, and other relevant specialists. UCD emerged in the
business context in the 1980s and 1990s as a response to criticisms that designers
were developing products and technologies without adequately understanding the
needs of users (Norman 1988; Wasson 2000). The chief difference from other product design philosophies is that UCD tries to optimize the product around how users
can, want, or need to use the product, rather than forcing the users to change their
behavior to accommodate the product (Wikipedia 2016).
In the business world, the adoption of UCD principles constituted a signiicant
paradigm shift about 20 years ago (Robinson 1993, 1994; Ritter et al. 2014). This
paradigm shift fundamentally altered business understandings, from the assumption
that designers’ intuitions were an adequate foundation for new product ideas, to the
recognition that the products resulting from such a process had an unacceptably high
rate of failure. The new work processes that emerged in industry required designers
to base their product ideas on user research in order to mitigate the risk of launching
new products.
Language Documentation & Conservation Vol. 10, 2016
Bringing User-Centered Design to the Field of Language Archives
648
Anthropologists have played a prominent role in the development of UCD, and
the term “design anthropology” emerged to describe anthropologists working in this
area (Squires & Byrne 2002; Wasson 2000). The role of anthropologists in UCD is
to work with user groups to identify their needs and constraints, and then translate
those insights into actionable implications for designers and others involved in the
development of new products and technologies. Prior to the rise of UCD, designers
had mainly worked with cognitive psychologists who tested new product ideas in laboratory settings, focusing on the interaction between one human and one product or
technology (Bannon 1991; Robinson 1993). Anthropologists, by contrast, observed
product use “in the wild,” in the actual contexts where people were using those products. Researchers were able to see how interactions among groups of people shaped
product use, and how users’ engagements with a product were shaped by cultural systems of meaning. From the beginning, anthropological studies in the ield of design
showed major discrepancies between designers’ intended uses of their products, and
consumers’ everyday behaviors (Suchman 1995; Wasson 2000; Wasson & Squires
2012).
Four key components of the UCD process are:
1. Start by identifying user groups
2. Work with users to identify needs
3. Process is interdisciplinary and collaborative
4. Process is iterative
2.2.1 Start by identifying user groups Early in the UCD process, the UCD team
needs to ask:
• What groups of people are currently using the technology?
• What other groups might beneit from using it?
This information is needed in order to ensure that the design responds to the needs
of all signiicant current and potential user groups. It is especially important when
a resource is used by groups of people with distinctly different needs. Surprisingly
few publications on language archives have sought to systematically identify user
groups for language archives. The primary reference seems to be Austin (2011), a
blog post on PARADISEC that reported on interviews with managers of six language
archives. A few other scholarly articles examine the needs of either linguists or local
communities, but do not engage in a holistic investigation of the full range of users.
2.2.1.1 Linguists and language communities A irst pass at identifying the main
user groups of language archives might be to say that there are two groups: linguists
and language communities. This is already interesting from a UCD point of view,
Language Documentation & Conservation Vol. 10, 2016
Bringing User-Centered Design to the Field of Language Archives
649
because the needs of these two user groups are strikingly different and sometimes
at odds. An informal review of user logs at the Alaska Native Language Archive
conducted by Gary Holton reveals that academic researchers typically seek linguistic
information in a form which allows ready comparison with other languages, thus
facilitating philological and typological analyses, while members of local language
communities often seek information about particular people, such as a story or personal narrative recorded by a relative.
For linguists, language archives offer the raw data which provide the very foundation of linguistic science. In the case of endangered languages, archives may be the
only source of data. Where languages are still spoken but in decline, archives may
contain data which record a more vibrant stage of the language. Where publications
such as dictionaries and reference grammars already exist, language archives provide
the raw data which allow veriication and falsiication of claims made in published
sources. Archival data thus provide the scientiic underpinning to linguistic research
(Gezelter 2009). However, within the ield of linguistics, the ability to effectively use
archival language data remains limited (Berez 2015), in ways that will be discussed
below.
For local language communities, archives provide a repository of cultural heritage.
In many indigenous communities, traditional languages are being used less and less,
due to the effects of colonial practices and now globalization. Such communities are
increasingly turning to archives to support language and culture revitalization efforts.
For these users, language may be only one component embedded in a larger matrix
of traditional culture that includes history, genealogy, music, and other resources.
Though language archives may not have been designed to collect this broader array
of materials, they often become valued sources of such information (Holton 2012).
Documentary linguistics records information about language; yet at the same time,
that language provides a medium for the transmission of other culturally relevant
information. In most cases, archival linguistic data have been removed from the
source communities, complicating access by local language communities. Language
archives are often located within academic institutions which may not view local language communities as a primary user group. The return or repatriation of language
documentation resources has become an increasing priority, but many of these efforts
at repatriation have been top-down, based on archivists’ and linguists’ perceptions
of how and why local language communities access language archives (Dobrin &
Holton 2013; Holton 2014).
2.2.1.2 Complexifying the user groups The constellation of user groups of language archives actually has a much more intricate structure than just a binary opposition between linguists and language communities. Complexifying factors include:
• Users may belong to multiple groups—for instance, a user may be both a linguist and a member of a language community
• There may be additional user groups—historians, artists, students, the general
public, etc.
Language Documentation & Conservation Vol. 10, 2016
Bringing User-Centered Design to the Field of Language Archives
650
• There may be factions within user groups
• Some users are yet unborn—there may be a concern for future generations of
the language community
• There may be users who aren’t users—for instance, members of a language
community who don’t actually want to use an archive themselves, but want to
know it is there for their children and grandchildren
• Other stakeholder groups need to be considered, such as archivists; they play
an important role in the design and maintenance of language archives and in
guiding users through the archives
Finally, we have some major concerns about how user-centered the very concept
of an archive is. As many researchers have noted, archives are constructed within a
paradigm of Western scientiic concepts and assumptions (Foucault 1982; Isaacman,
Lalu, & Nygren 2005; Povinelli 2011; Stoler 2010; Zeitlyn 2012). This includes
curation practices that serve as a form of control or even suppression when decisions
as to what is put in or kept out of an archive are made solely by archivists and
linguists, rather than by members of the communities whose language data are being
placed in the archive (Zeitlyn 2012). Understandings of what should be included in
an archive, and how the materials should be organized, may vary signiicantly across
these stakeholder groups. The assumptions and priorities of linguists and archivists
are frequently not a good match with cultural models of the Indigenous groups whose
languages are being archived. In this sense, there is a colonial aspect to the whole
principle of archiving. And the more rigid the conception is, based on predetermined
deinitions of what is an archive and what are the linguistic categories of analysis, the
less it is user-centered for language communities.
This situation creates challenges for the development of a UCD approach to language archives. However, we feel hopeful that there can be creative solutions to designing language archives that accommodate the needs of multiple user groups with
disparate interests. To give just one example, a linguist expressed concern that a language community might prioritize funding for access over funding for preservation.
Yet a language community member expressed concern in the opposite direction. We
envision win-win solutions that could address both needs. For instance, a language
archive might have a front end and a back end. The back end could be a simple, stable database structure that would address the preservation concerns and conceptual
frameworks of archivists and linguists. At the same time, there could be multiple
front end applications that would facilitate access for various user groups, for instance a game for a mobile device to help teach the language to young people. Such
games could include elements to involve young users, such as glitzy graphics and the
possibility of “leveling up,” the process of moving to ever higher levels of challenge
and reward that, for many users, contributes to the fascination of gaming.
2.2.2 Work with users to identify needs To return to a general description of the
UCD process: once user groups have been identiied, the UCD research process be-
Language Documentation & Conservation Vol. 10, 2016
Bringing User-Centered Design to the Field of Language Archives
651
gins. The goal is to map user experience. Sometimes this is called user research or
design research. The process involves a holistic examination of the lives of users of a
particular technology. Part of the research is learning how that technology may play
a role in users’ lives, but the research is not narrowly focused around just the use
of that technology. For instance, we cannot understand language archives without
examining broader issues such as the work lives and professional norms of linguists,
or the history and cultural values of language communities.
Typical UCD research questions might include:
• What is the broader cultural context within which the technology can help
people achieve their goals?
• What are the main ways that people use that technology?
• What would they like the technology to do that it doesn’t do?
• What problems do they encounter, and how do they work around those problems?
For instance, for particular local language communities, we might ask, what does
their traditional language mean to them? What practices do they engage in with
regard to the language? Are other aspects of cultural heritage interwoven? What are
their goals and concerns regarding language preservation and revitalization? What
forms of technology are most accessible to them? For linguists, we might ask, what
are their goals in accessing language archives? What kinds of information and search
capabilities are important to them? What kinds of comparisons do they want to be
able to make across languages?
The methods of UCD research are based on the anthropological approach termed
“ethnography.” Data collection methods typically include participant observation
and in-depth interviews. Both observations and interviews are often video or audio
recorded, and transcribed; at a minimum, researchers write detailed ield notes about
them. The UCD team subsequently analyzes the transcripts and ield notes (and any
other materials collected) in order to identify patterns and themes.
The level of collaboration with user groups varies according to the continuum
illustrated in Figure 1. In small communities or organizations, it is often possible to
engage in a participatory research process, where members of the community are active members of the UCD team from beginning to end. For the UCD of a technology
that is used by a huge, amorphous population, this is not so easy. Participatory research requires a somewhat cohesive community to partner with the researchers; only
such a group can arrive at consensus (or at least majority decisions) about desired research goals and approaches. For instance, when Wasson led a study on car use, there
was no way to select a small sample of users who could authoritatively speak for the
whole population of Americans who drive cars. It was certainly possible to select a
representative sample. But the population of car drivers does not constitute an internally cohesive community that could choose authorized representatives and delegate
Language Documentation & Conservation Vol. 10, 2016
Bringing User-Centered Design to the Field of Language Archives
652
them to speak for the community. The population is simply too large, too amorphous,
and too diverse. For this latter type of population, UCD research typically follows
a (non-participatory) ethnographic approach, where the UCD team does its best to
select a representative sample of study participants, and accurately represent the perspectives of the population, but the study participants are not active members of the
UCD team throughout the research process. The study participants are still given as
much opportunity to participate as possible. For instance, in the car study, we had
participants look at the video recordings we had made of them driving, and explain
what was happening from their point of view. We invited their analytical insights.
Figure 1. Levels of participation
2.2.3 Process is interdisciplinary and collaborative The third key component of
the UCD process is that it is profoundly collaborative and interdisciplinary. UCD is
a complex endeavor and there is no one person who has all the skills necessary to
create a good product on their own. So a UCD team should include, at a minimum:
• Members of the user groups whose needs the technology is supposed to meet
(when the UCD team follows a participatory research approach)
• User researchers, who translate user needs into design recommendations
• Designers, who implement insights from user groups and researchers in the
design of the user interface and navigational structure of e.g. a language archive
• Technologists, who do the software development
• Other experts and stakeholders as relevant; for language archives, this might
include archivists, curators, and experts in preparation of language learning
materials
By including all of these voices, a product can be developed that accommodates
the needs and constraints of each user group, while also working with the organizational needs of the archive and the constraints of available technologies.
2.2.4 Process is iterative The inal component of UCD is that it is an iterative process. The UCD literature is full of images with different kinds of circular patterns,
as illustrated in Figure 2. The point is to show that the process is cyclical. People
Language Documentation & Conservation Vol. 10, 2016
Bringing User-Centered Design to the Field of Language Archives
653
who design technology have learned that plans never work quite as expected. While
earlier approaches to the design process were top-down and linear, newer approaches
are spiral. Best practice in design is now conceptualized as the process of developing
a prototype, having users test the prototype, and then modifying the design based on
what worked and where users encountered dificulty. The process is usually repeated
several times before product launch. Also, because technology changes so rapidly,
technology designs have to be revisited every few years and updated to accommodate new developments.
Figure 2.
Iterative UCD process (Image used by permission of UserTesting: https://www.usertesting.com/blog/2015/07/09/how-ideo-uses-customer-insightsto-design-innovative-products-users-love/)
2.3 Bringing language archives and UCD together The application of UCD principles to archive design has particular relevance for language archives. From the outset,
the ield of language archiving has emphasized the portability of language resources,
ensuring that materials do not become locked in obsolete, inaccessible formats, and
providing for language identiication through standardized codes (Bird & Simons
2003). Many of the largest language archives in North America target speciic regions, including Alaska (Alaska Native Language Archive), Oklahoma (Sam Noble
Oklahoma Museum of Natural History), California (Survey of California and Other
Indian Languages), and Latin America (Archive of the Indigenous Languages of Latin
America). Linn (2014) argues that such regional language archives have been at the
forefront of what she refers to as “participatory delivery” of language resources. This
makes language archives an ideal domain in which to apply UCD principles, since
they are already open to engaging with the participatory framework.
As the use of language archives has burgeoned in recent years, making these
archives more accessible has become an urgent need. In spite of recent advances in
Language Documentation & Conservation Vol. 10, 2016
Bringing User-Centered Design to the Field of Language Archives
654
the standards and protocols for language archives, users of these archives often have
dificulty accessing the information they seek (Woodbury 2014). Until now, developers of language archives have been unfamiliar with the principles of user-centered
design (UCD). Bringing the ields of language archives and UCD into dialogue has the
potential to signiicantly improve users’ experiences with language archives, in terms
of their ability to navigate the sites and ind the information they are looking for. Informed by UCD, language archives could be designed or redesigned to accommodate
user groups’ cultural practices of data sharing, access, and use.
3. Typology of language archives A key outcome of the workshop was the development of a tentative typology for language archives. It was constructed through
a group activity led by UCD practitioner Crysta Metcalf. The typology is valuable
because it suggests how a UCD process might be customized to different types of
language archives.
Workshop participants grouped language archives into ive types; the irst four are
summarized in Table 2, while the ifth is discussed below. Everyone at the workshop
recognized that this typology was a simpliication of the actual situation. Nevertheless, the typology is useful in that it helps UCD practitioners learn about patterns of
similarity and difference across language archives.
Workshop participants noted that language archive Types 1–3 tend to be managed
by linguists, while Type 4 will probably hire someone (if they have funds) with a more
general background, probably a community member, and their responsibilities will
encompass more than linguistics/archiving.
Another dimension of variation was the extent to which a language archive is
integrated into an institution such as a university or state archive. Type 3 language
archives are the most deeply embedded. While Types 1–2 are associated with an
organization, they have more independence. For instance, Susan Kung indicated that
even though Type 2 AILLA is part of the University of Texas Library and on the UT
Library server, it operates independently from the rest of their digital repositories
and collections. Type 4 language archives may or may not be associated with an
institution; even when they are, such as the Myaamia Center, which has a relationship
with Miami University (Ohio), they maintain a fair amount of independence.
3.1 Type 1: Large language archives with global collections The main examples
of Type 1, ELAR and DoBeS, are ambitious endeavors to develop global collections
of materials on endangered languages. They are or were both funded by private
sources. Since DoBeS is in lux due to shifts in funding, we will mainly describe
ELAR. Mandana Seyfeddinipur, head of ELAR, explained that ELAR is closely connected to a funding source that provides grants for language documentation. ELAR
receives deposits from all grantees since the deposits are a condition of their funding.
Seyfeddinipur identiied the following user groups for ELAR:
• Depositors
• Linguists interested in conducting research
Language Documentation & Conservation Vol. 10, 2016
655
Bringing User-Centered Design to the Field of Language Archives
Table 2. Typology of language archives
Type
Type 1
Type 2
Type 3
Type 4
Descriptor
Large language
archives with
global collections
Large language
archives with
regional
collections
Language archives
embedded in larger
digital repositories
Single language
community archives
Contents
Global collection
Regional
collection
Often regional but
not always
Single language (or
possibly 2 or 3
languages from a
single community);
may include cultural,
historical, etc.
materials
Primary
Mission
Preservation/
documentation
Preservation/
documentation
Preservation/
documentation; may
extend to language
revitalization
Serving a language
community;
contributing to
language
revitalization
Source of
Collections
Linguist
depositors,
typically tied to
funded research
projects
Linguist
depositors
Linguist depositors
May come directly
from linguist
depositors; may be
copies of collections
in other archives
Intended
Users
Broad base of
users
Broad base of
users
Broad base of users;
parent repository
may be required to
serve certain
constituents, e.g.
university serves
students, faculty
Language
community; may
also choose to be
accessible to broad
base of users
Funders
Private grants
Government or
private grants or
university support
Parent repository,
most often a state
institution
Community, maybe
grants
Examples
ELAR
DoBeS/Language
Archive Cologne
PARADISEC
Calif Lang Archive
AILLA
Sam Noble ANLA
U Oregon Libraries
U Hawai‘i
Kaipuleohone
Dinjii Zhuh K’yaa
Myaamia Center
FirstVoices is
conglomeration of
Type 4s
• Language communities
• Researchers from ields like history, anthropology, political science, etc.
• Artists
• The general public
ELAR funds are largely earmarked for documentation, and this was also the case
for DoBeS. Therefore, it might make sense for Type 1 language archives to partner
Language Documentation & Conservation Vol. 10, 2016
Bringing User-Centered Design to the Field of Language Archives
656
with other organizations to prepare materials for language revitalization in cases
where that would be useful.
A challenge that pertains especially to Type 1s is that language communities may
not speak the language of the archive’s interface. For instance, Seyfeddinipur pointed
out that ELAR is only available in English, but the archive includes materials of language communities from around the world whose members may not speak English.
3.2 Type 2: Large language archives with regional collections Type 2 language
archives are similar to Type 1, but with a regional focus to their collections. There
are quite a few Type 2s, and they may be funded privately, by government grants, or by
public sector institutions such as universities. Like Type 1s, their primary mission and
funding tend to be focused on language documentation, so it might make sense for
them to partner with other organizations that could develop materials for language
revitalization.
Workshop participants felt that Type 1 and Type 2 language archives were distinct from each other and merited separate categories for several reasons. First, Type
1 archives are quite rare; workshop participants only came up with two examples.
Such archives therefore have a strong experimental aspect; they are still in the process of developing an effective operational model. By contrast, Type 2 archives are
much more common. Second, both of the Type 1 language archives were created
with funding from private foundations, while the majority of Type 2 archives were
developed in university settings, with public funds. This led to differences in their
development and operation.
3.3 Type 3: Language archives embedded in larger digital repositories Although
the majority of language archives are associated with an institution to some degree,
Type 3s are fully subservient to a parent repository that belongs to an institution such
as a university, historical society, state archive, or national archive. Their high level
of embeddedness can lead to challenges in customizing Type 3 language archives for
the characteristics of linguistic materials or the needs of users. Andrea Berez-Kroeker
said that the user interface of Kaipuleohone was determined by the University of
Hawai‘i library system: “As for the front end, I have no real control over what kind of
information gets displayed, or how things can be searched. It’s really geared towards
traditional library publications, not media.”
The parent repositories for these language archives often have a regional focus,
but not always. Type 3s range from well-established language archives to small, emergent collections. As described in §4, their level of collaboration with language communities can vary.
3.4 Type 4: Single language community archives The language archives created by
and for speciic language communities display signiicant differences from Types 1–3.
The primary mission of Type 4 archives is to serve their community in whatever ways
are relevant for that community. Collections typically focus on a single language, or
Language Documentation & Conservation Vol. 10, 2016
Bringing User-Centered Design to the Field of Language Archives
657
on a few languages if the community encompasses more than one. The activities of
Type 4s commonly include language revitalization. Furthermore, Type 4s are often
not just language archives; they may combine linguistic collections with cultural and
historical materials to provide a more holistic set of resources for the community.
The archives may be part of an information center that includes functions such as a
library, a museum, and classes. These venues are typically located on tribal lands.
Daryl Baldwin noted that creating single language community archives was important for tribes because “it allows for the development of those archives in such a
way that larger institutions either don’t have the time, staff or money to develop for
community use. For instance, all of our language documents were/are unpublished,
untranscribed, and in some cases untranslated. With our own digital copies we are
able to organize based on our need, and create tools that process them in such a way
that we are able to gain access to the information without having to work through
another institution’s protocols and procedures.” At the same time, he considered it
likely that such language archives will be afiliated with a larger tribal archive, or
another institution such as a university.
Type 4 language archives often seek to bring together all the materials on their
language that they can locate around the world. Unlike Types 1–3, therefore, a signiicant part of their contents may be copies of collections from other archives. An
example from the Miami-Illinois Digital Archive is illustrative: in 1999, researcher
Michael McCafferty discovered a Jesuit manuscript by Pierre-Francois Pinet about
the Miami-Illinois language, dating from about 1700, in the Archives De La Compagnie De Jesus in Quebec, Canada. After a process of negotiation, the Miami Tribe
of Oklahoma obtained a digital copy of the manuscript in 2003.
Three Type 4 language archives were represented at the workshop, and each was
different from the others in signiicant ways, displaying the range of variation that
can be found across Type 4s. Hence, it is useful to describe each one separately.
• Dinjii Zhuh K’yaa (Gwich’in Language Archive and Language Revitalization
Center) emerged from discussions at a Gwich’in gathering in 2010, so it is relatively new. It is located in Fort Yukon, Alaska, a town primarily inhabited by
Gwich’in people. Its collections focus on language materials, but it also functions more broadly as a community center. Because the Internet is problematic
in Fort Yukon, visiting the physical site of this language archive is currently
the main way for community members to access its materials. Dinjii Zhuh
K’yaa does have a public Facebook page at https://www.facebook.com/dinjiizhuhkyaa/.
• For the Miami Tribe of Oklahoma, language and cultural preservation and revitalization functions are spread across multiple, interconnected entities.
– The Myaamia Center at Miami University in Ohio advances the Miami
Tribe of Oklahoma’s language and cultural revitalization efforts. Created
in 2001, the center offers a wide range of educational experiences and
programs for students who are members of the Miami Tribe. One of the
Language Documentation & Conservation Vol. 10, 2016
Bringing User-Centered Design to the Field of Language Archives
658
goals of the Myaamia Center is to collect high-resolution copies of all
Miami-Illinois language documents.
– The Miami-Illinois Digital Archive (MIDA) is an ongoing research project
that is now managed by the Myaamia Center (http://www.ilaatawaakani.org).
MIDA was founded in 1999 to transcribe, translate and make accessible
manuscripts written by Jesuit missionaries about the Miami-Illinois language. It is intended as a resource for researchers.
– The online Myaamia Dictionary is intended as a resource for language
learners (https://myaamiadictionary.org/dictionary2015/). It is also managed by the Myaamia Center. The Myaamia Dictionary is linked to MIDA,
so materials are shared across the two sites.
– Finally, there is also a more general tribal archive, which collects and shares
a range of historical and cultural materials pertaining to the Miami Tribe.
In summer 2016, the Tribe started to make materials from this archive
available online, using the Mukurtu platform.
Daryl Baldwin, director of the Myaamia Center, noted that because MIDA is a language archive that focuses speciically on in-depth analyses of historical documents,
the documents it makes available are not in themselves particularly useful for community members who want to engage in language learning, unless they already have
some prior training. The Myaamia Dictionary, by contrast, is intended as a tool for
language learners. In addition, language and cultural revitalization for the Miami
Tribe is advanced by the many educational activities of the Myaamia Center.
• FirstVoices was launched in 2003 as an online language archive for the First Nations of British Columbia (http://www.irstvoices.com). So it is different from
the other two examples in that it caters to multiple language communities. It
includes 34 languages from British Columbia (61 dialects), and has also added
a few languages from the rest of Canada and from California. FirstVoices has
an especially strong pedagogical focus in its online offerings. It provides not
only an alphabet, dictionary, and grammar for each language, but also extensive
educational materials and games for both children and adult learners. While
its ofices are physically located in Victoria, British Columbia, FirstVoices is
primarily an online entity.
In the U.S., the government does not provide ongoing inancial support for Type
4 language archives. Members of language communities articulated nuanced positions regarding the pros and cons of government funding. On the one hand, they
noted the U.S. government’s failure to recognize the importance of funding archives.
FirstVoices, based in Canada, formed a contrast since it does receive government funding. On the other hand, members of language communities expressed a preference
for being self-suficient. As Daryl Baldwin put it, “you have to feed yourself.” They
were wary of accepting money from outside sources due to the strings attached. Yet
they also recognized that some tribes had more inancial resources than other tribes,
and that those with few resources were in a dificult situation.
Language Documentation & Conservation Vol. 10, 2016
Bringing User-Centered Design to the Field of Language Archives
659
For Type 4 archives associated with small language communities that have limited
resources, it can make sense to partner with a Type 2 language archive. The Type 2 can
provide secure and stable long-term storage of materials, while the Type 4 can make
materials accessible to local community members. Such a partnership was developed
between Dinjii Zhuh K’yaa (Gwich’in Language Center and Archive) and ANLA.
Sometimes this is referred to as a “hub and spokes” model. Workshop participants
seemed to think this might be a future growth area. Type 4 language archives may also
be more likely than other types to use cultural heritage platforms such as Mukurtu
(http://mukurtu.org). Mukurtu is gaining in popularity due to its free availability
and sensitivity toward cultural protocols and access restrictions. Further insights on
Type 4 archives are described in §4.1 from the perspective of language community
members.
3.5 Type 5: Elder archives Workshop participants also identiied a ifth type of
linguistic repository that is neither a digital repository nor a repository in a building. Instead, it is a person who holds knowledge—a living person. Such people are
referred to as elders. They are usually senior in age but not always. Their knowledge may include linguistic, cultural, historical, and genealogical dimensions. It may
include knowledge about who in the community is the keeper of different kinds of
information, and how information is transmitted across generations. This is living
knowledge; it may not be written down. While Type 5 is technically not a formal
archive, the concept can provide useful lessons and implications for the design of
language archives. For instance, knowledge is not stored all in one individual but
rather split among various “experts” or holders of knowledge on topics important to
that particular community. This concept can help in the design of additional access
mechanisms.
4. Stakeholder groups and their perspectives Another key outcome from the workshop was mapping the diverse perspectives of different stakeholder groups concerning language archives. These groups included users as well as other stakeholders. For
purposes of the workshop, we identiied ive main stakeholder groups, of whom the
irst two were user groups:
1. Language communities
2. Linguists
3. Archivists
4. User-centered design practitioners
5. Representatives of funding agencies
All workshop participants were able to represent at least one of these groups;
many participants belonged to several groups.
Language Documentation & Conservation Vol. 10, 2016
Bringing User-Centered Design to the Field of Language Archives
660
In order to map the perspectives of each group, we took the following steps:
• Prior to the workshop, we divided the participants into the ive groups listed
above. We gave members of each group a set of questions about their perspective as a representative of that group. We asked them to prepare a short
presentation for the workshop answering those questions.
• At the workshop, participants delivered presentations illed with rich insights.
• The entire workshop was video recorded.
• A detailed transcript summary of the workshop was prepared based on the
video recordings.
• We coded the transcript using the qualitative analysis software Dedoose to capture all statements by participants that revealed the perspectives of each stakeholder group. In this way we not only examined participants’ presentations but
also considered relevant remarks made at other times during the workshop.
• By grouping the statements coded for each stakeholder group, we were able to
identify common themes and patterns in the perspectives of each group.
The results of our analysis are summarized below.
4.1 Language communities The workshop participants who were asked to represent the perspective of language communities in their presentations were Loriene Roy,
Daryl Baldwin, Edward Alexander, Wesley Leonard, and Michael Shepard. The irst
four are members of tribal communities; Michael is a non-indigenous anthropologist
who has worked extensively with several tribal communities on archiving issues. In
addition, Jennifer O’Neal contributed insights as a member of a tribal community.
In planning this workshop, the organizers decided to limit language communities to
those of Native North America in order to create more coherence in the discussions.
The discussion in this section overlaps somewhat with §3.4, above, where Type 4
language archives were described.
Both Daryl Baldwin and Edward Alexander had experience developing Type 4
single language community archives. Baldwin played an instrumental role at the
Myaamia Center, which was founded in 2001 at Miami University in Ohio. Alexander guided the development of Dinjii Zhuh K’yaa (Gwich’in Language Archive and
Language Revitalization Center) in Fort Yukon, Alaska. The idea for the language
archive emerged from a Gwich’in gathering in 2010.
Language community members made it clear that language was fundamental to
the identity of tribal groups and their members. Wesley Leonard shared this quote
from L. Frank:
Language is the basket that holds all of our culture…in order to understand why this oak tree sitting on a hilltop is so critical to my afterlife,
the language is the only thing that explains that and carries that and is
Language Documentation & Conservation Vol. 10, 2016
Bringing User-Centered Design to the Field of Language Archives
661
that…I need the language to understand. I don’t want to take a wrong
road when I get to the edge of the land of the dead…so language is pretty
much everything (quoted in Leonard to appear).
The history of U.S. colonial policies and practices (including the boarding schools)
created signiicant hurdles for language use in the majority of Native American communities. The after-effects of colonialism undergirded many workshop discussions.
For instance, Edward Alexander commented, “right now it seems like archives, and
this kind of information, it still feels like the BIA days. Because we don’t have selfdetermination over the information.” He compared the poor quality of services under
Bureau of Indian Affairs (BIA) management with the improved quality of services after the Indian Self-Determination and Education Assistance Act, when tribes obtained
more control. Loriene Roy said, “we forgive you for taking our words. We forgive
you for recording them sometimes without our permission.”
Michael Shepard pointed out that archives are sites of struggle for power, control,
access, and ownership. Language communities wish to exercise sovereignty over their
cultural and linguistic heritage. Some forms of knowledge may not be appropriate
to share. Some kinds of information can play an important role in supporting a
community’s legal rights, including land claims.
At an ideological level, Wesley Leonard argued that there can be mismatches between the values and categories that guide archiving practices, and those that guide
language reclamation efforts. “A key for language reclamation is appropriate integration of cultural beliefs, including community notions of language—so not a top-down
‘this is what language is’ as deined in this archive, but from the bottom up, from the
community.” A number of workshop participants suggested that language communities needed to step away from traditional Western linguistic and archiving frameworks in order to conceptualize Type 4 language archives in the ways that would be
most useful to the communities.
4.1.1 Perspective on language archives of Types 1–3 The representatives of language communities spoke of encountering numerous challenges in their efforts to
work with languages archives of Types 1–3. First of all, it is often very dificult to
locate all the materials pertaining to their languages. These materials can be scattered
across archives all over the world. Archives may use spellings for language names that
users are not familiar with, or completely different names. Materials in large-scale
repositories, like federal archives, are typically easiest to ind.
Another major challenge is the dificulty of understanding and interpreting many
of the materials found in language archives. As Edward Alexander said, “you need
a key to be able to understand what you’re looking at. Not only is it in another language but there’s another language of linguistics on top of it.” Susan Kung pointed
out that for older documents, the linguistics “language” of 200 years ago could be
indecipherable to today’s linguists as well. So there is a need to translate academic
and archaic language into accessible terms. Daryl Baldwin talked about the work he
has done to transform source documents into a form that members of his community can easily use: “that’s a huge process... There’s no easy way to do that.” Much
Language Documentation & Conservation Vol. 10, 2016
Bringing User-Centered Design to the Field of Language Archives
662
of the Myaamia language was irst recorded by the Jesuits and annotated in French.
So each source document requires transfer of handwritten materials into typed documents, and translation of French annotations from the 1700s into modern English.
Then the Miami-Illinois materials are further analyzed using current linguistic methods. The analyzed documents are uploaded to the Miami-Illinois Digital Archive in a
searchable format, and insights gained from the documents are used to create entries
for the online Myaamia Dictionary. It is this dictionary that is the most useful to the
majority of tribal members.
With respect to the legacies of colonialism, sometimes Type 1–3 language archives
included materials taken in earlier times without the permission or understanding of
tribal members, making information publicly available that the language community
would prefer not to share openly. Sometimes access to a Type 1–3 language archive
was limited for members of a tribal community by the expense of traveling (to a
non-digital archive), copying materials, and so forth.
4.1.2 Perspective on Type 4 language archives As described in §3, the primary
mission of Type 4 language archives is to serve their communities. Michael Shepard noted that Type 4s can be thought of as a reservoir that a language community
draws on for a variety of purposes, from land claims to education. Each community shapes its language archive according to local needs and interests. For instance,
Edward Alexander described Dinjii Zhuh K’yaa (Gwich’in Language Archive and
Language Revitalization Center) as a place to share, a source of innovation where
community members make new things from the past. It emerged from Gwich’in
concerns that their language, and therefore their very existence, were under threat.
Alexander worked with ANLA to bring copies of materials to Fort Yukon. Many
of these materials had not previously been digitized or placed online, so they had
not been accessible locally. Gwich’in community members beneit from Dinjii Zhuh
K’yaa by gaining access to previously unknown materials. Alexander himself discovered a recording of his grandfather that he hadn’t known existed. One of the
innovative things Dinjii Zhuh K’yaa has done is to record knowledge and then imprint it on the land using augmented reality. For instance, the building that houses the
archive is scannable; visitors can pull up information about it on their phone. This
helps share information between elders and the younger generation.
4.1.2.1 Language revitalization/reclamation Preparing language learning materials from the linguistic materials stored in language archives was accorded great importance by workshop participants. As a result of American colonial policies, including
the infamous boarding schools, many members of Native American language communities no longer speak their tribal languages in the home. Yet tribal communities
understand their languages to be closely linked to cultural knowledge systems and
values. Community members often have great interest in language learning activities
that can promote “revitalization” or increased use of such languages. To highlight
tribal sovereignty, Wesley Leonard has adopted the term “language reclamation” to
describe the right of community members to claim, learn and speak their language,
Language Documentation & Conservation Vol. 10, 2016
Bringing User-Centered Design to the Field of Language Archives
663
and to deine what success means for their language learning efforts (Leonard 2011:
141). During workshop discussion, Michael Shepard argued that “there’s a real role
for a pedagogical specialist to be involved” in the development of language archives,
since the skill set needed to prepare educational materials is not widely found among
language community members, linguists, or archivists. FirstVoices is a great example
of how language learning materials can be developed from language archives.
4.1.2.2 Create engagement Workshop participants pointed out that most current
language archives are not very engaging for most language community members, outside of programs designed speciically for language learning. Daryl Baldwin shared
his experience of education trips or NAGPRA-related trips with both elders and
young people. He contrasted their engagement with physical objects to their lack
of connection with language documents:
In most cases, when community members step into the archive, they’re
looking for objects…in every case it’s been a very powerful moment for
them to engage with or interact with physical objects that are directly not
only tied to their culture, but in many cases tied to their genealogies. So
these are objects that came from their great-grandparents or whatever it
may be. What makes the experience rich really hinges on what they know
about the object. In some cases they don’t know a lot about the object.
So if there’s somebody there from the community that can serve as an
historian or cultural expert they can contextualize their experience. And
when that is happening, their interaction with the object is strengthened.
Baldwin has found that while the average community member may ind looking at original language documents cool and interesting, such documents often don’t
have the same impact as a cultural object (like a basket), and the person may quickly
lose interest. However, students who are more engaged in the language learning
process ind materials from language archives highly meaningful. For instance, the
Myaamia Center sent three tribal students to work with the Smithsonian’s Recovering Voices program, doing research in preparation for the 2015 National Breath of
Life Archival Institute for Indigenous Languages (http://nmnh.typepad.com/recoveringvoices/2015/01/). These students had been taking classes on Myaamia language
and culture. Seeing Myaamia language documents helped them contextualize their
experiences in the larger framework of indigenous language revitalization.
Workshop participants brainstormed ideas for how to make language archives
more engaging. Suggestions included the use of audio and video recordings, and
making abstract information more tangible. Loriene Roy gave an example of working
with a group of 8–10 year olds to narrate a virtual tour of an online exhibit, using
their language. One boy chose to talk about a piece of pottery from Maria Martinez.
He started his narration by saying, “my auntie made this plate. And irst she went
to the river and she gathered the clay and she sifted everything,” and continued from
there. Having children tell stories that would be viewed by other children seemed
like a great engagement technique.
Language Documentation & Conservation Vol. 10, 2016
Bringing User-Centered Design to the Field of Language Archives
664
4.1.2.3 Let users annotate language archives Language community representatives expressed a wish for users to be able to annotate language archives. For instance,
Edward Alexander said there were recordings in Gwich’in at ANLA that have only
minimal descriptions. There is no way for him to add more detailed information that
might be helpful for other users. Likewise, Daryl Baldwin expressed a wish to work
with archives to:
• Update their descriptions and keywords for Miami-Illinois (the broader language of which Myaamia is a dialect)
• Share the transcription and analysis work done by linguists and tribal researchers,
and connect this work with the archival documents
• Create a list of ongoing research associated with certain collections so that
others who visit the archives become aware of who is doing what
Workshop participants explored ideas for enabling community members to annotate materials themselves, and even to debate interpretations with each other online.
One of our reviewers noted that some archives have crowdsourced archival information in a manner that was accessible to a broad range of users, and that oral historians
have had some success crowdsourcing the transcription of recorded interviews.
4.1.2.4 Let users be depositors Edward Alexander suggested that it would be
helpful if users could be depositors—in other words, the language archive could let
them upload materials themselves. He pointed out that recording devices are now
ubiquitous, unlike earlier days when they were rare and expensive. Alex Wadsworth
indicated that FirstVoices already has a self-deposit tool that allows users to upload
materials. Alexander also pointed out that users needed to be thoughtful about possible future uses of recordings they upload. “You don’t know how people are going to
use information.” When his grandfather was recorded, he probably just regarded it as
a novelty, without realizing that years later his grandson would ind this information
on a computer.
4.1.2.5 Constraints: Literacy, costs, technical support Workshop participants
identiied a number of constraints that Type 4 language archives need to negotiate.
Loriene Roy noted that literacy might be an issue; sometimes recordings may be more
useful than written materials. Many workshop participants talked about the inancial challenges of Type 4 archives. As noted in §3.4, the U.S. government does not
currently provide funding for the language archives of tribal communities. Similarly,
some communities have limited Internet access or face other constraints in their technological infrastructure.
4.2 Linguists The workshop participants we asked to represent a linguist perspective were Andrea Berez-Kroeker and Justin Spence. In addition, participants Shobhana Chelliah, Mandana Seyfeddinipur, Susan Kung, Felix Rau, Daryl Baldwin, Wesley Leonard, and Gary Holton are also linguists, and at times their remarks were
Language Documentation & Conservation Vol. 10, 2016
Bringing User-Centered Design to the Field of Language Archives
665
shaped by this perspective. One of the goals of the workshop organizers was to treat
linguists the same as any other stakeholder group. Because many digital language
archives have been designed by linguists, there often seems to be a default assumption that they must be working well for linguists. We did not want to make this
assumption. Rather, we wanted to ask linguists what their experience was like. And
we recognized that linguists might have different experiences depending on whether
they were interacting with a language archive as a depositor or as a researcher.
Some aspects of a linguist perspective on language archives were described in the
irst half of §2, which summarized Holton’s presentation at the start of the workshop.
The current section focuses on additional contributions from Berez-Kroeker’s and
Spence’s presentations, and comments made by linguists in later discussions.
4.2.1 Linguists as researchers One signiicant inding was that the linguists at the
workshop did not seem to be using language archives much as researchers. The most
common way that linguists engaged with language archives was to deposit their materials. Aside from Daryl Baldwin’s and Edward Alexander’s engagement with language
archives to develop their own community’s archive and to bring more knowledge
about their language to their community, Justin Spence was the only workshop participant who used language archives extensively for linguistic research. While many
authors have noted the great potential of language archives to contribute to linguistic
research (cf. Seifart et al. 2012), low usage of language archives by linguists seems to
a generalizable inding. The authors of a recent study who explicitly searched for linguists who use language archives as a source of research data found such individuals
to be quite rare (Al Smadi et al. 2016).
Justin Spence acknowledged that he was willing to “slog through sometimes hours
of recordings just because I kind of enjoy it!” His careful approach to navigating
materials that could be “very haphazard” almost sounded like a form of detective
work. The old ield notes, recordings, and transcripts he examined were useful in
recovering variation, examining language change, and understanding people’s lived
experiences of the languages they knew while processes of language shift were in
motion.
A second, related inding was that linguists as a user group had a great deal of
dificulty using language archives effectively for research purposes. In some ways,
this was surprising, since most digital language archives were created by linguists.
However, Shobhana Chelliah noted that “even as an academic I feel like I’m shut out
a lot.” Workshop participants said that the major dificulty in using existing deposits
for linguistic research was that they often lack suficient annotations. Annotations are
needed both to ind relevant materials during the search process, and to interpret the
materials once they are found. Such annotations could range from a text paragraph
summarizing the contents, to speciic metadata to facilitate language comparisons,
to a gloss for every single morpheme in a text. Different linguists have different
annotation needs, depending on their speciic research topics. Voice recordings are
most accessible when they are accompanied by transcripts, because transcripts can
be searched in a way that recordings currently cannot be.
Language Documentation & Conservation Vol. 10, 2016
Bringing User-Centered Design to the Field of Language Archives
666
4.2.2 Linguists as depositors Linguists also encounter challenges in their role as
depositors. One challenge is that until recently linguists have received little training
in how to prepare deposits in a way that will be useful even to other linguists, let
alone language community members. As Chelliah put it:
In my education as a linguist I was really never taught how to fashion my
data so that community members could access it and use it. And we’re
still really lacking in our education, as linguists, how to do that properly.
So we can’t expect archivists to know that for us. I’m called on as a
linguist to do two things. One is to produce theoretically or descriptively
exciting information that pushes forward the science of what we know
about language and cognition. And secondly, for the people who invited
me into their communities to come and help them with the creation of
a history of their language, to work on that information with them…I
don’t have the training to know how to use my limited time to do both
of those things.
These comments suggest that archiving of linguistic data should be more fully
integrated into linguistic training. There are signs that this is already happening.
Summer institutes such as CoLang now routinely provide training in preparation of
archive deposits, and graduate programs are beginning to require archiving of data
collected as part of the research process (Berez 2015). Archived collections of linguistic data are increasingly recognized as valid academic contributions (cf. Salffner
2015), and evaluation metrics for scholarly collections of linguistic data are being developed (Thieberger et al. 2015). Nevertheless, the experiences of ordinary working
linguistics suggests that much more needs to be done to improve the curriculum with
respect to preparation of archive deposits.
Another challenge is that language archives may not have a clear submission process for deposits. Berez-Kroeker noted that she had encountered opaque submission
procedures, especially for metadata creation, and poor communication in the “black
box” period between submission and ingestion. This suggests that it might be productive for language archives to provide detailed guides about how to prepare deposits
on their websites.
A third issue is that developing a collection for a language archive is a lot of
work, and linguists often get little credit for such efforts in academic merit reviews.
Mandana Seyfeddinipur pointed out that there isn’t much of a reward for linguists
to deposit their data. This issue is being addressed by a group of linguists, including
some workshop participants, who are seeking to gain recognition for deposits as a
form of publication (Thieberger et al. 2015). As a related issue, linguists may be reluctant to share their data in an archive because of concerns about intellectual property
and preserving a publishing advantage.
4.3 Archivists The workshop participants we asked to represent an archivist perspective were Mandana Seyfeddinipur, Felix Rau, Susan Kung, Jennifer O’Neal, and
Language Documentation & Conservation Vol. 10, 2016
667
Bringing User-Centered Design to the Field of Language Archives
Alex Wadsworth; Table 3 provides more information about them. Andrea BerezKroeker (Kaipuleohone), Daryl Baldwin (Myaamia Center), Edward Alexander (Dinjii Zhuh K’yaa/Gwich’in Language Center and Archive), and Gary Holton (ANLA)
also brought experience in managing archives to the table. One thing to keep in mind
is that language archives are often managed by people who don’t, in fact, have formal training in archiving. Among workshop participants, only Jennifer O’Neal and
Loriene Roy had degrees in library and information science.
In their presentations and subsequent discussions, the archivists painted a rich
portrait of each of the archives they worked with and presented a nuanced view of
the complexities of managing a language archive. It was useful to have ive examples,
because each one was different in signiicant ways. In this section we describe patterns
that emerged across the examples. Most of these patterns applied to some but not all
of the ive language archives under discussion.
Table 3. Workshop participants asked to represent archivist perspective
Mandana Seyfeddinipur
Felix Rau
Susan Kung
Jennifer O’Neal
Alex Wadsworth
ELAR
Language Archive Cologne
AILLA
University of Oregon
FirstVoices
Type 1
Type 1
Type 2
Type 3
Type 4
4.3.1 Feeling disconnected from users (other than depositors) Seyfeddinipur, Kung
and Rau all talked about feeling disconnected from their users, except for some of
their depositors. These three archivists manage Type 1 and Type 2 language archives.
Kung said:
As someone who runs an archive, the biggest issue I have, since it’s a digital archive, is knowing who’s using the archive. When somebody contacts
me by email or phone…then I know who they are and what they’re trying to do, but otherwise I can track the downloads, I know people are
logging in every day and downloading materials, but I have no idea who
they are or what they’re using these materials for, what their agendas are,
what they’re researching. So I feel like I’m just totally disconnected from
most of my users, I have no insight into their needs or their wants.
While ELAR collects more information about users when they create an account,
Seyfeddinipur still expressed frustration about her limited knowledge of users. “We
are trying our best with the knowledge that we have, and we need more information.”
She also expressed concern that ELAR is underutilized. Speciically, she noted that
the restricted collections, which mainly belong to Native American and Australian
Aboriginal communities, are not getting any use at all. “User statistics are actually
zero, which is heartbreaking.” For Seyfeddinipur, not only do language archives need
to become more engaging for users, but they also need to do a better job of alerting
people to their existence.
Language Documentation & Conservation Vol. 10, 2016
Bringing User-Centered Design to the Field of Language Archives
668
At the same time, Seyfeddinipur pointed out that sometimes there is no longer
any community associated with a language whose materials have been deposited in
ELAR. The few remaining speakers may be dispersed, or there may be none left.
The one set of users that did regularly communicate with these archivists was the
depositors. Rau noted that at the Language Archive Cologne, the majority of users
are, in fact, depositors: “it’s bordering to the ridiculous sometimes how the whole
thing is focused on the producer side.” ELAR also works closely with depositors;
ELAR gives grants for language documentation that require the recipients to deposit
their data with ELAR. Since ELAR provides training to the grant recipients in data
management, there is a fair amount of contact with them.
It was notable that Seyfeddinipur, Kung and Rau all manage archives of Types 1
and 2, which have a primary mission of language documentation and preservation
rather than community engagement. For instance, Kung said that while AILLA has
an open collection policy, it does not actively seek out collections to acquire. “We
don’t hunt it down, it comes to us.”
4.3.2 Collaboration with language communities By contrast, Alex Wadsworth and
Jennifer O’Neal collaborate closely with language communities. FirstVoices, where
Wadsworth works, was speciically created to support the Indigenous languages of
British Columbia. It is a conglomeration of Type 4 language archives. The University
Archives of the University of Oregon, where O’Neal works, has a much broader
mission; language archives constitute only a small part of the holdings. We labeled
language archives that are embedded within a parent repository like this Type 3. At
the time of the workshop, O’Neal was working on two projects related to endangered
languages in the Northwest.
It seemed like O’Neal’s collaboration with language communities was, at least to
some extent, a personal choice. She spoke of how important it was to her to be an
advocate for tribal communities; she herself is a member of one. She described the
ongoing challenge of persuading her Dean to dedicate resources to language archives
rather than other urgent university priorities.
O’Neal’s case shows that an archivist’s collaboration with language communities
is not only shaped by institutional structures (such as our typology), but that it can
also be shaped by the agency and personal commitment of the archivist. There may
be Type 3 archivists whose experience more closely resembles that of the Type 1 and
2 archivists described above.
4.3.3 Scarce resources A common theme in the comments of archivists was the
challenge of inding adequate resources to maintain and improve their archives. Most
language archives have few employees and limited inancial resources. They are already doing a lot with a little. Several archivists expressed concern that their funding
sources could dry up or become totally inadequate—a terrible scenario for an archive,
whose most fundamental mission is after all to preserve collections for hundreds and
even thousands of years. In terms of external funding sources, they noted that a
language archive’s need for guaranteed, long-term sustainability is a poor it with
Language Documentation & Conservation Vol. 10, 2016
Bringing User-Centered Design to the Field of Language Archives
669
the dominant funding model of short-term grants for speciic projects. In terms of
institutional funding, they expressed concern about the need to compete with other
institutional priorities; O’Neal noted that many of her colleagues had to “ight” for
support because language archives are often not a high priority at their institutions.
In their concluding remarks at the workshop, archivists made comments such as:
• “Things are changing faster than we can keep up with”
• “I know the problems users have…but at the same time, with very few resources, and feeling very over-extended, this wasn’t really something that I
wanted to address”
• “We’re trying to do the best job we can with the small resources and stafing
that we have”
• “I’ll echo what has been said about the strain on all of us that we’re dealing
with”
UCD efforts will need to take this context into account, and strive to ease the
workloads of those who are involved with language archives, rather than making
things more burdensome for them. As described in §2.2.2, the UCD process entails
close collaboration with stakeholder groups to identify their speciic needs and tailor
design solutions to their particular context and external constraints.
4.3.4 Problems encountered by users of language archives In their presentations,
the archivists provided a rich list of problems that might be encountered by users of
language archives. The most frequently mentioned items were:
• A lack of contextual information at the deposit level, or metadata
• Incomplete materials—missing annotation, missing translations
• Inadequate search/browse functions
• Problems with the interface/information display
• Users may be frustrated when they don’t have access to data; it may be hard
for the archivist to get hold of a collection owner to request access for a user
• Technology issues—outdated, broken scripts, Flash/Java problems, etc.
• Interface language(s) may not include a language spoken by would-be users
4.4 UCD practitioners The UCD practitioners invited to the workshop were Crysta
Metcalf, a design anthropologist, and Santosh Basapur, a designer. Both worked at
the Motorola Mobility Applied Research Center until 2013, when the center was sold
and disbanded. In addition, Christina Wasson, workshop co-organizer, and Heather
Roth, research assistant, brought a UCD perspective to the conversations. The UCD
Language Documentation & Conservation Vol. 10, 2016
Bringing User-Centered Design to the Field of Language Archives
670
group was different from the other stakeholder groups in that it was the only one
whose members did not have a history of working with language archives. And it
was the only stakeholder group that did have deep experience with UCD.
Many aspects of a UCD perspective on language archives were described in §2
under User Centered Design, which summarized Wasson’s presentation at the start of
the workshop. The current section focuses on additional contributions from Metcalf
and Basapur, articulated during their presentations and in later discussions.
Throughout the workshop, all UCD practitioners emphasized the bottom-up nature of the UCD approach. As Metcalf stated on one of her presentation slides:
I think that the focus on the users in User-Centered Design is the key.
In UCD we don’t assume needs or goals or worklows, we don’t design
what WE want or would use—we talk to the people who will be using the
technology, and engage in a dialectical design process, so the result should
meet the needs of all of the users in an intuitive way for each group.
A consequence of this bottom-up approach was that none of the UCD practitioners was ready to say much about speciic design solutions for language archives at
the workshop. Such design solutions will only emerge after in-depth user research
has been conducted.
4.4.1 Explaining UCD to other stakeholder groups During their presentations, Metcalf and Basapur sought to explain the UCD process to an audience unfamiliar with
this approach. Both used visual aids with circular arrows, emphasizing the cyclical
and iterative nature of the UCD process. The steps they listed can be summarized as:
• Develop initial understanding of problem
• Conduct research with users in the context of use
• Identify research indings
• Identify design concepts based on research indings
• Create design prototypes
• Circle back with the user groups for their assessment of the design
• Iterate on the design with continued input from user groups
Metcalf and especially Basapur provided examples of previous projects to illustrate the UCD process. Basapur reviewed projects on:
• Social TV, which allows geographically distant friends and family to watch TV
together and socialize around it
• A location-based mobile app that allowed people to access videos about particular locations when they approached those spots; the videos were recorded
by the parents and grandparents of the users, so this app promoted intergenerational communication as well as knowledge of the urban landscape
Language Documentation & Conservation Vol. 10, 2016
Bringing User-Centered Design to the Field of Language Archives
671
• A hospital’s operations theater, where researchers discovered that nurses were
the group most lacking in support
• A city’s bike sharing service, speciically how it could be made more attractive
to residents of marginalized communities
The goal of explaining UCD to the other stakeholder groups was to generate
shared knowledge and frameworks that all workshop participants could build on
during the workshop. Since UCD practitioners regularly engage in interdisciplinary
collaboration, they were familiar with both the joys and the challenges involved.
4.4.2 Mapping the terrain of language archives For the UCD practitioners, their
most urgent task at the workshop was to map out the key features of the (to them)
new and unknown world of language archives, in ways that would help them plan
further research and design activities. Two dimensions emerged as key features of
this map:
• Identifying the stakeholder groups and their perspectives
• Identifying different types of language archives.
The irst dimension is addressed in the present section of this report; the second dimension was addressed in the previous section.
4.4.3 Moving forward While the UCD practitioners were not prepared to identify
design solutions during the workshop, they shared ideas concerning research methods
and other elements of a UCD approach for language archives, as those topics became
relevant during various discussions. For instance, when they learned that language
archives don’t always know much about their users, they brainstormed methods for
identifying current language archive users and collecting information about them.
Santosh Basapur suggested that when we start to conduct research, we can place
intercepts on a language archive website, i.e., pop-up questions that users can respond
to. Crysta Metcalf pointed out that information about language archive users could
be aggregated across multiple language archives.
UCD practitioners noted the potential value of partnerships between language
archives and other organizations as a way of accomplishing goals such as language
revitalization activities. They recognized the need to address funding challenges. They
let workshop participants know how useful metaphors such as “opening the door”
(described in §5) were—such analogies can help inspire design solutions.
Finally, the UCD practitioners stressed the importance of collaboration across
stakeholder groups. In a humorous way, Basapur said he had seen linguists feeling
sorry for archivists, archivists feeling sorry for language community members, and
language community members feeling sorry for the technical people. He pointed out
that this kind of empathy is central to the UCD process.
Language Documentation & Conservation Vol. 10, 2016
Bringing User-Centered Design to the Field of Language Archives
672
Everybody has to have that empathy…Usually design gets a reputation
for new ideas and fresh ideas, but actually what we usually do is bring
people on the same page and then you do something with it. By the time
they all come onto the same page they bring their own ideas and all the
ideas start compiling and [then we can] say what will really make sense.
So developing empathy and then having the empathy distributed among
all the stakeholders is very important.
4.5 Funding agency representatives The workshop participant representing U.S.
funding agencies was Shobhana Chelliah, who recently completed two terms as program oficer for the NSF Documenting Endangered Languages program (DEL), 2012–
2015. It was also useful to hear Mandana Seyfeddinipur’s perspective as a funder/archive manager outside the U.S. Comments from other workshop participants
illuminated the ways in which U.S. federal grants are embedded in a larger system
of institutions, archives, language communities, and other groups whose needs and
activities are intertwined in complex ways.
4.5.1 DEL Data Management Plan Chelliah explained that one mission of DEL is
to create resources for science and humanitarian purposes. This mission can only be
fulilled if grant recipients make their documentation results available to the public,
which usually means depositing them in a language archive. In 2012, when she became DEL program oficer, Chelliah started to realize that many grant recipients were
not in fact making their materials available. She talked to language archive managers,
and they discovered that “we really didn’t know what was being archived, how much
was being archived, and what quality those deposits were.”
As a result, the DEL program added a requirement to the existing NSF Data Management Plan (DMP). The DEL-speciic DMP required applicants for DEL standard
research and dissertation development research grants to include a plan for placing
their materials in an archive at the conclusion of the funding period. They had to
budget time and money to accomplish those tasks, and they had to get a letter from
an archivist approving this plan.
Here is the relevant text from the DEL Program Solicitation (NSF 15-567):
The DMP should provide evidence that the applicant has contacted a
trusted repository to arrange for long-term archiving of documentation
generated by the DEL project. The language archive selected by a DEL
project must have a long-term institutional commitment to data preservation and access. While the DEL Program does not sponsor or have
an oficial arrangement with any language archive, these services are provided by DELAMAN member archives (http://www.delaman.org)) and
by institutions holding the Data Seal of Approval (http://www.datasealofapproval.org/en/)). Regular data backup should be an integral part of
the DMP, but this is not to be equated with archiving in a trusted repos-
Language Documentation & Conservation Vol. 10, 2016
Bringing User-Centered Design to the Field of Language Archives
673
itory. Backing up data on hard drives, servers, optical media, and cloud
based services does not constitute archiving.
The DMP should include a time-line for completion of archiving activities.
It is expected that archiving should be completed prior to the submission
of the inal project report.
Applicants should include a letter of support from the archive indicating
their willingness to archive project materials and outlining any speciic
arrangements which have been made. This statement must be uploaded
under “Other Supplementary Documents”.
Language documentation is of little value if it cannot be accessed. To that
extent the DEL Program expects that the vast majority of data generated
by the DEL project will be publicly accessible with minimal restrictions
for non-commercial, educational purposes. (Restrictions on commercial
use are acceptable.) The DMP should indicate how archived materials
will be accessible to the public. Any restrictions to be placed on access
should be clearly indicated. If the applicant expects access to some materials to be restricted to certain user groups, the DMP should indicate
the criteria delineating such user groups and provide an estimate of the
percentage of materials which will be so restricted. If time limits are to
be placed on access to materials, the DMP should indicate the period of
time after which access restrictions will be removed (NSF 2015).
While the new DMP was an important step forward in ensuring that language
materials would be placed in an archive, it also had unintended consequences. Some
archivists are now feeling overwhelmed by requests to take language materials, while
the funding they receive from their institutions has not increased. Chelliah noted that
DEL allows grant applicants to add a line item to their budget to offset costs associated with archiving, typically 8% of direct costs, as suggested by the DELAMAN
group. Such funds could help alleviate stresses to archives in accepting new materials.
Susan Kung said that she had indeed started to ask for 8% of direct costs from NSF
grant recipients who want to put their collections in AILLA. However, she has yet to
receive any of the funds that have supposedly been budgeted. The grant applicants
are not even letting her know whether they have received the grants. Chelliah commented that protocol and communication between archivists and grant applicants
needed further development.
4.5.2 Limitations of U.S. funding model Workshop participants identiied several
limitations of the U.S. approach to funding the development of language archives.
One issue was that funding takes the form of “soft money”, meaning short-term
grants, usually for three years. Gary Holton pointed out that “within the U.S., at
least, this is the way we fund science…this is the model.” Yet the concept of preservation is central to the notion of a language archive, and for preservation, the temporal
horizon is not three years but hundreds or thousands of years. Mandana Seyfeddinipur argued that “the problem [with soft money] is that’s not sustainable, this is
Language Documentation & Conservation Vol. 10, 2016
Bringing User-Centered Design to the Field of Language Archives
674
not something that will save the archive. This is something that gives you money for
a certain amount of time.” She noted that one of the major language archives with a
global collection, DoBeS, was funded on soft money, and when its money was pulled
the archive had to undergo a major transformation and search for a new institutional
home.
A second and related point is that the U.S. lacks a central archive to deposit materials funded by the NSF. Seyfeddinipur suggested the Smithsonian. Such an archive
would avoid the problems of soft money and lack of sustainability, especially for
small communities with limited funds. Jennifer O’Neal also stated that many tribal
communities don’t have the funds to create their own language archive. She said all
of her archivist colleagues in the Northwest are dealing with the same issue: language
communities are coming to them for assistance, but the additional funding needed to
help them is not the highest priority for their universities.
However, Chelliah pointed out that the U.S. Congress may not be interested in
funding a national language archive. Their current priorities are national security,
health, and economics. She also noted that some depositors may want to preserve
the freedom to choose where they deposit materials, and that a centralized archive
would constrain the format of deposits to a single set of guidelines.
A third limitation to the U.S. funding approach is that DEL draws a line between
language documentation and access/use/revitalization. DEL only funds documentation, not the other aspects. However, Chelliah pointed out that it was possible to
partner with other funding sources for education/revitalization activities. She mentioned that the director of the NSF Tribal Colleges and Universities Program was very
interested in language and linguistics, and in funding the education of tribal university
students.
To receive DEL funding, Chelliah suggested the following approaches. She noted
that she was only speaking about NSF funding priorities so her advice might sound
“cold.” At a general level, she said that you could say you are developing ideas on
how language works which would contribute to our understanding of all languages,
including languages of interest to national security. More speciic suggestions for
inding funding for the creation of language archives were:
• Apply for funding to create a computationally sophisticated, annotated archive
that facilitates cross-language comparisons. NSF programs in Linguistics and
Documenting Endangered Languages (DEL) would be interested in funding that
kind of archive.
• Apply for funding to develop an archive to help advance Natural Language Processing or Forced Alignment and Transcription. NSF Computer and Information Science and Engineering (CISE) programs would be interesting in funding
that kind of archive.
• Apply for funding to the Social, Behavioral and Economic Science Directorate’s
special program called RIDIR, Resource Implementations for Data Intensive Research. This program “seeks to develop user-friendly large-scale next-generation
Language Documentation & Conservation Vol. 10, 2016
Bringing User-Centered Design to the Field of Language Archives
675
data resources and relevant analytic techniques to advance fundamental research” (NSF Website).
5. Access: Opening the door
5.1 The twin missions of preservation and access In the world of archiving, the two
main missions of archives are framed as “preservation” and “access” (Hunter 2003).
According to workshop participants, the language archives we termed Types 1–3 are
mainly funded to engage in preservation work, so that tends to be their primary
focus, but they are also interested in promoting the use of their materials. For Type 4,
preservation remains important, but there is likely to be a greater emphasis on access.
For preservation, the application of UCD is a somewhat abstract concept. Preservation is accorded a high value in linguistics and archiving regardless of the current
needs of particular users—in this sense, preservation is not a user-centered activity,
although it may be end up being extremely useful to many people. In the broadest
sense, “users” who may beneit from preservation include potential users who have
not yet been born, as well as non-users such as members of a language community
who don’t themselves use the language archive, but value its existence. The application of UCD to facilitate access to language archives is more obvious.
We should also keep in mind that uses of a language archive may be emergent and
unexpected. For instance, the Dinjii Zhuh K’yaa (Gwich’in Language Center and
Archive) ended up getting National Institutes of Health funds because it became a
community center, a place for people to have meetings about managing health issues.
5.2 Opening the door During the workshop, Edward Alexander initiated the idea
of access as a “door” to language archives; the metaphor was quickly adopted and
extended by the group. Alexander initially remarked:
Academics are building these archives…and so you build it for people
like yourself. So the door is an academic door, right? So other academics
walk along and say, ‘oh! I know how to open this door. And it’s for me!
And everything in there is for me!’ And for other people who are not
academics, they look at these archives and they’re like looking at tools
from some foreign thing…the door isn’t made for them.
As other workshop participants built on this notion, it became clear that none
of the workshop participants, not even the linguists, felt that the doors to language
archives were easy to open. The UCD practitioners found the door metaphor very
helpful as a way to think about designing for access. The design of a language archive
could create doors in the irst place, help users open a door and enter, and escort users
inside the language archive to help them make sense of the offerings and ind what
they are looking for.
Language Documentation & Conservation Vol. 10, 2016
Bringing User-Centered Design to the Field of Language Archives
676
5.3 Escort Alexander also suggested that it would be helpful for users to have an
escort once they entered the language archive, to help them ind and interpret items
of interest. When he irst visited ANLA, he was guided by an archivist. Libraries have
reference librarians. The Siri personal assistant on iPhones can help people ind their
way around a city. During his presentation, Daryl Baldwin also expressed the view
that while search engines have increased access, nothing replaces the knowledge of
archivists. This notion of an escort found resonance with workshop participants.
5.4 Collection guides The workshop participants with archiving expertise pointed
out that the creation of inding aids such as collection guides is a normal part of the
archiving process. Usually a curator writes a guide for each collection, including a
description of the contents, a list of inventory, who has access rights, and how to
navigate the collection. Even when a collection is not digital, the collection guide is
usually available online; this is what allows people to ind collections that have not
been digitized. Jennifer O’Neal noted that collection guides may be placed in federal
databases to make the materials more indable.
Digital language archives are unusual in the world of archives in that collection
guides are often missing. The reason is probably that many of these language archives
were created by linguists, who may have lacked expertise in archiving practices. At
this point, there is also a shortage of time and staff to prepare detailed inding aids
for all collections. Susan Kung noted that she has a few guides on AILLA. But they
are dependent on depositors writing the guides for their material.
Workshop participants discussed whether such collection guides would resolve
some of the access challenges for language communities and linguists. They concluded that it would depend on how much information was included in a collection
guide, and whether that information was targeted to the search needs and interpretation needs of particular users.
6. Conclusions The workshop described here was valuable in generating the foundational insights and initial conceptual frameworks that will guide us in our further
research on the user-centered design of language archives. We see three main steps
ahead of us to complete the research trajectory.
1. UCD of individual language archives
This will be the most time-consuming task. We plan to work with several contrasting
types of language archives in order to identify the range of variation involved in UCD
processes. We are in conversation with heads of several existing or planned language
archives. For each language archive, the UCD team will join with archive staff and
representatives of user groups to conduct user research, develop a design framework,
and implement the design.
As a irst step, Wasson has joined Shobhana Chelliah’s effort to develop a language
archive termed the Computational Resource for South Asian languages, or CoRSAL.
Wasson’s contribution will be to ensure that its development is informed by UCD
principles. Her Design Anthropology class conducted user research for CoRSAL in
Language Documentation & Conservation Vol. 10, 2016
Bringing User-Centered Design to the Field of Language Archives
677
Fall 2016. This project allowed us to develop a more ine-grained understanding of
the needs of user groups targeted by a particular archive. We also moved forward
in our understanding of the weaknesses of the “archive” concept, and how language
archives might be productively reconceptualized. As the class noted in their inal
report:
The concept of an “archive” and its associated practices are a poor it with
the work practices of linguist depositors. While the logic of archiving
requires the deposit of a completed, unchanging artifact, linguists engage
in a never-ending process of updating and revising their transcriptions
and annotations.
CoRSAL (and ideally all language archives) should permit endless, easy
annotation of deposits. A model of data storage that is dynamic and
interactive, such as a relational database, would be more appropriate (Al
Smadi et al. 2016: 74).
The project also allowed us to further explore the colonial underpinnings of
archiving and linguistics, the importance of developing a participatory research and
design process with language community members, and the practical challenges of
such collaboration in the context of local constraints with regard to Internet access
and other resources.
2. Develop guidelines for UCD of language archives
Based on the knowledge gained in the previous step, we will prepare a guidelines/best
practices document. The initial draft will be circulated to diverse stakeholders and
revised based on their comments.
We expect the guidelines to describe a collaborative process among language
archive stakeholder groups, including user researchers, designers, and whatever other
specialists may be needed. The guidelines will recognize that the design of each
archive needs to be customized to the concerns of its language communities. They
will seek to accommodate the potentially conlicting needs of different user groups.
The guidelines may also include recommendations that go beyond archive design
per se. For instance, they may suggest ways to encourage linguists to deposit data,
and ways to encourage linguists to use archive data for their research.
3. Disseminate the guidelines
We will work with a variety of professional organizations and language communities
to encourage a paradigm shift or culture change toward adoption of UCD principles.
We will seek to have our guidelines adopted or endorsed by organizations such as
• Committee on Endangered Languages and their Preservation, Linguistic Society
of America (CELP, LSA)
• Digital Endangered Languages and Musics Archives Network (DELAMAN)
• Association of Tribal Archives, Libraries, and Museums (ATALM)
Language Documentation & Conservation Vol. 10, 2016
Bringing User-Centered Design to the Field of Language Archives
678
References
Al Smadi, Duha, Sebastian Barnes, Molly Blair, Miyoung Chong, Robin Cole-Jett,
Aaron Davis, Samantha Hardisty, Jenny Hooker, Corderon Jackson, Tori Kennedy,
Janette Klein, Brittany LeMay, Melanie Medina, Kenneth Saintonge, Anh Vu &
Christina Wasson. 2016. Exploratory User Research for CoRSAL. Report prepared
by students of UNT Design Anthropology class for S. Chelliah, Director of the
Computational Resource for South Asian Languages (CoRSAL). https://designinglan
guagearchives.iles.wordpress.com/2017/03/exploratory-research-for-corsal.pdf.
Aiken, Jo, Victoria Schlieder, & Christina Wasson. 2014. “No more cakes and ale?”:
Discovering ethical gray areas in a design anthropology class. Journal of Business
Anthropology, Special Issue 1, 38–61.
Austin, Peter K. 2011. Who uses digital language archives?
http://www.paradisec.org.au/blog/2011/04/who-uses-digital-language-archives/.
Bannon, Liam. 1991. From human factors to human actors: The role of psychology
and human-computer interaction studies in system design. In J. Greenbaum & M.
Kyng (eds.), Design at work: cooperative design of computer systems, 25–44. Hillsdale, NJ: Lawrence Erlbaum Associates.
Berez, Andrea. 2015. Reproducible research in descriptive linguistics: Integrating
archiving and citation into the postgraduate curriculum at the University of Hawai‘i
at Mānoa. In Amanda Harris, Linda Barwick, & Nicholas Thieberger (eds.), Research, Records and Responsibility: Ten years of PARADISEC, 39–51. Sydney: University of Sydney Press.
Bird, Steven & Gary Simons. 2003. Seven dimensions of portability for language documentation and description. Language 79(3). 557–582.
Dobrin, Lise M. & Gary Holton. 2013. The documentation lives a life of its
own: The temporal transformation of two endangered language archive projects.
Museum Anthropology Review 7. 140–154. http://scholarworks.iu.edu/journals/in
dex.php/mar/issue/view/233.
Foucault, Michel. 1982. The archaeology of knowledge and the discourse on language.
New York: Pantheon.
Gezelter, Dan. 2009. Being scientiic: Falsiiability, veriiability, empirical tests, and
reproducibility. http://www.openscience.org/blog/?p=312.
Green, David. 2003. The NINCH guide to good practice in the digital representation
and management of cultural heritage materials. New York: Humanities Advanced
Technology and Information Institute, University of Glasgow and National Initiative
for a Networked Cultural Heritage, New York University.
Hagan, William T. 1978. Archival captive—The American Indian. American Archivist,
41(2), 135–146.
Henke, Ryan & Andrea Berez-Kroeker. 2016. A brief history of archiving in language
documentation, with an annotated bibliography. Language Documentation & Conservation 10. 411–457. http://hdl.handle.net/10125/24714.
Himmelmann, Nikolas P. 1998. Documentary and descriptive linguistics. Linguistics,
36, 161–195.
Language Documentation & Conservation Vol. 10, 2016
Bringing User-Centered Design to the Field of Language Archives
679
Holton, Gary. 2011a. The role of information technology in supporting small and
endangered languages. In Peter K. Austin & Julia Sallabank (eds.), The Cambridge
Handbook of Endangered Languages, 371–399. Cambridge: Cambridge University
Press.
Holton, Gary. 2011b.‘Unknown unknowns’ and the retrieval problem in language documentation and archiving. Language Documentation & Conservation 5. 157–168.
http://hdl.handle.net/10125/4496.
Holton, Gary. 2012. Language archives: They’re not just for linguists any more. In F.
Seifart, G. Haig, N. P. Himmelmann, D. Jung, A. Margetts & P. Trilsbeek (eds.), Potentials of Language Documentation: Methods, Analyses, and Utilization, 111–117.
Honolulu: University of Hawai‘i Press.
Holton, Gary. 2014. Mediating language documentation. In David Nathan & Peter K.
Austin (eds.), Language Documentation and Description, Volume 12: Special Issue
on Language Documentation and Archiving, 37–52. London: SOAS.
Holton, Gary, Andrea Berez, & Sadie Williams. 2007. Building the Dena’ina language
archive. In Laural E. Dyson, Max Hendriks, & Stephen Grant (Eds.), Information
Technology and Indigenous Peoples, 205–209. Hershey, Pennsylvania: Information
Science Publishing.
Hunter, Gregory. 2003. Developing and Maintaining Practical Archives: A how-to-doit manual. Chicago: Neal-Schuman Publishers.
Huvila, Isto. 2008. Participatory archive: Towards decentralised curation, radical user
orientation, and broader contextualisation of records management. Archival Science
8. 15–36.
Isaacman, Allen F., Lalu, Premesh, & Nygren, Thomas. I. 2005. Digitization, history
and the making of a postcolonial archive of Southern African liberation struggles:
The Aluka Project. Africa Today 52(2). 55–77.
Krauss, Michael. E. 1974. Alaska Native Language Legislation. International Journal
of American Linguistics 40(2). 150–152.
Krauss, Michael. E. 1992. The world’s languages in crisis. Language 68(1). 4–10.
Leonard, Wesley. Y. 2011. Challenging “extinction” through modern Miami language
practices. American Indian Culture and Research Journal 35(2). 135–160.
Leonard, Wesley. Y. to appear. Producing language reclamation by decolonizing “language.” In Peter K. Austin (ed.), Language Documentation and Description.
Linn, Mary. 2014. Living archives: A community-based language archive model. In
David Nathan & Peter K. Austin (eds.), Language Documentation and Description,
Volume 12: Special Issue on Language Documentation and Archiving, 53–67. London: SOAS.
Nathan, David. 2014. Access and accessibility at ELAR, an archive for endangered languages documentation. In David Nathan & Peter K. Austin (eds.), Language Documentation and Description, Volume 12: Special Issue on Language Documentation
and Archiving, 187–208. London: SOAS.
National Science Foundation. 2015. Documenting Endangered Languages (DEL)—
Data, Infrastructure and Computational Methods. Program Solicitation. NSF 15567. Posted May 5, 2015. https://www.nsf.gov/pubs/2015/nsf15567/nsf15567.pdf.
Language Documentation & Conservation Vol. 10, 2016
Bringing User-Centered Design to the Field of Language Archives
680
Norman, Donald A. 1988. The psychology of everyday things. New York: Basic Books.
Povinelli, Elizabeth A. 2011. The woman on the other side of the wall: Archiving the
otherwise in postcolonial digital archives. Differences 22(1). 146–171.
Ritter, Frank E., Gordon D. Baxter & Elizabeth F. Churchill. 2014. Chapter 2:
User-centered systems design: A brief history. In Foundations for Designing UserCentered Systems, 33–54. London: Springer Verlag.
Robinson, Rick E. 1993. What to do with a human factor: a manifesto of sorts. Special
issue: New human factors. American Center for Design Journal 7(1). 63–73.
Robinson, Rick E. 1994. The origin of cool things. In Design that packs a wallop:
Understanding the power of strategic design. American Center for Design conference
proceedings, 5–10. New York: American Center for Design.
Salffner, Sophie. 2015. A guide to the Ikaan language and culture documentation. Language Documentation & Conservation 9. 237–67. http://hdl.handle.net/10125/24639.
Seifart, Frank, Sebastian Drude, Bruna Franchetto, Jürg Gasché, Lucía Golluscio,
& Elizabeth Manrique. 2008. Language documentation and archives in South
America. Language Documentation & Conservation 2(1). 130–140. http://hdl.handle.net/10125/1775.
Seifart, Frank, Geoffrey Haig, Nikolas Himmelmann, Dagmar Jung, Anna Margetts,
& Paul Trilsbeek (eds.). 2012. Potentials of Language Documentation: Methods,
analyses, and utilization. Honolulu: University of Hawai‘i Press.
Shepard, Michael. 2015. “The substance of self-determination”: Language, culture,
archives and sovereignty. Vancouver: University of British Columbia Ph.D. dissertation.
Shilton, Katie & Ramesh Srinivasan. 2007. Counterpoint: Participatory appraisal and
arrangement for multicultural archival collections. Archivaria 63. 87–101.
Squires, Susan & Bryan Byrne (eds.). 2002. Creating breakthrough ideas: The collaboration of anthropologists and designers in the product development industry.
Westport: Bergin & Garvey.
Stoler, Ann L. 2010. Along the archival grain: Epistemic anxieties and colonial common sense. Princeton: Princeton University Press.
Suchman, Lucy. 1995. Making work visible. Communications of the ACM 38(9).
56–65.
Thieberger, Nick, Anna Margetts, Stephen Morey and Simon Musgrave. 2015. Assessing annotated corpora as research output. Australian Journal of Linguistics 36(1).
1–21.
Wasson, Christina. 2000. Ethnography in the ield of design. Human Organization
59(4). 377–388.
Wasson, Christina. 2002. Collaborative work: Integrating the roles of ethnographers
and designers. In S. Squires & B. Byrne (eds.), Creating breakthrough ideas: The
collaboration of anthropologists and designers in the product development industry,
71–90. Westport: Bergin & Garvey.
Wasson, Christina. 2005. Celebrating the cutting edge. Proceedings of the Ethnographic Praxis in Industry Conference, 140–145. Redmond, WA.
Language Documentation & Conservation Vol. 10, 2016
Bringing User-Centered Design to the Field of Language Archives
681
Wasson, Christina & Crysta Metcalf. 2013. Teaching design anthropology through
university-industry partnerships. In W. Gunn, T. Otto, & R. C. Smith (eds.), Design
Anthropology: Between Theory and Practice, 216–231. London: Berg.
Wasson, Christina & Susan Squires. 2012. Localizing the global in technology design.
In Christina Wasson, Mary O. Butler, & Jacqueline Copeland-Carson (eds.), Applying Anthropology in the Global Village, 251–284. Walnut Creek: Left Coast Press.
Wikipedia. 2016. User-centered design. https://en.wikipedia.org/wiki/Usercentered_design.
Woodbury, Anthony C. 2003. Deining documentary linguistics. In P. Austin (ed.), Language Documentation and Description, Volume 1, 33–51. London: SOAS.
Woodbury, Anthony C. 2014. Archives and audiences: Toward making endangered
language documentations people can read, use, understand, and admire. In David
Nathan & Peter K Austin (eds.), Language Documentation and Description, Volume
12, 19–36. London: SOAS.
Zeitlyn, David. 2012. Anthropology in and of the archives: Possible futures and contingent pasts. Archives as anthropological surrogates. Annual Review of Anthropology
41. 461–480.
Christina Wasson
christina.wasson@unt.edu
Gary Holton
holton@hawaii.edu
Heather S. Roth
heathersroth@gmail.com
Language Documentation & Conservation Vol. 10, 2016