(PDF) The Researcher and the Never-Ending Field: Reconsidering Big Data and Digital Ethnography

This chapter focuses on the concept of data to clarify how it operates on our research sensibilities. By deconstructing the concept, we can better situate it, consider whether or not we should use the term at all, or be more clear in our definitions of what we mean when we explain our research to others. Drawing on current critical academic responses to the rise of data and big data, I posit that data operates on at least two levels; as thing and as ideology. Though inextricable in practice, we can separate these concepts momentarily to begin to identify how quite different meanings might be operating in our theoretical frameworks, research design, and everyday activities. Once these dual levels are recognized-a process that requires conscious and critical self-reflexivity, one can more strategically frame and use the interpretation of data in multiple and nuanced ways to add layers of meaning or augment the analytical processes.

Emerging digital data sources provide opportunities for explaining social processes, but also challenge knowledge production practices within social sciences. this article contributes to the ‘end of theory’ discussions, which have intensified in the social sciences since the widening practice of big data and computational methods. Adopting a systematic literature review of 120 empirical articles through a combined quantitative and qualitative approach, this article strives to contribute to the ongoing discussions on the epistemological shifts in social media big data (smBD) studies. This study offers an insight into the development of analytical methods and research practices in smBD studies during their rapid growth period in 2012 –2016. The study findings only partially revealed the ‘end of theory’ claim: the problem setting of the studies is rather weakly related to theory, often neither hypothesis nor research questions are formulated on the basis of previous theories or research. However, this relatively weak relatedness to theory has not led to the descriptive type of inference, but rather exploratory, or predictive ways of reasoning. instead of enabling predictions in social science research, smBD raises issues of understanding the causes and effects in predictions for evaluating the social mechanisms of global disruptions. Developing ‘human research machines’ that exploit the cognitive resources of individuals should not be the aim of smBD production. the outcome should be to recognise that the cognitive abilities of researchers, access to data, and developing novel methods are necessary for evaluating the global impact of social behaviour.

The digitization of communications technology has led to an intense interaction between human and digital-based technology. A large number of digital data traces produced by humans as a result of that activity. Such data is commonly referred to Big Data. The availability of Big Data as a digital data source in turn, opens opportunities for communication scientists to be able to use that data to get the patterns and trends of human activities that have been done through social research. It is necessary to understand the basic concept of the Big Data, using appropriate tools and adequate access to the data, and appropriate research method in order to be able to conduct research by using such digital data. This paper aims to describe the potential of Big Data for the purposes of communication research, the use of appropriate tools, techniques and methods and to identify potential research directions in the digital realm. Some limitations and critical issues related to the research vali...

Christine Lohmeier, The Researcher and the never-ending field: reconsidering big data and digital ethnography Chapter forthcoming in Martin Hand & Sam Hillyard (eds) (in press) Big Data? Qualitative approaches to digital research. Emerald Press This chapter considers the challenges and potentials of using so called big data in communication research. The chapter first grapples the task of clearly defining big data in the context of communication and media studies. It then moves on to analyse and critique processes associated with the dealings of big data: datafication and dataism. The challenges of data-driven research are juxtaposed with qualitative perspectives on research regarding data gathering and context. These thoughts are further elaborated in the second part of the chapter where the lessons learned in digital ethnography are linked to challenges of big data research. As a method, digital ethnography has evolved in the past years, particularly with the so-called material turn – a call for communication researchers to consider the wider context in which (digital) communication takes place. The chapter emphasises the need to ground research in wider practices in overlapping media and material realms. As digital media and communication researchers, we are facing the challenge as well as enjoying the richness of a never-ending digital field which constantly generates multitudes of data. It is proposed that by including the materialities of contexts and transitions between these contexts and material and mediated realms, we can ask more relevant research questions and gain more insights compared to a purely data-driven approach.

ng THE RESEARCHER AND THE NEVER-ENDING FIELD: RECONSIDERING BIG DATA AND DIGITAL ETHNOGRAPHY Pu bl is hi Christine Lohmeier up ABSTRACT al d G ro Purpose This chapter considers the challenges and potentials of using so called big data in communication research. It asks what lessons big data research can learn from digital ethnography, another method of gathering digital data. (C )E m er Design/methodology/approach The chapter first takes on the task of clearly defining big data in the context of communication and media studies. It then moves on to analyse and critique processes associated with the dealings of big data: datafication and dataism. The challenges of data-driven research are juxtaposed with qualitative perspectives on research regarding data gathering and context. These thoughts are further elaborated in the second part of the chapter where the lessons learned in digital ethnography are linked to challenges of big data research. Big Data? Qualitative Approaches to Digital Research Studies in Qualitative Methodology, Volume 13, 75 89 Copyright r 2014 by Emerald Group Publishing Limited All rights of reproduction in any form reserved ISSN: 1042-3192/doi:10.1108/S1042-319220140000013005 75 76 CHRISTINE LOHMEIER Findings It is proposed that by including the materialities of contexts and transitions between material and mediated realms, we can ask more relevant research questions and gain more insights compared to a purely data-driven approach. Practical implications This chapter encourages researchers to reflect upon their relations to the object of study and the context in which data was produced through human/human technical interaction. is hi ng Originality/value This chapter contributes to debates about qualitative and quantitative research methods in communication and media studies. Moreover, it proposes that methods which are in the widest sense used in the never-ending digital field benefit from the mutual consideration of both qualitative and quantitative approaches. up Pu bl Keywords: Digital ethnography; big data; qualitative research; communication research; material turn G ro INTRODUCTION (C )E m er al d Big data is hyping. The possibilities of big data have received a lot of attention by communication scholars. One of the most recent pieces of evidence for this is the publication of a special issue on big data by the Journal of Communication, one of the most prominent and well-respected publications in the field. The magazine Research Trends (Halevi & Moed, 2012, p. 5) attests to ‘an explosion of publications since 2008’. This chapter considers how big data is used in communication research. Following an assessment of what is meant by ‘big data’, it outlines the potentials and challenges of (communication) research with big data. In a second step, big data as well as digital ethnography are re-considered from a qualitative research perspective. Over the past two decades, digital ethnography another research method with a strong focus on the digital world and online activities has experienced increasing popularity. I propose that approaches to and with big data can benefit from what has been learned in developing and refining digital ethnographies. BIG DATA IN COMMUNICATION RESEARCH Big data stands at the intersection of technology and social reality. It is a ‘cultural, technological, and scholarly phenomenon’ (boyd & Crawford, 77 The Researcher and the Never-Ending Field (C )E m er al d G ro up Pu bl is hi ng 2011, p. 663). The term is used to refer to a method and an approach to science and research as well as to large datasets themselves. In the past, big data has caused some over-excitement and even mythologising, meaning a ‘widespread belief that large data sets offer a higher form of intelligence and knowledge’ with a previously unachieved ‘aura of truth, objectivity, and accuracy’ (boyd & Crawford, 2011, p. 663). Parks (2014, p. 355) even calls what we are witnessing right now a ‘Big Data movement’. As the term suggests, we are talking about ‘big’ data, but there have always been data sets which in their time were considered relatively large, so size ‘alone is therefore an insufficient descriptor’ (Parks, 2014, p. 355). Even before the term became fashionable, larger datasets than those which are now referred to as big data were already available, such as census data (boyd & Crawford, 2011, p. 663). For communication scholars, the datasets in questions can be ‘large social networks (including online networks such as Twitter), automated data aggregation and mining, web and mobile analytics, visualization of large datasets, sentiment analysis/opinion mining, machine learning, natural language processing, and computer-assisted content analysis of very large datasets’ (Parks, 2014, p. 355). In communication research, the analysis of big data stemming from Twitter is particularly common at this point in time. This is partly due to the fact that large datasets of tweets are relatively easy to get hold of. Nevertheless, even with regards to Twitter, researchers are somewhat dependent on the benevolence of Twitter Inc. and its regulations; the challenge of data availability will be discussed in more detail below. CHALLENGES OF DATAFICATION AND DATAISM Why has big data been given such a prime spot in debates about social sciences over the past few years? The coming together of technological developments, that is computers having the capacity to store and carry out analysis of large datasets, promises new findings hopefully followed by new insights that could not be obtained at an earlier stage. At the same time, big data which is of particular interest to communication scholar is continuously being generated by people using and ‘feeding’ information and communication technologies. This process has been coined as ‘datafication’ (Mayer-Schönberger & Cukier, 2013). Data is being generated by users and being conceived as something worth looking at by (communication) researchers. These developments are indeed exciting as they allow for new types of research questions. 78 CHRISTINE LOHMEIER (C )E m er al d G ro up Pu bl is hi ng A second aspect of ‘datafication’ is linked to the new computational prowess in analysing large datasets. These new capacities allows for the bringing together of multiple ‘datasets of different times, from different places, or gathered at different times’. Big data has evoked scholars and commentators to refer to what we are experiencing now as a ‘big data revolution’ (Mayer-Schönberger & Cukier, 2013). No doubt, the benefits of big data analysis might be ground-breaking in some disciplines and possibly lifesaving, for example when it comes to analysing medical data sets. However, big data is also a continuation of how science, including the social sciences, has evolved over the past 100 years (boyd & Crawford, 2012; Parks, 2014). As with other technologies and types of information and data, what was once only accessible to few is now available for more agents, including ‘scholars, marketers, governmental agencies, educational institutions, and motivated individuals’ (boyd & Crawford, 2011, p. 664). The question of how to go about an analysis of large data sets does not require a trip to the local library: tricks and pitfalls can now be easily found in blog posts (Bar-Joseph, 2013). The process of datafication, alongside questions on how to deal with the big data sets in question, brings several challenges. Anderson’s bold assertion that ‘[w]ith enough data, the numbers speak for themselves’ (2008) has been widely refuted, even in circles of researcher that are strongly associated with quantitative research. Moreover, if we think about the social world from an epistemological perspective, ‘data’ is ubiquitous; the (digital) ethnographer in the field just like the big data analyst is surrounded by data. The challenge then becomes to relate different pieces of data, trace and confirm patterns and make sense of what was found in the larger scheme of things. But often the assumption when it comes to large data sets is that they are (a) intrinsically relevant, (b) holistic and complete in describing phenomena that can be distinguished from other occurrences disconnected to or at least not effected by them and (c) clean meaning that there are no corrupted data. This type of thinking, the underlying assumption that all answers are to be found by looking at data alone has been coined ‘dataism’. While working towards my PhD, I remember sitting in a doctoral workshop at the University of Glasgow, during which, a senior scholar encouraged us to ‘trust our data’. For me, this meant trusting what I have observed during times of ethnographic field work, taking seriously field notes and what research participants had told me in interviews and focus groups. Interpretations, of course, need thinking, re-thinking, questioning. As in other areas of life (Turkle, 2011), there is a latent assumption that 79 The Researcher and the Never-Ending Field (C )E m er al d G ro up Pu bl is hi ng technology can do better than humans, that is technically generated or mechanically selected data sets are more reliable than those collected by personally and physically going to a field and gathering data. Dataism is an expression of the tendency to value technically generated or selected data higher, to view it as more objective and therefore more reliable, making theory obsolete. The famous case of correlation between S&P 500 stock index and butter production in Bangladesh (Leinweber, 2007) demonstrates that everything data suggests is neither true nor necessarily significant. As has been shown for example in the case of large datasets gathered from Twitter, the data received is problematic. First of all, Twitter, like Facebook, offers very limited archiving capacities (boyd & Crawford, 2012). Consequently, there is a bias towards working with fairly recent data or data of the immediate past. Secondly, the data sets obtained are not necessarily complete or selected in a traceable manner. For example, to gather tweets and feed them into a data set, researchers work with an application programme interface (API). The majority of researchers have access to about 10 per cent of public tweets. This is due to terms and conditions set by Twitter Inc. So how are these 10 per cent of all public tweets selected? ‘It could be that the API pulls a random sample of tweets or that it pulls the first few thousand tweets per hour or that it only pulls tweets from a particular segment of the network graph. Without knowing, it is difficult for researchers to make claims about the quality of the data they are analysing’ (boyd & Crawford, 2012, p. 669). For many data sets relevant to communication research, the quality and therefore the reliability of the data is limited and access often depends on the goodwill of companies: ‘[O]nly social media companies have access to really large social data especially transactional data. An anthropologist working for Facebook or a sociologist working for Google will have access to data that the rest of the scholarly community will not’ (Manovich, 2011). Alongside questions of access and data reliability, it is doubtful that research questions can always be answered in the best possible manner purely because of researchers working with a large data set. Java et al. (2007) found that people’s motivations for using Twitter were the need to share and seek information as well as to sustain and conserve friendships. These results were based on the analysis of 1.3 million tweets from 76,177 users. But as Marwick (2014) rightly points out, conducting qualitative interviews and participant observation with Twitter users, is likely to bring out a much more refined picture of motivations, human technology interactions, relationships and other issues at stake. The hype about big data and methods including computational analysis should not mean a turning 80 CHRISTINE LOHMEIER (C )E m er al d G ro up Pu bl is hi ng away from small data sets. They hold very valuable insights too (boyd & Crawford, 2012). More often than not, the true promise of big data research might become apparent in combining big data research with other, perhaps especially, with qualitative research methods. In the case of big data research on tweets, Axel Bruns and colleagues (Bruns, 2012; Bruns & Burgess, 2012; Bruns, Burgess, Crawford, & Shaw, 2012) have, among others, used big data analyses to map the shape and dynamics of large networks. While this is extremely useful for our understanding of the workings of large networks, such type of analyses tell us little about the meaning of networks, tweets, platforms in people’s everyday life. By purposefully taking a small data approach, Stephanson and Couldry (2014) demonstrate that great insights can be gained on Twitter’s influence on community and (collective) identity by combining a number of methods and by analysing a relatively small and context-specific number of tweets. The aim here is not to praise the virtue of one kind of research in contrast to the shortcomings of another but to acknowledge that each and every one method and approach comes with advantages as well as shortcomings. Drawing on the work of Florian Znaniecki on ‘the human coefficient’, Christians and Carey (1989, p. 360) remind us that ‘data always belong to somebody, that they are constructed in vivo and must be recovered accordingly’. Capturing data in vivo is of course a challenge in and of itself and it is certainly not essential for every type of research question. However, Christians and Carey’s (1989) point reminds us of two important aspects of data: For one, every insight gained through big data analysis gives information about the past. This is not specific to big data all forms of content analyses do not provide first-hand information on how data was produced in vivo (e.g. in newsroom, in living rooms, on the go with mobile devices). However, when it comes to big data because of the sheer amount of users considered we know little about individual circumstances in which data was produced. Answering the question of whether we can use our understanding of the past to predict the future goes beyond the remits of this contribution. But nevertheless, with only a rudimentary understanding or a good estimate of what goes on ‘on the ground’ where data originates, the quality of predictions and even of the analyses are likely to decline. The second point raised by Christians and Carey (1989) relates back to dataism. At times there seems to be an unconscious detachment regarding the origin of data. As social and cultural researchers, we are generally interested in data directly or indirectly generated by humans or through human technology interaction. Big data research in the field of communication 81 The Researcher and the Never-Ending Field d G ro up Pu bl is hi ng makes use of people’s digital footprints or data trails. However, the question of ownership of these data is highly contentious. The recent verdict of the European Court of Justice forced Google Inc. to delete certain information about a Spanish citizen (Travis & Arthur, 2014). In a similar vein, the ‘right to be forgotten’ a concept originally coined by Victor Mayer-Schönberger (2009) and taken up by policy makers as has been diswell as NGOs and civil liberties groups (Rosen, 2012) cussed widely and, in fact, is a concern to many users. From an ethical perspective, big data then does not happen in a void. Can we imagine a scenario where permissions to use tweets have to be sought from each and every single user in a large data set? For medical records, that is certainly the case. But access to and power over data is not straightforward. Will we allow companies such as Twitter, Facebook and Google to negotiate ethical concerns or even to simply ignore them? The huge promises of big data are therefore accompanied by a number of serious challenges. The following section will approach challenges big data poses in light of discussions surrounding digital ethnography and the aims of qualitative research more generally. )E m er al RECONSIDERING BIG DATA AND DIGITAL ETHNOGRAPHY FROM A QUALITATIVE PERSPECTIVE (C From a communication scholar’s perspective, digital ethnography and big data are both linked to processes of digitisation and mediatisation. We live with what Couldry (2011) has called a ‘media manifold’ in which the majority of highly diverse aspects of everyday life are directly or indirectly mediated (Hepp, 2010; Livingstone, 2009). The dynamic configurations of mobile and more or less stationary technical devices form part of everyday life and allow for a ‘connected presence’: We can now, if we wish, be permanently open (and potentially responsive) to content from all directions. Many writers see the practice (or even compulsion) of continuous connectivity as characteristic of the ‘digital native’ generation. […] Keeping all channels open means permanently orienting oneself to the world beyond one’s private space and the media that are circulated within it. (Couldry, 2012, p. 55) Communication devices are either at the centre of our actions and attention or on the periphery. Most significantly though, they are ubiquitous 82 CHRISTINE LOHMEIER (C )E m er al d G ro up Pu bl is hi ng (Hand, 2012) and they intersect, influence, form and arrange aspects of our material world. I will return to this point in greater detail below. With this in mind, researching media and communication is a highly complex undertaking and several methods have been developed to adhere to research questions and capture the needed data. Along interviews, focus groups, surveys all methods common to the social sciences more generally, there are some which are more specific to media and communication research, such as different forms of content analysis and media ethnography. Media ethnography is used to gather data on websites or digital media more generally. Of course ethnographies as well as large datasets are possible outside of the digital realm; examples could be large datasets on television viewing habits in a pre-Internet era and ethnographies of newspaper readers. But it cannot be ignored that both of these approaches to research media ethnography and big data analyses have gained momentum in the digital era. After introducing digital ethnography in more detail, the chapter will move on to consider in which ways big data research might benefit by considering some of the challenges which digital ethnographic researchers have had to face. Digital ethnography1 is based on the anthropological and sociological approach of treating a certain space as a field. In traditional anthropology, this was generally speaking a certain locale which the researcher would travel to and make him or herself ‘at home’ as far as that was possible in order to gather data. An exemplary anthropologist was supposed to ‘go native’, live just like or at least alongside the ‘tribe’ she was researching and, once substantial amounts of data were gathered, return home to interpret field notes, recorded conversations and so on. A pivotal characteristic of this type of research is the close, embodied and personal relationship between researcher and researched (see Coffey, 1999). Interestingly, and perhaps in contrast to what one might come to expect, field relations do not end with the researcher leaving the field. A very common experience of ethnographic work is that the field turns out to be ‘sticky’ as it stays present on the researcher’s mind much longer than could be expected. Okely (1994, p. 32) eloquently describes this process: [T]he experience of anthropological material is, like fieldwork, a continuing and creative experience. The research has combined action and contemplation. Scrutiny of the notes offers both empirical certainty and intuitive reminders. Insights emerge also from the subconscious and from bodily memories, never penned on paper. […] The author is not alienated from the experience of participant observation, but draws upon it both precisely and amorphously for the resolution of the completed text. 83 The Researcher and the Never-Ending Field (C )E m er al d G ro up Pu bl is hi ng Following this approach, ethnographic research consists of a mix of methods, including interviews as well as informal chats with people encountered in the field, focus groups and (participant) observation. While the individual methods employed might vary strongly depending on the field and the research questions, the main commonality of ethnographic studies is that the researcher makes a conscious effort of understanding the field and the people he or she researches from their perspective. In an ideal scenario, the researcher simultaneously manages to keep a certain level of objectivity and a critical capacity of what he or she encounters which is not easy as field relations quickly become complex and multi-dimensional (Lohmeier, 2014). Among others, Christine Hine must be acknowledged as one of the pioneers of media or virtual ethnography. Like ‘regular’ ethnography, media ethnography is a mix of method (see for example Hine, 2000) which has gained ever higher levels popularity in communication research. The opportunity to examine communities and interactions in social information and communication technologies (SICT) has led to a steep increase on studies focusing on communication practices online. In traditional ethnographies, scholars distinguish between emic and etic approaches to the field. While the former indicates that the researcher is part of the community he or she investigates, the latter implies that the researcher is in fact an intruder who has not been socialised in the context s/he now examines. Both types of field relations have advantages and disadvantages. An emic researcher, for example a person researching the community he or she has been brought up in, might be highly familiar with certain behavioural patterns and structures encouraging or hindering certain actions. In this case, the researcher will need a lot less time of familiarising himself or herself with the field and with what is at stake. Then again, the fact of belonging somewhere and being seen as ‘one of us’ in the widest sense by research participants, might also have certain disadvantages. If, the field in question is highly polarised, research participants are likely to assign the researcher to a ‘side’. Whether this is justified or not, is another matter. Imagine a research project on the memories of the Troubles in Northern Ireland. Clearly, an etic researcher, who in an ideal scenario even comes from outside of the United Kingdom and Ireland, might have more success of building rapport with informants than someone who is perceived as biased right from the start. On the other hand, there might be complexities and intricacies of the field that the etic researcher might completely miss out on because certain phenomena which are relevant in this particular field 84 CHRISTINE LOHMEIER are not familiar from her own background. Similarly, there might be prejudices among informants about a researcher coming from a different background. So both ways of doing research, emic and etic, have benefits as well as drawbacks. But whether one or the other, good research ends with insights, understanding and in all likelihood more questions to answer and follow up on. The term ‘understanding’ is often linked back to qualitative or ‘soft’ science. However, as Wax (1971, pp. 10 11) points out, understanding is not meant in the sense of empathy: Pu bl is hi ng Understanding does not refer to a mysterious empathy between human beings. Nor does it refer to an intuitive or rationalistic ascription of motivations. Instead, it is a social phenomenon a phenomenon of shared meanings. Thus a fieldworker who approaches a strange people soon perceives that this people are saying and doing things which they understand but he does not understand. One of the strangers may make a particular gesture, whereupon all the other strangers laugh. They share in the understanding of what the gesture means, but the fieldworker does not. When he does share it, he begins to ‘understand’. He possesses a part of the insider’s view. (C )E m er al d G ro up The distinction between emic and etic field relations forms part of practising reflexivity. In ethnographic work, this conscious reflection of field relations and potential blind spots and biases is clearly encouraged. In the case of digital ethnographies, it is not common to make explicit one’s relationship to the subject of study. But what could be gained by doing so, by reflecting on the researcher’s relation the subject? What is striking when considering digital ethnography as well as big data, is the prominence of data in our relating to it. But would it not make sense to also consider how we relate to this data at the start and throughout the research process? This is not meant to encourage a normative stance in researchers, labelling something as good or bad. What I’m aiming for here is a subjective perspective of the data analysed. If we stick with an analysis of tweets, short messages published through Twitter as described above, does it make a difference if the researcher uses Twitter himself or herself ? Does it matter if he enjoys using it or not? Obviously, for crunching numbers in quantitative analyses, this might not matter so much as the actual calculation seems fairly standardised. But just like in a digital ethnography, the researchers’ insights about the way Twitter can be used and put to use for individuals, has an influence on the sort of research questions she might ask. A bit more than a decade ago, Marc Prensky (2001) coined the concept of ‘digital natives’ and ‘digital immigrants’. In communication research, the distinction of those having grown up with digital technologies and gadgets as opposed to those who have learned how to live with these technologies 85 The Researcher and the Never-Ending Field at a later point in life has been useful. When considering digital data, be it in the form of digital ethnography or big data, the distinction where a researcher stands could be useful too. Drawing on the work of Lash and Lunenfeld, Beneito-Montagut (2011, p. 720) emphasises the following with regard to (digital) ethnography in today’s world: ng Ethnography in this dynamic arena eventually necessitates a ‘technologized’ researcher (Lash, 2002; Lunenfeld, 2000). Moreover, paradoxically, in order to achieve reflexive, critical, precise descriptions of internet phenomena we need both to ‘speed-up’ to follow our fast-moving objects of analysis and to ‘slow down’ to understand them properly. […][Some studies] are more concerned with the features of the technology than with the forms and meaning of social interaction online. (C )E m er al d G ro up Pu bl is hi The danger is indeed that a focus on technologies and data becomes an end in itself. As researchers we are at times so enthralled with the wealth of digital data and what could possibly be done with it, that there is a danger is to forget what the most pressing research questions are. Moreover, being critical and reflective of a researcher’s relation to technology can be highly useful. Returning to the case of Twitter as an example, boyd and Crawford (2012, p. 669) remind us that, for one, Twitter does not ‘represent “all people”’ and it is wrong to assume that ‘“people” and “Twitter users” are synonymous’ as some users might have multiple accounts and some accounts have multiple users. In addition, some accounts are so-called ‘bots’ which ‘produce automated content without directly involving a person’. Some ‘users’ might never establish a Twitter account but ‘listen in’ via the web (Crawford, 2009). What do definitions of ‘user’, ‘participation’ and ‘active’ mean in this context? Understanding the technical side of Twitter and its affordances, that is how this technology is and can be used, is absolutely essential when considering the results that come out of big data analyses. This background information is not only highly useful but also essential in making sense of the results. A second challenge digital research has to face is a re-focusing on contexts. In what has become known as the material turn, researchers are encouraged to pay attention to how objects and the physicality as well as different spaces of life interact with what was originally called the virtual life. What we are experiencing are two simultaneous but highly related developments; for one, there is the increasing mediatisation of everyday life; it seems that for some individuals, all areas of life are mediated and life without media seems unthinkable. Secondly, the material turn in the humanities reminds us that despite digitisation and the mediatisation of everyday life, objects and the physicality of what surrounds us is still highly 86 CHRISTINE LOHMEIER Pu bl is hi ng significant and should not be neglected in our conceptualisations. The challenge of course lies in creating research methods, which capture online and offline life and their intersections. Studies relying solely or to a great extent on big data or digital ethnography alone, run the risk of being disconnected from social reality. In other words, according to proponents of the material turn, these kinds of studies tell us only about a very limited interaction which research subjects engage with in their everyday life. What it takes, is a multi-sited and userfocused way of research, that does not hold data and thereby datafication in a more esteemed sense than social reality. In the case of digital ethnography, Beneito-Montagut and others (2011, p. 730; see also Christine Hine on the University of Surrey Youtube Channel, 2013) argue for what Beneito-Montagut calls an extended or ‘expanded ethnography’ which goes beyond looking at single-media use and even viewing the digital world as a field in itself: G ro up [A]n extended ethnography is multi-situated, user centred, flexible and multimedia. It requires highlighting again that the strength of expanded ethnography lies in its capacity to analyse in-depth complex interactions, avoiding artificial divisions of linked social phenomena and problems for their analysis. Meanwhile, it needs to be considered that such a user-centered approach requires a clear ethic guideline. (C )E m er al d Following this criticism and the re-focusing of communication research in digital times, we need theories and research methods which place people and their social practices at the heart of research activities. In times of digital/big data, online and offline spaces overlap to such a great extent and they are so vastly interdependent, that the next big challenge is for research to develop methodologies which allow us to capture these realities: Social practices change as digital spaces become embedded in a culture. People may feel anxious if a smart phone is lost or an internet connection gets disrupted, and making a New Year’s resolution or celebrating Lent may involve forgoing access to electronic devices. (Hallett & Barber, 2014, p. 310) The challenge for digital ethnography has been to move away from the one-dimensionality of data. For convenience sake, online activity has often been viewed as an isolated action. Online ethnographies of one particular site are still a legitimate way of gathering data and depending on the research question they can indeed bring new insights. However, there is also a strong calling to not view certain media practices as isolated events but see them in the context of a wider media ecology (Hoskins & O’Loughlin, 2010) in which individuals use, read, consume, produce, contribute, collect, share, comment, like, link, create and so on, and in which 87 The Researcher and the Never-Ending Field collectives come together, grow, decline and disintegrate over the space of time. Even with the promises of big data analyses, the challenges will be similar to the ones that digital ethnographers have to address and are still in the process of solving. CONCLUSION NOTE er al d G ro up Pu bl is hi ng After an overview of big data use in communication research, this chapter addressed some of the myths and thinking surrounding big data. The criticism of processes coined ‘dataism’ and ‘datafication’ is a reminder to refocus and to not get carried away by the sheer availability of relatively large data sets. The never-ending field to be found by the (digital) researcher does not make all data and the results they yield relevant or every sample desirable for analysis. The challenge remains to find methodologies that capture, record and analyse the complexities of media practice as opposed to reducing them. (C )E m 1. Depending on the time and context of writing, the term used might also be ‘virtual’ or ‘media’ ethnography. REFERENCES Anderson, C. (2008). The end of theory: Will the data deluge makes the scientific method obsolete? Retrieved from http://edge.org/3rd_culture/anderson08/anderson08_index.html Bar-Joseph, U. (2013). Big data = big trouble: How to avoid 5 data analysis pitfalls. Retrieved from http://searchenginewatch.com/article/2289574/Big-Data-Big-Trouble-How-toAvoid-5-Data-Analysis-Pitfalls. Accessed on April 30, 2014. Beneito-Montagut, R. (2011). Ethnography goes online: Towards a user-centred methodology to research interpersonal communication on the internet. Qualitative Research, 11(6), 716 735. doi:10.1177/1468794111413368 boyd, d., & Crawford, K. (2011). Six Provocations for Big Data. A Decade in Internet Time: Symposium on the Dynamics of the Internet and Society. Retrieved from http://ssrn. com/abstract=1926431. Accessed in September 2011. boyd, d., & Crawford, K. (2012). Critical questions for big data. Information, Communication & Society, 15(5), 662 679. doi:10.1080/1369118X.2012.678878 88 CHRISTINE LOHMEIER (C )E m er al d G ro up Pu bl is hi ng Bruns, A. (2012). How long is a tweet? Mapping dynamic conversation networks on Twitter using Gawk and Gephi. Information, Communication and Society, 15(9), 1323 1351. doi:10.1080/1369118X.2011.635214 Bruns, A., & Burgess, J. E. (2012). Researching news discussion on Twitter: New methodologies. Journalism Studies, 13(56), 801 814. doi:10.1080/1461670X.2012.664428 Bruns, A., Burgess, J. E., Crawford, K., & Shaw, F. (2012). #qldfloods and @QPSMedia: Crisis communication on Twitter in the 2011 South East Queensland floods. Research report (pp. 1 58). Retrieved from http://eprints.qut.edu.au/48241/. Accessed on June 16, 2014. Christians, G., & Carey, J. W. (1989). The logics and aims of qualitative research. In G. H. I. Stempel & B. H. Westley (Eds.), Research methods in mass communication (pp. 354 374). Englewood Cliffs, NJ: Prentice Hall. Coffey, A. (1999). The ethnographic self: Fieldwork and the representation of identity. London: Sage. Couldry, N. (2011). The necessary future of the audience … and how to research it. In V. Nightingale (Ed.), The handbook of media audiences (pp. 213 229). Oxford: Wiley-Blackwell. doi:10.1002/9781444340525.ch10 Couldry, N. (2012). Media, society, world: Social theory and digital media practice. Boston, MA: Polity Press. Crawford, K. (2009). Following you: Disciplines of listening in social media. Continuum: Journal of Media & Cultural Studies, 23(4), 525–535. Halevi, G., & Moed, H. (2012). The evolution of big data as research and scientific topic: Overview of the literature. Research Trends. Special Issue on Big Data. (30), 3 6. Retrieved from http://www.researchtrends.com/wp-content/uploads/2012/09/Research_ Trends_Issue30.pdf. Accessed on April 30, 2014. Hallett, R., & Barber, K. (2014). Ethnographic research in a cyber era. Journal of Contemporary Ethnography, 43(3), 306 330. Hand, M. (2012). Ubiquitous photography. Cambridge: Polity. Hepp, A. (2010). Researching ‘mediatised worlds’: Non-mediacentric media and communication research as a challenge. In N. Carpentier, I. Tomanic Trivundza, P. PruulmannVengerfeldt, E. Sundin, T. Olsson, R. Kilborn, H. Nieminen, & B. Cammaerts, (Eds.), Media and communication studies interventions and intersections (pp. 37 48). Tartu, Estonia: Tartu University Press. Hine, C. (2000). Virtual ethnography. London: Sage. Hoskins, A., & O’Loughlin, B. (2010). War and media. The emergence of diffused war. Cambridge: Polity. Lash, S. (2002). Critique of information. London: Sage. Leinweber, D. (2007). Stupid data miner tricks: Overfitting the S&P 500. The Journal of Investing, 16(1), 721 723. Livingstone, S. (2009). On the mediation of everything: ICA presidential address 2008. Journal of Communication, 59(1), 1 18. Lohmeier, C. (2014). Cuban Americans and the Miami media. Jefferson, NC: McFarland. Lunenfeld, P. (2000). Snap to grid: A user’s guide to digital arts, media and cultures. Cambridge, MA: MIT Press. Manovich, L. (2011). Trending: the promises and the challenges of big social data. In M. K. Gold (Eds.), Debates in the digital humanities. Minneapolis, MN: The University of Minneapolis Press. Retrieved from http://www.manovich.net/DOCS/Manovich_ trending_paper.pdf. Accessed on April 30, 2014. 89 The Researcher and the Never-Ending Field (C )E m er al d G ro up Pu bl is hi ng Marwick, A. E. (2014). Ethnographic and qualitative research on twitter. In K. Weller, A. Bruns, J. Burgess, M. Mahrt, & C. Puschmann (Eds.), Twitter and society. New York, NY: Peter Lang. Mayer-Schönberger, V. (2009). Delete: The virtue of forgetting in the digital age. Princeton, NJ: Princeton University Press. Mayer-Schönberger, V., & Cukier, K. (2013). Big data: A revolution that will transform how we live, work, and think. Boston, MA: Houghton Mifflin Harcourt. Okely, J. (1994). Thinking through fieldwork. In A. Bryman, & R. G. Burgess (Eds.), Analyzing qualitative data (pp. 1 34). London: Routledge. Parks, M. R. (2014). Big data in communication research: Its contents and discontents. Journal of Communication, 64, 355 360. doi:10.1111/jcom12090 Prensky, M. (2001). Digital native, digital immigrant. Retrieved from http://www.marcprensky. com/writing/Prensky%20-%20Digital%20Natives,%20Digital%20Immigrants%20-% 20Part1.pdf. Accessed on May 1, 2014. Rosen, J. (2012, February 13). The right to be forgotten. 64 STAN. L.REV.ONLINE 88. Retrieved from http://www.stanfordlawreview.org/sites/default/files/online/topics/64SLRO-88.pdf. Accessed on June 9, 2014. Stephanson, H., & Couldry, N. (2014). Understanding micro-processes of community building and mutual learning on twitter: A ‘small data’ approach. Information, Communication & Society. doi:10.1080/1369118X.2014.902984 Travis, A., & Arthur, C. (2014). EU court back ‘right to be forgotten’: Google must amend results on request. The Guardian, May 13. Retrieved from http://www.theguardian. com/technology/2014/may/13/right-to-be-forgotten-eu-court-google-search-results. Accessed on June 9. Turkle, S. (2011). Alone together: Why we expect more from technology and less from each other. New York, NY: Basic Books. University of Surrey Youtube Channel. (2013). Christine Hine on online research methods. Retrieved from http://www.youtube.com/watch?v=No8RZOebhX8. Accessed on May 1, 2014. Wax, R. (1971). Doing fieldwork: Warnings and advice. Chicago, IL: University of Chicago Press.

RELATED PAPERS

RELATED TOPICS

Log In

The Researcher and the Never-Ending Field: Reconsidering Big Data and Digital Ethnography

The Researcher and the Never-Ending Field: Reconsidering Big Data and Digital Ethnography

Related Papers

RELATED PAPERS

RELATED TOPICS