Abstract
Big data has been a focus of research in science, technology, economics, and social studies. Many countries have already incorporated big data research into their national strategies. This paper elaborates upon the origin, connotation, and development of big data from both a spatial and temporal perspective. It proposes that scientific big data will become a new solution in scientific research as the paradigm changes from being model-driven to data-driven. This paper defines the concept of “scientific big data” and proposes strategies for solving “big data problems”. Theoretical frameworks and data systems for Digital Earth are discussed with a clear conclusion that scientific big data is a prominent feature of Digital Earth. As an example, spatial cognition of the formation mechanism of China’s Heihe-Tengchong Line—a geo-demographic demarcation line dividing China into two parts—is discussed within the context of big data computation and analysis for Digital Earth.




Similar content being viewed by others
References
EMC (2011) Big data: big opportunities to create business value. EMC Corporation, Hopkinton
Bryant RE, Katz RH, Lazowska ED (2008) Big-data computing: creating revolutionary breakthroughs in commerce, science, and society. Computing Community Consortium. http://www.cra.org/ccc/files/docs/init/Big_Data.pdf. Accessed 10 Nov 2013
Wigan MR, Clarke R (2013) Big data’s big unintended consequences. Computer 46:46–53
Frankel F, Reid R (2008) Big data: distilling meaning from data. Nature 455:30
Gantz J, Reinsel D (2012) The digital universe in 2020: big data, bigger digital shadows, and biggest growth in the far east. IDC Analyze the Future, Framingham
Meng XF, Ci X (2013) Big data management: concepts, techniques and challenges. J Comput Res Dev 50:146–169 (in Chinese)
Nature (2008) Big data. Nature 455:1–136
Hey T, Tansley S, Tolle K (2009) The fourth paradigm: data-intensive scientific discovery. Microsoft Research, Washington
Cukier K (2010) Data, data everywhere: a special report on managing information. Economist. http://www.economist.com/node/15557443#sthash.QG9d5Gkr.dpbs. Accessed 22 Aug 2013
Science (2011) Dealing with data. Science 331:639–806
Manyika J, Chui M, Brown B et al (2011) Big data: the next frontier for innovation, competition, and productivity. Mckinsey Global Institute. http://www.mckinsey.com/~/media/McKinsey/dotcom/Insights%20and%20pubs/MGI/Research/Technology%20and%20Innovation/Big%20Data/MGI_big_data_full_report.ashx. Accessed 03 Sep 2013
United Nations Global Pulse (2012) Big data for development: challenges and opportunities. http://www.unglobalpulse.org/sites/default/files/BigDataforDevelopment-UNGlobalPulseJune2012.pdf. Accessed 05 Aug 2013
Douglas L (2012) The importance of big data: a definition. Gartner. https://www.gartner.com/doc/2057415?ref=SiteSearch&sthkw=The%20importance%20of%20big%20data%3A%20a%20definition&fnl=search&srcId=1-3478922254. Accessed 30 Dec 2012
Goodchild MF, Guo HD, Annoni A et al (2012) Next-generation digital earth. Proc Natl Acad Sci USA 109:11088–11094
Executive Office of the President (2014) Big data: seizing opportunities, preserving values. http://www.whitehouse.gov/sites/default/files/docs/big_data_privacy_report_5.1.14_final_print.pdf. Accessed 01 May 2014
Executive Office of the President, President’s Council of Advisors on Science and Technology (2014) Report to the president big data and privacy: a technological perspective. http://www.whitehouse.gov/sites/default/files/microsites/ostp/PCAST/pcast_big_data_and_privacy_-_may_2014.pdf. Accessed 01 May 2014
Office of Science and Technology Policy Executive Office of the President (2012) Obama administration unveils “big data” initiative: announces $200 million in new r&d investments. http://www.whitehouse.gov/sites/default/files/microsites/ostp/big_data_press_release.pdf. Accessed 25 Jun 2012
Fang JQ (2013) Network science and engineering faced with a new challenge and developing opportunity under the wave impact of big data. Chin J Nat 35:345–354 (in Chinese)
Bai CL (2013) Preface. In: Chinese Academy of Sciences (eds) China’s e-science blue book 2013. Science Press, Beijing, i-ii
Mayer-Schönberger V, Cukier K (2013) Big data: a revolution that will transform how we live, work and think. Houghton Mifflin Harcourt Publishing Company, New York
Li GJ, Cheng XQ (2012) Research status and scientific thinking of big data. Bull Chin Acad Sci 27:647–657 (in Chinese)
Zhao Y (2013) NSF official on new supers, data-intensive future. e-Sci Technol Appl 4:91–93 (in Chinese)
Abarbanel HDI, Brown R, Sidorowich JJ et al (1993) The analysis of observed chaotic data in physical systems. Rev Mod Phys 65:1331–1392
Cressie N, Wikle CK (2011) Statistics for spatio-temporal data. Wiley, New Jersey
Rocha LM (1999) Complex systems modeling: using metaphors from nature in simulation and scientific models. Los Alamos National Laboratory, Los Alamos
Barry RG, Chorley RJ (2009) Atmosphere, weather and climate. Routledge, London
Kennedy MC, O’Hagan A (2001) Bayesian calibration of computer models. J R Stat Soc Ser B (Stat Methodol) 63:425–464
Han XX, Li JZ et al (2013) Efficient skyline computation on big data. IEEE Trans Knowl Data Eng 25:2521–2535
Donoho DL (2006) Compressed sensing. IEEE Trans Inf Theory 52:1289–1306
Candès EJ, Wakin MB (2008) An introduction to compressive sampling. IEEE Signal Proc Mag 25:21–30
Aharon M, Elad M, Bruckstein A (2006) K-SVD: an algorithm for designing overcomplete dictionaries for sparse representation. IEEE Trans Signal Process 54:4311–4322
Gore A (1992) Earth in the balance: ecology and the human spirit. Plume, Boston
Grossner KE, Goodchild MF, Clarke KC (2008) Defining a digital earth system. Trans GIS 12:145–160
Guo HD (2012) China’s earth observing satellites for building a digital earth. Int J Digit Earth 5:185–188
Craglia M, Ostermann F, Spinsanti L (2012) Digital earth from vision to practice: making sense of citizen-generated content. Int J Digit Earth 5:398–416
Foresman T, Schade S, Georgiadou Y et al (2014) Does DE need a C? A proposal for a DE curriculum. Int J Digit Earth 7:88–92
Guo HD, Wang CL (2013) Digital earth: fifteen years experience and perspective. Bull Chin Acad Sci 28:59–66 (in Chinese)
Craglia M, Bie KD, Jackson D et al (2012) Digital earth 2020: towards the vision for the next decade. Int J Digit Earth 5:4–21
Ehlers M, Woodgate P, Annoni A et al (2014) Advancing digital earth: beyond the next generation. Int J Digit Earth 7:3–16
De Longueville B, Annoni A, Schade S et al (2010) Digital earth’s nervous system for crisis events: real-time sensor web enablement of volunteered geographic information. Int J Digit Earth 3:242–259
Wang LZ, Tao J, Rajiv R et al (2013) G-Hadoop: MapReduce across distributed data centers for data-intensive computing. Future Gener Comput Syst 29:739–750
Zhang WF, Wang LZ, Liu DS et al (2013) Towards building a multi-datacenter infrastructure for massive remote sensing image processing. Concurr Comput Pract Exp 25:1798–1812
Chen J, Xiang LG, Gong JY (2013) Virtual globe-based integration and sharing service method of geospatial information. Sci China Earth Sci 56:1780–1790 (in Chinese)
Ma Y, Wang LZ, Zomaya AY et al (2013) Task-tree based large-scale mosaicking for remote sensed imageries with dynamic dag scheduling. IEEE Trans Parallel Distrib Syst. doi:10.1109/TPDS.2013.272
Yang CW, Xu Y, Nebert D (2013) Redefining the possibility of digital earth and geosciences with spatial cloud computing. Int J Digit Earth 6:297–312
Wang LZ, Chen D, Liu WY et al (2012) Parallel simulation of threat management for urban water distribution systems with MapReduce in clouds. Comput Sci Eng. doi:10.1109/MCSE.2012.89
Kim IH, Tsou MH (2013) Enabling digital earth simulation models using cloud computing or grid computing: two approaches supporting high-performance GIS simulation frameworks. Int J Digit Earth 6:383–403
Naughton B (2006) The Chinese economy: transitions and growth. The MIT Press, Cambridge
Guo HD (2014) Digital earth: big earth data. Int J Digit Earth 7:1–2
Acknowledgments
This work is supported by the International Cooperation and Exchange of the National Natural Science Foundation of China (41120114001).
Conflict of interest
The authors declare that they have no conflict of interest.
Author information
Authors and Affiliations
Corresponding author
Electronic supplementary material
Below is the link to the electronic supplementary material.
About this article
Cite this article
Guo, H., Wang, L., Chen, F. et al. Scientific big data and Digital Earth. Chin. Sci. Bull. 59, 5066–5073 (2014). https://doi.org/10.1007/s11434-014-0645-3
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11434-014-0645-3