The WDS Asia–Oceania Conference 2019 will be held on 7–8 May 2019 in Beijing, China; hosted by the Institute of Geographic Sciences and Nature Resources Research, Chinese Academy Of Science. The purpose of the conference is to establish a collaborative network to share information and to do bottom up of data-oriented activities in the Asia-Oceania area. Towards realizing this goal, we are ...
The a new WDS Strategic Plan for 2019–2023 has now been published . This document, produced in consultation with the ISC Governing Board and ISC Members, reduces the number of strategic targets from five previously, to the three seen by the WDS Scientific Committee (WDS-SC) as absolutely at the core of the WDS mission. Operating in unison with the new five-year Strategic Plan is an ...
We hope that you will join us in welcoming Dr Karen Payne, who was officially appointed in November 2018 as Associate Director for the WDS International Technology Office (WDS-ITO), hosted by Ocean Networks Canada (ONC) at the University of Victoria. ONC is part of the Canadian consortium of three WDS Regular Members selected to host the WDS-ITO. Karen's first role as ITO Director was ...
The WDS International Programme Office (IPO) is pleased to announce the recent appointment of Ms. Nobuko Miyairi as Communications Officer. Nobuko will be closely working with the IPO staff to accelerate our communications to current and future WDS Members, as well as to the Scientific Committee . On behalf of WDS, we would like to welcome Nobuko to the WDS-IPO. If you would like to ...
Scientific Data Systems in Latin America and the Caribbean
A Blog post by Alfredo Tolmasquim (WDS Scientific Committee member)
Several countries in the Latin American and Caribbean (LAC) region, such as Argentina, Brazil and Mexico, among others, have relevant scientific production and participation in important multinational associations and projects. However, this scientific activity has not been reflected in effective participation in forums related to the management of scientific data. An example is that no LAC country has a scientific data system accredited as a Regular Member of the World Data System (or certified by the CoreTrustSeal). At both the International Data Week held in September 2016 in Denver, Colorado and the recent one held in November 2018 in Botswana, Gaborone, the number of researchers from these countries was very limited.
With the objectives of surveying the scientific data management initiatives in the region and increasing its integration with other international forums, WDS organized—together with the Brazilian Academy of Sciences (ABC) and the Museum of Tomorrow—the Latin America and the Caribbean Scientific Data Management Workshop, which took place on 17–18 April 2018 in Rio de Janeiro. The Workshop was supported by some multilateral bodies such as the Inter-American Network of Academies of Science, ICSU Regional Office for LAC, Inter-American Institute for Global Chance Research, Institut de Recherche pour le Developpement; as well as Brazilian institutions such as the Brazilian Institute of Information in Science and Technology, the Brazilian Institute for Scientific Computing, and the Network for Research and Development.
The Workshop brought together more than 150 participants, comprising researchers from countries in the region such as Brazil, Chile, Argentina, Costa Rica, Uruguay, Guatemala, Paraguay, Panama, El Salvador and Honduras; members of the WDS Scientific Committee; representative of the Research Data Alliance; and policy-making and funding institutions, including the Brazilian Ministry of Science, Technology, Innovation and Communication and the Foundation for Research Support of São Paulo State (FAPESP). Unfortunately, representation from Mexico, the second largest producer of science in the region, was not present, despite numerous efforts to identify and attract researchers.
The Workshop consisted of presentations on the state-of-the-art and future perspectives for scientific data management; on ways to accredit scientific data repositories; and on opportunities for funding, training, and infrastructure in scientific data management. In addition, 44 scientific data system initiatives in the region were presented from a wide range of fields—soil and agriculture, biodiversity, climate, astronomy, geology, health sciences, anthropometry, and the humanities, to name but some—providing a good landscape of what is being done. The full programme, the papers presented, and videos of all the sessions can be accessed on the Workshop website.
It became clear from the presentations that there are major disparities in the LAC region. On the one side, there are highly structured and complex data systems, such as the Inter-Institutional Laboratory of e-Astronomy developed by the National Observatory, Center for Integration of Data and Health Knowledge of the Oswaldo Cruz Foundation, and Biodiversity Research Program coordinated by National Institute of Amazonian Research. On the other side, there are universities or countries that are taking the first steps towards creating their repositories of scientific publications. For the latter, the La Referencia project has been fundamental, since it is creating a network of open access repositories for science and involves many countries in the LAC region.
The Workshop also highlighted the existence of simultaneous initiatives in the same field. This is the case for agronomy and soil sciences in Brazil, which has robust systems being developed at the same time at the Federal Rural University of Rio de Janeiro, the Federal University of Santa Maria, and the Brazilian Company on Agriculture and Livestock Research, but with no cooperation among them thus far.
The Workshop, moreover, brought to the fore some of difficulties face by countries in the region to develop scientific data systems. One is that the structuring and management of scientific data is not perceived by institutions as bringing benefits. As a result, they are the concern of individual initiatives, which lack adequate infrastructure and sustainability. This problem is especially aggravated by the discontinuity of project funding by governments and agencies. Another aspect mentioned is the lack of rules and standards established by agencies, and fostered at the time when financial support is given to a project. The exception is FAPESP, which in 2017 implemented a rule whereby projects receiving up to approximately 50,000 USD must plan in advance the destination of all scientific data collected or produced during the course of the research.
In addition to exchanges of information and the development of greater synergies among the participants, the Workshop provided a number of concrete results. The first was the creation of a Working Group at the ABC to elaborate norms and guidelines on the treatment of open data for science and technology in Brazil. This Working Group is currently in the process of planning a follow-up meeting to the Workshop to be held in 2019. A further consequence of the Workshop is that ABC has been invited to participate in debates in the National Congress of Brazil on a new law regarding the treatment and protection of personal data. There are likely many more such examples beyond Brazil, and Workshop participants will be contacted to discover the positive outcomes of the Workshop in promoting scientific data systems in countries and projects throughout the LAC region.
The Importance of Advisory Bodies for Data Repositories: Learning from the NASA Experience
A Blog post by H. K. 'Rama' Ramapriyan (Science Systems and Applications, Inc. contractor for NASA Earth Science Data and Information System Project) and Alex de Sherbinin (WDS Scientific Committee member)
Any service organization exists to serve its customers. For a science data repository, the customers are users of its data, be they researchers or applied users in the scientific domain of that repository. For the data repository to serve the community best, it is essential that its managers understand the requirements of the users, respond to their changing needs, and evolve with technological changes as well. Many different mechanisms can be used to interact with users to continually maintain an understanding of their needs. NASA’s Earth Science Data and Information System (ESDIS) Project and its constituent Distributed Active Archive Centers (DAACs) engage with User Working Groups (UWGs) to fulfil this function.The purpose of this Blog post is to discuss how UWGs have benefitted ESDIS and the DAACs, and to share lessons gleaned from 25 years of UWG experience to demonstrate how advisory bodies can improve data curation and management for the benefit of diverse user communities.
Figure 1. Map of the NASA DAACs.
NASA’s ESDIS Project, a WDS Network Member, is responsible for 12 DAACs, ten of which are Regular Members of ICSU-WDS. The DAACs serve user communities in various Earth Science disciplines and are geographically distributed across the United States as shown in Figure 1. The majority of the DAACs have UWGs consisting of experts who represent broad user communities in the respective Earth Science disciplines; specifically, the members of a UWG are regular users of the data served by that particular DAAC. These UWGs have typically existed since the mid-1990’s when the DAACs started their operations, although their memberships have changed during that time with regular rotation off of members and the recruitment of new members.
Each UWG has a charter that specifies its role in providing advice to its DAAC regarding the data and services offered. A summary of the common aspects of these charters is given below:
• Ensure science user involvement in the planning, development, and operations of the DAAC.
• Define the DAAC's science goals.
• Provide recommendations on annual work plans and long-range planning.
• Represent the science user community in reviewing and guiding the DAAC activities.
• Review progress and performance of the DAAC relative to its missions.
• Assess data-products and service quality by periodically reviewing applications of the data products made by the broad user community, and by sampling the confidence of the user community.
• Communicate users’ assessment of the DAAC performance to the DAAC and NASA.
• Advise the DAAC on the levels of service provided to the user community.
• Advise the DAAC on improvements to the user access, user interface, and relative priorities for DAAC-related functions.
• Recommend to the DAAC and NASA the addition of new data products and new services based upon documented NASA research needs.
• Provide advice on research and development in support of product prototyping and generation.
The UWGs generally hold annual in-person meetings attended by representatives of the responsible DAAC and the ESDIS Project. Staff members from some of the other DAACs also attend the meetings to benefit from the discussions that may apply to their own activities. The UWGs also hold teleconferences a few times each year. UWG meetings consist of presentations by the DAAC staff addressing data and services offered, on-going developments, responses to prior recommendations by the UWG, and the status of action items. Following the presentations, the UWG provides comments on implementation of past recommendations and advice regarding improvements for the future. The advice can either be DAAC-specific or apply to the broader cross-DAAC and ESDIS Project activities.
Some examples of advice provided by the UWGs in the past year are given below. 'Data search' is highlighted separately because this is the primary way the users find the data they need in repositories that are growing larger by the day, and so getting this right has important implications for the user community.
• Obtain user input on the design and usability of the Earthdata Search Client. Search relevance should be based on user experience in addition to the characteristics of the data.
• Provide multiple avenues to access data holdings so that different types of users have tools appropriate for their needs.
• Make data more readily searchable to a non-technical audience. Develop a data search page intended for inexperienced and non-specialist users, containing popular data products and explanations of these products.
• Make filtering and refining more obvious on webpages ('Amazon style').
• Add ability to save a search/search parameters.
• Enable users to rank search results in order of 'popularity' based on, for example, the number of previous downloads (search relevance by popularity).
• Enable users to rank search results by relevance in order to help them deal with the large number of search returns.
• Add the ability to filter by spatial and temporal resolution.
• Create a simpler and more intuitive user interface that is optimized for touchscreen.
• Convene a focus group to determine how current users have performed analyses. The focus group would also solicit guidance from data producers.
• Communicate with Principal Investigators (who provide datasets to the DAAC) at least twice a year to ensure data and metadata are current and accurate.
• Work within security requirements, but have a reliable, easy to access system that provides users with direct access to all the data holdings. Include an Application Programming Interface (API) and OPeNDAP with all data holdings available using these tools.
• Focus on increasing the capacity of the service so that the entire satellite data record can be accessed using a secure but easy to use interface, and which includes an API to script data access.
• Expand tools that facilitate coordinate system changes. (Context: The impact of the Digital Elevation Map choice and coordinate system on the ability for users to intercompare radar and optical imagery is currently poorly understood.)
• Explore ways to reorganize high value datasets into analytics-optimized storage, while assessing the impact on the user community, and emphasizing variable-level analysis.
• Pursue/develop a plan to become an event-driven repository for major hydrometeorological events (e.g., Hurricane Harvey), providing bundled datasets that enable researchers to analyze specific events.
• Examine issues involved with multisensor analyses, including cross-DAAC coordination. (Context: User needs are becoming increasingly complex. Analyses often include multisensor data from a variety of sources.)
• Explore methods to improve and enhance data and computing, and tools using alternative access points; for example, Commercial cloud systems.
• Provide users more consistency DAAC-wide (e.g., comparable product descriptions, more unified terminology, and similar 'look and feel').
As these examples suggest, significant involvement by knowledgeable users in advising the DAACs has been extremely valuable for maintaining the high level of their performance. UWGs is one of several mechanisms that the NASA ESDIS Project and the DAACs use for receiving user inputs and feedback regarding their data and services. As enshrined in CoreTrustSeal Requirement VI, it is recommended that data repositories in all disciplines engage advisory groups in their respective areas to ensure responsiveness to their user communities.
Towards an operational glacier monitoring service: Europe’s Copernicus Climate Change Service
A Blog post by Isabelle Gärtner-Roer (WDS Scientific Committee member)
In the 'European State of the Climate 2017' report, presented at the European Parliament in April 2018, glaciers represent one-out-of-five 'Headline Climate Indicators', which focus on long-term key indicators for global and regional climate change.
On the global level, the World Glacier Monitoring Service (WGMS; WDS Regular Member) is in charge of compiling changes in glacier length, area, volume, and mass, based on in-situ (length, mass) and remote sensing (area, volume) measurements. An increasing number of glacier-related research projects produce data from air- and space-borne sensors. Combined with the existing in-situ network, these studies complement the multi-level glacier monitoring strategy of the Global Terrestrial Network for Glaciers. WGMS actively fosters glacier observations from space through several initiatives, such as its cooperation with the Copernicus Climate Change Service (C3S) or the Climate Change Initiative of the European Space Agency.
Figure 1. The Copernicus Services implemented by ECMWF
C3S is implemented by the European Centre for Medium-Range Weather Forecasts (ECMWF) within the Copernicus programme, which is Europe’s flagship project to monitor the Earth and her many ecosystems. C3S delivers freely accessible operational data and information services that provide users with reliable and up-to-date information related to environmental issues, and that are based on the Sentinel satellites and other contributing space missions, as well as 'in-situ' (meaning 'in the field' or 'on-site') measurement sensors on land, sea, and air. Collectively, these sources provide huge amounts of Earth observation data that are converted into products for up to 20 Environmental Climate Variables. The products can be freely accessed from a one stop portal: the Copernicus Climate Data Store (CDS). This wealth of climate information forms the basis for generating a wide variety of climate indicators aimed at supporting adaptation and mitigation policies in Europe in a number of sectors; for example, water management, tourism, energy, and health (Fig. 1).
WGMS and the Department of Geography at the University of Zurich, together with Gamma Remote Sensing, the University of Oslo, and the US National Snow and Ice Data Center are contributing glacier data and information to C3S. The partners compile and produce information on the global distribution of glaciers (inventories), as well as their volume and mass changes, using field and remote sensing observations at a global scale. Starting in 2017, they integrated both the Randolph Glacier Inventory 6.0 and the available glacier volume and mass change series from the Fluctuations of Glaciers database maintained by WGMS into the CDS. Furthermore, C3S enables the improvement and extension of existing glacier inventory datasets and will also boost the compilation and computation of glacier volume changes (geodetic method) using space-borne sensors.
Figure 2. Cumulative glacier mass changes in Europe from 1967–2017 for glaciers with long-term records in nine different regions. Cumulative mass balance values are given in the unit ‘metre water equivalent (m w.e.)’ relative to 1997.
Data source: WGMS (2017, updated),.Credit: WGMS / C3S.
Similar to the global mean, glaciers in Europe experienced relative stable to slightly negative mass balances in the 1970s and 1980s with short periods of mass gain (e.g., in the Alps at end of the 1980s or in coastal Scandinavia in the 1990s), followed by strong and continued mass losses after the year 1997 (Fig. 2). Since 1997, the monitored glaciers in Europe have lost between 7 m and 23 m of water equivalent or tons per square metre. In the mean, this is a loss of about 14 tons of water per square metre of ice cover, or when multiplied by the total glacier area (51250 km2, excluding peripheral glaciers on Greenland), about 690 km3; namely, 14-times the water volume of Lake Constance. In other words, the mean annual ice loss of these glaciers (35 km3) would cover the fresh water needs of New York City (see NYC OpenData 2018), the city with the highest consumption worldwide (Kennedy et al. 2015), over a period of about 25 years. The graph also highlights a strong regional variability; in particular, when comparing Northern Scandinavia and the Alps.
– Global Terrestrial Network for Glaciers: http://www.gtn-g.org
– World Glacier Monitoring Service: http://www.wgms.ch
– Copernicus Climate Change Service: https://climate.copernicus.eu/about-c3s
– Earlier WDS Blog post: https://www.icsu-wds.org/news/blog/essential-climate-variables
CASEarth: A Big Earth Data Project Launched by Chinese Academy of Sciences
A Blog post by Guoqing Li (WDS Scientific Committee member)
On 12 February 2018, the Chinese Academy of Sciences (CAS) announced the launch of its 'Big Earth Data Science Engineering' (CASEarth) project in Beijing. With a total funding of approximately RMB 1.76 billion (USD 279 million), and greater than 1,200 scientists from 130 institutions around the world involved, this five-year project (2018–2022) is headed by Huadong Guo, an academician of the CAS Institute of Remote Sensing and Digital Earth.
The overall objective of CASEarth is to establish an International Centre of Big Earth Data Science with the mission of
1. Building the world’s leading Big Earth Data infrastructure to overcome the bottlenecks of data access and sharing.
2. Developing a world-class Big Earth Data platform to drive the discipline.
3. Constructing a decision-support system to serve high-level government authorities and solve multiple issues.
This joint centre will provide a very large volume of data, comprehensively enhancing national technological innovation, scientific discovery, macro-decision making, and public knowledge dissemination, as well as other significant outputs.
Framework of the Big Earth Data Science Engineering Project (CASEarth)
To achieve its goals, CASEarth consists of the eight Work Packages (research components) to help it achieve technological advances and obtain innovative results, paying special attention to data sharing and encouraging interested scholars to rely on the platform to carry out research. For example, CASEarth Small Satellites will provide continuous observing data from space according to the missions of the project; the Big Data and Cloud Service Platform will build a cloud-based Big Data information infrastructure, and processing engines with 50 PB storage and 2 Pflops data-intensive computing resources; and the Digital Earth Science Platform will integrate multidisciplinary data and information into a visualization facility to meet the requirements of decision-making and application in areas such as Biodiversity and Ecological Security, Three-Dimensional Information Ocean, and Spatiotemporal Three-Pole Environment. In particular, it will provide a comprehensive display and dynamic simulation for sustainable development processes and ecological conditions along the Silk Belt and Road, and provide accurate evaluation and decision support for Beautiful China’s sustainable development.
In summary, CASEarth is expected to increase open data and data sharing; realize comprehensive assimilation of data, models, and services in the fields of resources, the environment, biology, and ecology; and build platforms with global influence. CASEarth will explore a new paradigm of scientific discovery involving Big Data-driven multidisciplinary integration and worldwide collaboration, and will constitute a major breakthrough in Earth Systems Science, Life Sciences, and related disciplines.
For more information, please look at the following references:
– A paper about Big Earth Data:
– The Big Earth Data Journal: https://www.tandfonline.com/toc/tbed20/current