Personal tools

Community

WDS–RDA Publishing Data Case Statement

The Publishing Data Interest Group is focused on all issues related to the data publication concept. It brings together all stakeholders involved in data publication activities such as data repositories, science publishers, and service providers. Every effort will be made to get a good representation from the major international programmes, their working groups and other private or institutional activities involved in this area. The Publishing Data Interest Group will build on existing resources, reports and other shared experiences from the different stakeholders and will nurture more specific and targeted working groups addressing practical aspects of the data publication concept. The Publishing Data Interest Group can be regarded as a broad and inclusive forum for interested individuals engaged in testing, validating and promoting the findings of the Working Groups. The main objectives of the Publishing Data Interest Group are to promote and establish the data publication concept among scientists, data repositories, science publishers, and bibliometric service providers by addressing the issues of workflows for publishing data and establishing corresponding services as part of scholarly publishing.

Data Publication

In the empirical sciences, data has traditionally been an integral part of scholarly publishing. In the last decades, rapid technical developments, such as digital data and high-throughput techniques, dramatically changed the scholarly publishing paradigm. This requires new approaches in order to ensure availability and usability of science data. However, existing approaches to address this issue are mostly technically dominated and lack success because they do not supply the necessary benefit for data producers. Instead, the concept of Data Publication is undergoing a renaissance as part of scholarly communication and on the base of new and proven technologies. Publishing data is a new and strong incentive for scientist to share their data and has positive effects on the data quality. The impact on citation rates could be shown in recent bibliometric studies on science articles having supplementary data.

Publishing data can be similar to the conventional publication of articles in journals that includes online submission, quality checks, peer-review, editorial decisions, and an equivalent of ‘page proofs’. In fact, storage of data in public repositories and the ability to reference the datasets is getting increasingly important. It is already mandatory for the acceptance of peer-reviewed publications in specific fields of research such as molecular sciences or ecology. However, Data Publication as a generally accepted new publication type self-standing or supplementary to literature is not without controversy.

For data centres, science publishers, and service providers it is a challenge in terms of organization, technical developments, and funding. Compared to science articles the economic value of data is generally higher but they also need more resources for production, processing, long-term archiving and publication. If published data are to be usable as reliable as peer-reviewed science articles, they should not only meet scientific requirements. Archiving and publishing procedures have to be transparent and accepted as part of the science culture. Moreover, published data should conform to current content and interoperability standards thus allowing for efficient usage and integration of data from various sources.

Workflows for archiving and publishing data

  • Investigate current workflows for archiving and publishing data
  • The role of QA/QC and peer-review in the publication process
  • The role of science publishers and journals in the data publication process

Deliverable: Provide generic workflow models for data publication

Bibliometrics including published data

  • General requirements for citability of scientific data (granularity, citation information and persistent identification) 
  • Current citation practice in data centres and literature

Deliverable: Recommendations for data publishers & academic publishers

Data publication services

  • Existing service components to be used as building blocks
  • Relevant content and interoperability standards
  • Interoperability requirements for data centres (registration, metadata & data services)

Deliverable: Infrastructure and organization for a one-for-all cross-referencing service for academic publishers and providers of bibliometric services

Cost Recovery for Data Centres (IG)

  • Investigate current cost structures for archiving and publishing data.
  • Elaborate a business model based on open access which compensates for the additional costs due to data publication

Deliverable: Recommendations for funding organizations

Objectives

Publishing data can follow good practices of conventional publication of articles in journals that includes online submission, quality checks, peer-review, editorial decisions, and an equivalent of ‘page proofs’. In fact, storage of data in public repositories and the ability to reference datasets is getting increasingly important. It is already mandatory for the acceptance of peer-reviewed publications in specific fields of research such as molecular sciences or ecology.  Data Publication as a generally accepted new publication type—self-standing or supplementary to literature—is not without controversy. For data centres, science publishers, and service providers data publication is a challenge in terms of organization, technical developments, and funding. Compared to science articles, the economic value of data is generally higher but they also need more resources for production, processing, long-term archiving and publication. If published data are to be usable and as reliable as peer-reviewed science articles they should not only meet scientific requirements, but also archival and longevity requirements. Archiving and publishing procedures for data must be transparent and accepted as part of the science culture. Moreover, published data should conform to generally accepted content and interoperability standards, thus allowing for efficient usage and integration of data from various sources.

The overall objectives of the WGs under the umbrella of the WDS–RDA Publishing Data IG are to incentivize and enable researchers to publish data by:

  • Promote and establish the data publication concept among data centres: What are possible workflows for publishing data? What are the experiences gained so far? Are there generic models applicable to the various data centres? What should be the difference between publicly accessible data and published data? What is the role of QA/QC and peer-review? What is the role of certification? What are the costs? WDS as an umbrella organization for data publishers?
  • Promote and establish the data publication concept among science publishers and bibliometric service providers: What are the experiences gained so far? Is there a common workflow for the editorial of data and articles? What are the implications for journal editors and publishers in general? What are the implications for peer-review? What is the role and perspective of supplementary materials compared to published data related to an article? What are the benefits? What are the costs
  • Establish data publishing services as part of scholarly publishing: Who are the relevant stakeholders? Which services are needed to embed data publications into the current framework of scholarly publishing - on the side of data centres / science publishers / bibliometric service providers? What are the organizational and technical requirements for the different stakeholders? Which are the relevant standards for content and interoperability? How do data publication services fit into the globally evolving data infrastructures? Is there a common model for a service infrastructure? What are the benefits? What are the costs? 

Co-chairs:

  • Michael Diepenbroek (Germany, PANGAEA, WDS-SC)
  • Eefke Smit (The Netherlands, STM)

Members:

  • David Anderson (US, NOAA's NCDC–WDS Regular Member)
  • Geoffrey Bilder (UK, CrossRef)
  • Theodora Bloom (UK, The BMJ)
  • Jan Brase (DataCite)
  • Ian Bruno (Cambridge Crystallographic Data Centre: CCDC)
  • Lukasz Bolikowski (Poland, Centre for Open Science, University of Warsaw) 
  • Adrian Burton (Australia, ANDS)
  • Sarah Callaghan (UK, BADC)
  • Ross Cameron (UK, Scopus)
  • David Carlson (UK, ESSD)
  • Cyndy Chandler (US, Woods Hole Oceanographic Institution)
  • Merce Crosas (US, Harvard University)
  • Mikael Karstensen Elbæk (Denmark, OpenAIRE/Technical University of Denmark)
  • Janine Felden (Germany, MARUM–WDS Regular Member)
  • Bettina Görner (Germany, Springer)
  • Laurel Haak (ODIN - ORCID and DataCite Interoperability Network)
  • John Helly (US, UCSD)
  • Francisco Hernandez (Belgium, VLIZ Data Centre–WDS Regular Member)
  • Simon Hodson (UK, CODATA)
  • Brian Hole (Ubiquity Press Ltd.)
  • Hylke Koers (The Netherlands, Elsevier)
  • Rebecca Lawrence (UK, F1000 Research Ltd.)
  • Paolo Manghi (Italy, OpenAIRE, CNR)
  • Caroline Martin (France, IRSTEA)
  • Jo McEntyre (UK, EBI)
  • Ingeborg Meijer (The Netherlands, Leiden University)
  • Fiona Murphy (UK, Wiley-Blackwell–WDS Associate Member)
  • Elizabeth Newbold (UK, British Library)
  • Fiona Nielsen (UK, DNAdigest.org)
  • Amy Nurnberger (US, Columbia University Libraries)
  • Lyubomir Penev (Bulgaria, Pensoft Publishers)
  • Howard Ratner (Clearinghouse for the Open Research of the United States)
  • Lisa Raymond (US, Library Woods Hole Oceanographic Institution)
  • Nigel Robinson (UK, Thomson Reuters)
  • Sergio Ruiz/Jan Brase (Germany, DataCite–WDS Associate Member)
  • Jochen Schirrwagen (Germany, OpenAIRE)
  • Johanna Schwarz (Germany, Springer)
  • Barbara Sierman/Marcel Ras (The Netherlands, Koninklijke Bibliotheek)
  • Mike Taylor (UK, Elsevier Labs)
  • Mark Thorley (UK, NERC)
  • Frank Toussaint and Martina Martina Stockhause (Germany, DKRZ-WDC Climate–WDS Regular Member)
  • Mary Vardigan (US, ICPSR–WDS Regular Member)
  • Anita de Waard (The Netherlands, Elsevier–WDS Associate Member)
  • Angus Whyte (Digital Curation Centre, University of Edinburgh)
  • Juanle Wang (China, WDC RRE–WDS Regular Member)
  • Eva Zanzerkia (US, NSF)

Ex-officio

  • Mustapha Mokrane (Ex officio, WDS-IPO)
  • Mark Parsons (Ex officio, RDA Secretariat)