Our Data and Code Guidelines are grounded in the principles of transparency and reproducibility established in the Guidelines for Transparency and Openness Promotion (TOP). To enable others to reproduce findings and potentially reuse data for further research, it is essential for authors to provide comprehensive access to their underlying data, analytic methods (code), and research materials. Below, we outline guidelines for sharing data and code, along with valid exceptions to full access.
Authors are expected to provide full access to the underlying data, analytic methods (code), and research materials necessary to reproduce the study’s findings. These should be deposited in a trusted third-party repository, such as OSF, or Dataverse that guarantees discoverability, accessibility, usability, and long-term preservation. Websites maintained by authors do not meet these requirements due to potential issues with data preservation and accessibility.
There are exceptions where full access may not be possible, including:
Your dataset(s) must be reusable by others, adhering to any relevant data sharing standards in your discipline and aligning with the FAIR Data Principles (Findable, Accessible, Interoperable, and Reusable). If you have developed in-house software, ensure the source code is written in (or compatible with) an Open Source programming language, archived under an open license and shared.
Your dataset(s) must not contain any sensitive information that could compromise privacy or confidentiality unless proper anonymization or redaction has been performed.
Data, software and code must be openly licensed to facilitate reuse. Appropriate licenses include, but are not limited to, Creative Commons licenses such as CC0 or CC BY.
All datasets and program code used in a publication must have a persistent identifier, such as a Digital Object Identifier (DOI), to ensure that they can be reliably located.
All datasets and program code used in a publication must be cited in the text and listed in the reference section.
Citations should include:
Your dataset(s) should link back to your article to ensure that the data and article are connected and that readers can easily find the associated data.
You must provide an Availability Statement within your manuscript that clearly describes where and how underlying data, software and other research materials can be accessed.
The Statement should include:
Examples formulations
Data archived in a repository:
'The datasets generated during and/or analysed during the current study are available in the [NAME] repository, [PERSISTENT WEB LINK TO DATASETS].'
Data available in a repository with restricted access:
'The data that support the findings of this study are available from [THIRD PARTY NAME] but restrictions apply to the availability of these data, which were used under license for the current study, and so are not publicly available. Data are however available from the authors upon reasonable request and with permission of [THIRD PARTY NAME].'
Data cannot be shared openly but are available on request from authors:
'The datasets generated during and/or analysed during the current study are not publicly available due to [REASON(S) WHY DATA ARE NOT PUBLIC] but are available from the corresponding author on reasonable request.'
Data shared with manuscript or Supplementary Information:
'All data generated or analysed during this study are included in this published article (and its supplementary information files).'
Data sharing is not applicable:
'No underlying data are available for this article, since no datasets were generated or analysed during this study.'
Restrictions
If any restrictions to the underlying data, code or materials apply, authors must provide a clear explanation for the restriction in their Availability Statement. The statement should detail the specific limitations and include all necessary information required for a reader or reviewer to access the data/code by the same means as the authors: