Research Data Management
- What is data sharing and why should I share my data?
- What do I do about sensitive data, or data I don't own?
- How do I use a licence to specify the conditions for reusing my data?
- Are there data-sharing recommendations specific to my discipline?
- How do I share data in PRISM Dataverse, the University of Calgary's data repository?
- How do I find other repositories specific to my discipline?
Data sharing entails depositing your data in a publicly accessible repository. Data repositories accept raw data files and the accompanying metadata that allow other parties to use and understand the data. These repositories usually assign a DOI (Digital Object Identifier) or other unique identifier, and ensure that your data is securely stored, can be found by search engines, and is downloadable by interested parties. Some repositories also provide private working space that you can use to securely store your data, remotely access it, and collaborate with colleagues without making this data publicly available.
Why share data?
Sharing data is useful to both you and everyone else as well:
- Making data accessible allows others to verify your research
- Available data encourages others to cite your research
- Deposited data is a primary research object and can be cited just like a publication
- Sharing a dataset can lead to new contacts from potential collaborators, funders, and other interested parties
- It provides a securely-stored, authoritative copy of your data that will be easy to find in future
- Funders are beginning to mandate data sharing, and some journals also require data-sharing as a condition of publication
- Sharing data may increase the likelihood of your paper being cited.
There are several factors that may inhibit sharing your data:
- Ethics considerations concerning human research subjects or sensitive information, such as breeding grounds of endangered species.
- Data licensing issues where you might have integrated data that you are not licensed to distribute
- Confidentiality agreements where collaborating scholars or organizations do not want data publicly distributed.
There are tools for anonymizing your data, such as Amnesia from OpenAIRE.
Even in these circumstances it may be possible to share parts of the data that are not sensitive.
- University of Minnesota Library guidance on creating a de-identified data set that complies with HIPAA (the Health Insurance Portability and Accountability Act)
- EDUCAUSE Guidelines for Data De-Identification and Anonymization
If you are worried about others publishing articles based on your data before you are able to, that many repositories offer embargo periods where the data is kept private until you are ready for its release.
You can stipulate the terms and conditions for use of your data by applying a licence. One commonly used set of licences are the Creative Commons licences, which offer several options for both unrestricted reuse, and reuse under certain specific conditions. The Open Data Commons also offers data-specific licences.
For more information, see this guide to data licensing from the Open Data Institute.
- Last Updated: Oct 23, 2020 2:28 PM
- URL: https://library.ucalgary.ca/guides/researchdatamanagement
- Print Page