The openness of research data increases the visibility and impact of your research, creates new research opportunities, and facilitates disciplinary and interdisciplinary collaboration. Open data improves the transparency and reliability of science, empowering and democratizing science.
Publishing research data may create more opportunities for the researcher to gain merit as a researcher, e.g. through more citations or registered downloads. In this way, the researcher can gain acknowledgement for other stages of the research process apart from the published article.
Research data and related published research results produced at Arcada should be published openly and made available for shared use. The discoverability and citability of research data are to be ensured. When reusing data, normal citation practice applies.
When opening your data, consider the following questions:
1. How to describe and publish the metadata of your data?
Metadata are data about data and describe the context, content, structure, compilation, and management of research data (See the section on Metadata and data documentation on this page). Informative metadata is the key to making data open, understandable, and reusable.
2. What part of the data will be opened and published?
3. Where will the data be published?
Apply for storage space in IDA by contacting datamanagement@arcada.fi
Or choose another suitable repository for sharing and opening your data at the start of the project.
4. When will the data be available? Do you need to set any embargo period?
5. Which license will you use to open and share your data? Licensing is necessary for publishing data. It is recommended to use Creative Commons (CC) licenses for open research data.
6. Will some part of the data be destroyed? See Data disposal by the Finnish Social Science Data Archive (FSD) and Five steps to decide what data to keep by the Digital Curation Centre (DCC).
Data documentation means describing the data, is data about data, and provides information about the who, what, when, where, why, how of the data. Investing time in documenting the data makes it easy to understand them for both others and yourself, and decrease the risk of false interpretation of the data. Data documentation can be a readme file (human readable) and metadata (computer readable):
Write a readme file about your data and data files. Put the readme file in the most obvious place in the data file folders to ensure that it can be noticed and seen immediately.
Metadata should be machine-readable. There are standard methods available for data documentation called metadata standards, which should be used if suitable for the data. The Fairdata Qvain metadata tool makes describing and publishing research data smooth and effortless for researchers without requiring technical skills.
Data described and published by Qvain metadata tool are automatically transferred to Finnish metadata warehouse Metax, which is integrated with both Etsin (research dataset finder) and the Finnish National Research Information Hub (in Finnish: Tutkimustietovaranto, a service also commissioned by the Ministry of Education and CSC).
See Qvain User Guide.
Other important issues include data formats, file naming conventions, version control, and directory structure. See Data formats and organizing.
For more information, see:
Long-term preservation means that data is preserved for more than 25 years. When creating your data, you need to consider how long it will be preserved. Also remember to check discipline-specific, funder-related, and publishers' data preservation time length requirements.
Finnish Ministry of Education and Culture has established the Fairdata-PAS service (Digital Preservation Service for Research Data, DPS for Research Data) for Finnish research organizations for long-term preservation of the nationally most significant research data.
See Digital Preservation (Fairdata-PAS): Guidelines for UH Evaluators by the University of Helsinki.
If you are interested in Fairdata-PAS, contact datamanagement@arcada.fi.
The FAIR data principles, formulated and published by Force 11, are a set of guiding principles for good data management and open access to research data. FAIR is an acronym that stands for Findable, Accessible, Interoperable and Reusable. Research data that are published according to the FAIR principles should be easy to find, access, transfer or combine, and reuse.
In order to ensure that your data and/or their metadata are FAIR, follow the following steps:
To learn more:
It is recommended to use the Fairdata services offered by the Ministry of Education and Culture and produced by CSC – IT Center for Science Ltd for data management, data storage, metadata creation, dataset dissemination and distribution as well as digital preservation of research data. The services include:
Read How to make the research dataset FAIR? and learn more about the Fairdata services.