Good data management practices occur throughout the research process. Even if your project did not require a formal data management plan, thinking about data management during your research will help ensure your data are secure, well-documented, and better prepared for later dissemination and archiving.
This page will examine the following issues:
Now imagine if you had to leave the project and come back after six months or a year. What else would you add to the list?
Need Help? Download an Example Readme.txt (plain text file) template that can be adapted for your data.
With the “raw material” documenting your project down, the next step is to standardize the formatting. The standard to use depends on the discipline and/or format of your data. A few standards are listed below. Again this isn’t intended to be exhaustive, but rather descriptive.
Type of Data |
Discipline/s |
Standard |
---- |
Social and Behavioral Sciences |
|
---- |
Ecology |
|
Spatial |
---- |
Content Standard for Digital Geospatial Metadata (CSDGM)/FGDC/ISO 19115 |
Biodiversity |
Life Sciences |
A more comprehensive list of disciplinary metadata standards is availablefrom the Digital Curation Centre.
Naming your files
File names should be:
One goal for file naming is to give enough information so that either the creator or a new user can figure out where the information in the file fits into the project.
Elements that may be included in your file names are date, project name, type of data, location, and version. There are other features to consider as you design your file naming plan described on this google doc.
The library subscribes to many data archives and resources for you to find and access essential data for your research.
Consult this guide to get a comprehensive overview of the many archives & resources CSUN University Library subscribes to.
One of the challenges of sharing human subjects data is the risk that your data may identify an individual, either directly or indirectly. Additionally, the information in your dataset may be legally protected or sensitive, which could lead to legal repercussions for you and/or bring harm to the individual if that information is released and linked to that individual’s identity.
Disclosure is the unauthorized release of information that may identify an individual research participant or organization. Examples of disclosive information include:
Legally protected data have restrictions placed on them by law. Examples include:
Sensitive data include any information that may cause harm, legal jeopardy, or reputational damage to the subject if disclosed. Such data may or may not be legally protected. Examples include:
Before sharing human subjects data publicly, the dataset should have a low disclosure risk or be free of disclosive information. This involves removing both direct identifiers AND indirect identifiers that may pose a disclosure risk.
If your data contain legally protected or sensitive data, or if the removal of identifiers limit the usefulness of your data, consider sharing through archives with restricted access repositories, such as the Inter-University Consortium for Political and Social Research (ICPSR).
In addition to the content of the data, the agreement made with participants in your IRB can also limit the extent to which human subjects data can be shared.