Properly citing data assists in the research process by giving data creators proper credit for their work, aids replication, provides permanent and reliable information about the data source, helps track the impact of the data, and facilitates resource discovery and access.
Provide citations for data sets when you have either conducted secondary analyses of publicly archived data or archived your own data being presented for the first time in the current work.
If you are citing existing analysis or statistics, cite the publication in which the data were published (e.g., a journal article, report, or webpage) rather than the data set itself.
Example Citation
O’Donohue, W. (2017). Content analysis of undergraduate psychology textbooks (ICPSR 21600; Version V1) [Data set]. ICPSR. https://doi.org/10.3886/ICPSR36966.v1
Parenthetical citation: (O’Donohue, 2017)
In-text citation: O’Donohue (2017)
- The date in the reference is the year of publication for the version of the data used.
- Provide the title of the data set in italics. Then provide any numerical identifier and version number for the data in parentheses without italics, separated by a semicolon.
- The bracketed description is flexible (e.g., “[Data set],” “[Data set and code book]”).
- Provide the publisher of the data set in the source element. ICPSR is one common example.