![]() |
|
|
|
|
|
Data citationWho should read this?This guide is intended for eResearch infrastructure support providers and researchers. It is not so much a guide to how to cite data, but a guide to the issues around it, and activities underway to change the culture around data citation in order to support improved data management and sharing. What do we mean by data citation?Data citation refers to the practice of providing a reference to data in the same way as researchers routinely provide a bibliographic reference to printed resources. The need to cite data is starting to be recognised as one of the key practices underpinning the recognition of data as a primary research output rather than as a by-product of research. While data has often been shared in the past, it is rarely, if ever, cited in the same way as a journal article or other publication might be. If data sets were cited, they would achieve a validity and significance within the cycle of activities associated with scholarly communications and recognition of scholarly effort. How do you cite data?At present, there is no generally recognised way of citing data, despite the growing need. ISO690 2 dates from 1997. This ‘specifies the elements to be included in bibliographic references to electronic documents. It sets out a prescribed order for the elements of the reference and establishes conventions for the transcription and presentation of information derived from the source electronic document. [It] is intended for use by authors and editors in the compilation of references to electronic documents for inclusion in a bibliography, and in the formulation of citations within the text corresponding to the entries in that bibliography. It does not apply to full bibliographic descriptions as required by librarians, descriptive and analytic bibliographers, indexers, etc.’1 A recent OECD Publishing White Paper2 by Toby Green sets out the need for a recognised standard and proposes a model which will be used by the OECD for its own data and data tables. Altman and King3 proposed a standard for citing quantitative data in 2007. This contains many of the elements common to print citations, to which are added components specific to quantitative data sets. Similar to the recommendations of the OECD White Paper and the citation supplied by ICPSR, their standard includes a permanent identifier (whether DOI or other) as an essential element. Their minimum citation includes only six elements, including the permanent identifier. Various data repositories provide a recommended format for citing data from that repository. For example: ICPSR and other social science data centres provide a citation for each of their datasets as follows:
The following citation comes from PANGAEA, the Publishing Network for Geoscientific & Environmental Data in Germany, and covers both the publication and the data on which it was based.
The issue is further complicated by the fact that bibliographic management systems such as EndNote and Zotero do not provide a template for a data citation. Wouldn’t it be lovely if …
The ANDS approach to data citationAn important aim of ANDS is to enable more researchers to re-use research data more often. To achieve this aim, ANDS is engaged in activities that will make it easier to share data, to recognise the importance of making data available and to make data citation a standard procedure.
Directions around data publication
1 http://paedpsych.jk.uni-linz.ac.at/internet/ARBEITSBLAETTERORD/LITERATURORD/ZitationISO690.html 2 Green, T. (2009). ‘We Need Publishing Standards for Datasets and Data Tables.’ OECD Publishing White Paper. http://dx.doi.org/10.1787/603233448430 3 Micah Altman and Gary King. 2007. ‘A Proposed Standard for the Scholarly Citation of Quantitative Data.’ D-Lib Magazine, Vol. 13, No. 3/4 (March/April), http://www.dlib.org/dlib/march07/altman/03altman.html
This work is licensed under a Creative Commons Attribution-Noncommercial-Share Alike 2.5 Australia License. |