Data Citation Resources
This web page provides extra information in the form of guides, explanations, templates, aids in decision making and more to help individuals and organisations learn about data citation and Digital Object Identifiers (DOIs).
Overview : http://ands.org.au/cite-data
What is Data Citation
Data Citation Benefits
Data Citation Benefits for Researchers
Using Identifiers to Cite Data
Digital Object Identifiers for Datasets
DOI Data Citation in Action
ANDS and DOIs: Services, Projects and Connections
Implementing DOIs
Presentations and Training Material
Related Links
What is Data Citation?
Data citation refers to the practice of providing a reference to data in the same way as researchers routinely provide a bibliographic reference to printed resources. An important aim of ANDS is to enable more researchers to re-use research data more often. To achieve this aim ANDS is engaged in activities that will make it easier to share data, to recognise the importance of making data available and to make data citation a standard procedure.
More:
- Poster [PDF] - multiple free printed copies of the poster in pamphlet form are available by emailing: guides@ands.org.au
- Data Citation Guide ANDS awareness level: This two page guide covers: defining data citation, the ANDS approach to data citation, how to cite data.
- Mons, B., Haagen, H. van, Chichester, C., Hoen, P.-B. 'T, Dunnen, J. T. den, Ommen, G. van, et al. (2011). The value of data. Nature genetics, 43(4), 281-3. Nature Publishing Group. doi: 10.1038/ng0411-281.
Data Citation Benefits?
|
CrossRef Animation |
The need to cite data is starting to be recognised as one of the key practices underpinning the recognition of data as a primary research output rather than as a by-product of research.
|
More:
- Whitlock, M. C. (2010). Data archiving in ecology and evolution: best practices. Trends in Ecology & Evolution, 1-5. Elsevier Ltd. doi: 10.1016/j.tree.2010.11.006.
- Ball, A.& Duke, M. (2011). ‘Data Citation and Linking'. DCC Briefing Papers. Edinburgh: Digital Curation Centre.
- Ball, A. & Duke, M. (2011). ‘How to Cite Datasets and Link to Publications’. DCC How-to Guides. Edinburgh: Digital Curation Centre.
Data Citation benefits for Researchers:
(Information from Oak Ridge / NASA Poster)
- Makes data publications more acceptable for CVs and creates more recognitions for data sharing
- Provides a means of tracking publications who are using or citing researchers dataset
- Can search for DOIs in search engines to find grey literature
- Journals are more accepting of DOIs
More:
- Piwowar HA, Day RS, Fridsma DB, 2007 Sharing Detailed Research Data Is Associated with Increased Citation Rate. PLoS ONE 2(3): e308. doi:10.1371/journal.pone.0000308
Using Identifiers to Cite Data
To be cited accurately and to ensure other researchers can find your dataset in the future, datasets need one or more identifiers (preferably a persistent identifier) to be attached to the dataset. Research data outputs don't stay in the one place: they have a lifecycle which involves the data moving around. Persistent identifier links to the data outputs shouldn't break, whether they are raw data or publications.
There are many types of persistent identifiers which can be used to cite data and datasets. Some institutions may use several different types of persistent identifiers. Digital Object Identifiers (DOIs) are a type of persistent identifier which will be available through the ANDS Cite My Data Service from the end of 2011.
More:
- Persistent Identifiers Guide ANDS awareness level. This 2 page guide covers: defining persistent identifiers, reasons for persistent identifiers, how they work, an example of one type of persistent identifier (Handle), and what needs to be done by whom to ensure persistence.
- Persistent Identifiers decision tree:
Is a DOI preferable to another type of persistant identifier for data citation? YES if:
- This data will be exposed
- This data forms part of the scholarly record
- This data can be kept persistent
- The minimum metadata required by DataCite can be supplied.
Digital Object Identifiers (DOIs) for Datasets
A DOI Name is a Persistent Identifier, but also provides extra benefits.
- DOI Names can be used to uniquely identify either digital or non-digital objects, whether or not they have any internet presence.
- DOI Names are supported by the International DOI Foundation (IDF) and Registration Agencies infrastructure, which provides ongoing DOI services and allows for a high level of confidence in the quality and accuracy of DOI Names.
- DOIs are used by CrossRef and other publishers to track citation metrics.
DOIs are intended to be persistent. The DOI name syntax is a NISO standard but allows the incorporation of any form of existing identifier. The DOI name suffix can be any alphanumeric string that the Registrant chooses. This can simply be a sequential number, or it can make use of an existing (legacy) identifier.
Example DOIs for datasets: doi:10.4232/1.10079 -or- doi:10.1594/WDCC/dphase_mpeps
Note: Currently the Cite My Data service will mint DOI suffixes in the form of random digits. This policy will be revisited in future releases of the service.
More:
- The Digital Object Identifier System and DOI Names: This 2 page ANDS guide covers: defining DOIs, DOI names, difference between DOIs and other persistent identifiers, advantages of DOIs for datasets, ANDS and DOIs.
- International DOI Foundation has links to the DOI Handbook and FAQ
- The DOI Handbook on how a DOI® name is constructed and assigned. It discusses the use of the DOI name prefix as a naming authority, and the DOI name suffix as a mechanism for assigning individual numbers within that naming authority, incorporating (if required) existing identifiers. The ability of the DOI name to incorporate existing identifiers and the benefits of that approach are discussed in detail. Character sets, case sensitivity, uniqueness, and Internet identifier specifications are also discussed.
- CrossRef is a DOI Registration agency: http://www.crossref.org/
- Watch: Scott Edmunds announce BGI's new GigaScience journal which uses DOIs
DOI Data Citation in Action
Data citation standards vary across disciplines. However, DataCite has undertaken some work in this area and recommends the following format.
Creator (Publication Year): Title. Publisher. Identifier
Denhard, Michael (2009): dphase_mpeps: MicroPEPS LAF‐Ensemble run by DWD for the MAP D‐PHASE project. World Data Center for Climate. http://dx.doi.org/10.1594/WDCC/dphase_mpeps
They recognise that it may also be desirable to include two optional properties, Version and ResourceType (as appropriate). If so, the recommended form is as follows:
Creator (PublicationYear): Title. Version. Publisher. ResourceType. Identifier
ISSP Research Group: International Social Survey Programme 2007: Leisure Time and Sports (ISSP 2007). GESIS Data Archive, Cologne, Germany, ZA4850 Data file Vers. 2.0.0, doi:10.4232/1.10079
More:
For citation purposes, the Identifier may optionally appear both in its original format and in a linkable, http format.
- Citations form: doi:10.1594/PANGAEA.484677
- Hypertext link form: http://dx.doi.org/10.1594/PANGAEA.484677
Examples of DOIs for PANGAEA datasets in a variety of settings:
- World Cat
- Google Scholar
- Scopus (- see references list)
Resources:
- A short article about how DOIs will be used in the journal Gigascience.
- DataCite Metadata Schema (PDF) for the Publication and Citation of Research Data: an 18 page PDF examining the broader issues of data citation and associated metadata
- A short guide to data citation by the Michigan State University Library.
- DataCite-International Initiative to Facilitate Access to Research Data
- An example of how Nature journal publisher uses DOIs in their publication process
- An example of how DOIs are used in collaboration endeavours between scientists: Use of DOIs in solving the German e coli outbreak in July 2011.
- Watch: A Guelph Humber Librarian shows how various academic journals use DOIs in their reference structure (Where's the DOI?)
ANDS and DOIs: Services, Projects, Connections
- ANDS is a member of DataCite
ANDS has joined DataCite, a group of leading research libraries and technical information providers, that aims to make it easier for research datasets to be handled as independent, citable, unique scientific objects via the use of DOIs. ANDS is also participating in the DataCite Metadata Standards Working Group. - Cite My Data service
The ANDS Cite My Data service is offered to Australian research sector organizations who need to manage citations to their datasets. There is no charge for using this service. The Cite My Data service is built on the global DataCite data citation infrastructure . ANDS is currently the only Australian member of the DataCite Consortium - ANDS is keen to provide a service which helps Australian researchers, research organisations and discipline communities to address some of the policy, workflows and scholarly communications issues around data citation. The service would support research organisations to:
- Develop and implement policy
- Build consensus on citation practice
- Trial self- data citation in journal articles
- Track third party data citation
- Provide input into trials with automated data citation indexation services
ANDS is still exploring service provision opportunities in this area - we would appreciate your interest, feedback and input. Please email: contact@ands.org.au
Implementing DOIs
ANDS has a wealth of material and services to support the use of DOI's in promoting reusability and connectedness of Australian research data.
| Institutional policy & procedures |
Issues to consider when developing institutional policy and procedures Persistence into the future is the strength of the DOI concept - institutions need to have policies and procedures in place to define and assure persistence Institutions can incorporate an alpha component into the DOI name (eg doi:10.1234/FACULTYABC.654321) but note that this also implies persistence of the name into the foreseeable future Persistence may incur ongoing direct and indirect costs for the institution. "Persistent Identification" is an accepted concept: what does the DOI System add to this? |
| IT Tools & Infrastructure (hardware & software) |
ANDS IT Projects Institutions need to meet minimum requirements before DOIs can be minted - Identifiers decision TREE (PDF) DOIs can be minted through ANDS using the Cite My Data machine-to-machine serviceManaging Metadata DOIs need to be stored and managed with other metadata about the dataset in local stores or repositories. DOIs should be included in RIF-CS metadata feeds to Research Data Australia in the Identifier element to ensure maximum access for researchers to the data
Guides |
| Support services (people & advice) |
Awareness and Advice DOIs are well known within the scholarly journal world, but promulgation of the benefits and use of DOIs for research datasets may require an awareness campaign across the institution. Guides and articles
Posters and Pamphlets
People and Community Building contacts and networks
Conference presentations
|
Presentations and Training Material
Data Citation workshop at eResearch Australasia 2011
More:
This workshop was designed for all those with an interest in data citation and how it can be supported by their institutions. The workshop included:
- An overview of data citation in scholarly communication
- A practical demonstration of how to use the ANDS DOI service, "Cite My Data" and integrate it into eresearch systems
- A discussion of the value of being part of the international DataCite consortium and DOI system
- Examples of emerging practice internationally
- Opportunities to learn from the experience of early implementers in Australia
- Strategies for policy implementation at research organisations and data archives
Presenters included Jan Brase from DataCite, early adopters of the Cite My Data service and ANDS staff.
The presentation slides are available from our data citation presentations and training page.
Related Links
If you are interested in DOIs, you may also be interested in the following related concepts. Information is available through:
The ANDS website:
- Data management: planning, policies, procedures
- IT: the Data curation continuum
- Licensing to encourage the reuse of data
- Research data policy and the Australian Code for the Responsible Conduct of Research
External:
- QUT library blog post - linking your publications to your data















