How to Cite a Dataset in Vancouver Format
Understanding Dataset Citations in Vancouver Format
Research datasets are collections of data created through research, observation, or computational processes. Proper citation of datasets allows readers to access the same data and verify findings. Vancouver format for datasets includes creator information, dataset identifier, repository, and access information.
Vancouver Format for Datasets
Creator/Author. Dataset title [type of dataset]. Repository Name. Year. Dataset Identifier. Available from: URL.
Example:
Smith JK, Johnson ML. Global temperature measurements 1900-2023 [dataset]. Zenodo. 2023. DOI: 10.5281/zenodo.1234567. Available from: https://zenodo.org/record/1234567.
Detailed Citation Elements
Creator/Author
Include the person or organization that created the dataset.
Dataset Title
Use the official title of the dataset.
Dataset Type
Specify the type of data (numerical, image, genomic, etc.).
Repository Name
Include the name of the data repository hosting the dataset.
Publication Year
Include the year the dataset was published or made available.
Dataset Identifier
Include the DOI or other persistent identifier.
Access URL
Include the URL to access the dataset.
Detailed Examples
Environmental Science Dataset
Garcia MS, Lee JH, Thompson KA. Climate station precipitation data 2000-2023 [dataset]. NOAA Data Catalog. 2023. DOI: 10.7289/V5D798BF. Available from: https://www.ncei.noaa.gov.
Genomic Dataset
Genome Sequencing Consortium. Human genome reference data [dataset]. GenBank. 2023. Accession GCA_000001405.29. Available from: https://www.ncbi.nlm.nih.gov/grc/.
Social Science Survey Dataset
University Survey Research Center. National household survey 2023 [dataset]. Inter-university Consortium for Political and Social Research. 2023. DOI: 10.3886/ICPSR37654. Available from: https://www.icpsr.umich.edu/.
Open Access Research Dataset
Anderson BJ, White MK, Chen L. Behavioral data from cognitive psychology experiment [dataset]. Open Science Framework. 2023. DOI: 10.17605/OSF.IO/ABCDE. Available from: https://osf.io/abcde/.
In-Text Citations for Datasets
Use the creator’s name and year:
(Smith & Johnson, 2023)
Or reference the specific aspect of the dataset:
(Global temperature measurements, 2023)
Special Cases in Dataset Citation
Large Collaborative Dataset
International Astronomical Consortium. Exoplanet survey data catalog [dataset]. Exoplanet Database. 2023. DOI: 10.5555/exoplanet.2023. Available from: https://exoplanetdb.org/.
Version-Specific Dataset
Martinez SR, Garcia MS. RNA sequencing results v2.1 [dataset]. Figshare. 2023. DOI: 10.6084/m9.figshare.1234567. Available from: https://figshare.com/articles/1234567.
Supplementary Dataset to Published Study
Lee JH, Williams RC. Supplementary data for "Climate impacts on agriculture" [dataset]. Dryad. 2023. DOI: 10.5061/dryad.12345678. Available from: https://datadryad.org/.
Using GenText for Dataset Citation Management
Managing citations for multiple datasets from various repositories requires systematic organization. GenText helps by:
- Formatting dataset citations in proper Vancouver style
- Tracking dataset identifiers and DOIs
- Organizing by repository, discipline, or subject
- Managing dataset version information
- Converting between citation styles
- Generating reference lists for data-driven research
- Maintaining access URLs and availability information
Common Mistakes with Dataset Citations
- Incorrect DOI format: Verify the exact DOI structure.
- Missing dataset version: Specify version if multiple versions exist.
- Incomplete creator information: List all creators or the organization.
- Wrong repository name: Verify the exact name of the data repository.
- Broken or outdated URLs: Verify URLs are current and functional.
Checklist for Dataset Citations
- Creator/author name(s) are correct
- Dataset title is accurate
- Dataset type is specified
- Repository name is correct
- Publication year is accurate
- DOI or identifier is complete
- Access URL is functional
- Version number is included (if applicable)
Finding Dataset Information
To gather dataset citation information:
- Check the repository’s metadata page
- Locate the dataset DOI or identifier
- Note the creator or author
- Verify the publication date
- Record the correct repository name
- Copy the persistent URL
Major Research Data Repositories
Common repositories for research data:
- Zenodo: Multidisciplinary open science platform
- Figshare: Open access repository for research outputs
- Dryad: Curated digital repository
- Open Science Framework: Free collaboration platform
- Data.gov: US government open data
- ICPSR: Social science data archive
- GenBank: Genetic sequence database
Research datasets are critical evidence for scientific reproducibility. Proper citation allows others to verify findings and build upon existing research.
Frequently Asked Questions
What information is needed to cite a dataset?
Include the creator/author, dataset title, publication year, repository, dataset identifier (DOI), and access URL.
Should I cite the specific version of a dataset?
Yes, if different versions exist, specify which version you used.
Where are research datasets typically stored?
Research datasets are stored in repositories like Zenodo, Figshare, Open Science Framework, and discipline-specific repositories.
Related Guides
Format Citations Automatically
Format citations in APA, MLA, Chicago and more—all inside Microsoft Word.
Install Free