Dynamic Data Solutions for Advanced Biodiversity Research
BOLD Data Packages are expertly crafted to enhance utility and accessibility for researchers, policymakers, and industry professionals, offering structured, ready-to-use data for a variety of applications—from the latest data snapshots for immediate analysis to comprehensive historical datasets and specialized project-specific data for in-depth studies.
Optimizing Data Accessibility for Scalable Research
These packages support scalable data analysis, accommodating both small-scale individual research and large-scale international projects, by making large datasets easily accessible and manageable. Employing frictionless standards, the packages ensure data is easy to access, integrate, and reuse across different platforms, enhancing research tool interoperability and promoting reliable and reproducible biodiversity research. They also provide flexibility to meet diverse research needs, whether for rapid response or detailed historical analysis, significantly reducing the time and resources typically needed for data collection and preparation.
Enhancing Research Efficiency and Cost-Effectiveness
This streamlined approach not only accelerates research timelines but also improves the economic feasibility of biodiversity projects, making BOLD Data Packages essential for driving impactful decisions and advancing global biodiversity research.
Recent Data:
Data is provided in TSV and FASTA formats along with metadata files in JSON format.
Specimens - 16,078,145
Sequences - 16,407,007
Latest
The latest BOLD DNA Barcode Reference Library snapshot. Data is provided in TSV and FASTA formats along with metadata files in JSON format.
Specimens - 16,079,148
Sequences - 16,407,477
Second Latest
The second latest BOLD DNA Barcode Reference Library snapshot. Data is provided in TSV and FASTA formats along with metadata files in JSON format.
Historical Data:
Quarterly snapshots of the Public data on BOLD over the past year
Dataset from 27-SEP-2024
BOLD DNA Barcode Reference Library snapshot taken on Sep 27, 2024. Data is provided in TSV and FASTA formats along with metadata files in JSON format following BCDM.
Specimens: 16,070,594
Sequences: 16,397,881
Dataset from 19-JUL-2024
BOLD DNA Barcode Reference Library snapshot taken on Jul 19, 2024. Data is provided in TSV and FASTA formats along with metadata files in JSON format following BCDM.
Specimens: 15,708,115
Sequences: 16,033,052
Dataset from 29-MAR-2024
BOLD DNA Barcode Reference Library snapshot taken on Mar 29, 2024. Data is provided in TSV and FASTA formats along with metadata files in JSON format.
Specimens: 10,459,761
Sequences: 10,783,019
Dataset from 29-DEC-2023
BOLD DNA Barcode Reference Library snapshot taken on Dec 29, 2023. Data is provided in TSV and FASTA formats along with metadata files in JSON format
Specimens: 10,404,811
Sequences: 10,726,287
Project Data:
This is the initial data release from the Centre for Biodiversity Genomics (CBG)
CBG.R1.21-Mar-2024
This is the initial data release from the Centre for Biodiversity Genomics (CBG). It marks the adoption of a strict data release policy and signals its commitment to open science. This dataset, likely the largest and most diverse DNA barcode dataset released, encompasses records generated over a 15-year period, with the majority produced in the past three years. The records originate from over 180 countries and represent 351K species. All records underwent validation, though errors may persist. Efforts were made to ensure validity at least to the family level. CBG aims to support global biodiversity research and advance collaborative efforts across biodiversity science community by releasing this data.
Specimens:
4,194,294
Sequences:
4,194,294
iBOLD.31-Dec-2016
BARCODE 500K program was the inaugural program of the International Barcode of Life (iBOL) consortium. It delivered DNA barcodes for five hundred thousand species sourced from a global network of collections. The program was initiated in 2009 and concluded in 2015. This dataset includes barcodes from reference museum specimens as well as new collections generated from environmental samples.
Specimens:
2,787,799
Sequences:
2,799,047
CBN.31-Dec-2008
The Canadian Barcode of Life Network project's goal was to dramatically advance the inventory of Canadian biodiversity. The project was initiated 2005 and concluded in 2009. This dataset includes barcodes from groups of particular economic and social interest in Canada as well as samples from a wide range of other species.
Specimens:
94,318
Sequences:
102,707