Warning: These documents are under active
development and subject to change (version 2.1.0-beta).
The latest release documents are at:
https://purl.dataone.org/architecture
The replication policy of Member Nodes (MN) indicates factors such as the amount of storage space available, bandwidth constraints, the types of data and metadata that can be managed, and perhaps access control restrictions. This information is used by Coordinating Nodes (CN) to balance the distribution of data packages throughout the DataONE system to achieve the goals of data package persistence and accessibility.
Initial implementation of the infrastructure emphasizes the preservation goals of replication, i.e. ensuring sufficient copies of data and metadata objects are available to ensure ongoing access to the content. The replication system offers the possibility of more dynamic control over the flow of information between systems to support science research. For example, the replication services may be exploited as a mechanism for staging data in preparation for experiments at HPC nodes or centers.
Figure 1. Use case 08.
Figure 2. Communication of Replication Policy Metadata
Notes