|
ORE Studio 0.0.4
|
A coherent catalog of synthetic data with proper relationships. More...
#include <synthetic_catalog.hpp>

Public Attributes | |
| std::uint64_t | seed = 0 |
| The seed used to generate this catalog. | |
| std::vector< iam::domain::account > | accounts |
| IAM accounts for user references. | |
| std::vector< dq::domain::catalog > | catalogs |
| DQ catalogs for organizing datasets. | |
| std::vector< dq::domain::data_domain > | data_domains |
| Data domains for categorization. | |
| std::vector< dq::domain::subject_area > | subject_areas |
| Subject areas within data domains. | |
| std::vector< dq::domain::origin_dimension > | origin_dimensions |
| Origin dimensions for data lineage. | |
| std::vector< dq::domain::nature_dimension > | nature_dimensions |
| Nature dimensions for data classification. | |
| std::vector< dq::domain::treatment_dimension > | treatment_dimensions |
| Treatment dimensions for processing classification. | |
| std::vector< dq::domain::dataset > | datasets |
| DQ datasets with proper references to other entities. | |
| std::vector< std::string > | dependencies |
| Names of catalogs this synthetic catalog depends on. | |
A coherent catalog of synthetic data with proper relationships.
This structure contains all related entities that form a complete synthetic data catalog. All entities are properly linked: datasets reference their catalogs, subject areas, domains, and dimensions; all entities have consistent recorded_by values from the accounts collection.
The catalog includes:
Note: Methodologies are not generated - datasets should reference the well-known "Synthetic Data Generation" methodology by ID.
| std::uint64_t seed = 0 |
The seed used to generate this catalog.
Can be used to recreate the exact same catalog by passing the same seed to the generator service.
| std::vector<iam::domain::account> accounts |
IAM accounts for user references.
These accounts are used as recorded_by values across all entities.
| std::vector<std::string> dependencies |
Names of catalogs this synthetic catalog depends on.
When injecting this catalog, these dependencies must be satisfied first. Each string is a catalog name that will be resolved at injection time.