ORE Studio 0.0.4
Loading...
Searching...
No Matches
Public Attributes | List of all members
synthetic_catalog Struct Referencefinal

A coherent catalog of synthetic data with proper relationships. More...

#include <synthetic_catalog.hpp>

Collaboration diagram for synthetic_catalog:
Collaboration graph

Public Attributes

std::uint64_t seed = 0
 The seed used to generate this catalog.
 
std::vector< iam::domain::accountaccounts
 IAM accounts for user references.
 
std::vector< dq::domain::catalogcatalogs
 DQ catalogs for organizing datasets.
 
std::vector< dq::domain::data_domaindata_domains
 Data domains for categorization.
 
std::vector< dq::domain::subject_areasubject_areas
 Subject areas within data domains.
 
std::vector< dq::domain::origin_dimensionorigin_dimensions
 Origin dimensions for data lineage.
 
std::vector< dq::domain::nature_dimensionnature_dimensions
 Nature dimensions for data classification.
 
std::vector< dq::domain::treatment_dimensiontreatment_dimensions
 Treatment dimensions for processing classification.
 
std::vector< dq::domain::datasetdatasets
 DQ datasets with proper references to other entities.
 
std::vector< std::string > dependencies
 Names of catalogs this synthetic catalog depends on.
 

Detailed Description

A coherent catalog of synthetic data with proper relationships.

This structure contains all related entities that form a complete synthetic data catalog. All entities are properly linked: datasets reference their catalogs, subject areas, domains, and dimensions; all entities have consistent recorded_by values from the accounts collection.

The catalog includes:

Note: Methodologies are not generated - datasets should reference the well-known "Synthetic Data Generation" methodology by ID.

Member Data Documentation

◆ seed

std::uint64_t seed = 0

The seed used to generate this catalog.

Can be used to recreate the exact same catalog by passing the same seed to the generator service.

◆ accounts

std::vector<iam::domain::account> accounts

IAM accounts for user references.

These accounts are used as recorded_by values across all entities.

◆ dependencies

std::vector<std::string> dependencies

Names of catalogs this synthetic catalog depends on.

When injecting this catalog, these dependencies must be satisfied first. Each string is a catalog name that will be resolved at injection time.