academic_observatory_workflows.pubmed_telescope.release

Classes

PubMedRelease

Module Contents

class academic_observatory_workflows.pubmed_telescope.release.PubMedRelease(*, dag_id: str, run_id: str, cloud_workspace: observatory_platform.airflow.workflow.CloudWorkspace, bq_dataset_id: str, start_date: pendulum.DateTime, end_date: pendulum.DateTime, year_first_run: bool, datafile_list: List[academic_observatory_workflows.pubmed_telescope.datafile.Datafile], baseline_upload_date: pendulum.DateTime)[source]

Bases: observatory_platform.airflow.release.ChangefileRelease

cloud_workspace[source]
bq_dataset_id[source]
year_first_run[source]
datafile_list[source]
baseline_upload_date[source]
datafile_release[source]
schema_file_path(record_type: str) str[source]
transfer_blob_pattern(table_type: str) str[source]

Create a blob pattern for importing the transformed unmerged records from GCS into Bigquery.

Parameters:

table_type – Type of the record.

Returns:

Uri pattern for transformed files.

property baseline_files: List[academic_observatory_workflows.pubmed_telescope.datafile.Datafile][source]

Return a list of the “baseline” datafiles files for this release.

property updatefiles: List[academic_observatory_workflows.pubmed_telescope.datafile.Datafile][source]

Return a list of “updatefile” datafiles for this release.

property merged_upsert_uri_blob_pattern: str[source]

Create a uri blob pattern for importing the transformed merged upserts from GCS into Bigquery.

Returns:

Uri pattern for merged transform files.

property merged_delete_transfer_uri: str[source]

Create a uri for importing the transformed merged deletes from GCS into Bigquery.

Returns:

uri for merged transform files.

property merged_delete_file_path[source]
static from_dict(dict_: dict) PubMedRelease[source]
to_dict() dict[source]