academic_observatory_workflows.openalex_telescope.release
Classes
Functions
|
Extracts the S3 bucket name and object key from the given S3 URI. |
Module Contents
- class academic_observatory_workflows.openalex_telescope.release.OpenAlexEntity(*, dag_id: str, run_id: str, cloud_workspace: observatory_platform.airflow.workflow.CloudWorkspace, entity_name: str, bq_dataset_id: str, schema_folder: str, snapshot_date: pendulum.DateTime, manifest: Manifest, merged_ids: List[MergedId], is_first_run: bool)[source]
Bases:
observatory_platform.airflow.release.SnapshotRelease- static from_dict(dict_: dict) OpenAlexEntity[source]
- academic_observatory_workflows.openalex_telescope.release.s3_uri_parts(s3_uri: str) Tuple[str, str][source]
Extracts the S3 bucket name and object key from the given S3 URI.
- Parameters:
s3_uri – str, S3 URI in format s3://mybucketname/path/to/object
- Returns:
tuple, (bucket_name, object_key)
- class academic_observatory_workflows.openalex_telescope.release.Manifest(entries: List[ManifestEntry], meta: Meta)[source]
- class academic_observatory_workflows.openalex_telescope.release.Meta(content_length, record_count)[source]
- class academic_observatory_workflows.openalex_telescope.release.ManifestEntry(url: str, meta: Meta)[source]
-
- static from_dict(dict_: Dict) ManifestEntry[source]