GuestlineCrawlerTask
The GuestlineCrawlerTask parses the files in the ingest bucket for Guestline chains and writes Iceberg/raw tables for further processing. It maps the physical ingest file layout to model classes and standardizes the table naming convention.
Overview
This task:
- Maps S3 ingest paths to
RawRoompicksModelandRawPersonprofilesModel - Reads JSON payloads from the ingest bucket
- Writes them into Iceberg/raw tables for downstream cleaning tasks
- Uses
get_files_pathto locate per-property vs global payloads
Implemented Logic
get_files_path(table_name, property_id=None)returns the ingest path used by the chain & propertyget_model_for_table(table_name)mapsroompickstoRawRoompicksModelandpersonprofilestoRawPersonprofilesModel
Models
Requires
- None (ingester writes ingest files)
Provides
RawRoompicksModelRawPersonprofilesModel
Related Tasks
Last updated on