Skip to Content
DevelopmentReferenceTasksChainsGuestlineGuestlineCrawlerTask — Developer Reference

GuestlineCrawlerTask — Developer Reference

Developer reference for GuestlineCrawlerTask.

File: etl_lib/tasks/chains/guestline/GuestlineCrawlerTask.py

Constructor Parameters

ParameterTypeDescription
nameOptional[str]Custom task name
job_contextJobContextJob context with Spark and config
**kwargsdictExtra Task kwargs

Example

from etl_lib.tasks.chains.guestline.GuestlineCrawlerTask import GuestlineCrawlerTask task = GuestlineCrawlerTask(job_context=job_context) task.run()

Implementation Details

  • Maps S3 ingest paths to models using get_files_path.
  • Maps table names to models via get_model_for_table (roompicks → RawRoompicksModel, personprofiles → RawPersonprofilesModel).
  • Reads ingest JSON and writes to Iceberg/raw tables.

Back to tasks: /processes/tasks/chains/guestline/guestline-crawler-task

Last updated on