GuestlineCrawlerTask — Developer Reference
Developer reference for GuestlineCrawlerTask.
File: etl_lib/tasks/chains/guestline/GuestlineCrawlerTask.py
Constructor Parameters
| Parameter | Type | Description |
|---|---|---|
name | Optional[str] | Custom task name |
job_context | JobContext | Job context with Spark and config |
**kwargs | dict | Extra Task kwargs |
Example
from etl_lib.tasks.chains.guestline.GuestlineCrawlerTask import GuestlineCrawlerTask
task = GuestlineCrawlerTask(job_context=job_context)
task.run()Implementation Details
- Maps S3 ingest paths to models using
get_files_path. - Maps table names to models via
get_model_for_table(roompicks →RawRoompicksModel, personprofiles →RawPersonprofilesModel). - Reads ingest JSON and writes to Iceberg/raw tables.
Back to tasks: /processes/tasks/chains/guestline/guestline-crawler-task
Last updated on