GuestlineRawTask — Developer Reference
Developer reference for GuestlineRawTask with constructor parameters, examples and implementation details.
Constructor Parameters
| Parameter | Type | Description |
|---|---|---|
name | Optional[str] | Custom name for the task |
job_context | JobContext | Context with Spark session and configuration |
sync_dates | Optional[List[str]] | Specific dates to sync (YYYY-MM-DD format) |
skip_ingestion | bool | Skip ingestion and only run crawler (default: False) |
write_to_catalog | bool | Write outputs to the catalog (default: True) |
Example Usage
from etl_lib.tasks.chains.guestline.GuestlineRawTask import GuestlineRawTask
task = GuestlineRawTask(
job_context=job_context,
write_to_catalog=True
)
task.run()Sync Specific Dates
task = GuestlineRawTask(
job_context=job_context,
sync_dates=["2025-09-15"],
)Skip Ingestion
task = GuestlineRawTask(
job_context=job_context,
skip_ingestion=True
)Implementation Details
GuestlineRawTask conditionally creates an ingester subtask followed by the crawler subtask, and writes RawModels.
subtasks = []
if not skip_ingestion:
subtasks.append(GuestlineIngesterTask(job_context=job_context, sync_dates=sync_dates))
subtasks.append(GuestlineCrawlerTask(job_context=job_context))
super().__init__(name=name, job_context=job_context, subtasks=subtasks)Back to tasks: /processes/tasks/chains/guestline/guestline-raw-task
Last updated on