Skip to Content
DevelopmentReferenceTasksChainsGuestlineGuestlineRawTask — Developer Reference

GuestlineRawTask — Developer Reference

Developer reference for GuestlineRawTask with constructor parameters, examples and implementation details.

Constructor Parameters

ParameterTypeDescription
nameOptional[str]Custom name for the task
job_contextJobContextContext with Spark session and configuration
sync_datesOptional[List[str]]Specific dates to sync (YYYY-MM-DD format)
skip_ingestionboolSkip ingestion and only run crawler (default: False)
write_to_catalogboolWrite outputs to the catalog (default: True)

Example Usage

from etl_lib.tasks.chains.guestline.GuestlineRawTask import GuestlineRawTask task = GuestlineRawTask( job_context=job_context, write_to_catalog=True ) task.run()

Sync Specific Dates

task = GuestlineRawTask( job_context=job_context, sync_dates=["2025-09-15"], )

Skip Ingestion

task = GuestlineRawTask( job_context=job_context, skip_ingestion=True )

Implementation Details

GuestlineRawTask conditionally creates an ingester subtask followed by the crawler subtask, and writes RawModels.

subtasks = [] if not skip_ingestion: subtasks.append(GuestlineIngesterTask(job_context=job_context, sync_dates=sync_dates)) subtasks.append(GuestlineCrawlerTask(job_context=job_context)) super().__init__(name=name, job_context=job_context, subtasks=subtasks)

Back to tasks: /processes/tasks/chains/guestline/guestline-raw-task

Last updated on