RawProfileModel — Developer Reference
Developer reference for RawProfileModel used for the Opera chain (guest profiles).
File: etl_lib/models/raw/RawProfileModel.py
Constructor Parameters
| Parameter | Type | Description |
|---|---|---|
job_context | JobContext | Job context with Spark, config and catalog |
database | GlueDatabases | Default: GlueDatabases.RAW |
table | RawTables | Default: RawTables.GUEST_PROFILES |
df | DataFrame | Optional DataFrame to use instead of reading from the catalog |
overwrite_partitions | bool | Whether to overwrite partitions when writing |
Purpose & Usage
RawProfileModel represents the guest profile payloads fetched by the Opera ingester and stored in the raw database. These records are used by CleanOperaTask to enrich guest information and derive nationality, addresses, and other profile details.
Example
from etl_lib.models.raw.RawProfileModel import RawProfileModel
model = RawProfileModel(job_context=job_context)
df = model.get()Back to models: [/processes/models](/processes/models)
Last updated on