Opera Crawler Task
The OperaCrawlerTask parses Opera raw files (JSON / XML / CSV) and converts them into the raw models consumed by the cleaning task:
RawReservationModelRawRateModelRawProfileModel
Notes
- The crawler should be able to run incrementally when
incremental_crawl=True. - The crawler extracts nested reservation structures and ensures consistent keys for downstream cleaning.
Related Documentation
Last updated on