Overview
run_data_extraction is the first step of the pipeline. It triggers the job-data-extraction-worker pipeline job, which downloads historical OHLCV (Open, High, Low, Close, Volume) market data from the configured data source and writes the result to blob storage.
The tool blocks until the job completes and returns the output blob URL.
Parameters
Data extraction configuration.
Returns
| Field | Description |
|---|---|
status | Job terminal status (Succeeded) |
output_url | Full HTTPS URL to the output blob — pass to run_feature_worker |
output_name | Blob filename |
execution_name | Job execution ID for audit/debugging |
Example
Resources
| Resource | Value |
|---|---|
| Container Apps Job | job-data-extraction-worker |
| Container name | job-data-extraction-worker |
| Env var injected | CONFIG (JSON-serialized config) |
| Output blob prefix | data_extractor_ |
| Timeout | 600 seconds |
Next Step
Passoutput_url to run_feature_worker as input_url.