Skip to main content

Overview

run_ml_job is Stage 3a of the pipeline (alternative to run_dl_job). It triggers the ml-worker-job pipeline job, which trains a classical machine learning model using scikit-learn on the feature-engineered dataset and generates per-ticker price predictions. The tool blocks until the job completes and returns the output blob URL.

Parameters

feature_url
string
required
Blob storage URL pointing to a feature_engine_*.json file.This is the output_url returned by run_feature_worker.
data_extractor_url
string
required
Blob storage URL pointing to a data_extractor_*.json file.This is the output_url returned by run_data_extraction. Required alongside feature_url for proper train/test date alignment.
config
object
required
ML model configuration.

Returns

{
  "status": "Succeeded",
  "output_url": "https://stmcpfabricdev.blob.core.windows.net/data/ml_engine_20260307_125841.json",
  "output_name": "ml_engine_20260307_125841.json",
  "execution_name": "ml-worker-job-abc123xyz"
}
FieldDescription
statusJob terminal status (Succeeded)
output_urlFull HTTPS URL to the output blob — pass to run_po_job
output_nameBlob filename
execution_nameJob execution ID for audit/debugging

Example

{
  "feature_url": "https://stmcpfabricdev.blob.core.windows.net/data/feature_engine_20260307_125103.json",
  "data_extractor_url": "https://stmcpfabricdev.blob.core.windows.net/data/data_extractor_20260307_124937.json",
  "config": {
    "ML params": {
      "model": "random_forest",
      "n_estimators": 100,
      "test_size": 15,
      "n_splits": 15
    }
  }
}

Resources

ResourceValue
Container Apps Jobml-worker-job
Container nameml-worker-job
Env vars injectedFEATURE_URL, DATA_EXTRACTOR_URL, CONFIG
Output blob prefixml_engine_
Timeout600 seconds

Next Step

Pass output_url to run_po_job as input_url.
If you prefer a neural network approach, use run_dl_job instead — it produces an nn_engine_*.json blob that is equally compatible with run_po_job.