Skip to content

DatabricksRunNowOperator: Repair run doesnt pass job parameters and a retry on a job with an idempotency token results always results in succes. #64828

@roydobbe

Description

@roydobbe

Under which category would you file this issue?

Providers

Apache Airflow version

2.11.1

What happened and how to reproduce it?

Repair run:

Whenever a repair run is triggered, the job parameters need to be passed to the api endpoint for it to register the params for the tasks again.

I tried replicating the operator logic locally, but only when i explicitly passed the job parameters to the repair endpoint, I saw the parameters for the task it was repairing resolved.

Currently the operator only updates the job parameters as an intermediate step before triggering the repair endpoint, but doesn't pass it along in the repair_json object.

Idempotency token

This may be expected behavior but, when we retry an airflow task with an idempotency token, it results in a successful state, even if the task its retrying has failed initially.

We would expect it to trigger a repair run if repair run is true, else we would expect a failed state.

What you think should happen instead?

No response

Operating System

No response

Deployment

Astronomer

Apache Airflow Provider(s)

databricks

Versions of Apache Airflow Providers

databricks - 7.12

Official Helm Chart version

Not Applicable

Kubernetes Version

No response

Helm Chart configuration

No response

Docker Image customizations

No response

Anything else?

No response

Are you willing to submit PR?

  • Yes I am willing to submit a PR!

Code of Conduct

Metadata

Metadata

Assignees

No one assigned

    Type

    No type
    No fields configured for issues without a type.

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions