Skip to content

Instantly share code, notes, and snippets.

@seregasheypak
Last active April 18, 2016 19:24
Show Gist options
  • Save seregasheypak/2843f995ad6292d29bf42a1995628106 to your computer and use it in GitHub Desktop.
Save seregasheypak/2843f995ad6292d29bf42a1995628106 to your computer and use it in GitHub Desktop.
one_day_ago = datetime.combine(datetime.today() - timedelta(1), datetime.min.time())
default_args = {
'owner': 'airflow',
'start_date': one_day_ago
}
dag = DAG(dag_id='my_pipeline', default_args=default_args)
generate_build_id = BuildIdOperator(dag=dag)
# What is the right way to pass input_path, output_path as parameters
# when starting DAG?
scalding_operator = ScaldingOperator(input_path="{{ dag_run.conf['input_path'] }}",
output_path="{{ dag_run.conf['output_path'] }}",
dag=dag, task_id='my-job-scalding',
artifact="com.company:my-job:{{ dag_run.conf['my_job_version'] }}")
scalding_operator.set_upstream(generate_build_id)
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment