Skip to content

Job request: 4616

Organisation:
Bennett Institute
Workspace:
hba1c-levels-report
ID:
eulhwa742h7x5kpd

This page shows the technical details of what happened when the authorised researcher Robin Park requested one or more actions to be run against real patient data within a secure environment.

By cross-referencing the list of jobs with the pipeline section below, you can infer what security level the outputs were written to.

The output security levels are:

  • highly_sensitive
    • Researchers can never directly view these outputs
    • Researchers can only request code is run against them
  • moderately_sensitive
    • Can be viewed by an approved researcher by logging into a highly secure environment
    • These are the only outputs that can be requested for public release via a controlled output review service.

Pipeline

Show project.yaml
version: '3.0'

expectations:
  population_size: 10000

actions:

  generate_study_population:
    run: cohortextractor:latest generate_cohort --study-definition study_definition_all_patients --index-date-range "2019-01-01 to 2021-06-01 by month" --output-dir=output/data 
    outputs:
      highly_sensitive:
        cohort: output/data/input_all_patients_*.csv

  generate_study_population_median:
    run: cohortextractor:latest generate_cohort --study-definition study_definition_median --index-date-range "2019-01-01 to 2021-06-01 by month" --output-dir=output/data 
    outputs:
      highly_sensitive:
        cohort: output/data/input_median_*.csv
        
  generate_study_population_ethnicity:
    run: cohortextractor:latest generate_cohort --study-definition study_definition_ethnicity --output-dir=output/data 
    outputs:
      highly_sensitive:
        cohort: output/data/input_ethnicity.csv
        
  join_ethnicity_all_patients:
    run: python:latest python analysis/join_ethnicity.py "input_all_patients"
    needs: [generate_study_population, generate_study_population_ethnicity]
    outputs:
      highly_sensitive:
        cohort: output/data/input_all_patients*.csv

  join_ethnicity_median:
    run: python:latest python analysis/join_ethnicity.py "input_median"
    needs: [generate_study_population_median, generate_study_population_ethnicity]
    outputs:
      highly_sensitive:
        cohort: output/data/input_median*.csv

  calculate_measures:
    run: cohortextractor:latest generate_measures --study-definition study_definition_all_patients --output-dir=output/data
    needs: [join_ethnicity_all_patients]
    outputs:
      moderately_sensitive:
        measure_csv: output/data/measure_*.csv

  redact_measures:
    run: python:latest python analysis/redact_measures.py
    needs: [calculate_measures]
    outputs:
      moderately_sensitive:
        measure_csv: output/data/measure*.csv

  generate_data_description:
    run: jupyter:latest jupyter nbconvert /workspace/notebooks/data_description.ipynb --execute --to html --template basic --output-dir=/workspace/output --ExecutePreprocessor.timeout=86400 --no-input
    needs: [join_ethnicity_all_patients]
    outputs:
      moderately_sensitive:
        notebook: output/data_description.html

  generate_change_inputs: 
    run: python:latest python analysis/mean_change_input.py
    needs: [join_ethnicity_median]
    outputs:
      moderately_sensitive:
        cohort: output/data/calc_chg_t2dm*.csv

  generate_median_inputs: 
    run: python:latest python analysis/median_input.py
    needs: [join_ethnicity_median]
    outputs:
      moderately_sensitive:
        cohort: output/data/calc_med_t2dm*.csv

  generate_charts:
    run: jupyter:latest jupyter nbconvert /workspace/notebooks/charts.ipynb --execute --to html --template basic --output-dir=/workspace/output --ExecutePreprocessor.timeout=86400 --no-input
    needs: [redact_measures, generate_change_inputs, generate_median_inputs]
    outputs:
      moderately_sensitive:
        notebook: output/charts.html

Timeline

  • Created:

  • Started:

  • Finished:

  • Runtime:

These timestamps are generated and stored using the UTC timezone on the TPP backend.

Job request

Status
Failed
JobRequestError: redact_measures failed on a previous run and must be re-run
Backend
TPP
Requested by
Robin Park
Branch
master
Force run dependencies
No
Git commit hash
f0af5d4
Requested actions
  • generate_charts

Code comparison

Compare the code used in this job request