Skip to content

Job request: 11453

Organisation:
Bennett Institute
Workspace:
asthma_indicators
ID:
ix4hndqbjfuu4jqn

This page shows the technical details of what happened when the authorised researcher Rose Higgins requested one or more actions to be run against real patient data within a secure environment.

By cross-referencing the list of jobs with the pipeline section below, you can infer what security level the outputs were written to.

The output security levels are:

  • highly_sensitive
    • Researchers can never directly view these outputs
    • Researchers can only request code is run against them
  • moderately_sensitive
    • Can be viewed by an approved researcher by logging into a highly secure environment
    • These are the only outputs that can be requested for public release via a controlled output review service.

Jobs

Pipeline

Show project.yaml
version: '3.0'

expectations:
  population_size: 1000

actions:

# Asthma indicator (AST007) actions

  generate_study_population_ast007:
    run: cohortextractor:latest generate_cohort 
      --study-definition study_definition_ast007 
      --index-date-range "2019-01-01 to 2022-03-31 by month" 
      --output-dir=output
    outputs:
      highly_sensitive:
        cohort: output/input_ast007*.csv
 
  generate_study_population_ethnicity:
    run: cohortextractor:latest generate_cohort 
      --study-definition study_definition_ethnicity 
      --output-dir=output
    outputs:
      highly_sensitive:
        cohort: output/input_ethnicity.csv

  join_ethnicity_ast007:
    run: >
      cohort-joiner:v0.0.46 
        --lhs output/input_ast007*.csv
        --rhs output/input_ethnicity.csv
        --output-dir output/joined
    needs: [generate_study_population_ethnicity, generate_study_population_ast007]
    outputs:
      highly_sensitive:
        cohort: output/joined/input_ast007*.csv

  generate_measures_ast007:
      run: cohortextractor:latest generate_measures 
        --study-definition study_definition_ast007 
        --output-dir=output/joined
      needs: [generate_study_population_ast007, generate_study_population_ethnicity, join_ethnicity_ast007]
      outputs:
        moderately_sensitive:
          measure_csv: output/joined/measure_*_rate.csv

  join_measures_ast007:
      run: python:latest python analysis/join_and_round.py
           --input-list output/joined/measure_ast007_total_rate.csv
           --input-list output/joined/measure_ast007_age_band_rate.csv
           --input-list output/joined/measure_ast007_care_home_rate.csv
           --input-list output/joined/measure_ast007_ethnicity_rate.csv
           --input-list output/joined/measure_ast007_imd_rate.csv
           --input-list output/joined/measure_ast007_learning_disability_rate.csv
           --input-list output/joined/measure_ast007_region_rate.csv
           --input-list output/joined/measure_ast007_sex_rate.csv
           --output-dir output/joined/summary
           --output-name "measure_ast007.csv"
      needs: [generate_measures_ast007]
      outputs:
        moderately_sensitive:
          # Only output the single summary file
          measure_csv: output/joined/summary/measure_ast007.csv     

  # calculate_rates_ast007:
  #     run: python:latest python analysis/rate_calculations_ast007.py
  #     needs: [join_ethnicity_ast007, generate_measures_ast007]
  #     outputs:
  #       moderately_sensitive:
  #         tables: output/rate_table_*.csv
  #         plots: output/plot_*.png
  #         decile_chart: output/decile_chart.png

  #############################
  # Plotting
  #############################
  generate_qof_deciles_charts:
    run: >
            deciles-charts:v0.0.15
            --input-files output/joined/measure_*_practice_rate.csv
            --output-dir output/joined
    config:
      show_outer_percentiles: false
      tables:
        output: true
      charts:
        output: true
    needs: [generate_measures_ast007]
    outputs:
      moderately_sensitive:
        cohort: output/joined/deciles_*_*.*

  generate_qof_groups:
    run: >
            python:latest python analysis/group_charts.py
            --input-files output/joined/measure_ast007_*.csv
            --output-dir output/joined
            --date-lines "2019-03-31" "2020-03-31" "2021-03-31"
            --scale "percentage"
    needs: [generate_measures_ast007]
    outputs:
      moderately_sensitive:
        cohort: output/joined/group_chart_*.png

  generate_all_breakdowns:
    run: >
            python:latest python analysis/panel_plots.py
            --input-file output/joined/summary/measure_ast007.csv
            --measures-pattern "*_rate"
            --output-dir output/joined/summary
            --output-name "asthma_ast007_by_demographic_group"
            --date-lines "2019-03-31" "2020-03-31" "2021-03-31"
            --scale "percentage"
    needs: [join_measures_ast007]
    outputs:
      moderately_sensitive:
        cohort: output/joined/summary/asthma_ast007_by_demographic_group.png

  generate_notebook_ast007:
    run: jupyter:latest jupyter nbconvert /workspace/analysis/Table1_notebook.ipynb --execute --to html --output-dir=/workspace/output --ExecutePreprocessor.timeout=86400 --no-input
    needs: [join_measures_ast007]
    outputs:
      moderately_sensitive:
        notebook: output/Table1_notebook.html


# Testing Asthma variables
  # generate_study_population_ast007_test_variables:
  #   run: cohortextractor:latest generate_cohort 
  #       --study-definition study_definition_ast007_test_variables 
  #       --index-date-range "2018-03-01 to 2020-03-31 by month" 
  #       --output-dir=output/test
  #   outputs:
  #     highly_sensitive:
  #       cohort: output/test/input_ast007_test*.csv

  # generate_measures_ast007_test:
  #     run: cohortextractor:latest generate_measures 
  #       --study-definition study_definition_ast007_test_variables 
  #       --output-dir=output/test
  #     needs: [generate_study_population_ast007_test_variables]
  #     outputs:
  #       moderately_sensitive:
  #         measure_csv: output/test/measure_*_rate.csv
  
 # Check with separate analysis script 
  # check_data:
  #   run: r:latest analysis/indicator_dates_testing.py
  #   outputs:
  #     moderately_sensitive:
  #       table_csv: output/test/test.csv

Timeline

  • Created:

  • Started:

  • Finished:

  • Runtime: 22:03:37

These timestamps are generated and stored using the UTC timezone on the TPP backend.

Job request

Status
Succeeded
Backend
TPP
Workspace
asthma_indicators
Requested by
Rose Higgins
Branch
main
Force run dependencies
Yes
Git commit hash
dd213d2
Requested actions
  • run_all

Code comparison

Compare the code used in this job request