Skip to content

Job request: 11388

Organisation:
University of Manchester
Workspace:
broad-spectrum-its
ID:
faw2ykgp5clp5ffe

This page shows the technical details of what happened when the authorised researcher Billy Zhong requested one or more actions to be run against real patient data within a secure environment.

By cross-referencing the list of jobs with the pipeline section below, you can infer what security level the outputs were written to.

The output security levels are:

  • highly_sensitive
    • Researchers can never directly view these outputs
    • Researchers can only request code is run against them
  • moderately_sensitive
    • Can be viewed by an approved researcher by logging into a highly secure environment
    • These are the only outputs that can be requested for public release via a controlled output review service.

Jobs

Pipeline

Show project.yaml
version: '3.0'

expectations:
  population_size: 30000

actions:

  generate_study_population_patient_characteristics_2019:
      run: cohortextractor:latest generate_cohort 
        --with-end-date-fix
        --study-definition study_definition_patient_characteristics_2019
        --output-format=csv.gz
      outputs:
        highly_sensitive:
          cohort: output/input_patient_characteristics_2019.csv.gz

  generate_study_population_patient_characteristics_2020:
      run: cohortextractor:latest generate_cohort 
        --with-end-date-fix
        --study-definition study_definition_patient_characteristics_2020
        --output-format=csv.gz
      outputs:
        highly_sensitive:
          cohort: output/input_patient_characteristics_2020.csv.gz

  generate_study_population_patient_characteristics_2021:
      run: cohortextractor:latest generate_cohort 
        --with-end-date-fix
        --study-definition study_definition_patient_characteristics_2021
        --output-format=csv.gz
      outputs:
        highly_sensitive:
          cohort: output/input_patient_characteristics_2021.csv.gz

  table_1: 
    run: r:latest analysis/table_1.R
    needs: [generate_study_population_patient_characteristics_2019]
    outputs:
       moderately_sensitive:
        table1: output/table1_2019_overallcount.csv
        table2: output/table1_2019.csv

  table_1_2020: 
    run: r:latest analysis/table_1_2020.R
    needs: [generate_study_population_patient_characteristics_2020]
    outputs:
       moderately_sensitive:
        table1: output/table1_2020_overallcount.csv
        table2: output/table1_2020.csv

  table_1_2021: 
    run: r:latest analysis/table_1_2021.R
    needs: [generate_study_population_patient_characteristics_2021]
    outputs:
       moderately_sensitive:
        table1: output/table1_2021_overallcount.csv
        table2: output/table1_2021.csv

  generate_study_population:
    run: cohortextractor:latest generate_cohort
      --with-end-date-fix
      --study-definition study_definition 
      --index-date-range "2019-01-01 to 2021-12-31 by month" 
      --skip-existing 
      --output-dir=output/measures 
      --output-format=csv.gz
    outputs:
      highly_sensitive:
        cohort: output/measures/input_*.csv.gz

Timeline

  • Created:

  • Started:

  • Finished:

  • Runtime: 19:13:11

These timestamps are generated and stored using the UTC timezone on the TPP backend.

Job request

Status
Succeeded
Backend
TPP
Workspace
broad-spectrum-its
Requested by
Billy Zhong
Branch
broad_ITS
Force run dependencies
No
Git commit hash
a50ef88
Requested actions
  • generate_study_population

Code comparison

Compare the code used in this job request