Skip to content

Job request: 15156

Organisation:
University of Bristol
Workspace:
post-covid-respiratory
ID:
6pkfmiyeky6laiqo

This page shows the technical details of what happened when the authorised researcher Rachel Denholm requested one or more actions to be run against real patient data within a secure environment.

By cross-referencing the list of jobs with the pipeline section below, you can infer what security level the outputs were written to.

The output security levels are:

  • highly_sensitive
    • Researchers can never directly view these outputs
    • Researchers can only request code is run against them
  • moderately_sensitive
    • Can be viewed by an approved researcher by logging into a highly secure environment
    • These are the only outputs that can be requested for public release via a controlled output review service.

Pipeline

Show project.yaml
version: '3.0'

expectations:

  population_size: 10000

actions:

  ## # # # # # # # # # # # # # # # # # # # # # # # # # # # # # # # # # # 
  ## DO NOT EDIT project.yaml DIRECTLY 
  ## This file is created by create_project_actions.R 
  ## Edit and run create_project_actions.R to update the project.yaml 
  ## # # # # # # # # # # # # # # # # # # # # # # # # # # # # # # # # # # 

  vax_eligibility_inputs:
    run: r:latest analysis/metadates.R
    outputs:
      highly_sensitive:
        study_dates_json: output/study_dates.json
        vax_jcvi_groups: output/vax_jcvi_groups.csv
        vax_eligible_dates: output/vax_eligible_dates.csv

  generate_study_population_prelim:
    run: cohortextractor:latest generate_cohort --study-definition study_definition_prelim
      --output-format feather
    needs:
    - vax_eligibility_inputs
    outputs:
      highly_sensitive:
        cohort: output/input_prelim.feather

  generate_index_dates:
    run: r:latest analysis/prelim.R
    needs:
    - vax_eligibility_inputs
    - generate_study_population_prelim
    outputs:
      highly_sensitive:
        index_dates: output/index_dates.csv

  generate_study_population_prevax:
    run: cohortextractor:latest generate_cohort --study-definition study_definition_prevax
      --output-format csv.gz
    needs:
    - vax_eligibility_inputs
    - generate_index_dates
    outputs:
      highly_sensitive:
        cohort: output/input_prevax.csv.gz

  generate_study_population_vax:
    run: cohortextractor:latest generate_cohort --study-definition study_definition_vax
      --output-format csv.gz
    needs:
    - generate_index_dates
    - vax_eligibility_inputs
    outputs:
      highly_sensitive:
        cohort: output/input_vax.csv.gz

  generate_study_population_unvax:
    run: cohortextractor:latest generate_cohort --study-definition study_definition_unvax
      --output-format csv.gz
    needs:
    - vax_eligibility_inputs
    - generate_index_dates
    outputs:
      highly_sensitive:
        cohort: output/input_unvax.csv.gz

  preprocess_data_prevax:
    run: r:latest analysis/preprocess/preprocess_data.R prevax
    needs:
    - generate_index_dates
    - generate_study_population_prevax
    outputs:
      moderately_sensitive:
        describe: output/not-for-review/describe_input_prevax_stage0.txt
        describe_venn: output/not-for-review/describe_venn_prevax.txt
      highly_sensitive:
        cohort: output/input_prevax.rds
        venn: output/venn_prevax.rds

  preprocess_data_vax:
    run: r:latest analysis/preprocess/preprocess_data.R vax
    needs:
    - generate_index_dates
    - generate_study_population_vax
    outputs:
      moderately_sensitive:
        describe: output/not-for-review/describe_input_vax_stage0.txt
        descrive_venn: output/not-for-review/describe_venn_vax.txt
      highly_sensitive:
        cohort: output/input_vax.rds
        venn: output/venn_vax.rds

  preprocess_data_unvax:
    run: r:latest analysis/preprocess/preprocess_data.R unvax
    needs:
    - generate_index_dates
    - generate_study_population_unvax
    outputs:
      moderately_sensitive:
        describe: output/not-for-review/describe_input_unvax_stage0.txt
        describe_venn: output/not-for-review/describe_venn_unvax.txt
      highly_sensitive:
        cohort: output/input_unvax.rds
        venn: output/venn_unvax.rds

  stage1_data_cleaning_all:
    run: r:latest analysis/preprocess/Stage1_data_cleaning.R all
    needs:
    - preprocess_data_prevax
    - preprocess_data_vax
    - preprocess_data_unvax
    - vax_eligibility_inputs
    outputs:
      moderately_sensitive:
        refactoring: output/not-for-review/meta_data_factors_*.csv
        QA_rules: output/review/descriptives/QA_summary_*.csv
        IE_criteria: output/review/descriptives/Cohort_flow_*.csv
        histograms: output/not-for-review/numeric_histograms_*.svg
      highly_sensitive:
        cohort: output/input_*.rds

Timeline

  • Created:

  • Started:

  • Finished:

  • Runtime:

These timestamps are generated and stored using the UTC timezone on the TPP backend.

Job request

Status
Failed
JobRequestError: stage1_data_cleaning_all failed on a previous run and must be re-run
Backend
TPP
Requested by
Rachel Denholm
Branch
main
Force run dependencies
No
Git commit hash
709a72b
Requested actions
  • run_all

Code comparison

Compare the code used in this job request