Skip to content

Job request: 1069

Organisation:
The London School of Hygiene & Tropical Medicine
Workspace:
post-covid-research
ID:
pgu2gyvcsvf4js5w

This page shows the technical details of what happened when the authorised researcher John Tazare requested one or more actions to be run against real patient data within a secure environment.

By cross-referencing the list of jobs with the pipeline section below, you can infer what security level the outputs were written to.

The output security levels are:

  • highly_sensitive
    • Researchers can never directly view these outputs
    • Researchers can only request code is run against them
  • moderately_sensitive
    • Can be viewed by an approved researcher by logging into a highly secure environment
    • These are the only outputs that can be requested for public release via a controlled output review service.

Jobs

  • Action:
    generate_covid_cohort
    Status:
    Succeeded
    Job identifier:
    vjak5wbj5hex4sqx
  • Action:
    generate_covid_community_cohort
    Status:
    Succeeded
    Job identifier:
    n3nlpmn6324lbp57
  • Action:
    generate_covid_general_population_cohort
    Status:
    Succeeded
    Job identifier:
    at2pw5wizjdzgv5b
  • Action:
    generate_pneumonia_cohort
    Status:
    Succeeded
    Job identifier:
    xbu7a2wu6nngqjsr
  • Action:
    covid_rates_cohort
    Status:
    Succeeded
    Job identifier:
    eaqs2viyerijacod
  • Action:
    covid_community_rates_cohort
    Status:
    Succeeded
    Job identifier:
    qvyuxj2aesyh5jh7
  • Action:
    covid_rates
    Status:
    Succeeded
    Job identifier:
    wi6hjosr5ubmgfpl
  • Action:
    pneumonia_rates_cohort
    Status:
    Succeeded
    Job identifier:
    x5gjl6vnrguv6to7
  • Action:
    covid_comm_rates
    Status:
    Succeeded
    Job identifier:
    pwwfwbjaku6fxhs3
  • Action:
    pneumonia_rates
    Status:
    Succeeded
    Job identifier:
    e7ijqty6tckfi3to
  • Action:
    matching
    Status:
    Failed
    Job identifier:
    o5n5ldztixptghas
    Error:
    nonzero_exit: Job exited with an error code
  • Action:
    baseline_characteristics
    Status:
    Failed
    Job identifier:
    xsnvcrynbyyentsn
    Error:
    dependency_failed: Not starting as dependency failed
  • Action:
    gen_pop_rates_cohort
    Status:
    Failed
    Job identifier:
    p6fuxp3m6bxtqihj
    Error:
    dependency_failed: Not starting as dependency failed
  • Action:
    gen_pop_rates
    Status:
    Failed
    Job identifier:
    pklhutwcj7b5hote
    Error:
    dependency_failed: Not starting as dependency failed
  • Action:
    append_cohorts
    Status:
    Failed
    Job identifier:
    lt4hhtksjxs3ycxv
    Error:
    dependency_failed: Not starting as dependency failed
  • Action:
    cox_models
    Status:
    Failed
    Job identifier:
    fwgmjsh2kkjwtnsn
    Error:
    dependency_failed: Not starting as dependency failed

Pipeline

Show project.yaml
version: "3.0"

expectations:
  population_size: 20000

actions:
  generate_covid_cohort:
    run: cohortextractor:latest generate_cohort --study-definition study_definition_covid
    outputs:
      highly_sensitive:
        cohort: output/input_covid.csv

  generate_covid_community_cohort:
    run: cohortextractor:latest generate_cohort --study-definition study_definition_covid_community
    outputs:
      highly_sensitive:
        cohort: output/input_covid_community.csv

  generate_covid_general_population_cohort:
    run: cohortextractor:latest generate_cohort --study-definition study_definition_general_population
    outputs:
      highly_sensitive:
        cohort: output/input_general_population.csv

  generate_pneumonia_cohort:
    run: cohortextractor:latest generate_cohort --study-definition study_definition_pneumonia
    outputs:
      highly_sensitive:
        cohort: output/input_pneumonia.csv

  matching:
    run: python:latest python analysis/match_running.py
    needs: [generate_covid_cohort, generate_covid_general_population_cohort]
    outputs:
      moderately_sensitive: 
        matching_report: output/matching_report_general_population.txt  
      highly_sensitive: 
        combined: output/matched_combined_general_population.csv

  covid_rates_cohort:
    run: stata-mp:latest analysis/000_cr_define_covariates_simple_rates.do "covid"
    needs: [generate_covid_cohort]
    outputs:
      highly_sensitive:
        analysis_dataset: output/cohort_rates_covid.dta

  covid_community_rates_cohort:
    run: stata-mp:latest analysis/000_cr_define_covariates_simple_rates.do "covid_community"
    needs: [generate_covid_community_cohort]
    outputs:
      highly_sensitive:
        analysis_dataset: output/cohort_rates_covid_community.dta

  pneumonia_rates_cohort:
    run: stata-mp:latest analysis/000_cr_define_covariates_simple_rates.do "pneumonia"
    needs: [generate_pneumonia_cohort]
    outputs:
      highly_sensitive:
        analysis_dataset: output/cohort_rates_pneumonia.dta

  gen_pop_rates_cohort:
    run: stata-mp:latest analysis/000_cr_define_covariates_simple_rates.do "matched_combined_general_population"
    needs: [matching]
    outputs:
      highly_sensitive:
        analysis_dataset: output/cohort_rates_gen_population.dta

  covid_rates:
    run: stata-mp:latest analysis/201_cr_simple_rates.do "covid"
    needs: [covid_rates_cohort]
    outputs:
      moderately_sensitive:
        rates: output/tabfig/rates_summary_covid.csv

  covid_comm_rates:
    run: stata-mp:latest analysis/201_cr_simple_rates.do "covid_community"
    needs: [covid_community_rates_cohort]
    outputs:
      moderately_sensitive:
        rates: output/tabfig/rates_summary_covid_community.csv

  pneumonia_rates:
    run: stata-mp:latest analysis/201_cr_simple_rates.do "pneumonia"
    needs: [pneumonia_rates_cohort]
    outputs:
      moderately_sensitive:
        rates: output/tabfig/rates_summary_pneumonia.csv

  gen_pop_rates:
    run: stata-mp:latest analysis/201_cr_simple_rates.do "gen_population"
    needs: [gen_pop_rates_cohort]
    outputs:
      moderately_sensitive:
        rates: output/tabfig/rates_summary_gen_population.csv

  baseline_characteristics:
    run: stata-mp:latest analysis/400_baseline_characteristics.do
    needs: [covid_rates_cohort, covid_community_rates_cohort, pneumonia_rates_cohort, gen_pop_rates_cohort]
    outputs:
      moderately_sensitive:
        tables: output/tabfig/an_descriptiveTable_*.txt

  append_cohorts:
    run: stata-mp:latest analysis/300_cr_data_management_matching.do
    needs: [covid_rates_cohort, pneumonia_rates_cohort, gen_pop_rates_cohort]
    outputs:
      moderately_sensitive:
        log: output/append_cohorts.txt
      highly_sensitive: 
        dataset: output/combined_covid_pneumonia.dta
        dataset2: output/combined_covid_gen_population.dta

  cox_models:
    run: stata-mp:latest analysis/302_cox_models.do
    needs: [append_cohorts]
    outputs:
      moderately_sensitive:
        log: output/cox_models.txt
        dataset: output/tabfig/cox_model_summary.csv

Timeline

  • Created:

  • Started:

  • Finished:

  • Runtime: 04:13:41

These timestamps are generated and stored using the UTC timezone on the TPP backend.

Job request

Status
Failed
Backend
TPP
Requested by
John Tazare
Branch
master
Force run dependencies
Yes
Git commit hash
63c6da5
Requested actions
  • run_all

Code comparison

Compare the code used in this job request

  • No previous job request available for comparison