Job request: 10760

Organisation:: The London School of Hygiene & Tropical Medicine
Workspace:: long-covid-symptoms
ID:: kig5fz5nrdtu2ipr

This page shows the technical details of what happened when the authorised researcher Kevin Wing requested one or more actions to be run against real patient data within a secure environment.

By cross-referencing the list of jobs with the pipeline section below, you can infer what security level the outputs were written to.

The output security levels are:

highly_sensitive
- Researchers can never directly view these outputs
- Researchers can only request code is run against them
moderately_sensitive
- Can be viewed by an approved researcher by logging into a highly secure environment
- These are the only outputs that can be requested for public release via a controlled output review service.

Jobs

Action:

generate_potential_controls_contemporary

Status:

Status: Failed

Job identifier:

cwsycwr4pay5yalr

Error:

cancelled_by_user: Cancelled by user
Action:

generate_community_cases_cohort

Status:

Status: Failed

Job identifier:

rozujdfh64fii2j4

Error:

cancelled_by_user: Cancelled by user
Action:

01_longCovidSymp_check_case_control_source

Status:

Status: Failed

Job identifier:

bmgc2cs465sewagu

Error:

cancelled_by_user: Cancelled by user
Action:

match_contemporary

Status:

Status: Failed

Job identifier:

bsfuatqoibrbuxvm

Error:

cancelled_by_user: Cancelled by user
Action:

add_allVars_contemporary_controls

Status:

Status: Failed

Job identifier:

7jqrdiabdicg3p25

Error:

dependency_failed: Not starting as dependency failed
Action:

add_allVars_cases

Status:

Status: Failed

Job identifier:

jgv5gqcwrnxced74

Error:

dependency_failed: Not starting as dependency failed

Pipeline

Show project.yaml

version: '3.0'

expectations:
  population_size: 10000

actions:

## 1. EXTRACT DATA FOR EXPOSED
  generate_community_cases_cohort:
    run: cohortextractor:latest generate_cohort --study-definition study_definition_covid_communitycases
    outputs:
      highly_sensitive:
        cohort: output/input_covid_communitycases.csv


# 2. EXTRACT DATA FOR POTENTIAL COMPARATORS
# new extract potential comparators code... I guess I'll have to this twice for each control group
# or potentially 3 times: 2018 matches, 2019 matches, contemporary matches?

 # *** contemporary potential comparators
  generate_potential_controls_contemporary:
    run: cohortextractor:latest generate_cohort --study-definition study_definition_controls_contemporary
    outputs:
      highly_sensitive:
        cohort: output/input_controls_contemporary.csv


# 3. CHECKS ON SEPARATE CASE AND CONTROL FILES - this is for looking to see if they contain per region
  01_longCovidSymp_check_case_control_source:
    run: stata-mp:latest analysis/01_longCovidSymp_check_case_control_source.do
    needs: [generate_community_cases_cohort, generate_potential_controls_contemporary]
    outputs:
      moderately_sensitive:
        log: logs/01_longCovidSymp_check_case_control_source.log



# 4. MATCH CASES TO COMPARATORS
  # *** contemporary potential comparators
  match_contemporary:
    run: python:latest python analysis/match_contemporary.py
    needs: [generate_community_cases_cohort, generate_potential_controls_contemporary]
    outputs:
      moderately_sensitive:
        matching_report: output/matching_report_contemporary.txt
      highly_sensitive:
        matched_cases: output/matched_cases_contemporary.csv
        matched_matches: output/matched_matches_contemporary.csv
        matched_all: output/matched_combined_contemporary.csv



 # 4. ADD FURTHER VARIABLES TO CONTROLS FOLLOWING MATCHING
  add_allVars_contemporary_controls:
    run: cohortextractor:latest generate_cohort --study-definition study_definition_complete_controls_contemporary 
    needs: [match_contemporary]
    outputs:
      highly_sensitive:
        cohort: output/input_complete_controls_contemporary.csv



# 5. ADD FURTHER VARIABLES TO CASES FOLLOWING MATCHING
  add_allVars_cases:
    run: cohortextractor:latest generate_cohort --study-definition study_definition_complete_covid_communitycases 
    needs: [match_contemporary]
    outputs:
      highly_sensitive:
        cohort: output/input_complete_covid_communitycases.csv

Timeline

Created: 3 years, 7 months ago 09 Aug 2022 13:08:06 UTC
Started: 3 years, 7 months ago 09 Aug 2022 13:07:21 UTC
Finished: 3 years, 7 months ago 09 Aug 2022 13:13:27 UTC
Runtime: 00:12:08

These timestamps are generated and stored using the UTC timezone on the TPP backend.

Job request

Status: Failed
Backend: TPP
Workspace: long-covid-symptoms
Requested by: Kevin Wing
Branch: master
Force run dependencies: Yes
Git commit hash: 295713e
Requested actions: run_all

Code comparison

Compare the code used in this job request