Job request: 20292
- Organisation:
 - The London School of Hygiene & Tropical Medicine
 - Workspace:
 - covid_risk_immunocompromised
 - ID:
 - lisdtn26i3p3tvo2
 
This page shows the technical details of what happened when the authorised researcher Ed Parker requested one or more actions to be run against real patient data within a secure environment.
By cross-referencing the list of jobs with the pipeline section below, you can infer what security level the outputs were written to.
The output security levels are:
- 
                highly_sensitive
                
- Researchers can never directly view these outputs
 - Researchers can only request code is run against them
 
 - 
                moderately_sensitive
                
- Can be viewed by an approved researcher by logging into a highly secure environment
 - These are the only outputs that can be requested for public release via a controlled output review service.
 
 
Jobs
- 
                
- Job identifier:
 - 
                    
                    
ukxp5dekz46pzr74 
 - 
                
- Job identifier:
 - 
                    
                    
yuswg2n4ex3shrlc 
 - 
                
- Job identifier:
 - 
                    
                    
652tgy7hmzg35k3a 
 - 
                
- Job identifier:
 - 
                    
                    
iu37rmr7ukulroya - Error:
 - nonzero_exit: Job exited with an error
 
 
Pipeline
Show project.yaml
version: '3.0'
expectations:
  population_size: 10000
actions:
  # Extract data
  generate_study_population_wave4:
    run: >
      cohortextractor:latest generate_cohort
        --study-definition study_definition_wave4
        --skip-existing
        --output-format=csv.gz
    outputs:
      highly_sensitive:
        cohort: output/input_wave4.csv.gz
  # Process data
  process_data_wave4:
    run: r:latest analysis/data_process.R wave4
    needs: [generate_study_population_wave4]
    outputs:
      highly_sensitive:
        rds: output/processed/input_wave4.rds
        
  # Skim data
  skim_data_wave4:
    run: r:latest analysis/data_skim.R output/processed/input_wave4.rds output/data_properties
    needs: [process_data_wave4]
    outputs:
      moderately_sensitive:
        txt1: output/data_properties/input_wave4_skim.txt
        txt2: output/data_properties/input_wave4_coltypes.txt
        txt3: output/data_properties/input_wave4_tabulate.txt
        
  # Filter data
  filter_data_wave4:
    run: r:latest analysis/data_selection.R wave4
    needs: [process_data_wave4]
    outputs:
      highly_sensitive:
        rds: output/filtered/input_wave4.rds
      moderately_sensitive:
        csv: output/flowchart/flowchart_wave4.csv
  # Table 1
  table_1_wave4:
    run: r:latest analysis/table_1.R wave4
    needs: [filter_data_wave4]
    outputs:
      highly_sensitive:
        data: output/table_1/table_1_wave4.rds
      moderately_sensitive:
        table: output/tables/table_1_wave4.html
Timeline
- 
  
    
  
  
Created:
 - 
  
    
  
  
Started:
 - 
  
    
  
  
Finished:
 - 
  
  
Runtime: 00:03:27
 
These timestamps are generated and stored using the UTC timezone on the TPP backend.
Job request
- Status
 - 
            Failed
 - Backend
 - TPP
 - Workspace
 - covid_risk_immunocompromised
 - Requested by
 - Ed Parker
 - Branch
 - main
 - Force run dependencies
 - No
 - Git commit hash
 - 0a22775
 - Requested actions
 - 
            
- 
                  
process_data_wave4 - 
                  
skim_data_wave4 - 
                  
filter_data_wave4 - 
                  
table_1_wave4 
 - 
                  
 
Code comparison
Compare the code used in this job request