Job request: 10080
- Organisation:
- Bennett Institute
- Workspace:
- mab-av-non-user-main
- ID:
- c65554324ufas7mw
This page shows the technical details of what happened when the authorised researcher John Tazare requested one or more actions to be run against real patient data within a secure environment.
By cross-referencing the list of jobs with the pipeline section below, you can infer what security level the outputs were written to.
The output security levels are:
-
highly_sensitive
- Researchers can never directly view these outputs
- Researchers can only request code is run against them
-
moderately_sensitive
- Can be viewed by an approved researcher by logging into a highly secure environment
- These are the only outputs that can be requested for public release via a controlled output review service.
Jobs
-
- Job identifier:
-
kdkvd2yqvac4gsv2
-
- Job identifier:
-
hnsbpxjyrctlnr5w
-
- Job identifier:
-
zrmpyqrlgspvvll7
-
- Job identifier:
-
edxsbd4m727gn5qd
-
- Job identifier:
-
c4iu4bfwau2jj7jd
-
- Job identifier:
-
fna6qjrzpwemeiyh
Pipeline
Show project.yaml
version: '3.0'
expectations:
population_size: 100000
actions:
generate_study_population:
run: cohortextractor:latest generate_cohort --study-definition study_definition --output-format=csv.gz
outputs:
highly_sensitive:
cohort: output/input.csv.gz
data_process:
run: r:latest analysis/data_process.R
needs: [generate_study_population]
outputs:
highly_sensitive:
data1: output/data/data_processed_day0.rds
data2: output/data/data_processed_day5.rds
data_properties_process_day0:
run: r:latest analysis/data_properties.R output/data/data_processed_day0.rds output/data_properties
needs: [data_process]
outputs:
moderately_sensitive:
txt1: output/data_properties/data_processed_day0_skim.txt
txt2: output/data_properties/data_processed_day0_coltypes.txt
txt3: output/data_properties/data_processed_day0_tabulate.txt
data_properties_process_day5:
run: r:latest analysis/data_properties.R output/data/data_processed_day5.rds output/data_properties
needs: [data_process]
outputs:
moderately_sensitive:
txt1: output/data_properties/data_processed_day5_skim.txt
txt2: output/data_properties/data_processed_day5_coltypes.txt
txt3: output/data_properties/data_processed_day5_tabulate.txt
table_1_day5:
run: r:latest analysis/table_1.R day5
needs: [data_process]
outputs:
highly_sensitive:
data: output/tables/table1_redacted_day5.rds
moderately_sensitive:
table: output/tables/table1_redacted_day5.html
cox_day5:
run: r:latest analysis/cox_models.R day5
needs: [data_process]
outputs:
highly_sensitive:
data: output/tables/cox_models_day5.rds
moderately_sensitive:
table1: output/tables/cox_models_day5.csv
table2: output/tables/log_cox_models_day5.csv
Timeline
-
Created:
-
Started:
-
Finished:
-
Runtime: 04:17:03
These timestamps are generated and stored using the UTC timezone on the TPP backend.
Job request
- Status
-
Succeeded
- Backend
- TPP
- Workspace
- mab-av-non-user-main
- Requested by
- John Tazare
- Branch
- main
- Force run dependencies
- Yes
- Git commit hash
- 6917fff
- Requested actions
-
-
generate_study_population -
data_process -
data_properties_process_day0 -
data_properties_process_day5 -
table_1_day5 -
cox_day5 -
run_all
-
Code comparison
Compare the code used in this job request