Job request: 24047
- Organisation:
- Bennett Institute
- Workspace:
- covid-vaccine-history
- ID:
- 3mdcm4ij4gvvnh2g
This page shows the technical details of what happened when the authorised researcher Will Hulme requested one or more actions to be run against real patient data within a secure environment.
By cross-referencing the list of jobs with the pipeline section below, you can infer what security level the outputs were written to.
The output security levels are:
-
highly_sensitive
- Researchers can never directly view these outputs
- Researchers can only request code is run against them
-
moderately_sensitive
- Can be viewed by an approved researcher by logging into a highly secure environment
- These are the only outputs that can be requested for public release via a controlled output review service.
Jobs
-
- Job identifier:
-
xhskelwjy35cdhfm
-
- Job identifier:
-
qhx4dwnamtpy3duz
-
- Job identifier:
-
fd63v4ezsfnilpzj
-
- Job identifier:
-
caholijcxtc52iyd
-
- Job identifier:
-
qlss5lgt2u5qwy76
-
- Job identifier:
-
xhun2sdjcncye3il
-
- Job identifier:
-
tvfe4yh2xbp5ft6m
-
- Job identifier:
-
472ksl7nt645qwep
Pipeline
Show project.yaml
version: '3.0'
expectations:
population_size: 10000
actions:
extract_fixed:
run: ehrql:v1 generate-dataset analysis/dataset_definition_fixed.py
--output output/extracts/extract_fixed.arrow
--dummy-data-file lib/dummydata/dummyinput_fixed.arrow
outputs:
highly_sensitive:
cohort: output/extracts/extract_fixed.arrow
extract_varying:
run: ehrql:v1 generate-dataset analysis/dataset_definition_varying.py
--output output/extracts/extract_varying.arrow
#--dummy-data-file lib/dummydata/dummyinput_varying.arrow
outputs:
highly_sensitive:
cohort: output/extracts/extract_varying.arrow
process:
run: r:latest analysis/process.R
needs: [extract_fixed, extract_varying]
outputs:
highly_sensitive:
rds: output/process/*.rds
moderately_sensitive:
csv: output/process/*.csv
txt: output/process/*.txt
# report vaccine history over entire observational period ---
report_history:
run: r:latest analysis/report_history.R
needs: [process]
outputs:
moderately_sensitive:
csv: output/report_history/*.csv
png: output/report_history/*.png
# report detailed vaccine history at specific snapshots in time ----
# 2020-12-08
extract_snapshot_20201208:
run: ehrql:v1 generate-dataset analysis/dataset_definition_snapshot.py
--output output/extracts/extract_snapshot_20201208.arrow
-- 20201208
outputs:
highly_sensitive:
cohort: output/extracts/extract_snapshot_20201208.arrow
report_snapshot_20201208:
run: r:latest analysis/report_snapshot.R 20201208
needs: [extract_snapshot_20201208, process]
outputs:
moderately_sensitive:
csv: output/report_snapshot_20201208/*.csv
png: output/report_snapshot_20201208/*.png
txt: output/report_snapshot_20201208/*.txt
# 2021-09-06
extract_snapshot_20210906:
run: ehrql:v1 generate-dataset analysis/dataset_definition_snapshot.py
--output output/extracts/extract_snapshot_20210906.arrow
-- 20210906
outputs:
highly_sensitive:
cohort: output/extracts/extract_snapshot_20210906.arrow
report_snapshot_20210906:
run: r:latest analysis/report_snapshot.R 20210906
needs: [extract_snapshot_20210906, process]
outputs:
moderately_sensitive:
csv: output/report_snapshot_20210906/*.csv
png: output/report_snapshot_20210906/*.png
txt: output/report_snapshot_20210906/*.txt
# 2022-04-01
extract_snapshot_20220401:
run: ehrql:v1 generate-dataset analysis/dataset_definition_snapshot.py
--output output/extracts/extract_snapshot_20220401.arrow
-- 20220401
outputs:
highly_sensitive:
cohort: output/extracts/extract_snapshot_20220401.arrow
report_snapshot_20220401:
run: r:latest analysis/report_snapshot.R 20220401
needs: [extract_snapshot_20220401, process]
outputs:
moderately_sensitive:
csv: output/report_snapshot_20220401/*.csv
png: output/report_snapshot_20220401/*.png
txt: output/report_snapshot_20220401/*.txt
# 2022-09-12
extract_snapshot_20220912:
run: ehrql:v1 generate-dataset analysis/dataset_definition_snapshot.py
--output output/extracts/extract_snapshot_20220912.arrow
-- 20220912
outputs:
highly_sensitive:
cohort: output/extracts/extract_snapshot_20220912.arrow
report_snapshot_20220912:
run: r:latest analysis/report_snapshot.R 20220912
needs: [extract_snapshot_20220912, process]
outputs:
moderately_sensitive:
csv: output/report_snapshot_20220912/*.csv
png: output/report_snapshot_20220912/*.png
txt: output/report_snapshot_20220912/*.txt
# 2023-04-03
extract_snapshot_20230403:
run: ehrql:v1 generate-dataset analysis/dataset_definition_snapshot.py
--output output/extracts/extract_snapshot_20230403.arrow
-- 20230403
outputs:
highly_sensitive:
cohort: output/extracts/extract_snapshot_20230403.arrow
report_snapshot_20230403:
run: r:latest analysis/report_snapshot.R 20230403
needs: [extract_snapshot_20230403, process]
outputs:
moderately_sensitive:
csv: output/report_snapshot_20230403/*.csv
png: output/report_snapshot_20230403/*.png
txt: output/report_snapshot_20230403/*.txt
# 2023-09-11
extract_snapshot_20230911:
run: ehrql:v1 generate-dataset analysis/dataset_definition_snapshot.py
--output output/extracts/extract_snapshot_20230911.arrow
-- 20230911
outputs:
highly_sensitive:
cohort: output/extracts/extract_snapshot_20230911.arrow
report_snapshot_20230911:
run: r:latest analysis/report_snapshot.R 20230911
needs: [extract_snapshot_20230911, process]
outputs:
moderately_sensitive:
csv: output/report_snapshot_20230911/*.csv
png: output/report_snapshot_20230911/*.png
txt: output/report_snapshot_20230911/*.txt
# 2024-04-01
extract_snapshot_20240401:
run: ehrql:v1 generate-dataset analysis/dataset_definition_snapshot.py
--output output/extracts/extract_snapshot_20240401.arrow
-- 20240401
outputs:
highly_sensitive:
cohort: output/extracts/extract_snapshot_20240401.arrow
report_snapshot_20240401:
run: r:latest analysis/report_snapshot.R 20240401
needs: [extract_snapshot_20240401, process]
outputs:
moderately_sensitive:
csv: output/report_snapshot_20240401/*.csv
png: output/report_snapshot_20240401/*.png
txt: output/report_snapshot_20240401/*.txt
# 2024-10-03
extract_snapshot_20241003:
run: ehrql:v1 generate-dataset analysis/dataset_definition_snapshot.py
--output output/extracts/extract_snapshot_20241003.arrow
-- 20241003
outputs:
highly_sensitive:
cohort: output/extracts/extract_snapshot_20241003.arrow
report_snapshot_20241003:
run: r:latest analysis/report_snapshot.R 20241003
needs: [extract_snapshot_20241003, process]
outputs:
moderately_sensitive:
csv: output/report_snapshot_20241003/*.csv
png: output/report_snapshot_20241003/*.png
txt: output/report_snapshot_20241003/*.txt
Timeline
-
Created:
-
Started:
-
Finished:
-
Runtime: 133:20:40
These timestamps are generated and stored using the UTC timezone on the TPP backend.
Job request
- Status
-
Succeeded
- Backend
- TPP
- Workspace
- covid-vaccine-history
- Requested by
- Will Hulme
- Branch
- main
- Force run dependencies
- No
- Git commit hash
- bb1fdb9
- Requested actions
-
-
report_snapshot_20201208 -
report_snapshot_20210906 -
report_snapshot_20220401 -
report_snapshot_20220912 -
report_snapshot_20230403 -
report_snapshot_20230911 -
report_snapshot_20240401 -
report_snapshot_20241003
-
Code comparison
Compare the code used in this job request