Job request: 20538

Organisation:: University of Bristol
Workspace:: vaccine-counts
ID:: bqu74drz6yfsiych

This page shows the technical details of what happened when the authorised researcher Ed Parker requested one or more actions to be run against real patient data within a secure environment.

By cross-referencing the list of jobs with the pipeline section below, you can infer what security level the outputs were written to.

The output security levels are:

highly_sensitive
- Researchers can never directly view these outputs
- Researchers can only request code is run against them
moderately_sensitive
- Can be viewed by an approved researcher by logging into a highly secure environment
- These are the only outputs that can be requested for public release via a controlled output review service.

Jobs

Action:

report_snapshot

Status:

Status: Succeeded

Job identifier:

k7d5sx3icuzumxom
Action:

html_snapshot

Status:

Status: Succeeded

Job identifier:

2juv3kb7e2n3k2vs

Pipeline

Show project.yaml

version: '3.0'

expectations:
  population_size: 10000

actions:

  extract_fixed:
    run: ehrql:v0 generate-dataset analysis/dataset_definition_fixed.py
      --output output/extracts/extract_fixed.arrow
      --dummy-data-file lib/dummydata/dummyinput_fixed.arrow
    outputs:
      highly_sensitive:
        cohort: output/extracts/extract_fixed.arrow

  extract_varying:
    run: ehrql:v0 generate-dataset analysis/dataset_definition_varying.py
      --output output/extracts/extract_varying.arrow
      --dummy-data-file lib/dummydata/dummyinput_varying.arrow
    outputs:
      highly_sensitive:
        cohort: output/extracts/extract_varying.arrow


  process:
    run: r:latest analysis/process.R
    needs: [extract_fixed, extract_varying]
    outputs:
      highly_sensitive:
        rds: output/process/*.rds

  report:
    run: r:latest analysis/report.R
    needs: [process]
    outputs:
      moderately_sensitive:
        csv: output/report/*.csv
        png: output/report/*.png

  # Additional actions for snapshot
  extract_snapshot:
    run: >
      cohortextractor:latest generate_cohort
        --study-definition study_definition_snapshot
        --skip-existing
        --output-format=csv.gz
    outputs:
      highly_sensitive:
        cohort: output/input_snapshot.csv.gz

  process_snapshot:
    run: r:latest analysis/snapshot_process.R
    needs: [extract_snapshot]
    outputs:
      highly_sensitive:
        rds: output/snapshot/processed_snapshot.rds

  report_snapshot:
    run: r:latest analysis/snapshot_report.R
    needs: [process_snapshot]
    outputs:
      highly_sensitive:
        rds: output/snapshot_report/snapshot_summary.rds
      moderately_sensitive:
        csv: output/snapshot_report/snapshot_summary.csv
    
  html_snapshot:
    run: r:latest -e 'rmarkdown::render("analysis/vaccine_snapshot_report.Rmd", knit_root_dir = "/workspace", output_dir="/workspace/output/snapshot_report")'
    needs: [process_snapshot, report_snapshot]
    outputs:
      moderately_sensitive:
        html: output/snapshot_report/vaccine_snapshot_report.html

Timeline

Created: 2 years, 5 months ago 27 Sep 2023 16:22:31 UTC
Started: 2 years, 5 months ago 27 Sep 2023 16:22:03 UTC
Finished: 2 years, 5 months ago 27 Sep 2023 16:31:24 UTC
Runtime: 00:09:18

These timestamps are generated and stored using the UTC timezone on the TPP backend.

Job request

Status: Succeeded
Backend: TPP
Workspace: vaccine-counts
Requested by: Ed Parker
Branch: main
Force run dependencies: No
Git commit hash: fa1b45a
Requested actions: report_snapshot

html_snapshot

Code comparison

Compare the code used in this job request