Skip to content

Job request: 6883

Workspace:
pancreatic_cancer
ID:
f62k5dcfgjiymuar

This page shows the technical details of what happened when authorised researcher AgzLeman requested one or more actions to be run against real patient data in the project, within a secure environment.

By cross-referencing the indicated Requested Actions with the Pipeline section below, you can infer what security level various outputs were written to. Outputs marked as highly_sensitive can never be viewed directly by a researcher; they can only request that code runs against them. Outputs marked as moderately_sensitive can be viewed by an approved researcher by logging into a highly secure environment. Only outputs marked as moderately_sensitive can be requested for release to the public, via a controlled output review service.

Jobs

Pipeline

Show project.yaml
version: '3.0'

expectations:
  population_size: 1000

actions:

  generate_study_population:
    run: cohortextractor:latest generate_cohort --study-definition study_definition --output-format=csv
    outputs:
      highly_sensitive:
        cohort: output/input.csv

  generate_denominator_1:    
    run: cohortextractor:latest generate_cohort --study-definition study_definition_denominator --index-date-range "2015-01-01 to 2019-01-01 by month" --skip-existing --output-dir=output --output-format=feather
    outputs:      
      highly_sensitive:
        cohort: output/measures/input*.feather
  
  generate_denominator_2:    
    run: cohortextractor:latest generate_cohort --study-definition study_definition_denominator --index-date-range "2019-02-01 to 2022-03-01 by month" --skip-existing --output-dir=output --output-format=feather
    outputs:      
      highly_sensitive:
        cohort: output/measures/inpu*.feather

  generate_measures:
    run: cohortextractor:latest generate_measures --study-definition study_definition_denominator --skip-existing --output-dir=output/measures
    needs: 
      [
        generate_denominator_1,
        generate_denominator_2,
      ]
    outputs:
      moderately_sensitive:
        measure_csv: output/measures/measure_registered*_rate.csv

  describe_data:
    run: r:latest analysis/analysis_counts.R
    needs: 
      [
        generate_study_population
      ]
    outputs:
      moderately_sensitive:
        Tab1: output/monthly_count.csv
        Tab2: output/demographics.csv
        Tab3: output/monthly_count_resect.csv
        Tab4: output/monthly_count_died.csv
        Tab5: output/monthly_count_GPconsult.csv

Timeline

  • Created:

  • Started:

  • Finished:

  • Runtime:

These timestamps are generated and stored using the UTC timezone on the backend.

Job information

Status
Succeeded
Backend
TPP
Workspace
pancreatic_cancer
Requested by
AgzLeman
Branch
main
Force run dependencies
Yes
Git commit hash
7c5a2b3
Requested actions
  • generate_study_population
  • describe_data