Version: 17.0.0 | Published: 30 Oct 2023 | Updated: 572 days ago
Documentation
Associated Media:
Description:
Contains tables related to long-reads sequencing data for 100,000 Genomes Project participants.
- lrs_laboratory_sample: Data describing the characteristics and processing methods (DNA to library preparation) of samples from participants in the 100,000 Genomes Project for which long-reads sequencing has been carried out.
- lrs_sequencing_data: This table includes data describing long-read sequencing of a subset of 100,000 Genomes Project participants and associated output, including paths to raw and BAM files.
- cancer_ont_cohorts: Table listing participant ids, sample data, file paths and sequencing statistics for Oxford Nanopore cancer cohorts available in the Research Environment, along with corresponding matched germline and Illumina short reads files where available
- rare_disease_pacbio_pilot: This is a dataset of 91 rare disease samples from the 100k genome project re-sequenced with Pacific Biosciences (PacBio) as an example dataset to to demonstrate the utility of their HiFi technology.
Is Part Of:
100K Primary Data
Coverage
Spatial:
UK
Typical Age Range:
0-150
Follow Up:
OTHER
Physical Sample Availability:
DNA
Pathway:
Linked datasets cover secondary care.
Provenance
Origin
Purposes:
- CARE
- DISEASE REGISTRY
- OTHER
Sources:
- ELECTRONIC SURVEY
- EPR
- LIMS
- MACHINE GENERATED
Collection Situations:
- CLINIC
- IN-PATIENTS
- OUTPATIENTS
Temporal
Accrual Periodicity:
QUARTERLY
Distribution Release Date:
30 March 2023
Start Date:
01 January 2012
End Date:
31 December 2022
Time Lag:
2-6 MONTHS
Accessibility
Access
Access Service:
More information about the Genomics England Research Environment can be found
here:
https://www.genomicsengland.co.uk/about-genomics-england/research-environment/
https://research-help.genomicsengland.co.uk/display/GERE/1.+The+Genomics+England+Research+Environment
Genomics England 100k participants have consented to longitudinal lifetime
followup and recontact safely through our clinical network. BRST (Bioinformatics
Research Services) are a team of bioinformatics who know the dataset inside out
and provide consultancy projects on a case by case basis. Our network of
clinical and medical experts can be made available on case by case basis.
Researchers have the opportunity to work with our and access the GeCIP network
who are a community of world-leading experts in specific cancers and rare
diseases.
Access Request Cost:
Fees will be dependent on the type of access that is necessary. Raw data is not
eligible for export. Summary-level data may be exported provided that it is
approved through the Genomics England Airlock Process
Delivery Lead Time:
2-6 MONTHS
Jurisdictions:
GB-GBN
Data Controller:
GENOMICS ENGLAND
Data Processor:
GENOMICS ENGLAND
Usage
Data Use Limitations:
GENERAL RESEARCH USE
Data Use Requirements:
- ETHICS APPROVAL REQUIRED
- PROJECT SPECIFIC RESTRICTIONS
- PUBLICATION MORATORIUM
Resource Creators:
- The 100
- 000 Genomes Project Protocol v3
- Genomics England. doi:10.6084/m9.figshare.4530893.v3. 2017. Publications that use the Genomics England Database should include an author as: Genomics England Research Consortium. Please see publication policy.
Is Referenced By:
Format and Standards
Vocabulary Encoding Schemes:
- LOCAL
- ICD10
- NHS NATIONAL CODES
- ODS
- OPCS4
- READ
- SNOMED CT
- OTHER
Languages:
en
Formats:
Multiple formats available
Enrichment and Linkage
Qualified Relations:
- HES Accident and Emergency
- HES Outpatient Care
- Diagnostic Imaging Dataset (DID)
- Patient Reported Outcome Measures (PROMs)
- Cancer Registration (AV) tables
- HES Admitted Patient Care
- Cancer waiting times (CWT)
- Lung Cancer Data Audit (LUCADA)
- PHE Diagnostic Imaging Dataset (NCRAS_DID)
- Systemic Anti-Cancer Therapy Data Set (SACT)
- Office for National Statistics - Death details data (ONS)
- National Radiotherapy Dataset (RTDS)
- Mental Health Minimum Data Set (MHMDS)
Derivations:
Not Known
Observations
Statistical Population
Population Description
Population Size
Measured Property
Observation Date
Findings
Rare Disease - Number of genomes
73,517
Count
30 March 2023
Findings
Cancer Germline - Number of genomes
32,753
Count
30 March 2023
Findings
Cancer Tumour - Number of genomes
17,003
Count
30 March 2023
Persons
Cancer Participants
15,624
Count
30 March 2023
Persons
Rare Disease Participants
72,874
Count
30 March 2023
Origin
Name:
Data Catalogue