skip to content

Centre For Cancer Genetic Epidemiology

 

The CIMBA database includes data on around 90,000 individuals. This includes:

  • Core demographic data
  • Data on diagnosis, pathology, treatment and outcome for breast cancer patients
  • Data on breast cancer risk factors
  • Genetic data

Genotype Data

The CIMBA database currently includes genotype data from two large genotyping arrays, iCOGS and OncoArray, together with data from pre-iCOGS. Data from the Confluence array is currently being generated.

 

Data Access

 

Access to CIMBA data is available:

1. Summary data

Summary results from the GWAS (iCOGS & OncoArray) projects are now publicly available.

 

2. Individual level CIMBA data

Access to individual level CIMBA data is governed by a Data Access Coordinating Committee (DACC). If you require access to individual level CIMBA data, you will need to follow the procedure below:

(a) Download the Concept Analysis Application Form (Sep 2025) and return the completed form to the CIMBA Data Access Coordination Committee (DACC). The DACC review all submitted applications on an ad-hoc basis and will be in contact with the concept proposer once this review process has taken place.

The relevant CIMBA data dictionaries are provided below:

- Female phenotypic

- iCOGS PRS

- Oncoarray PRS

(b) If the DACC approves the concept analysis: (i) there is a one month period for CIMBA PIs to opt-out of any analysis for their study samples, (ii) a non-negotiable DTA must be signed by the concept proposer and their institution, (iii) a fee is payable for the data access. Please also refer to the concept analysis application form for more details.

(c) Once the above criteria have been fulfilled, the requested data will be sent to the concept proposer, with details of the CIMBA authorship/funding/acknowledgment requirements for any publications resulting from the data analyses.

If you have any queries, please contact the CIMBA Coordinator.

 

Next page: Publications