Terms of reference
- To agree and define standard pipelines for the analysis of FAANG data, with the aim that data sets are comparable because they have been analysed using the same pipeline.
- These pipelines to include software names, versions and parameter sets and the order in which they should be applied. Software chosen should be "industry standard"
- The group should work together to investigate methods to integrate the primary analysis results to high level summaries of functional state
- To communicate with the "Metadata and Data Sharing" group to ensure samples come with all sample and experimental metadata needed to run said pipelines.
- To communicate with the "Samples, Assays and Animals group to ensure we know the number and type of samples that will arrive.
- To communicate with the "Samples, Assays and Animals" group to define metrics which we will feed back regarding the quality of the data we receive and the quality of the analysis that has been carried out.
- To define computational data standards and make recommendations to the Samples, Assays and Animals" group about minimal experimental standards in terms of depth, read length, pairedness etc
- The group will be responsible for defining standard datasets such as reference genome, gene sets, mappability sets/black lists
- Executive: Mick Watson, James Reecey
- Members of the faang-analysis mailing list
- The group will meet as required via Zoom and will engage in discussion over email via the faang-analysis mailing list.
- Regular monthly meetings will be scheduled
- The group is accountable to the international FAANG steering committee
- Leads for each data type/analysis task (e.g. RNA-Seq, ChIP-Seq etc) will be identified and the lead will work with others in the group to create a proposal for a standard pipeline for that data type. The lead will report back to the group for approval of the proposal
- The wider group will then be asked to agree to the adoption of that pipeline.
- The pipeline should include software names and versions and all parameter sets. The software chosen should be "industry standard". The pipeline should include any filtering, pre- and post- processing steps
- Periodic reviews of each pipeline will be carried out via this group. Any member can propose a change.
- Before re-analysis of large scale datasets is started (e.g when triggered by genome assembly updates) methodology will be reviewed and updated based on newer methods or better understanding of the data
- The performance and relevance of the group will be the international FAANG steering committee
- You have to sign up to become a
FAANG member, accepting the principles and guidelines as set forth in the
"FAANG Data Sharing Statement", in order to participate.
- After login as a member, you will have options to join a working group.