Contamination score is based on the fraction of single-copy genes that are observed more than once in a query genome. The following scores are acceptable for; High Quality Draft: < 5%, Medium Quality Draft: < 10%, Low Quality Draft: < 10%. Contamination must be below 5% for a SAG or MAG to be deposited into any of the public databases
Schema Source
from schema: https://github.com/turbomam/mixs-subset-examples-first/tree/main/src/mixs_subset_examples_first
LinkML Source
name: contam_score
annotations:
global_raw_definition:
tag: global_raw_definition
value: 'Contamination score is based on the fraction of single-copy genes that
are observed more than once in a query genome. The following scores are acceptable
for; High Quality Draft: < 5%, Medium Quality Draft: < 10%, Low Quality Draft:
< 10%. Contamination must be below 5% for a SAG or MAG to be deposited into
any of the public databases'
global_value_syntax:
tag: global_value_syntax
value: '{float} percentage'
occurrence:
tag: occurrence
value: '1'
description: placeholder description; linter will ignore this
title: contamination score
examples:
- value: '0.01'
from_schema: https://github.com/turbomam/mixs-subset-examples-first/tree/main/src/mixs_subset_examples_first
rank: 1000
slot_uri: MIXS:0000072
alias: contam_score
domain_of:
- Checklist
- Mimag
- Misag
range: string