malariagen_data.ag3.Ag3.genome_features#
- Ag3.genome_features(region: str | Region | Mapping | List[str | Region | Mapping] | Tuple[str | Region | Mapping, ...] | None = None, attributes: Sequence[str] | str | None = 'default') DataFrame #
Access genome feature annotations.
Parameters#
- regionstr or Region or Mapping or list of str or Region or Mapping or tuple of str or Region or Mapping or None, optional
Region of the reference genome. Can be a contig name, region string (formatted like “{contig}:{start}-{end}”), or identifier of a genome feature such as a gene or transcript. Can also be a sequence (e.g., list) of regions.
- attributessequence of str or str or None, optional, default: ‘default’
GFF attribute keys to unpack into dataframe columns. Provide “*” to unpack all attributes.
Returns#
- DataFrame
A dataframe of genome annotations, one row per feature. The dataframe follows the GFF3 format (The-Sequence- Ontology/Specifications/blob/master/gff3.md), including extra attributes ID, Parent, Name and description depending on the dataset.