VA Profiles
VA Profiles extend generic Core Model classes with specializations to support a particular type of knowledge (e.g. a variant pathogenicity), and/or align with established community terminology and curation conventions (e.g. the ACMG-2015 Guidelines)
The VA-Spec provides a Profile authoring approach that specifies how profiles are defined as YAML-based specifications, from which machine-readable JSON Schema are derived and used by implementers to structure, validate, and exchange variant data in their systems.
The VA-Spec currently supports profiles of four Core Model classes:
Statement Profiles support assertions of general knowledge about a variant (e.g. classification of the PTEN:c.35A>T(p.Asn12Ile) variant as likely pathogenic in the ClinVar knowledgebase).
Evidence Line Profiles describe how information is interpreted as evidence supporting or disputing a proposition that may ultimately be asserted as true in a Statement (e.g. how data from an experimental functional impact analysis are interpreted to provide strong evidence supporting an assertion that PTEN:c.35A>T(p.Asn12Ile) is pathogenic).
Study Result Profiles capture collections of data items about a specific variant from a particular study or analysis, which often represent evidence for higher order Statements about the variant (e.g. functional impact data about PTEN:c.35A>T(p.Asn12Ile) from the MAVE dataset).
Proposition Profiles are used exclusively within Statement and Evidence Line objects - where they encapsulate the semantics of the possible fact that may be asserted in a Statement, or against which evidence may be assessed in an Evidence Line (e.g. a proposition that “PTEN:c.35A>T(p.Asn12Ile) is causal for Hamartoma Tumor Syndrome”).
The data example here illustrates how profiles of these different types can be used together to represent a variant pathogenicity classification and the evidence supporting it.
In version 1.0 of the VA-Spec, we distinguish between two categories of profiles:
Base Profiles:
Specialize generic VA core classes for a particular type of knowledge, through formal definition of concrete subclasses.
This Base Profiling approach is used to create Proposition Profiles and Study Result Profiles, which can be used/referenced within Statement and Evidence Line profiles.
Community Profiles:
Layer additional constraints on top of VA core classes to enforce alignment with terminology conventions of a specific community guideline (e.g. ACMG 2015).
These constraints are defined using a schema composition approach, which does not result in creation of concrete subclasses for each profile.
This approach is used to define Statement and Evidence Line profiles - which incorporate Propositions to specify the possible fact they assert to be true or evaluate evidence against, respectively.
Notably, it is possible to define more than one Community Profile for a given knowledge type, each of which aligns with a different community guideline. For example, separate Pathogenicity community profiles may end up being defined to align with current ACMG 2015 Guidelines, and a forthcoming updated version of these guidelines.
For technical guidance around how these types of profiles are authored and implemented, see the Developer Guide section.
Index: