Proposition

Note

This data class is at a trial use maturity level and may change in future releases. Maturity levels are described in the maturity-model.

Computational Definition

An abstract entity representing a possible fact that may be true or false. As abstract entities, Propositions capture a ‘sharable’ piece of meaning whose identify and existence is independent of space and time, or whether it is ever asserted to be true by some agent.

Information Model

Some Proposition attributes are inherited from Entity.

Field

Flags

Type

Limits

Description

id

string

0..1

The ‘logical’ identifier of the Entity in the system of record, e.g. a UUID. This ‘id’ is unique within a given system, but may or may not be globally unique outside the system. It is used within a system to reference an object from another.

type

string

1..1

The name of the class that is instantiated by a data object representing the Entity.

name

string

0..1

A primary name for the entity.

description

string

0..1

A free-text description of the Entity.

aliases

string

0..m

Alternative name(s) for the Entity.

extensions

Extension

0..m

A list of extensions to the Entity, that allow for capture of information not directly supported by elements defined in the model.

subject

object

1..1

The Entity or concept about which the Proposition is made.

predicate

string

1..1

The relationship declared to hold between the subject and the object of the Proposition.

object

object

1..1

An Entity or concept that is related to the subject of a Proposition via its predicate.


Implementation Guidance:

1. Structuring Proposition Semantics:

Propositions are abstract representations of possible facts about a domain of discourse, e.g. “NM_005343.4:c.173C>T causes Costello Syndrome”. The semantics of a Proposition are captured in subject, predicate, object, and optional qualifier attributes (SPOQ). Proposition profiles constrain the types of values that can be captured in these SPO attributes, and may define any number of specialized qualifier attributes that extend the SPO “triple” with additional detail or context. For example, if an SPO triple asserts that “NM_005343.4:c.173C>T is causal for Costello Syndrome”, a geneContextQualifier can be used to indicate that its effect on the HRAS gene mediates this causal relationship, and an alleleOriginQualifier can be used to indicate that the fact holds specifically for variants of ‘germline’ origin:

# Note that values in this example are reported in shorthand form for human readability.
# In actual VA-Spec data, many values would be wrapped in complex data type structures such as MappableConcepts.

subject: NM_005343.4:c.173C>T
predicate: isCausalFor
object: Costello Syndrome
geneContextQualifier: HRAS
alleleOriginQualifier: germline

2. Proposition Utility:

A proposition itself makes no claim as to whether the sentiment it expresses is true or not - its job is simply to convey the sharable meaning of a possible fact in a structured data object. Such propositions can then be referenced and reused by Statements and Evidence Lines, which make assertions about them. Specifically:

  • Statements may report that a proposition was asserted by a particular agent to be true or false, or may report the overall strength of confidence or evidence supporting or disputing a proposition for which a definitive assertion cannot yet be made. Such Statements are based on the agent’s interpretation of evidence as providing discrete argument(s) for or against the proposition.

  • Evidence Lines are used to represent each such discrete evidence-based argument. They report that a particular collection of information (evidence items) was assessed and scored as evidence to support or dispute some target proposition. It is typically through the assessment of several distinct Evidence Lines that a particular Proposition is ultimately asserted to be true or false in a Statement.


3. Proposition Example

Variant pathogenicity classifications based on the 2015 ACMG Interpretation guidelines are a nice example of how a single proposition such as “NM_005343.4:c.173C>T is causal for Costello Syndrome” can be used in Evidence Lines and Statements. This proposition may first be used as a target against which data is interpreted to build Evidence Lines, according to specific evaluation criteria (e.g. PM2 for allele frequency data, PM1 for functional impact data). As Evidence Lines supporting this proposition accumulate, it may subsequently be used in a Statement where it is asserted as true to classify the variant as ‘pathogenic’.

The example below illustrates how such a scenario may be represented using the VA-Spec - illustrating the use of the same Proposition001 in two Evidence Lines and a Variant Pathogenicity Statement.

# Note that values in this example are reported in shorthand form for human readability.
# In actual VA-Spec data, many values would be wrapped in complex data type structures such as MappableConcepts.

  # As a target proposition in an EvidenceLine based on functional impact data, created at t0 by Curator 1
  id: EvidenceLine001
  type: EvidenceLine
  targetProposition:
    - id: VarPathProposition001
      type: VariantPathogenicityProposition
      subjectVariant: NM_005343.4:c.173C>T:c.173C>T
      predicate: isCausalFor
      objectCondition: Costello Syndrome
      geneContextQualifier: HRAS
  evidenceItems: FunctionalImpactStudyResult001       # full StudyResult object omitted for space
  directionOfEvidenceProvided: supports
  strengthOfEvidenceProvided: moderate
  specifiedBy: PM1

  # As a target proposition in an EvidenceLine based on cohort allele frequency data, created at t1 by Curator 2
  id: EvidenceLine002
  type: EvidenceLine
  targetProposition: VarPathProposition001    # no need to duplicate an inlined representation, as this Proposition is already defined in the message.
  evidenceItems: AlleleCohortFrequencyStudyResult001   # full StudyResult object omitted for space
  directionOfEvidenceProvided: supports
  strengthOfEvidenceProvided: moderate
  specifiedBy: PM2

  # As an asserted proposition in a VariantPathogenicityStatement, created at t2 by Curator 3 who puts forth the proposition as true and classifies the variant as 'pathogenic' based on the Evidence Lines above
  id: Statement001
  type: Statement
  proposition: VarPathProposition001          # no need to duplicate an inlined representation, as this Proposition is already defined in the message.
  direction: supports
  strength: definitive
  classification: pathogenic
  hasEvidenceLines
    - EvidenceLine001
    - EvidenceLine002
  specifiedBy: 2015 ACMG Variant Interpretation Guidelines