Genotype and Phenotype Probabilities by Ternary Code based Line-dot Method and Polygonal Illustrations
Genotype and Phenotype Probabilities by Ternary Code based Line-dot Method and Polygonal Illustrations
Muhammad Imran Shabbir
Department of Bioinformatics and Biotechnology, Faculty of Basic and Applied Sciences, International Islamic University, H-10, Islamabad, Pakistan
The heart of learning genetics is in realizing that how the likelihood of inheriting a particular trait can be predicted. This fundamental approach is the basis for plant and animal breeding to get desired varieties as well as performing genetic analysis including prediction of patterns of inheritance in family lines and to calculate the recurrence risk for relatives etc. Punnett square is used to describe the possible combinations of paternal and maternal alleles for a particular cross. Described here is a line-dot method using ternary genotype codes to determine genotype probabilities in a very quick and simple way. Also ternary genotype codes along with gender symbols and affection status are used to modify the Punnett squares. On applying these modified Punnett squares along polygonal shapes covers all possible parental combinations and their genotypic and phenotypic outcomes in offspring. In this way, simple illustrations incorporate a considerable amount of information that provides the opportunity to determine genotype and phenotype probabilities for monogenic Mendelian traits without the need to draw Punnett square for every parental genotype combination. These illustrations provide information for both forward (phenotype to genotype) as well as reverse genetics (genotype to phenotype) approaches.
Received 15 July 2015
Revised 21 December 2015
Accepted 03 May 2017
Available online 11 August 2017
Ternary genotype code, Line-dot method, Modified Punnett squares, Ternary coded genotype probability groups, Line-dot hexagon, Genotype hexagon, Genotype to phenotype hexagon, Phenotype to genotype square.
* Corresponding author: email@example.com
0030-9923/2017/0005-1603 $ 9.00/0
Copyright 2017 Zoological Society of Pakistan
Genetic conditions caused by a mutation in a single gene follow predictable patterns known as Mendelian inheritance. This inheritance is classified based on whether it is autosomal or sex-linked and whether it is following a dominant or recessive pattern. Punnett square approach is most commonly used to predict an outcome of a particular cross or breeding experiment (; ). It provides a rundown of every possible arrangement of one maternal allele with one paternal allele for any locus/gene being studied in the cross. With reference to genotype on a single locus, there are twenty five different parent unions possible in nuclear pedigrees, six for each of autosomal dominant, autosomal recessive, X-linked dominant, X-linked recessive (6+6+6+6 = 24) and one for Y-linked inheritance pattern. Genetic literature including books generally describes only one or two combinations whereas other combinations can be drawn as required (). To knowledge, no approach has been found in literature that provides a quick prediction of genotypic and phenotypic outcome of a particular cross for Mendelian inheritance without the need to draw genotype symbols from both parents through Punnett squares or alternative forked-line method.
This communication attempts to incorporate a good deal of genotype and phenotype probability information in the form of different illustrations. For this purpose, a number of approaches are employed which include i) Numeric coding of parental and offspring genotypes to simplify the symbolic representation, ii) A very quick “line-dot method” is described to draw genotype outcomes which is then elaborated into “line-dot hexagon” (Ld-hex), iii) Modifying Punnett squares with numeric coding and incorporating symbols to describe gender and affection status, iv) “Ternary coded genotype probability groups” that are also described in the form of “basic genotype hexagon” (bG-hex) (, ). Finally, three polygonal illustrations are described including “detailed genotype hexagon” (dG-hex) (), “genotype-to-phenotype hexagon” (G2P-hex) () and “phenotype-to-genotype square” (P2G-sq) () that describe the genotype and phenotype probabilities for monogenic Mendelian inheritance.
Materials and Methods
In order to provide illustrations with comprehensive information in the form of outcomes of all the possible parent combinations, following approaches were used:
To draw offspring outcomes from a parental cross using ternary numeric code, a method is described which uses line(s) drawn on four dots. For this purpose, line(s) are drawn vertically (for the first crossing parent) and horizontally (for the second crossing parent) according to their ternary genotype code written left to right. The genotype can be determined by counting number of lines passing through any dot ().
Punnett square modifications
Punnett square representations were modified to contain more information with the help of ternary genotype codes written outside an appropriately sized square along its four corners. Also four gender symbols were incorporated inside the square to illustrate the appearance of affection status in a particular gender wherever apply. These gender symbols may be empty (normal), having a dot inside (carrier), dark filled (affected), lightly filled (affected for dominant mode and carrier for recessive mode) to represent the occurrence of a respective phenotype ().
Numeric genotype probability groups
The genotype from the ternary coded Punnett square can also be written in straight line in numeric form by representing parents’ genotype (Pr and Ps) on the sides and possible genotype outcomes in four offspring (O) in the middle as [1—2110—1] (). As there are six different cross combinations possible between parents on the basis of genotype, six distinct genotype probability groups are named G0, G1, G2b, G3, G4 and G2s representing [0—0000—0], [0—0011—1], [1—2110—1], [1—1122—2], [2—2222—2] and [0—1111—2], respectively (). The genotype groups are named based on number of affected alleles shared by both parents (r + s) (). Group G2 involve two affected alleles either each one of them coming from one parent so ‘both’ parents are involved (G2b) or two affected alleles coming from a ‘single’ parent (G2s) ().
Avoiding diagonal shift of genotypes
The order of genotype of parents when written for a particular cross (e.g., F x M), if reversed as (M x F) cause diagonal shift of genotypes of offspring between the upper right side and lower left side of the Punnett square (). This may lead to wrong interpretation of results, so this is avoided in the polygonal illustrations by using arrows, arrow heads and/or considering only one type of writing combination.
In order to provide information of all possible parental genotype combinations and their genotypic outcomes in offspring, different polygonal illustrations are described. These illustrations include line-dot hexagon (LD-hex) (), basic genotype hexagon (bG-hex) (), detailed genotype hexagon (dG-hex) (), genotype-to-phenotype hexagon (G2P-hex) () and phenotype-to-genotype square (P2G-sq) () that describe the genotype and phenotype probabilities for Mendelian inheritance.
Results and Discussion
Ternary numeric coding, line-dot approach and modified Punnett squares
Ternary numeric coding described in this communication works on the basis of number of affected allele(s) that constitute a genotype. Codes ‘0’ and ‘2’ represent homozygotes for normal and affected alleles, respectively. However, code ‘1’ represents a heterozygote with one affected allele. By this notion, it simplifies the genotype representation for autosomal as well as sex-linked inheritance by only using three numerals 0, 1 and 2 instead of two letter or other complex genotype symbols (). Using these codes, the line-dot method describes a simple and quick way to find genotypic outcomes for offspring. Each dot represents a 25% (or ¼) chance of genotypic outcome in offspring. The number of lines passing through any dot represents the respective genotype (). It can provide a quick numerical way of solving a Punnett square which was further modified to incorporate ternary codes and gender symbols with affection status ().
As described in genotype probability groups (), there are six possible parent unions on the basis of genotype as [0—0], [0—1], [1—1], [1—2], [2—2] and [0—2]. However, when writing three more combinations [1—0], [2—1], and [2—0] of the underlined groups are possible making them total of nine. In order to describe all nine possible parental crosses, hexagonal shapes are used with parent symbols along with respective genotype code connected by a marriage line. The offspring outcomes are described along the marriage line (, , ). Also, in writing a particular cross, the order of parent genotype is important and reversing the order leads to diagonal genotype shift () that can mislead especially with ternary coded genotypes. To solve this problem, in the “hexagonal illustrations”, arrows or arrow heads are used to differentiate e.g., [0—1] as[ 0 → 1] and [1—0] as [1 → 0] (, , ). However, in the “square illustration”, genotype shift is also avoided by always considering parent’s genotypes in the only order of (smaller genotype code x larger genotype code) or (n x N) for autosomal and (Father’s genotype code x Mother’s genotype code) or (F x M) for sex-linked inheritance (). Arrow heads are also added in order to further ease this consideration, so the direction of the arrow describes which parent is considered to be written first for a particular cross ().
For line-dot method, only one cross is elaborated in and rest of all possible combinations are presented along a line-dot hexagon (Ld-hex) () and their equivalent ternary coded Punnett square outcomes are described in simple genotype hexagon (sG-hex) (). sG-hex is then further improved to detailed genotype hexagon (dG-hex) to incorporate autosomal (external hexagon) as well as sex-linked (internal hexagonal space) parent combinations and offspring outcomes (). Offspring genotype outcome is presented along the respective arrow head on the marriage line between two parent symbols when both parents have different genotype to avoid wrong interpretation due to diagonal shift of genotypes in offspring.
In genotype-to-phenotype hexagon (G2P-hex), along with genotypes, phenotypes are presented in the form of symbols depending upon dominant or recessive form of autosomal and sex-linked inheritance. The phenotypes described here assumes dominant-recessive model with full penetrance, however any other outcome including non-penetrance, co-dominance or incomplete dominance etc. can be easily imagined ().
G2P-hex also effectively describes why dominant mode does not skip any generation (because all kinds of parent unions involving at least one affected allele result in affected offspring) whereas recessive mode tend to skip generations (because of possibility of producing all normal offspring though carriers of disease in 0—1 and 0—2 parent unions in autosomal recessive and F1—M0 union in X-linked recessive modes). However, in dominant inheritance, generation skipping may occur due to non-penetrance ().
Genotype probability groups described in also represent all six sides of the hexagonal illustrations. All marriage lines with similar genotype combination from both parents that are parallel to each other belong to one group that lead to overall same genotypic outcome in offspring irrespective of gender, mode of inheritance or phenotypic outcomes. However, the group G4 [2—2222—2] exist only for autosomal inheritance and is not possible for sex-linked inheritance because male cannot be homozygous for either X or Y chromosome except if there is a chromosomal abnormality (, ).
Another polygonal illustration, Phenotype-to-genotype square (P2G-sq) initiates with a square in the center of the illustration representing four marriage lines among normal and/or affected parents and incorporates all possible four parent phenotype combinations. Then each parent union is represented with all possible combinations of having a particular genotype in them and possibility of having a respective genotype and phenotype in their offspring ().
The probability calculation from these illustrations is straight forward as each cross is described with four offspring each one of them with ¼ or 25% risk of having any genotype and thus associated phenotype. For example, in the genotype group G3 [1—1122—2], half (¼ + ¼ = ½) of offspring will be heterozygous (1) and remaining half (¼ + ¼ = ½) will be homozygous (2) for affected allele. However on the phenotype account, considering dominant trait, all (100%) offspring will be affected being either (1) or (2) whereas for recessive trait, there will be 50% chance of being affected (2) and 50% chance of being normal but carrier of affected allele (1) (, , , ).
Forward and reverse genetics approaches
The difference between forward and reverse genetic approaches is the order of studying genotype and phenotype. If the Phenotype is known and responsible genotype is explored, it is forward genetics (Phenotype → Genotype = P2G). On the other hand, Reverse genetics is when genotype is known and resulting phenotype is determined (Genotype → Phenotype = G2P). In this work, P2G-sq follow the forward approach describing the possible phenotypes of both parents first and then all possibilities of having a causative genotype in them as well as respective genotype and phenotype probabilities in offspring. However, G-hex follows the reverse genetic approach providing all the possibilities of parent combinations with known genotype and predicts the offspring genotype outcomes. Further, G2P-hex describes the possible phenotypes with respect to known genotypes in parents and offspring. Determining genotype and phenotype probabilities is important in genetic counseling, linkage analysis as well as in genetic evaluation programs. The probability determination described here will simplify the observation of genotype and phenotype for a particular monohybrid cross and assessment of risk of each offspring carrying a trait under study. These polygonal illustrations will be a useful tool for genetic counselors, researchers, teachers and students of genetics.
In essence, the methods and illustrations described in this communication tend to simplify the observation of genotype and phenotype probabilities for monogenic Mendelian inheritance. Described here are some of the potential benefits and applications:
Simple and easy to use
Line dot method and genotype probability grouping simplifies the Punnett squares and are self explanatory. With proper understanding and practice, monohybrid crosses can be solved in mind without need to draw on the paper. For example, any cross between two heterozygotes (Aa x Aa) can be coded to (1 x 1) and using line-dot method or following the genotype probability group G2b = [1—0112—1] will effectively tell that with each parent sharing genotype 1, ¼ or 25% offspring will be of genotype 0, ¼ + ¼ = ½ or 50% offspring will be of genotype 1 and remaining ¼ or 25% will be of genotype 2 ().
As described earlier, with reference to genotype on a single locus, there are twenty five different parent unions possible in nuclear pedigrees, six for each of autosomal dominant, autosomal recessive, X-linked dominant, X-linked recessive (6+6+6+6 = 24) and one for Y-linked inheritance pattern. No effort has been found in literature that provides the outcomes of all possible parent unions for all Mendelian modes of inheritance. However, these illustrations provide ready-to-use set of information incorporating all possible parental and offspring outcomes for genotype and phenotype. This eliminates the need for drawing any particular cross outcomes.
Reverse genetics-genotype to phenotype prediction
Genotype information can be effectively translated into phenotype by using G2P-hex (). For example, for an autosomal disorder, one parent is homozygous with normal alleles (0) and other parent is homozygous with affected alleles (2). One can find a cross (0 x 2) in the outer hexagon (for autosomal inheritance) where parent with genotype 0 will be normal and that of 2 will be affected. In this cross all the offspring will carry one affected allele with genotype 1. Therefore all the offspring will be affected or normal carriers considering autosomal dominant and autosomal recessive modes of inheritance, respectively ().
Forward genetics-phenotype to genotype prediction
Starting from parent’s phenotype, one can easily predict possible genotype and phenotype probabilities in offspring. Suppose a parental cross where father is affected and mother is normal for X-linked recessive disorder. In the P2G-sq (), this kind of parental cross lie on the right side of the central phenotype square showing normal mother on upper side and affected father on lower side joined vertically by marriage line. Along this cross on the right side are provided all possible genotypes that conform to the parental phenotype as well as genotype and phenotype outcomes possible in offspring for different possible modes of inheritance. For this example (X-linked recessive disorder), one can see two kinds of possible crosses under the label XR (). In both of these crosses, father is having genotype 1 having its only X chromosome mutated for the responsible gene. However, mother being normal can be either 0 or 1 on genotype account. If mother’s genotype is 0, offspring outcomes will be (all daughters = carriers, all sons = normal) and if mother’s genotype is 1, offspring outcomes will be (half daughters and sons = affected, remaining half of daughters and sons = normal with daughters carriers of disease).
These illustrations are particularly useful for genetic counselors because all possible parent unions with respect to genotype and phenotype and their outcomes in offspring are already provided in a self explanatory way. So these illustrations can be helpful in calculating the genotypic and phenotypic probability of a particular disorder following any Mendelian pattern of inheritance.
Pedigree analysis and inheritance prediction algorithm
These methods and illustrations provide basis for pedigree analysis/inheritance prediction and can be translated into such algorithm using bioinformatics’ approaches. The ternary genotype coding can be used to simplify the genotype labels.
In conclusion, a straight forward and self explanatory way to illustrate genotype and phenotype probabilities is presented in the form of polygonal illustration. The purpose of this communication is not to provide just another alternative to Punnett squares or forked line method but to provide a way by which significant amount of information can be composed in a limited space in the form of simple illustrations. These illustrations provide a quick reference to determine outcome of any parental cross. It has also been observed that with basic understanding and a little practice of line-dot method, the monohybrid crosses can be solved in mind without even writing on paper. Currently, the line-dot method has been provided for monogenic Mendelian traits; however, it can be extended for multigene traits. This method and polygonal illustrations are simple and easy as well as provide ready-to-use reference for genetic counselors, researchers, teachers and students of genetics. With the help of polygonal illustrations, prediction can be done in either direction as reverse genetics (genotype to phenotype prediction) or forward genetics (phenotype to genotype). Also, ternary coding can be used to simplify the genotype labels in bioinformatics based approaches. Based on these illustrations, pedigree analysis and/or inheritance prediction algorithm can also be designed. Further studies are required to elaborate the full potential of these methods and polygonal illustrations.
The author wishes to thank Shabbir Ahmed, Bushra Shabbir and Faiza Imran for their continuous support and helpful suggestions.
Statement of conflict of interest
Authors have declared no conflict of interest.