Gene duplication is the key mechanism for evolutionary change. To infer the timing and nature of gene duplication, the 'data' used are the end result of various pipelines. In this talk, I will summarize how the 'data' are obtained, explore the shortcomings of analyses in the literature, and end with current work on overcoming these shortcomings. The interesting statistical problems are that the 'data' are maximum likelihood estimates, and that the biological process (saturation effects) present complications in data modeling.
As part of my SAMSI affiliation, I will also give an overview of SAMSI programs for the coming years.
More information about Snehalata Huzurbazar may be found at http://www.samsi.info/people/snehalata-huzurbazar