Statistics for measuring how we do with real data #41

hyanwong · 2017-07-03T14:33:59Z

Ww want to run tsinfer on real data and see how we do, compared to what we might expect. This issue collects some ideas for how to do that.

hyanwong · 2017-07-03T14:35:46Z

On average, individuals from the same demographic area should share the vast majority of most recent coalescences with each other. Another way to measure this would be to use Shiffels & Durbin's 'cross coalescence rate'

hyanwong · 2017-11-27T13:42:05Z

Chatted to George Busby about 1000G data: his project with Ryan Christ involved chromosome painting with some of the 1000G data + a focal population of Africans with and introgressed lactose tolerance haplotype, and looking for areas of shared ancestry within the focal African population (ancestry was estimated by splitting the 1000G data into e.g. 6 populations & basing ancestry measures on haplotype prevalence within each pop). We might be able do this sort of thing with ancestors on trees instead.

Another suggestion would be to look across the duffy locus, which we know to have been under selection. One issue is that this is on Chromosome 2, which is the largest human chromosome.

hyanwong · 2018-09-26T14:10:48Z

Just chatting to Wilder - I wonder if we can plot a "densitree" using a random subsample of the data (both haplotypes and genomic positions)

hyanwong · 2018-09-26T14:26:01Z

Also, if there are any individuals in 1000G who have admixed parents (e.g. one maternal grandparent african, the other european), then we might be able to see large chunks where the genome shows more close relationship with africans and another chunk with europeans.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Statistics for measuring how we do with real data #41

Statistics for measuring how we do with real data #41

hyanwong commented Jul 3, 2017

hyanwong commented Jul 3, 2017

hyanwong commented Nov 27, 2017 •

edited

Loading

hyanwong commented Sep 26, 2018

hyanwong commented Sep 26, 2018

Statistics for measuring how we do with real data #41

Statistics for measuring how we do with real data #41

Comments

hyanwong commented Jul 3, 2017

hyanwong commented Jul 3, 2017

hyanwong commented Nov 27, 2017 • edited Loading

hyanwong commented Sep 26, 2018

hyanwong commented Sep 26, 2018

hyanwong commented Nov 27, 2017 •

edited

Loading