comparing DNA match data - redux #dna
Bob Smiley
I did my own test on the validity of large segments of DNA matches on MyHeritage. I had a personal pile-up on chromosome 16. There were 30 matches which were in the range of 25 to 30 cM in length which overlapped each other. I tested each set to determine which ones were from my Paternal chromosome, and which were from my Maternal chromosome.
I had two known paternal cousins which had matches in that region, and non of the other 28 triangulated with both of them. So I only had two valid Paternal matches. There were five who matched each other (triangulated) and they were presumed Maternal matches. None of them were known relatives and the total match was too small to expect to find the relationship through genealogical trees. The other 23 large matches did not triangulate with anyone. These were all false matches, or what I prefer calling, pseudomatches. The final determination is that in MyHeritage, over 76% of the large (bigger than 25 cM) matches are false matches. They are not real. This is undeniable data. -- Bob Smiley Kirkland, Washington USA |
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Endogamy is a challenge but for those of us who have only one Ashkenazi relationship, in my case my father’s mother, I find my small Ashkenazi DNA (27%) is more accurate and much easier to connect. I am connected to almost everyone from Raudnitz an der Elbe, N Bohemia. So do not overlook these small matches for those who are not 100% Jewish DNA.
Dorann Jacobs Cafaro Charleston, SC 732 687-5318 Researching Grauer, Dray, Marcus, May, Weisbach, Hecht, Cohen, Kahn (Dray being a brick wall) |
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Steven Usdansky
My matches with HM on MyHeritage, FTDNA, and GEDmatch. All I know for sure is that 60+ cM segment didn't come from my mother, who doesn't match HM at all on chromosome 7 (with a 3 cM minimum)
Yellow - Largest estimate for a given segment Blue - Segment only reported by one site
-- Steven Usdansky usdanskys@... USDANSKY (Узданский): Turec, Kapyl, Klyetsk, Nyasvizh, Slutsk, Grosovo SINIENSKI: Karelichy, Lyubcha, Navahrudak NAMENWIRTH: Bobowa, Rzepiennik SIGLER: "Minsk" |
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Carolyn Lea
Re: comparing DNA match data - redux #dna
Check out Kitty Cooper's blog on Ashkenazi DNA matches. Due to endogamy within the Ashkenazi population we need to look for a much larger threshold. "Finding relatives is difficult because all of us Ashkenazim are related multiple times, both way back when and more recently. Most AJs look like 4th or 5th cousins to each other even when that is not the case. Cousin marriages, uncle-niece marriages, and other close family marriages abound in our trees. In my own family, on my Jewish line, my great grandmother fixed up her sister with her husband’s brother to get that dowry for the family business so I have double third cousins. Click here for my article from back in 2014 that suggested that we are all descended from 350 people in the 1300s." https://blog. She also has an older one you can check out at her blog site. Carolyn Lea (nee Schwarzbaum) OKC, OK Schwarzbaum(Posen/NY/Georgia, US) Lewisohn/Levison Rothschild Wittkowski Basch |
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
As a follow-up to my post and the suggestion of several posts, I was able to compare the results of match data for my mother and another person (Mary) at MyHeritage, FTDNA and GEDMatch.
MyHeritage
FamilyTree DNA
GEDMatch
Some idea about where the discrepancy occurs came from a note in the FTDNA report: the section on Chr 1 where MyHeritage reported a 25.2cM segment match was marked by FTDNA as a "SNP poor region: not tested for Family Finder." The bottom line for my purposes is that the MH report in this case was misleading and a more realistic reading (3 out of 4 test sites) suggests this match falls below the threshold where I could reasonably expect to find a family tree connection. I do want to point out that in the other matches I reviewed, MyHeritage was often within range of the other services' results and my experience in this case does not mean that MH testing should be deprecated generally. I also want to note that I was fortunate to have the cooperation of the other test taker, who engaged in an exchange to explore a possible connection, shared pointers to other sites where she was tested and provided access to her family tree. Thanks also to those who responded to my post. Lee David Jaffe |
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Jill Whitehead
Following the reference to my friend Debbie Kennet's Cruwys blogspot from 2018, I have contacted her for an update on the relative merits of Ancestry and My Heritage for making DNA comparisons. Here is her reply:
"MyHeritage have updated their matching algorithms since I wrote that post. However, there is still a very high false positive rate because they accept uploads in different file formats and there is very little overlap in the SNPs used for matching on the different chips. See:
https://isogg.org/wiki/Autosomal_SNP_comparison_chart
I don’t trust matches sharing below about 40 cM at MyHeritage and I’ve also found some problematic matches sharing between 40 and 50 cM. The matches at AncestryDNA are much more reliable.
I have two kits at MyHeritage. I did a new test on the Global Screening Array and with that kit 47% of my matches don’t match either of my parents. My other kit is a transfer from Ancestry and with that kit 38% of my matches don’t match either of my parents. I also get over a thousand more matches with the transfer kit.
With Jewish matches at MyHeritage I would suggest you ignore all the low and medium confidence matches and only work with matches where the largest segment is 30 cM or more and where are there are preferably at least two reasonable-sized segments. If you look in the chromosome browser you’ll probably find that a lot of the matches where you appear to share high cM totals are actually made up of lots of really tiny segments that are more likely to be false. You may want to recalibrate the total cM shared and remove all the segments under 10 cM to get a more realistic estimate of the total cM shared." |
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
stoub@...
The dramatic differences between My Heritage and Ancestry are most noticeable when Ancestry thinks the total is <90cM. Above that threshold, TIMBER is not in effect. For Nancy's match to Mary, Ancestry calculates this as 67cM so TIMBER is in effect and they downweight this to 21cM. [Most of the matches you list above are well above 90cM.] Ancestry is basically saying that 2/3 of the DNA shared is too common to be genealogically meaningful. Some folks don't like TIMBER but my experience is that it is very useful, and Ancestry numbers are far more accurate than My Heritage. I doubt this is a traceable match. The main question is why Ancestry says the total is 67cM and My Heritage says 124cM. If Mary doesn't transfer to GEDmatch, we're left guessing, but given the problems My Heritage has with imputing data, I wouldn't be surprised if that accounts for the bulk of the difference. One researcher who tested both her parents found that 32% of all matches had no match with either of her parents. https://cruwys.blogspot.com/2018/01/a-chromosome-browser-and-new-matching.html That's a lot of bad data! |
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
I decided to try to reframe my earlier post, with the initial responses and new data I've gathered, to place the question in context.
My original post featured two DNA match reports, one from MyHeritage and another from Ancestry, for my mother Nancy and an unknown person Mary. MyHeritage reported total shared DNA of 123.8 cM (1.8%), in 10 segments, the largest being 25.2cM (and 3 other segments larger than 10cM). In turn, Ancestry reported 21cM (66cM unweighted) across 7 segments with the longest 15cM. I asked the list for help understanding such a wide discrepancy in these reports. I receive several responses, here and privately, most sharing a consistent view that Ancestry's algorithms are more conservative while MyHeritage often inflates its reports. I don't want to go into a lot of detail here (you can see my original post and the full exchange at https://groups.jewishgen.org/g/main/message/673482) but here is one representative quote: MH has some imputation problems where they "connect" some small segments that may be close. They also include segments under 7cMs. So they tend to inflate their numbers. Ancestry excludes the segments under 7 which would bring down the match size and then TIMBR takes out pile up areas.
Most responses recommended comparing tests at GEDMatch for a more realistic reading. I should also mention that there were at least a couple of responses warning not to dismiss MyHeritage results, though still acknowledging that their numbers were not as rigorous. These responses were alarming. Anyone who has spent any time trying to use DNA to further their research – especially in Ashkenazi inheritance where endogamy muddies the waters – will recognize that a DNA match of 123.8 with a 25.2 segment is considered significant while a match of 21cM (or even 66cM) and a 15cM longest segment is to be given lower priority. Processing what to do with this information, I wondered whether other matches I'd been exploring also exhibited such variation and were correspondingly questionable. I reviewed a table of my mother's DNA matches collected from Ancestry, MyHeritage, GEDMatch and FTDNA (see below). For the 22 instances where I found matches from two or more testing sites, the average (mean) difference between the highest and lowest reported total was 18cM (when the 102.8 discrepancy reported for Mary was removed from the calculation). Seven of the 21 (again, excluding Mary) posted above average variations: the largest was 71.6cM and the smallest 1.2cM. In all but a few cases the variation was not significant and would not affect a determination about which matches to pursue. At the same time, a review of the data strongly suggests that assertions that MyHeritage inflates its reports and Ancestry is more conservative don't hold up. In fact, Ancestry reported a higher total cM than MyHeritage did in 7 of the 9 instances where a head-to-head comparison was available. And Ancestry posted the highest total cM, compared to any other test site, in half (11 of 22) cases. For my part, this leaves me still looking for an answer to my original question: What can I make of two tests between my mother and Mary reporting such extremely different results? If I can't explain it away with the algorithms used by the two testing services, what is the explanation? And, on the practical side, which test do I trust? I understand that a 3rd party comparison is most likely to answer that question, if Mary is willing to copy her results to GEDMatch. I've asked her but she hasn't responded so far. Even if this individual case is resolved eventually what do we make of such discrepancies? This reminds me of something I found in a fortune cookie: "A person with one watch is certain of the time. A person with two watches isn't sure." Testing with more than one service is supposed to help with our family history research, not create more uncertainty. Lee David Jaffe
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
|