My Basal-rich K7 Results

David Wesolowski of the Eurogenes Blog, has created a new ancient admixture calculator, the Basal-rich K7.

In his blog, he states: "The Basal-rich K7 is the best ancient ancestry test that I've been able to come up with. It correlates strongly with latest research reported in scientific literature. And, in fact, in some instances it probably trumps latest scientific literature.

For instance, Broushaki et al. 2016 characterized Early Neolithic farmers from the Zagros Mountains, Iran, as 62% Basal Eurasian and 38% Ancient North Eurasian-related (Figure S52). This, considering formal statistics like the D-stat below, with AfontovaGora3 (AG3) as the ANE proxy, is unlikely to be correct, despite the fact that AG3 is a relatively low quality sample.".

Villabruna-related

The Villabruna cluster represents the DNA found in 13 individuals in Europe from after 14,000 years ago.  They were Late Ice Age hunter-gatherers.  They appear to have links with the Near East.  The current thought is that they replaced earlier groups of hunter-gatherers in Europe.  The DNA of people in the Middle East and Europe pulled together at this time, and they may represent an expansion from the South-East.  Much of the Aegean Sea would have been dry, with low sea levels (glaciation), so the migration may have been easy.  It is believed that they had dark skin, and blue eyes.  They were possibly, the last hunter-gatherers of Europe and the Middle East.  They may have contributed to our DNA both through or either, later Asian or European admixtures.

David gives the English average as 56.7%.  My result is 57.1%

Basal-rich

The Basal Eurasians are a hypothetical "ghost" population derived from DNA studies.  It is suggested that they splintered from other modern humans 45,000 years ago, presumably outside of Africa, somewhere around the Middle East.  They significantly contributed DNA to the Early Neolithic Farmers of the Fertile Crescent and Anatolia, and consequently, on to all of us modern West Eurasians.  

 David gives the English average as 26.5%.  My result is 28.8%

Ancient North Eurasian

Another Ice Age hunter-gatherer "Ghost" population, but this one has been associated with human remains and an Upper Palaeolithic culture (Mal'ta-Buret') at Lake Baikal, Siberia.  We know that it significantly contributes to modern West Eurasians, through earlier admixture on the Eurasian Steppes.  Copper Age pastoralists then carried it westwards into Europe with their later expansion.

David gives the English average as 16.6%.  My result is 14.0%

Others

David gives the English averages as SE Asian 0.15, Oceanian 0.07, East Eurasian 0.00% and Sub Saharan 0.00

My results are SE Asian 0.00, Oceanian 0.01, and Sub Saharan 0.05

Comparison with other testers

A remarkable similarity has been observed between many of my East Anglian atDNA results and a Norman tester.  On K7, we are almost identical.  Indeed, we are often closer to each other in results, than I am to other British, and he is to other French.

I'm increasingly recognising that although my East Anglian heritage should in theory bring me closer to North German and Scandinavian results, in practice, compared with other Britons, I am pulled more to the south - to France, and even to Southern European.  Hence, I tend to receive lower ANE than many British, Irish, or Scandinavian, and more Early Neolithic Farmer in ancient admixture tests, than would be expected.

Other than Norman admixture, I struggle to explain this with either known paper recorded ancestry (252 direct ancestors from East Anglia and SE England - 100% English), or with known regional history.

DNA.land - raw file comparison

Comparing the ancestry results of two raw files from the same tester (myself) uploaded to DNA.land.

Myself.
Paper trail and family history 100% SE English, mainly East Anglian. 249 direct ancestors named in documentary research.

23andMe result before phasing (spec mode):
100% European broken into
94% Northwestern Europe
3% Southern Europe
3% unassigned European

Broken down further to:
32% British & Irish
27% French & German
7% Scandinavian
29% Broadly NW European
0.5% Iberian
2.4% Broadly South European

23andMe result after phasing with one parent (spec mode):
100% European
96% Northwestern European
1.8% Southern European
2.2% Broadly European

Broken down further to:
37% British & Irish
22% French & German
1% Scandinavian
36% Broadly NW European
1.8% Broadly Southern European

FT-DNA Family Finder My Origins.
100% European

Broken down further to:
36% British Isles
32% Southern Europe
26% Scandinavia
6% Eastern Europe

Now I am comparing the two raw files for the same person, uploaded to, and analysed for ancestry, by DNA.land:

23andMe V4 raw file for myself on DNA.land:

100% West Eurasian.
77% North West European
19% South European (broken into 13% Balkan / 6.1% South/Central European
2.4% Finnish
1.3% Ambiguous

FT-DNA FF raw file for myself on DNA.land:
100% West Eurasian
75% North West European
25% Balkan

Just for more information:

My mother's 23andMe raw file on on DNA.land:
100% West Eurasian
80% North West European
10% South European (broken into 7.7% South/Central Europe / 2.4% Balkan)
6.4% Finnish
2.3% Sardinian
1.5% Ambiguous 

Conclusion

Phasing on 23andme suggested that I inherit (in spec mode) nearly 1% Southern European from each parent. That each of my very East Anglian parents had a Southern European ancestor within the past 300 - 500 years is highly unlikely, considering 1) the paper trail, and 2) local history in this rural area. Therefore I feel that this reflects much older background ancestry for the local SE English population. Ancient DNA calculators also predict that I have higher than average levels of ENF/EEF than other local populations such as the Irish and Scottish, and lower levels of ANE. This appears associated with my Southern European flavour that some tests suggest as a minority percentage. FT-DNA suggested 32% Southern European! Some commentators have suggested that this might indicate significant French admixture to the SE English population, perhaps during the Norman and Medieval periods, carrying a southern signal higher into lowland Britain. Earlier admixture into Lowland Britain from the south, is also possible during late prehistory and the Roman period.

DNA.land has been noted for a bias to predicting both Balkan, and Finnish ancestry for testers, and my results are no exception. I feel that as with all current autosomal DNA test/analysis for ancestry, that DNA.land has a way to go. As with the other predictors, it is very successful at recognising me as 100% European (although ironically my Y-DNA is Western Asian). It is fair at spotting me as NW European, but NOT as successful as 23andMe. Below that level, once again it falls down - but I feel that this is understandable, as most predictors fail down for anciently admixed populations such as the English. They are far more successful at spotting for example, Irish/Scottish. For the English, we tend to be ripped across different European populations. The Southern European element is a particular surprise - but all of the testers so far have been confused by this background signal. Dienekes has himself, suggested Southern European DNA coming into England with the Normans:

http://dienekes.blogspot.co.uk/2016/...-ancestry.html

I'm starting to settle with this hypothesis, although I still have some interest in possible Southern European admixture earlier.

Finally... The two raw files for one person, have produced slightly different results. The FT-DNA raw file has I believe, more tested (but different?) SNPs than the 23andMe file. It would be interesting to know the differences. DNA.land, using the FT-DNA FF file, does not see Finnish, or South/Central European, but enhances the Balkan.

FT-DNA My Ancient Origins

Family Tree DNA (FTDNA) have released a new, unexpected feature to their autosomal DNA Family Finder package.  It is clearly aimed at their customers (both new and existing), of mainly European heritage.  It uses ancient DNA references to plot our ancient ancestry.  It breaks European's ancient Eurasian ancestry down into four groups:

  • Hunter-Gatherer (Western Hunter-Gatherer)
  • Farmer (Early Neolithic Farmer)
  • Metal Age Invader (Yamnaya / Bronze Age Steppe immigration)
  • Non European (Other)

First of all, I welcome this new analysis.  Combined with the latest cutting edge research into the origin of the Eurasians, and with other open source calculators of ancient origin available via GedMatch - I feel that it can help us get personal with our ancient Eurasian roots.

However... unfortunately it has faults, as the online community quickly picked up.  In particular, with the Metal Age Invader component.  FT-DNA suggests that it represents the Yamnaya admixture event - where Copper or Early Bronze Age pasturalists, mounted on their horses, expanded from the Pontic and Caspian Steppes of Eurasia, into Europe around 5,000 years ago.  But 1) it doesn't include any ANE (Ancient North Eurasian) component from the Mal'ta-Buret reference, and 2) it of course cannot distinguish it's Western Hunter-Gatherer reference from that inherited directly within Europe or elsewhere.

All that the FT-DNA Metal Age Invader reference appears to represent, is the population known as Caucasus Hunter-Gatherer.  A minority component of Yamnaya DNA as we currently see it.

For the record, as the screendump above shows, my FT-DNA Ancient Origins are:

9% Metal Age Invader

47% Farmer

44% Hunter-Gatherer

0% Non European

Now that I've got that covered, I can move onto my next blog post, which I find more interesting - how I use My Ancient Origins to try to reconstruct my ancestry from 11,000 to 4,000 years ago.


Gedrosia and our DNA

Attribution: Fielding Lucas, Jr. [Public domain], via Wikimedia Commons

This post is partly an excuse to upload and store the above creative commons image.  My Y-DNA terminal SNP (L-SK1414) twin was a Balochi speaker in Makran, SW Pakistan.  In classical times, Makran was located in the Kingdom of Gedrosia.  It's almost ironic that an open source range of autosomal DNA testers on GEDmatch have been named after Gedrosia.

Gedrosia was a dry, mountainous country along the northwestern shores of the Indian Ocean.  The indigenous name for Gedrosia is thought to have been Gwadar.  It was conquered by the Persian king Cyrus the Great (559-530 BC). The capital of Gedrosia was Pura, which may survive today as modern Bampûr.  In 326 BC, The Macedonian king, Alexander the Great disastrously crossed the Gedrosian Desert, on the return from his campaign in India, and lost 12,000 of his men to the savage conditions.

Image of Gwadar Bay by wetlandsofpakistan (Gwadar - West Bay) [CC BY-SA 2.0 (http://creativecommons.org/licenses/by-sa/2.0)], via Wikimedia Commons.

So, although the GedrosiaDNA GEDmatch heritage calculators may have little to do with the legendary land that may have been host to my Y ancestors, out of interest, how do our atDNA test results tally with the GedrosiaDNA calculators?  These calculators are designed to measure Ancient Eurasian Admixture.

My results (using an FT-DNA raw file) against those of my mother (23andMe raw file).

My Eurasia K9 ASI Oracle:

  • 39% Western Hunter-Gatherer
  • 27% Early Neolithic Farmer
  • 15% Eastern Hunter-Gatherer
  • 12% Caucasus Hunter-Gatherer
  • 7% SW Asian
  • 1% Siberian East Asian

Mother's Eurasia K9 ASI Oracle:

  • 40% Western Hunter-Gatherer
  • 26% Early Neolithic Farmer
  • 14% Eastern Hunter-Gatherer
  • 12% Caucasus Hunter-Gatherer
  • 6% SW Asian
  • 1% Siberian East Asian

My Gedrosia K3 Oracle:

  • 97.5% West Eurasian
  • 2.5% East Eurasian

Mother's Gedrosia K3 Oracle:

  • 96% West Eurasian
  • 4% East Eurasian

My Gedrosia K15 Oracle:

  • 40% Western Hunter-Gatherer
  • 25% Early European Farmer
  • 21% Caucasus
  • 5% Burusho
  • 5% SW Asian
  • 3% Balochi
  • 1% Siberian

Mother's Gedrosia K15 Oracle:

  • 40% Western Hunter-Gatherer
  • 24% Early European Farmer
  • 18% Caucasus
  • 4% Burusho
  • 3% Kalash
  • 2% Siberian
  • 1% Balochi

My Ancient Eurasia K6 Oracle:

  • 40% West European Hunter-Gatherer
  • 39% Natufian
  • 21% Ancient North Eurasian
  • 1% East Asian

Mother's Ancient Eurasia K6 Oracle:

  • 41% West European Hunter-Gatherer
  • 38% Natufian
  • 19% Ancient North Eurasian
  • 2% Ancient South Eurasian
  • 1% East Asian

Conclusions

We both appear to have inherited around 40% of our DNA from ancient Western (European) Hunter-Gatherer populations, nothing unexpected there.  Western Hunter-Gatherers not only lived in Europe, but appear to have contributed to some later Eurasian populations such as the Yamnaya and Early European Farmers.

We both have low counts of Ancient North Eurasian - particularly my mother, who scores only 19% ANE (Upper-Paleolithic genomes from the Lake Baikal region of Siberia, identified as Malta, Afontogora 2, and Afontogora 3, dated to 17 to 24 kya).  It has been noted during online discussions, that the English appear to have slightly lower percentages of ANE than do their close neighbours.  ANE is sometimes used to indicate Yamnaya ancestry (ANE was a component), that spread from the Eurasian Steppes into Western Europe during the Early Bronze Age.

I have 3% Balochi compared to 1% Balochi for my mother.  It may mean nothing, but it could just perhaps indicate something in the autosomes that associates on my paternal side with my Y-DNA story.  The indicators (particularly K15) suggest that I have more SW Asian ancestry, presumably from my paternal side - again, it just could associate with what we know about my Y haplogroup L-SK1414.

We have around 24-25% Early European Farmer ancestry, representing early Neolithic descendants of an admix of WHG and "Basal Eurasians".  This signal apparently peaks around 80% in modern Sardinians.  However, the "Natufian" reference are higher - 38-39%.  According to GEDmatch: "Natufian was an Epipaleolithic culture that existed from 12,500 to 9,500 BC in the area of Israel. They were derived about 50% from an original Out-of-Africa population, referred to as Basal Eurasians. If you are a European and show Natufian admixture, this does not imply that Natufians interacted with your ancestors. All it means is that Natufian like admixture was mediated to you via intermediaries, such as the early European Farmers from the Near East".  I'm not sure what to make of that.



Counting the SNPs - 23andMe V FT-DNA

Comparing 23andMe V4 kit raw file to FT-DNA raw file.

Both tests were taken by myself this year (2016).  I am here comparing the quality of two separate atDNA tests from the same person, by two different DNA for Ancestry companies.  As will be seen, the quality varies considerably, at least in terms of the number of SNPs that are tokenized once forwarded to GEDmatch.com.  This is NOT a test of how well both companies ascertain our DNA ancestry from these files.  Both use their own reference populations and analysis programs.  I've reviewed that elsewhere.  This test simply weighs how many SNPs are registered from the autosomes and X chromosome of one person.

Using the GEDmatch DNA file diagnostic utility, I received the following SNP counts:

Kit M551698 (23andMe V4)

Token File data:
Chr Token SNP Count
1 40974
2 42110
3 34199
4 31020
5 30421
6 36383
7 26352
8 27900
9 23644
10 27888
11 25363
12 25395
13 19880
14 15957
15 15529
16 16551
17 13745
18 16775
19 9006
20 13530
21 7324
22 7386
X 15359

Processed in batch 5355
Number of SNPs utilized by GEDmatch template = 523997
Number of regular SNPs = 517780
Heterozygosity index = 0.302721 (fraction of total SNPs that are heterozygous)
No-calls = 4911 = 0.93956084952678 percent.
Kit M551698 has approximately 19959 total matches with other kits. Of these matches there are 4982 >= 7cM and 14977 < 7cM.


Kit T444495 (FT-DNA file):

Chr Token SNP Count
1 57931
2 59602
3 47094
4 41772
5 39314
6 47546
7 36567
8 36753
9 30643
10 36889
11 35941
12 35850
13 26763
14 22650
15 20899
16 21935
17 18379
18 22586
19 12773
20 19587
21 10001
22 9750
X 19176

Processed in batch 5914
Number of SNPs utilized by GEDmatch template = 709242
Number of regular SNPs = 694324
Heterozygosity index = 0.281384 (fraction of total SNPs that are heterozygous)

No-calls = 16077 = 2.263088030563 percent.

Kit T444495 has approximately 48755 total matches with other kits. Of these matches there are 9351 >= 7cM and 39404 < 7cM.

Conclusion

If the quality of a raw atDNA file is merely down to the number of SNPs that are tested, then FT-DNA clearly wins hands down, when compared with the 23andMe file, following tokenization for GEDmatch use.  The FT-DNA file utilises 709,209 SNPs compared with 23andMe's 523,997 SNPs

I thought that it might be interesting to compare how these files, of the same person, might compare on the same GEDmatch heritage admixture program.

On Eurogenes K13 Oracle, my 23andMe kit gets as top ten closest GD's:

1 South_Dutch 3.89
2 Southeast_English 4.35
3 West_German 5.22
4 Southwest_English 6.24
5 Orcadian 6.97
6 French 7.63
7 North_Dutch 7.76
8 Danish 7.95
9 North_German 8.17
10 Irish 8.22

On the same, using my FT-DNA kit (with many more SNPs tested as demonstrated above:

1 Southeast_English 3.75
2 South_Dutch 4.03
3 West_German 5.42
4 Southwest_English 5.68
5 Orcadian 6.33
6 North_Dutch 7.15
7 Danish 7.36
8 Irish 7.59
9 West_Scottish 7.62
10 North_German 7.7

Based on the numbers of SNPs tokenized, I will in future regard the FT-DNA (Family Tree DNA) file as superior in quality, over the 23andMe file, despite my disappointment in the FT-DNA My Origins ancestry analysis.

Y Haplotype L1b2c

By Hellerick (Own work) [CC BY-SA 4.0], via Wikimedia Commons.  Modified by Paul Brooker.

I've created this distribution map of known Y haplogroup L, L1b2c or L-SK1414. This is my Y-DNA haplotype.  Not a lot of dots there are there?  This is how rare that this clade is.  L1a and L1b most likely (in my opinion) originated during the last Ice Age circa 18,000 years ago, south of the Caucasus, and west of the Caspian Sea in Western Asia.  In other words, in the area of present day Armenia, Azerbaijan, and North-west Iran.  Again, I emphasise, that is just my opinion, looking at present-time evidence.

Y haplogroup L itself may have diverged between L1 and L2, not so much earlier, or so far away from this region.  Again, just my present opinion.

My sub clade of L1b, is so rare, that it is impossible to say.  As can be seen from the map.  However, this is my blog, so I'm going to push out on this one.  My very best guess would be further to the East than it's parent.  I suspect South East of the Caspian Sea, in what is now Eastern Iran.  I could well be wrong.  We have so few tests from nearby Afghanistan for example.  So far, the SNP SK1414 has only been reported twice.  1) in Makran, SW Pakistan, in a Balochi speaking man.  Balochi is an Iranian language, closely related to North-West Iranian languages.  Researchers suggest that the Balochi people of Makran, largely migrated from south west of the Caspian.

The only other guy in the world so far confirmed is little old me, an Englishman.  I trace my surname (direct paternal) line back to the Thames Valley of Oxfordshire / Berkshire 270 years ago.  If my biological line follows that.  A number of STR testers of English descent appear connected to me by STR analysis.  They all descend from Thomas Chandler, who lived around the same time as my earliest recorded ancestor - only 32 miles away at Basingstoke.

From all of the evidence, I conclude that my Y ancestral line moved, probably in one generation, from Western Asia, perhaps from he edge of Persia, to Southern England conservatively between 2,000 and 400 years ago.  Although I would speculate between 1,600 and 600 years ago - during the Medieval or close by.

Bunwell, Norfolk - ancestral parish

I took a little local bicycle ride today, along the extremes of my recorded maternal line - that which should carry my mt-DNA.  23andMe tested it as H6a1.  WeGene and mthap analyser both suggest H6a1a8.  I'm looking forward to see what Living DNA make of it.  The haplogroup, based on current evidence, most likely originated on the Pontic and Caspian Steppes, before spreading into Western Europe during the Early Bronze Age.  However, on documentary record, I've traced it back to Generation 9 - my maternal G.G.G.G.G.G Grandmother, Susannah Briting (Brighten, Brighton) who married my ancestor John Hardyman (Hardiment, Hardimend, Hardiman), in the Norfolk parish of Bunwell in 1747.  According to the Bunwell Parish Registers, between 1748 and 1754, John and Susannah had four children baptised at the local church of Bunwell St Michael & Angels: John, Martha, Elizabeth, and Thomas Hardyman.  My G.G.G.G.G Grandmother Elizabeth Hardyman, went on to marry my ancestor Robert Page at nearby Wymondham in 1779, and continued my maternal line down to my Mother.

Bunwell, Norfolk, East Anglia

Bunwell is a parish of scattered settlement and hamlets, located above the Tas Valley, on the high boulder-clay soils of South Norfolk.  These heavy soils encouraged a pattern of dispersed settlement during the Late Medieval, with occupation often taking place along the edges of common land.  This could suggest limited manorial control.

I took this photo of the local landscape.  Large medieval open fields were divided into smaller enclosed fields during the 17th to 19th centuries.  These small parceled and enclosed fields were then opened up again into larger fields, with the removal of many hedgerows, during the 20th century.  Main land uses today are arable agriculture - modern crops include sugar beet, wheat, oil seed rape, etc.

Counting the number of plant species within a designated length of hedgerow has been used as a dating process.  The number of species increasing across the centuries.

Vernacular tradition includes many classic South Norfolk farmhouses, of which the following example is a striking example:

The owner has been renovating it from many years, having inherited it from his father.  Aside from the chimney (which was replaced following a lightening strike), the newest sections of the house date to the 1740's.  The eldest have not been dated, but perhaps extend to the Late Medieval.

The Church building is of the Perpendicular tradition, and dates to circa the 1450's, although it was most likely built on the site of an earlier church.  It is dedicated to St Michael and All Angels.

It is very much still an active church.  A knitting club were busy at work on my visit.  The church warden also called in.  The locals told me that there were, or still are Hardiment and Britings living in the parish.  I had a look around the surrounding graves.

This headstone is a good example of 18th century headstones in East Anglia.  Sadly, not one of my known ancestors.  Note the extra information Late of Starston.

I didn't spot any Britings, but I did find a cluster of Hardyman graves, including this example:

Not one of my direct ancestors, but most likely, a cousin.

My mtDNA ancestor Elizabeth, moved away from Bunwell, marrying nearby at Wymondham. From there, my mtDNA line moved through other nearby villages, including Bestthorpe.  Another generation on, it made an unusual leap (my Norfolk ancestors rarely moved far) to the opposite side of Norwich, to the parish of Rackheath in Broadland.  It then moved further East, to the parishes of Tunstall and Reedham.  On to Hassingham and my mother carried it back west to the Norwich area. We've ironically both  carried it back to the Wymondham area.