-
Notifications
You must be signed in to change notification settings - Fork 5
Description
I am very new CoreHunter or germplasm data analyses. For a new project, I want to use historical climate data in lieu of genotypic / allelic data with CoreHunter
There are 26 climactic / bioclimactic data variables, plus elevation (which usually does not vary significantly over time).
As two examples - maximum temperature, precipitation etc.
For each of these 26 variables, the data will be tabulated for ~ 500 plant accessions and for each month of the year.
So for each variable, there will be ~ 500 rows and 12 columns (1 for each month of the year)
At http://www.corehunter.org/measures it says
"Alternatively, a precomputed distance matrix can be provided by the user."
How can I convert my 500 * 12 table into a distance matrix for use by CoreHunter?
But this will be for just 1 of 26 different climate data variables.
Therefore, would it be possible to concatenate ALL these tables, one for each of these 26 variables, and then create 1 distance matrix for combined use with CoreHunter3?
Climate for successive months are related to the preceding months (isn't this known as auto-correlation?)
and also in some pairwise cases, 2 climactic variables could be strongly (anti)correlated to one another, right? Would any of these statistical behaviors in my dataset require any special methods to create / use the distance matrix(ces)?
My final goal of course, using your CoreHunter3, is to leverage the climactic and bioclimactic data to subset the 500 accessions to 100 accessions (core set) and further condense that to a set of 25 (mini core set) while maximizing diversity across these climactic variables.
Thank you in advance.