Phylogenetic Tools for Comparative Biology: Phylogenetic signal with measurement error

Friday, October 21, 2011

Phylogenetic signal with measurement error

I just posted a new version of the function phylosig() that computes phylogenetic signal using Blomberg's K (Blomberg et al. 2003) and Pagel's λ methods.

The main improvement of this version is that I have added estimation of K with known measurement error, following Ives et al. (2007). This was in response to a user's email request.

This was not too difficult. Basically, without measurement error the values for the tip states on the tree under BM are distributed as a multivariate normal with variance-covariances given by σ²C, where C is an n×n matrix (for n species) containing the height above the root node of the common ancestor of each i,jth species pair on the tree. (For diagonal elements, these are just the heights of each tip node.)

With measurement error, the picture complicates slightly. Now, say the VCV matrix for the measurement error is given by M, then the multivariate distribution of the tip values is given by σ²C+M. The only problem with this is that although we have an analytic solution for σ² conditioned on no measurement error, we do not for known M. Instead, we have to find σ² by other means - say, by maximizing the likelihood. This is what I have done in the latest version of phylosig() (v0.4), available here.

To run this, first download the source and load "phytools":

> require(phytools)
> source("phylosig.R")

Now, just for fun, let's simulate data with measurement error.

> tree<-drop.tip(birthdeath.tree(b=1,d=0,taxa.stop=301),"301")
> x<-fastBM(tree) # simulate data
> se<-rchisq(n=300,df=1) # simulate SEs
> names(se)<-names(x)
> e<-rnorm(n=300,sd=se)
> xe<-x+e

We have data generated without error (in x) and with known error (in xe). First, let's compute phylogenetic signal for x:

> phylosig(tree,x)
[1] 1.013599

This is more or less what we expect. Now let's try xe, but with measurement error ignored:

> phylosig(tree,xe)
[1] 0.2640041

Finally, we can try our new function incorporating measurement error:

> phylosig(tree,xe,se=se)
$K
[1] 1.121445

$sig2
[1] 0.9772118

$logL
[1] -543.0133

Very cool! Note that the logL reported here is the log-likelihood for the value of sig2 conditioned on the known measurement error we've provided.

Of course, this is not magic - so the more measurement error we have, the less power we will have to measure phylogenetic signal. That said, incorporating measurement error has the nice property of making our estimates of signal unbiased (whereas they are biased downwards when signal is ignored).

4 comments:

Liam RevellOctober 21, 2011 at 5:47 PM
I will also add this to method="lambda" when time permits. This should actually be easier, I think, although it now requires a multidimensional ML optimization.
ReplyDelete
Replies
Liam RevellOctober 22, 2011 at 11:53 PM
Note, v0.4 has a bug. Users should download v0.5.
ReplyDelete
Replies
Elisa Fernández FueyoOctober 7, 2020 at 2:49 PM
Hi Liam

Thank you so much for this new version of the phylosig()function.
How can I download it? Do I copy and paste the code that comes up when I click on the 'v0.5' link and run it before running the code for my analysis?

Thanks in advance.

All the best,

Elisa
ReplyDelete
Replies

Note: due to the very large amount of spam, all comments are now automatically submitted for moderation.

Pages

Friday, October 21, 2011

Phylogenetic signal with measurement error

4 comments: