Package 'secsse' reference manual

Title:	Several Examined and Concealed States-Dependent Speciation and Extinction
Description:	Simultaneously infers state-dependent diversification across two or more states of a single or multiple traits while accounting for the role of a possible concealed trait. See Herrera-Alsina et al. (2019) <doi:10.1093/sysbio/syy057>.
Authors:	Leonel Herrera Alsina [aut] , Paul van Els [aut] , Thijs Janzen [ctb] , Hanno Hildenbrandt [ctb] , Pedro Santos Neves [ctb] , Rampal S. Etienne [cre, aut]
Maintainer:	Rampal S. Etienne <[email protected]>
License:	GPL (>= 3) \| file LICENSE
Version:	3.1.0
Built:	2025-02-24 05:05:26 UTC
Source:	https://github.com/rsetienne/secsse

Parameter structure setting for cla_secsse It sets the parameters (speciation, extinction and transition) IDs. Needed for ML calculation with cladogenetic options (cla_secsse_ml)

Description

Parameter structure setting for cla_secsse It sets the parameters (speciation, extinction and transition) IDs. Needed for ML calculation with cladogenetic options (cla_secsse_ml)

Usage

cla_id_paramPos(traits, num_concealed_states)
cla_id_paramPos(traits, num_concealed_states)

Arguments

`traits`	vector with trait states for each tip in the phylogeny. The order of the states must be the same as the tree tips. For help, see `vignette("starting_secsse", package = "secsse")`.
`num_concealed_states`	number of concealed states, generally equivalent to the number of examined states in the dataset.

Value

A list that includes the ids of the parameters for ML analysis.

Examples

traits <- sample(c(0,1,2), 45,replace = TRUE) #get some traits
num_concealed_states <- 3
param_posit <- cla_id_paramPos(traits, num_concealed_states)
traits <- sample(c(0,1,2), 45,replace = TRUE) #get some traits
num_concealed_states <- 3
param_posit <- cla_id_paramPos(traits, num_concealed_states)

Likelihood for SecSSE model, using Rcpp Loglikelihood calculation for the cla_SecSSE model given a set of parameters and data using Rcpp

Description

Likelihood for SecSSE model, using Rcpp Loglikelihood calculation for the cla_SecSSE model given a set of parameters and data using Rcpp

Usage

cla_secsse_loglik(
  parameter,
  phy,
  traits,
  num_concealed_states,
  cond = "proper_cond",
  root_state_weight = "proper_weights",
  sampling_fraction,
  setting_calculation = NULL,
  see_ancestral_states = FALSE,
  loglik_penalty = 0,
  is_complete_tree = FALSE,
  num_threads = 1,
  method = "odeint::bulirsch_stoer",
  atol = 1e-08,
  rtol = 1e-07
)
cla_secsse_loglik(
  parameter,
  phy,
  traits,
  num_concealed_states,
  cond = "proper_cond",
  root_state_weight = "proper_weights",
  sampling_fraction,
  setting_calculation = NULL,
  see_ancestral_states = FALSE,
  loglik_penalty = 0,
  is_complete_tree = FALSE,
  num_threads = 1,
  method = "odeint::bulirsch_stoer",
  atol = 1e-08,
  rtol = 1e-07
)

Arguments

`parameter`	list where first vector represents lambdas, the second mus and the third transition rates.
`phy`	phylogenetic tree of class `phylo`, rooted and with branch lengths.
`traits`	vector with trait states for each tip in the phylogeny. The order of the states must be the same as the tree tips. For help, see `vignette("starting_secsse", package = "secsse")`.
`num_concealed_states`	number of concealed states, generally equivalent to the number of examined states in the dataset.
`cond`	condition on the existence of a node root: `"maddison_cond"`, `"proper_cond"` (default). For details, see vignette.
`root_state_weight`	the method to weigh the states: `"maddison_weights"`, `"proper_weights"` (default) or `"equal_weights"`. It can also be specified for the root state: the vector `c(1, 0, 0)` indicates state 1 was the root state.
`sampling_fraction`	vector that states the sampling proportion per trait state. It must have as many elements as there are trait states.
`setting_calculation`	argument used internally to speed up calculation. It should be left blank (default : `setting_calculation = NULL`).
`see_ancestral_states`	Boolean for whether the ancestral states should be shown? Defaults to `FALSE`.
`loglik_penalty`	the size of the penalty for all parameters; default is 0 (no penalty).
`is_complete_tree`	logical specifying whether or not a tree with all its extinct species is provided. If set to `TRUE`, it also assumes that all all extinct lineages are present on the tree. Defaults to `FALSE`.
`num_threads`	number of threads to be used. Default is one thread.
`method`	integration method used, available are: `"odeint::runge_kutta_cash_karp54"`, `"odeint::runge_kutta_fehlberg78"`, `"odeint::runge_kutta_dopri5"`, `"odeint::bulirsch_stoer"` and `"odeint::runge_kutta4"`. Default method is: `"odeint::bulirsch_stoer"`.
`atol`	A numeric specifying the absolute tolerance of integration.
`rtol`	A numeric specifying the relative tolerance of integration.

Value

The loglikelihood of the data given the parameters

Examples

rm(list=ls(all=TRUE))
library(secsse)
set.seed(13)
phylotree <- ape::rcoal(12, tip.label = 1:12)
traits <- sample(c(0,1,2),ape::Ntip(phylotree),replace=TRUE)
num_concealed_states <- 3
sampling_fraction <- c(1,1,1)
phy <- phylotree
# the idparlist for a ETD model (dual state inheritance model of evolution)
# would be set like this:
idparlist <- cla_id_paramPos(traits,num_concealed_states)
lambd_and_modeSpe <- idparlist$lambdas
lambd_and_modeSpe[1,] <- c(1,1,1,2,2,2,3,3,3)
idparlist[[1]] <- lambd_and_modeSpe
idparlist[[2]][] <- 0
masterBlock <- matrix(4,ncol=3,nrow=3,byrow=TRUE)
diag(masterBlock) <- NA
idparlist [[3]] <- q_doubletrans(traits,masterBlock,diff.conceal = FALSE)
# Now, internally, clasecsse sorts the lambda matrices, so they look like:
prepare_full_lambdas(traits,num_concealed_states,idparlist[[1]])
# which is a list with 9 matrices, corresponding to the 9 states
# (0A,1A,2A,0B,etc)
# if we want to calculate a single likelihood:
parameter <- idparlist
lambda_and_modeSpe <- parameter$lambdas
lambda_and_modeSpe[1,] <- c(0.2,0.2,0.2,0.4,0.4,0.4,0.01,0.01,0.01)
parameter[[1]] <- prepare_full_lambdas(traits,num_concealed_states,
lambda_and_modeSpe)
parameter[[2]] <- rep(0,9)
masterBlock <- matrix(0.07, ncol=3, nrow=3, byrow=TRUE)
diag(masterBlock) <- NA
parameter [[3]] <- q_doubletrans(traits,masterBlock,diff.conceal = FALSE)
cla_secsse_loglik(parameter, phy, traits, num_concealed_states,
                 cond = 'maddison_cond',
                 root_state_weight = 'maddison_weights', sampling_fraction,
                 setting_calculation = NULL,
                 see_ancestral_states = FALSE,
                 loglik_penalty = 0)
# LL = -42.18407
rm(list=ls(all=TRUE))
library(secsse)
set.seed(13)
phylotree <- ape::rcoal(12, tip.label = 1:12)
traits <- sample(c(0,1,2),ape::Ntip(phylotree),replace=TRUE)
num_concealed_states <- 3
sampling_fraction <- c(1,1,1)
phy <- phylotree
# the idparlist for a ETD model (dual state inheritance model of evolution)
# would be set like this:
idparlist <- cla_id_paramPos(traits,num_concealed_states)
lambd_and_modeSpe <- idparlist$lambdas
lambd_and_modeSpe[1,] <- c(1,1,1,2,2,2,3,3,3)
idparlist[[1]] <- lambd_and_modeSpe
idparlist[[2]][] <- 0
masterBlock <- matrix(4,ncol=3,nrow=3,byrow=TRUE)
diag(masterBlock) <- NA
idparlist [[3]] <- q_doubletrans(traits,masterBlock,diff.conceal = FALSE)
# Now, internally, clasecsse sorts the lambda matrices, so they look like:
prepare_full_lambdas(traits,num_concealed_states,idparlist[[1]])
# which is a list with 9 matrices, corresponding to the 9 states
# (0A,1A,2A,0B,etc)
# if we want to calculate a single likelihood:
parameter <- idparlist
lambda_and_modeSpe <- parameter$lambdas
lambda_and_modeSpe[1,] <- c(0.2,0.2,0.2,0.4,0.4,0.4,0.01,0.01,0.01)
parameter[[1]] <- prepare_full_lambdas(traits,num_concealed_states,
lambda_and_modeSpe)
parameter[[2]] <- rep(0,9)
masterBlock <- matrix(0.07, ncol=3, nrow=3, byrow=TRUE)
diag(masterBlock) <- NA
parameter [[3]] <- q_doubletrans(traits,masterBlock,diff.conceal = FALSE)
cla_secsse_loglik(parameter, phy, traits, num_concealed_states,
                 cond = 'maddison_cond',
                 root_state_weight = 'maddison_weights', sampling_fraction,
                 setting_calculation = NULL,
                 see_ancestral_states = FALSE,
                 loglik_penalty = 0)
# LL = -42.18407

Maximum likehood estimation for (SecSSE)

Description

Maximum likehood estimation under Several examined and concealed States-dependent Speciation and Extinction (SecSSE) with cladogenetic option

Usage

cla_secsse_ml(
  phy,
  traits,
  num_concealed_states,
  idparslist,
  idparsopt,
  initparsopt,
  idparsfix,
  parsfix,
  cond = "proper_cond",
  root_state_weight = "proper_weights",
  sampling_fraction,
  tol = c(1e-04, 1e-05, 1e-07),
  maxiter = 1000 * round((1.25)^length(idparsopt)),
  optimmethod = "subplex",
  num_cycles = 1,
  loglik_penalty = 0,
  is_complete_tree = FALSE,
  verbose = (optimmethod == "simplex"),
  num_threads = 1,
  atol = 1e-08,
  rtol = 1e-07,
  method = "odeint::bulirsch_stoer"
)
cla_secsse_ml(
  phy,
  traits,
  num_concealed_states,
  idparslist,
  idparsopt,
  initparsopt,
  idparsfix,
  parsfix,
  cond = "proper_cond",
  root_state_weight = "proper_weights",
  sampling_fraction,
  tol = c(1e-04, 1e-05, 1e-07),
  maxiter = 1000 * round((1.25)^length(idparsopt)),
  optimmethod = "subplex",
  num_cycles = 1,
  loglik_penalty = 0,
  is_complete_tree = FALSE,
  verbose = (optimmethod == "simplex"),
  num_threads = 1,
  atol = 1e-08,
  rtol = 1e-07,
  method = "odeint::bulirsch_stoer"
)

Arguments

`phy`	phylogenetic tree of class `phylo`, rooted and with branch lengths.
`traits`	vector with trait states for each tip in the phylogeny. The order of the states must be the same as the tree tips. For help, see `vignette("starting_secsse", package = "secsse")`.
`num_concealed_states`	number of concealed states, generally equivalent to the number of examined states in the dataset.
`idparslist`	overview of parameters and their values.
`idparsopt`	a numeric vector with the ID of parameters to be estimated.
`initparsopt`	a numeric vector with the initial guess of the parameters to be estimated.
`idparsfix`	a numeric vector with the ID of the fixed parameters.
`parsfix`	a numeric vector with the value of the fixed parameters.
`cond`	condition on the existence of a node root: `"maddison_cond"`, `"proper_cond"` (default). For details, see vignette.
`root_state_weight`	the method to weigh the states: `"maddison_weights"`, `"proper_weights"` (default) or `"equal_weights"`. It can also be specified for the root state: the vector `c(1, 0, 0)` indicates state 1 was the root state.
`sampling_fraction`	vector that states the sampling proportion per trait state. It must have as many elements as there are trait states.
`tol`	A numeric vector with the maximum tolerance of the optimization algorithm. Default is `c(1e-04, 1e-05, 1e-05)`.
`maxiter`	max number of iterations. Default is `1000 * round((1.25) ^ length(idparsopt))`.
`optimmethod`	A string with method used for optimization. Default is `"subplex"`. Alternative is `"simplex"` and it shouldn't be used in normal conditions (only for debugging). Both are called from `DDD::optimizer()`, simplex is implemented natively in DDD, while subplex is ultimately called from `subplex::subplex()`.
`num_cycles`	Number of cycles of the optimization. When set to `Inf`, the optimization will be repeated until the result is, within the tolerance, equal to the starting values, with a maximum of 10 cycles.
`loglik_penalty`	the size of the penalty for all parameters; default is 0 (no penalty).
`is_complete_tree`	logical specifying whether or not a tree with all its extinct species is provided. If set to `TRUE`, it also assumes that all all extinct lineages are present on the tree. Defaults to `FALSE`.
`verbose`	sets verbose output; default is `TRUE` when `optimmethod` is `"simplex"`. If `optimmethod` is set to `"simplex"`, then even if set to `FALSE`, optimizer output will be shown.
`num_threads`	number of threads to be used. Default is one thread.
`atol`	A numeric specifying the absolute tolerance of integration.
`rtol`	A numeric specifying the relative tolerance of integration.
`method`	integration method used, available are: `"odeint::runge_kutta_cash_karp54"`, `"odeint::runge_kutta_fehlberg78"`, `"odeint::runge_kutta_dopri5"`, `"odeint::bulirsch_stoer"` and `"odeint::runge_kutta4"`. Default method is: `"odeint::bulirsch_stoer"`.

Value

Parameter estimated and maximum likelihood

Examples

# Example of how to set the arguments for a ML search.
library(secsse)
library(DDD)
set.seed(13)
# Check the vignette for a better working exercise.
# lambdas for 0A and 1A and 2A are the same but need to be estimated
# (CTD model, see Syst Biol paper)
# mus are fixed to zero,
# the transition rates are constrained to be equal and fixed 0.01
phylotree <- ape::rcoal(31, tip.label = 1:31)
#get some traits
traits <-  sample(c(0,1,2), ape::Ntip(phylotree), replace = TRUE)
num_concealed_states <- 3
idparslist <- cla_id_paramPos(traits,num_concealed_states)
idparslist$lambdas[1,] <- c(1,1,1,2,2,2,3,3,3)
idparslist[[2]][] <- 4
masterBlock <- matrix(5,ncol = 3,nrow = 3,byrow = TRUE)
diag(masterBlock) <- NA
diff.conceal <- FALSE
idparslist[[3]] <- q_doubletrans(traits,masterBlock,diff.conceal)
startingpoint <- bd_ML(brts = ape::branching.times(phylotree))
intGuessLamba <- startingpoint$lambda0
intGuessMu <- startingpoint$mu0
idparsopt <- c(1,2,3)
initparsopt <- c(rep(intGuessLamba,3))
idparsfix <- c(0,4,5)
parsfix <- c(0,0,0.01)
tol <- c(1e-04, 1e-05, 1e-07)
maxiter <- 1000 * round((1.25) ^ length(idparsopt))
optimmethod <- 'subplex'
cond <- 'proper_cond'
root_state_weight <- 'proper_weights'
sampling_fraction <- c(1,1,1)
model <- cla_secsse_ml(
 phylotree,
 traits,
 num_concealed_states,
 idparslist,
 idparsopt,
 initparsopt,
 idparsfix,
 parsfix,
 cond,
 root_state_weight,
 sampling_fraction,
 tol,
 maxiter,
 optimmethod,
 num_cycles = 1,
 num_threads = 1,
 verbose = FALSE)
# [1] -90.97626
# Example of how to set the arguments for a ML search.
library(secsse)
library(DDD)
set.seed(13)
# Check the vignette for a better working exercise.
# lambdas for 0A and 1A and 2A are the same but need to be estimated
# (CTD model, see Syst Biol paper)
# mus are fixed to zero,
# the transition rates are constrained to be equal and fixed 0.01
phylotree <- ape::rcoal(31, tip.label = 1:31)
#get some traits
traits <-  sample(c(0,1,2), ape::Ntip(phylotree), replace = TRUE)
num_concealed_states <- 3
idparslist <- cla_id_paramPos(traits,num_concealed_states)
idparslist$lambdas[1,] <- c(1,1,1,2,2,2,3,3,3)
idparslist[[2]][] <- 4
masterBlock <- matrix(5,ncol = 3,nrow = 3,byrow = TRUE)
diag(masterBlock) <- NA
diff.conceal <- FALSE
idparslist[[3]] <- q_doubletrans(traits,masterBlock,diff.conceal)
startingpoint <- bd_ML(brts = ape::branching.times(phylotree))
intGuessLamba <- startingpoint$lambda0
intGuessMu <- startingpoint$mu0
idparsopt <- c(1,2,3)
initparsopt <- c(rep(intGuessLamba,3))
idparsfix <- c(0,4,5)
parsfix <- c(0,0,0.01)
tol <- c(1e-04, 1e-05, 1e-07)
maxiter <- 1000 * round((1.25) ^ length(idparsopt))
optimmethod <- 'subplex'
cond <- 'proper_cond'
root_state_weight <- 'proper_weights'
sampling_fraction <- c(1,1,1)
model <- cla_secsse_ml(
 phylotree,
 traits,
 num_concealed_states,
 idparslist,
 idparsopt,
 initparsopt,
 idparsfix,
 parsfix,
 cond,
 root_state_weight,
 sampling_fraction,
 tol,
 maxiter,
 optimmethod,
 num_cycles = 1,
 num_threads = 1,
 verbose = FALSE)
# [1] -90.97626

Maximum likehood estimation for (SecSSE) with parameter as complex functions. Cladogenetic version

Description

Maximum likehood estimation under cla Several examined and concealed States-dependent Speciation and Extinction (SecSSE) where some paramaters are functions of other parameters and/or factors. Offers the option of cladogenesis

Usage

cla_secsse_ml_func_def_pars(
  phy,
  traits,
  num_concealed_states,
  idparslist,
  idparsopt,
  initparsopt,
  idfactorsopt,
  initfactors,
  idparsfix,
  parsfix,
  idparsfuncdefpar,
  functions_defining_params,
  cond = "proper_cond",
  root_state_weight = "proper_weights",
  sampling_fraction,
  tol = c(1e-04, 1e-05, 1e-07),
  maxiter = 1000 * round((1.25)^length(idparsopt)),
  optimmethod = "subplex",
  num_cycles = 1,
  loglik_penalty = 0,
  is_complete_tree = FALSE,
  verbose = (optimmethod == "simplex"),
  num_threads = 1,
  atol = 1e-12,
  rtol = 1e-12,
  method = "odeint::bulirsch_stoer"
)
cla_secsse_ml_func_def_pars(
  phy,
  traits,
  num_concealed_states,
  idparslist,
  idparsopt,
  initparsopt,
  idfactorsopt,
  initfactors,
  idparsfix,
  parsfix,
  idparsfuncdefpar,
  functions_defining_params,
  cond = "proper_cond",
  root_state_weight = "proper_weights",
  sampling_fraction,
  tol = c(1e-04, 1e-05, 1e-07),
  maxiter = 1000 * round((1.25)^length(idparsopt)),
  optimmethod = "subplex",
  num_cycles = 1,
  loglik_penalty = 0,
  is_complete_tree = FALSE,
  verbose = (optimmethod == "simplex"),
  num_threads = 1,
  atol = 1e-12,
  rtol = 1e-12,
  method = "odeint::bulirsch_stoer"
)

Arguments

`phy`	phylogenetic tree of class `phylo`, rooted and with branch lengths.
`traits`	vector with trait states for each tip in the phylogeny. The order of the states must be the same as the tree tips. For help, see `vignette("starting_secsse", package = "secsse")`.
`num_concealed_states`	number of concealed states, generally equivalent to the number of examined states in the dataset.
`idparslist`	overview of parameters and their values.
`idparsopt`	a numeric vector with the ID of parameters to be estimated.
`initparsopt`	a numeric vector with the initial guess of the parameters to be estimated.
`idfactorsopt`	id of the factors that will be optimized. There are not fixed factors, so use a constant within `functions_defining_params`.
`initfactors`	the initial guess for a factor (it should be set to `NULL` when no factors).
`idparsfix`	a numeric vector with the ID of the fixed parameters.
`parsfix`	a numeric vector with the value of the fixed parameters.
`idparsfuncdefpar`	id of the parameters which will be a function of optimized and/or fixed parameters. The order of id should match `functions_defining_params`.
`functions_defining_params`	a list of functions. Each element will be a function which defines a parameter e.g. `id_3 <- (id_1 + id_2) / 2`. See example.
`cond`	condition on the existence of a node root: `"maddison_cond"`, `"proper_cond"` (default). For details, see vignette.
`root_state_weight`	the method to weigh the states: `"maddison_weights"`, `"proper_weights"` (default) or `"equal_weights"`. It can also be specified for the root state: the vector `c(1, 0, 0)` indicates state 1 was the root state.
`sampling_fraction`	vector that states the sampling proportion per trait state. It must have as many elements as there are trait states.
`tol`	A numeric vector with the maximum tolerance of the optimization algorithm. Default is `c(1e-04, 1e-05, 1e-05)`.
`maxiter`	max number of iterations. Default is `1000 * round((1.25) ^ length(idparsopt))`.
`optimmethod`	A string with method used for optimization. Default is `"subplex"`. Alternative is `"simplex"` and it shouldn't be used in normal conditions (only for debugging). Both are called from `DDD::optimizer()`, simplex is implemented natively in DDD, while subplex is ultimately called from `subplex::subplex()`.
`num_cycles`	Number of cycles of the optimization. When set to `Inf`, the optimization will be repeated until the result is, within the tolerance, equal to the starting values, with a maximum of 10 cycles.
`loglik_penalty`	the size of the penalty for all parameters; default is 0 (no penalty).
`is_complete_tree`	logical specifying whether or not a tree with all its extinct species is provided. If set to `TRUE`, it also assumes that all all extinct lineages are present on the tree. Defaults to `FALSE`.
`verbose`	sets verbose output; default is `TRUE` when `optimmethod` is `"simplex"`. If `optimmethod` is set to `"simplex"`, then even if set to `FALSE`, optimizer output will be shown.
`num_threads`	number of threads to be used. Default is one thread.
`atol`	A numeric specifying the absolute tolerance of integration.
`rtol`	A numeric specifying the relative tolerance of integration.
`method`	integration method used, available are: `"odeint::runge_kutta_cash_karp54"`, `"odeint::runge_kutta_fehlberg78"`, `"odeint::runge_kutta_dopri5"`, `"odeint::bulirsch_stoer"` and `"odeint::runge_kutta4"`. Default method is: `"odeint::bulirsch_stoer"`.

Value

Parameter estimated and maximum likelihood

Examples

# Example of how to set the arguments for a ML search.
rm(list=ls(all=TRUE))
library(secsse)
library(DDD)
set.seed(16)
phylotree <- ape::rbdtree(0.07,0.001,Tmax=50)
startingpoint <- bd_ML(brts = ape::branching.times(phylotree))
intGuessLamba <- startingpoint$lambda0
intGuessMu <- startingpoint$mu0
traits <-  sample(c(0,1,2),
                 ape::Ntip(phylotree), replace = TRUE) # get some traits
num_concealed_states <- 3
idparslist <- cla_id_paramPos(traits, num_concealed_states)
idparslist$lambdas[1,] <- c(1,2,3,1,2,3,1,2,3)
idparslist[[2]][] <- 4
masterBlock <- matrix(c(5,6,5,6,5,6,5,6,5),ncol = 3, nrow=3, byrow = TRUE)
diag(masterBlock) <- NA
diff.conceal <- FALSE
idparslist[[3]] <- q_doubletrans(traits,masterBlock,diff.conceal)
idparsfuncdefpar <- c(3,5,6)
idparsopt <- c(1,2)
idparsfix <- c(0,4)
initparsopt <- c(rep(intGuessLamba,2))
parsfix <- c(0,0)
idfactorsopt <- 1
initfactors <- 4
# functions_defining_params is a list of functions. Each function has no
# arguments and to refer
# to parameters ids should be indicated as 'par_' i.e. par_3 refers to
# parameter 3. When a
# function is defined, be sure that all the parameters involved are either
# estimated, fixed or
# defined by previous functions (i.e, a function that defines parameter in
# 'functions_defining_params'). The user is responsible for this. In this
# example, par_3
# (i.e., parameter 3) is needed to calculate par_6. This is correct because
# par_3 is defined
# in the first function of 'functions_defining_params'. Notice that factor_1
# indicates a value
# that will be estimated to satisfy the equation. The same factor can be
# shared to define several parameters.
functions_defining_params <- list()
functions_defining_params[[1]] <- function() {
 par_3 <- par_1 + par_2
}
functions_defining_params[[2]] <- function() {
 par_5 <- par_1 * factor_1
}
functions_defining_params[[3]] <- function() {
 par_6 <- par_3 * factor_1
}

tol = c(1e-02, 1e-03, 1e-04)
maxiter = 1000 * round((1.25)^length(idparsopt))
optimmethod = 'subplex'
cond <- 'proper_cond'
root_state_weight <- 'proper_weights'
sampling_fraction <- c(1,1,1)
model <- cla_secsse_ml_func_def_pars(phylotree,
traits,
num_concealed_states,
idparslist,
idparsopt,
initparsopt,
idfactorsopt,
initfactors,
idparsfix,
parsfix,
idparsfuncdefpar,
functions_defining_params,
cond,
root_state_weight,
sampling_fraction,
tol,
maxiter,
optimmethod,
num_cycles = 1)
# ML -136.5796
# Example of how to set the arguments for a ML search.
rm(list=ls(all=TRUE))
library(secsse)
library(DDD)
set.seed(16)
phylotree <- ape::rbdtree(0.07,0.001,Tmax=50)
startingpoint <- bd_ML(brts = ape::branching.times(phylotree))
intGuessLamba <- startingpoint$lambda0
intGuessMu <- startingpoint$mu0
traits <-  sample(c(0,1,2),
                 ape::Ntip(phylotree), replace = TRUE) # get some traits
num_concealed_states <- 3
idparslist <- cla_id_paramPos(traits, num_concealed_states)
idparslist$lambdas[1,] <- c(1,2,3,1,2,3,1,2,3)
idparslist[[2]][] <- 4
masterBlock <- matrix(c(5,6,5,6,5,6,5,6,5),ncol = 3, nrow=3, byrow = TRUE)
diag(masterBlock) <- NA
diff.conceal <- FALSE
idparslist[[3]] <- q_doubletrans(traits,masterBlock,diff.conceal)
idparsfuncdefpar <- c(3,5,6)
idparsopt <- c(1,2)
idparsfix <- c(0,4)
initparsopt <- c(rep(intGuessLamba,2))
parsfix <- c(0,0)
idfactorsopt <- 1
initfactors <- 4
# functions_defining_params is a list of functions. Each function has no
# arguments and to refer
# to parameters ids should be indicated as 'par_' i.e. par_3 refers to
# parameter 3. When a
# function is defined, be sure that all the parameters involved are either
# estimated, fixed or
# defined by previous functions (i.e, a function that defines parameter in
# 'functions_defining_params'). The user is responsible for this. In this
# example, par_3
# (i.e., parameter 3) is needed to calculate par_6. This is correct because
# par_3 is defined
# in the first function of 'functions_defining_params'. Notice that factor_1
# indicates a value
# that will be estimated to satisfy the equation. The same factor can be
# shared to define several parameters.
functions_defining_params <- list()
functions_defining_params[[1]] <- function() {
 par_3 <- par_1 + par_2
}
functions_defining_params[[2]] <- function() {
 par_5 <- par_1 * factor_1
}
functions_defining_params[[3]] <- function() {
 par_6 <- par_3 * factor_1
}

tol = c(1e-02, 1e-03, 1e-04)
maxiter = 1000 * round((1.25)^length(idparsopt))
optimmethod = 'subplex'
cond <- 'proper_cond'
root_state_weight <- 'proper_weights'
sampling_fraction <- c(1,1,1)
model <- cla_secsse_ml_func_def_pars(phylotree,
traits,
num_concealed_states,
idparslist,
idparsopt,
initparsopt,
idfactorsopt,
initfactors,
idparsfix,
parsfix,
idparsfuncdefpar,
functions_defining_params,
cond,
root_state_weight,
sampling_fraction,
tol,
maxiter,
optimmethod,
num_cycles = 1)
# ML -136.5796

Helper function to create a default lambda list

Description

This function generates a generic lambda list, assuming no transitions between states, e.g. a species of observed state 0 generates daughter species with state 0 as well.

Usage

create_default_lambda_transition_matrix(
  state_names = c("0", "1"),
  model = "ETD"
)
create_default_lambda_transition_matrix(
  state_names = c("0", "1"),
  model = "ETD"
)

Arguments

`state_names`	vector of names of all observed states.
`model`	used model, choice of `"ETD"` (Examined Traits Diversification), `"CTD"` (Concealed Traits Diversification) or `"CR"` (Constant Rate).

Examples

lambda_matrix <-
     create_default_lambda_transition_matrix(state_names = c(0, 1),
                                             model = "ETD")
lambda_list <- create_lambda_list(state_names = c(0, 1),
                                  num_concealed_states = 2,
                                  transition_matrix = lambda_matrix,
                                  model = "ETD")
lambda_matrix <-
     create_default_lambda_transition_matrix(state_names = c(0, 1),
                                             model = "ETD")
lambda_list <- create_lambda_list(state_names = c(0, 1),
                                  num_concealed_states = 2,
                                  transition_matrix = lambda_matrix,
                                  model = "ETD")

Helper function to create a default `shift_matrix` list

Description

This function generates a generic shift matrix to be used with the function create_q_matrix().

Usage

create_default_shift_matrix(
  state_names = c("0", "1"),
  num_concealed_states = 2,
  mu_vector = NULL
)
create_default_shift_matrix(
  state_names = c("0", "1"),
  num_concealed_states = 2,
  mu_vector = NULL
)

Arguments

`state_names`	vector of names of all observed states.
`num_concealed_states`	number of concealed states, generally equivalent to the number of examined states in the dataset.
`mu_vector`	previously defined mus - used to choose indicator number.

Examples

shift_matrix <- create_default_shift_matrix(state_names = c(0, 1),
                                            num_concealed_states = 2,
                                            mu_vector = c(1, 2, 1, 2))
q_matrix <- create_q_matrix(state_names = c(0, 1),
                            num_concealed_states = 2,
                            shift_matrix = shift_matrix,
                            diff.conceal = FALSE)
shift_matrix <- create_default_shift_matrix(state_names = c(0, 1),
                                            num_concealed_states = 2,
                                            mu_vector = c(1, 2, 1, 2))
q_matrix <- create_q_matrix(state_names = c(0, 1),
                            num_concealed_states = 2,
                            shift_matrix = shift_matrix,
                            diff.conceal = FALSE)

Helper function to automatically create lambda matrices, based on input

Description

Helper function to automatically create lambda matrices, based on input

Usage

create_lambda_list(
  state_names = c(0, 1),
  num_concealed_states = 2,
  transition_matrix,
  model = "ETD",
  concealed_spec_rates = NULL
)
create_lambda_list(
  state_names = c(0, 1),
  num_concealed_states = 2,
  transition_matrix,
  model = "ETD",
  concealed_spec_rates = NULL
)

Arguments

`state_names`	vector of names of all observed states.
`num_concealed_states`	number of concealed states, generally equivalent to the number of examined states in the dataset.
`transition_matrix`	a matrix containing a description of all speciation events, where the first column indicates the source state, the second and third column indicate the two daughter states, and the fourth column gives the rate indicator used. E.g.: `⁠["SA", "S", "A", 1]⁠` for a trait state `"SA"` which upon speciation generates two daughter species with traits `"S"` and `"A"`, where the number 1 is used as indicator for optimization of the likelihood.
`model`	used model, choice of `"ETD"` (Examined Traits Diversification), `"CTD"` (Concealed Traits Diversification) or `"CR"` (Constant Rate).
`concealed_spec_rates`	vector specifying the rate indicators for each concealed state, length should be identical to `num_concealed_states`. If left empty when using the CTD model, it is assumed that all available speciation rates are distributed uniformly over the concealed states.

Examples

trans_matrix <- c(0, 0, 0, 1)
trans_matrix <- rbind(trans_matrix, c(1, 1, 1, 2))
lambda_list <- create_lambda_list(state_names = c(0, 1),
                                  num_concealed_states = 2,
                                  transition_matrix = trans_matrix,
                                  model = "ETD")

trans_matrix <- c(0, 0, 0, 1)
trans_matrix <- rbind(trans_matrix, c(1, 1, 1, 2))
lambda_list <- create_lambda_list(state_names = c(0, 1),
                                  num_concealed_states = 2,
                                  transition_matrix = trans_matrix,
                                  model = "ETD")

Generate mus vector

Description

Generate mus vector

Usage

create_mu_vector(state_names, num_concealed_states, model = "CR", lambda_list)
create_mu_vector(state_names, num_concealed_states, model = "CR", lambda_list)

Arguments

`state_names`	vector of names of all observed states.
`num_concealed_states`	number of concealed states, generally equivalent to the number of examined states in the dataset.
`model`	used model, choice of `"ETD"` (Examined Traits Diversification), `"CTD"` (Concealed Traits Diversification) or `"CR"` (Constant Rate).
`lambda_list`	previously generated list of lambda matrices, used to infer the rate number to start with.

Value

mu vector

Helper function to neatly setup a Q matrix, without transitions to concealed states (only observed transitions shown)

Description

Helper function to neatly setup a Q matrix, without transitions to concealed states (only observed transitions shown)

Usage

create_q_matrix(
  state_names,
  num_concealed_states,
  shift_matrix,
  diff.conceal = FALSE
)
create_q_matrix(
  state_names,
  num_concealed_states,
  shift_matrix,
  diff.conceal = FALSE
)

Arguments

`state_names`	vector of names of all observed states.
`num_concealed_states`	number of concealed states, generally equivalent to the number of examined states in the dataset.
`shift_matrix`	matrix of shifts, indicating in order: starting state (typically the column in the transition matrix) ending state (typically the row in the transition matrix) associated rate indicator.
`diff.conceal`	Boolean stating if the concealed states should be different. E.g. that the transition rates for the concealed states are different from the transition rates for the examined states. Normally it should be `FALSE` in order to avoid having a huge number of parameters.

Value

transition matrix

Examples

shift_matrix <- c(0, 1, 5)
shift_matrix <- rbind(shift_matrix, c(1, 0, 6))
q_matrix <- secsse::create_q_matrix(state_names = c(0, 1),
                                    num_concealed_states = 2,
                                    shift_matrix = shift_matrix,
                                    diff.conceal = TRUE)
shift_matrix <- c(0, 1, 5)
shift_matrix <- rbind(shift_matrix, c(1, 0, 6))
q_matrix <- secsse::create_q_matrix(state_names = c(0, 1),
                                    num_concealed_states = 2,
                                    shift_matrix = shift_matrix,
                                    diff.conceal = TRUE)

Event times of a (possibly non-ultrametric) phylogenetic tree

Description

Times at which speciation or extinction occurs

Usage

event_times(phy)
event_times(phy)

Arguments

phy

phylogenetic tree of class phylo, without polytomies, rooted and with branch lengths. Need not be ultrametric.

Value

times at which speciation or extinction happens.

Note

This script has been modified from BAMMtools' internal function NU.branching.times

A phylogeny with traits at the tips

Description

An example phylogeny for testing purposes

Usage

example_phy_GeoSSE
example_phy_GeoSSE

Format

A phylogeny as created by GeoSSE (diversitree)

Function to expand an existing q_matrix to a number of concealed states

Description

Function to expand an existing q_matrix to a number of concealed states

Usage

expand_q_matrix(q_matrix, num_concealed_states, diff.conceal = FALSE)
expand_q_matrix(q_matrix, num_concealed_states, diff.conceal = FALSE)

Arguments

`q_matrix`	`q_matrix` with only transitions between observed states.
`num_concealed_states`	number of concealed states, generally equivalent to the number of examined states in the dataset.
`diff.conceal`	Boolean stating if the concealed states should be different. E.g. that the transition rates for the concealed states are different from the transition rates for the examined states. Normally it should be `FALSE` in order to avoid having a huge number of parameters.

Value

updated q matrix

Note

This is highly similar to q_doubletrans().

Extract parameter values out of the result of a maximum likelihood inference run

Description

Extract parameter values out of the result of a maximum likelihood inference run

Usage

extract_par_vals(param_posit, ml_pars)
extract_par_vals(param_posit, ml_pars)

Arguments

param_posit

initial parameter structure, consisting of a list with three entries:

lambda matrices
mus
Q matrix

In each entry, integers numbers (1-n) indicate the parameter to be optimized.

ml_pars

resulting parameter estimates as returned by for instance cla_secsse_ml(), having the same structure as param_post.

Value

Vector of parameter estimates.

Helper function to enter parameter value on their right place

Description

Helper function to enter parameter value on their right place

Usage

fill_in(object, params)
fill_in(object, params)

Arguments

`object`	lambda matrices, `q_matrix` or mu vector.
`params`	parameters in order, where each value reflects the value of the parameter at that position, e.g. `c(0.3, 0.2, 0.1)` will fill out the value 0.3 for the parameter with rate identifier 1, 0.2 for the parameter with rate identifier 2 and 0.1 for the parameter with rate identifier 3.

Value

lambda matrices, q_matrix or mu vector with the correct values in their right place.

Parameter structure setting Sets the parameters (speciation, extinction and transition) ids. Needed for ML calculation (`secsse_ml()`).

Description

Parameter structure setting Sets the parameters (speciation, extinction and transition) ids. Needed for ML calculation (secsse_ml()).

Usage

id_paramPos(traits, num_concealed_states)
id_paramPos(traits, num_concealed_states)

Arguments

`traits`	vector with trait states for each tip in the phylogeny. The order of the states must be the same as the tree tips. For help, see `vignette("starting_secsse", package = "secsse")`.
`num_concealed_states`	number of concealed states, generally equivalent to the number of examined states in the dataset.

Value

A list that includes the ids of the parameters for ML analysis.

Examples

traits <- sample(c(0,1,2), 45,replace = TRUE) #get some traits
num_concealed_states <- 3
param_posit <- id_paramPos(traits,num_concealed_states)
traits <- sample(c(0,1,2), 45,replace = TRUE) #get some traits
num_concealed_states <- 3
param_posit <- id_paramPos(traits,num_concealed_states)

A phylogenetic reconstuction to run the vignette

Description

An example phylogeny in the right format for secsse

Usage

phylo_vignette
phylo_vignette

Format

Phylogenetic tree in phy format, rooted, including branch lengths

Plot the local probability along a tree

Description

Plot the local probability along the tree, including the branches

Usage

plot_state_exact(
  parameters,
  phy,
  traits,
  num_concealed_states,
  sampling_fraction,
  cond = "proper_cond",
  root_state_weight = "proper_weights",
  is_complete_tree = FALSE,
  method = "odeint::bulirsch_stoer",
  atol = 1e-16,
  rtol = 1e-16,
  num_steps = 100,
  prob_func = NULL,
  verbose = FALSE
)
plot_state_exact(
  parameters,
  phy,
  traits,
  num_concealed_states,
  sampling_fraction,
  cond = "proper_cond",
  root_state_weight = "proper_weights",
  is_complete_tree = FALSE,
  method = "odeint::bulirsch_stoer",
  atol = 1e-16,
  rtol = 1e-16,
  num_steps = 100,
  prob_func = NULL,
  verbose = FALSE
)

Arguments

`parameters`	list where first vector represents lambdas, the second mus and the third transition rates.
`phy`	phylogenetic tree of class `phylo`, rooted and with branch lengths.
`traits`	vector with trait states for each tip in the phylogeny. The order of the states must be the same as the tree tips. For help, see `vignette("starting_secsse", package = "secsse")`.
`num_concealed_states`	number of concealed states, generally equivalent to the number of examined states in the dataset.
`sampling_fraction`	vector that states the sampling proportion per trait state. It must have as many elements as there are trait states.
`cond`	condition on the existence of a node root: `"maddison_cond"`, `"proper_cond"` (default). For details, see vignette.
`root_state_weight`	the method to weigh the states: `"maddison_weights"`, `"proper_weights"` (default) or `"equal_weights"`. It can also be specified for the root state: the vector `c(1, 0, 0)` indicates state 1 was the root state.
`is_complete_tree`	logical specifying whether or not a tree with all its extinct species is provided. If set to `TRUE`, it also assumes that all all extinct lineages are present on the tree. Defaults to `FALSE`.
`method`	integration method used, available are: `"odeint::runge_kutta_cash_karp54"`, `"odeint::runge_kutta_fehlberg78"`, `"odeint::runge_kutta_dopri5"`, `"odeint::bulirsch_stoer"` and `"odeint::runge_kutta4"`. Default method is: `"odeint::bulirsch_stoer"`.
`atol`	A numeric specifying the absolute tolerance of integration.
`rtol`	A numeric specifying the relative tolerance of integration.
`num_steps`	number of substeps to show intermediate likelihoods along a branch.
`prob_func`	a function to calculate the probability of interest, see description.
`verbose`	sets verbose output; default is `TRUE` when `optimmethod` is `"simplex"`. If `optimmethod` is set to `"simplex"`, then even if set to `FALSE`, optimizer output will be shown.

Details

This function will evaluate the log likelihood locally along all branches and plot the result. When num_steps is left to NULL, all likelihood evaluations during integration are used for plotting. This may work for not too large trees, but may become very memory heavy for larger trees. Instead, the user can indicate a number of steps, which causes the probabilities to be evaluated at a distinct amount of steps along each branch (and the probabilities to be properly integrated in between these steps). This provides an approximation, but generally results look very similar to using the full evaluation. The function used for prob_func will be highly dependent on your system. for instance, for a 3 observed, 2 hidden states model, the probability of state A is prob[1] + prob[2] + prob[3], normalized by the row sum. prob_func will be applied to each row of the 'states' matrix (you can thus test your function on the states matrix returned when 'see_ancestral_states = TRUE'). Please note that the first N columns of the states matrix are the extinction rates, and the ⁠(N+1):2N⁠ columns belong to the speciation rates, where N = num_obs_states * num_concealed_states. A typical prob_func function will look like:

my_prob_func <- function(x) {
  return(sum(x[5:8]) / sum(x))
}

Value

ggplot2 object

Examples

set.seed(5)
phy <- ape::rphylo(n = 4, birth = 1, death = 0)
traits <- c(0, 1, 1, 0)
params <- secsse::id_paramPos(c(0, 1), 2)
params[[1]][] <- c(0.2, 0.2, 0.1, 0.1)
params[[2]][] <- 0.0
params[[3]][, ] <- 0.1
diag(params[[3]]) <- NA
#  Thus, we have for both, rates
# 0A, 1A, 0B and 1B. If we are interested in the posterior probability of
# trait 0,we have to provide a helper function that sums the probabilities of
# 0A and 0B, e.g.:
helper_function <- function(x) {
  return(sum(x[c(5, 7)]) / sum(x)) # normalized by total sum, just in case.
}

out_plot <- plot_state_exact(parameters = params,
                             phy = phy,
                             traits = traits,
                             num_concealed_states = 2,
                             sampling_fraction = c(1, 1),
                             num_steps = 10,
                             prob_func = helper_function)
set.seed(5)
phy <- ape::rphylo(n = 4, birth = 1, death = 0)
traits <- c(0, 1, 1, 0)
params <- secsse::id_paramPos(c(0, 1), 2)
params[[1]][] <- c(0.2, 0.2, 0.1, 0.1)
params[[2]][] <- 0.0
params[[3]][, ] <- 0.1
diag(params[[3]]) <- NA
#  Thus, we have for both, rates
# 0A, 1A, 0B and 1B. If we are interested in the posterior probability of
# trait 0,we have to provide a helper function that sums the probabilities of
# 0A and 0B, e.g.:
helper_function <- function(x) {
  return(sum(x[c(5, 7)]) / sum(x)) # normalized by total sum, just in case.
}

out_plot <- plot_state_exact(parameters = params,
                             phy = phy,
                             traits = traits,
                             num_concealed_states = 2,
                             sampling_fraction = c(1, 1),
                             num_steps = 10,
                             prob_func = helper_function)

Prepares the entire set of lambda matrices for cla_secsse. It provides the set of matrices containing all the speciation rates

Description

Prepares the entire set of lambda matrices for cla_secsse. It provides the set of matrices containing all the speciation rates

Usage

prepare_full_lambdas(traits, num_concealed_states, lambd_and_modeSpe)
prepare_full_lambdas(traits, num_concealed_states, lambd_and_modeSpe)

Arguments

`traits`	vector with trait states for each tip in the phylogeny. The order of the states must be the same as the tree tips. For help, see `vignette("starting_secsse", package = "secsse")`.
`num_concealed_states`	number of concealed states, generally equivalent to the number of examined states in the dataset.
`lambd_and_modeSpe`	a matrix with the 4 models of speciation possible.

Value

A list of lambdas, its length would be the same than the number of trait states * num_concealed_states..

Examples

set.seed(13)
phylotree <- ape::rcoal(12, tip.label = 1:12)
traits <- sample(c(0, 1, 2),
                 ape::Ntip(phylotree), replace = TRUE)
num_concealed_states <- 3
# the idparlist for a ETD model (dual state inheritance model of evolution)
# would be set like this:
idparlist <- secsse::cla_id_paramPos(traits, num_concealed_states)
lambd_and_modeSpe <- idparlist$lambdas
lambd_and_modeSpe[1, ] <- c(1, 1, 1, 2, 2, 2, 3, 3, 3)
idparlist[[1]] <- lambd_and_modeSpe
idparlist[[2]][] <- 0
masterBlock <- matrix(4, ncol = 3, nrow = 3, byrow = TRUE)
diag(masterBlock) <- NA
idparlist[[3]] <- q_doubletrans(traits, masterBlock, diff.conceal = FALSE)
# Now, internally, clasecsse sorts the lambda matrices, so they look like
#  a list with 9 matrices, corresponding to the 9 states
# (0A,1A,2A,0B, etc)

parameter <- idparlist
lambda_and_modeSpe <- parameter$lambdas
lambda_and_modeSpe[1, ] <- c(0.2, 0.2, 0.2, 0.4, 0.4, 0.4, 0.01, 0.01, 0.01)
parameter[[1]] <- prepare_full_lambdas(traits, num_concealed_states,
                                       lambda_and_modeSpe)
set.seed(13)
phylotree <- ape::rcoal(12, tip.label = 1:12)
traits <- sample(c(0, 1, 2),
                 ape::Ntip(phylotree), replace = TRUE)
num_concealed_states <- 3
# the idparlist for a ETD model (dual state inheritance model of evolution)
# would be set like this:
idparlist <- secsse::cla_id_paramPos(traits, num_concealed_states)
lambd_and_modeSpe <- idparlist$lambdas
lambd_and_modeSpe[1, ] <- c(1, 1, 1, 2, 2, 2, 3, 3, 3)
idparlist[[1]] <- lambd_and_modeSpe
idparlist[[2]][] <- 0
masterBlock <- matrix(4, ncol = 3, nrow = 3, byrow = TRUE)
diag(masterBlock) <- NA
idparlist[[3]] <- q_doubletrans(traits, masterBlock, diff.conceal = FALSE)
# Now, internally, clasecsse sorts the lambda matrices, so they look like
#  a list with 9 matrices, corresponding to the 9 states
# (0A,1A,2A,0B, etc)

parameter <- idparlist
lambda_and_modeSpe <- parameter$lambdas
lambda_and_modeSpe[1, ] <- c(0.2, 0.2, 0.2, 0.4, 0.4, 0.4, 0.01, 0.01, 0.01)
parameter[[1]] <- prepare_full_lambdas(traits, num_concealed_states,
                                       lambda_and_modeSpe)

Basic Qmatrix Sets a Q matrix where double transitions are not allowed

Description

This function expands the Q_matrix, but it does so assuming that the number of concealed traits is equal to the number of examined traits, if you have a different number, you should consider looking at the function expand_q_matrix().

Usage

q_doubletrans(traits, masterBlock, diff.conceal)
q_doubletrans(traits, masterBlock, diff.conceal)

Arguments

`traits`	vector with trait states for each tip in the phylogeny. The order of the states must be the same as the tree tips. For help, see `vignette("starting_secsse", package = "secsse")`.
`masterBlock`	matrix of transitions among only examined states, `NA` in the main diagonal, used to build the full transition rates matrix.
`diff.conceal`	Boolean stating if the concealed states should be different. E.g. that the transition rates for the concealed states are different from the transition rates for the examined states. Normally it should be `FALSE` in order to avoid having a huge number of parameters.

Value

Q matrix that includes both examined and concealed states, it should be declared as the third element of idparslist.

Examples

traits <- sample(c(0,1,2), 45,replace = TRUE) #get some traits
# For a three-state trait
masterBlock <- matrix(99,ncol = 3,nrow = 3,byrow = TRUE)
diag(masterBlock) <- NA
masterBlock[1,2] <- 6
masterBlock[1,3] <- 7
masterBlock[2,1] <- 8
masterBlock[2,3] <- 9
masterBlock[3,1] <- 10
masterBlock[3,2] <- 11
myQ <- q_doubletrans(traits,masterBlock,diff.conceal = FALSE)
# now, it can replace the Q matrix from id_paramPos
num_concealed_states <- 3
param_posit <- id_paramPos(traits,num_concealed_states)
param_posit[[3]] <- myQ
traits <- sample(c(0,1,2), 45,replace = TRUE) #get some traits
# For a three-state trait
masterBlock <- matrix(99,ncol = 3,nrow = 3,byrow = TRUE)
diag(masterBlock) <- NA
masterBlock[1,2] <- 6
masterBlock[1,3] <- 7
masterBlock[2,1] <- 8
masterBlock[2,3] <- 9
masterBlock[3,1] <- 10
masterBlock[3,2] <- 11
myQ <- q_doubletrans(traits,masterBlock,diff.conceal = FALSE)
# now, it can replace the Q matrix from id_paramPos
num_concealed_states <- 3
param_posit <- id_paramPos(traits,num_concealed_states)
param_posit[[3]] <- myQ

Likelihood for SecSSE model Loglikelihood calculation for the SecSSE model given a set of parameters and data

Description

Likelihood for SecSSE model Loglikelihood calculation for the SecSSE model given a set of parameters and data

Usage

secsse_loglik(
  parameter,
  phy,
  traits,
  num_concealed_states,
  cond = "proper_cond",
  root_state_weight = "proper_weights",
  sampling_fraction,
  setting_calculation = NULL,
  see_ancestral_states = FALSE,
  loglik_penalty = 0,
  is_complete_tree = FALSE,
  num_threads = 1,
  atol = 1e-08,
  rtol = 1e-07,
  method = "odeint::bulirsch_stoer"
)
secsse_loglik(
  parameter,
  phy,
  traits,
  num_concealed_states,
  cond = "proper_cond",
  root_state_weight = "proper_weights",
  sampling_fraction,
  setting_calculation = NULL,
  see_ancestral_states = FALSE,
  loglik_penalty = 0,
  is_complete_tree = FALSE,
  num_threads = 1,
  atol = 1e-08,
  rtol = 1e-07,
  method = "odeint::bulirsch_stoer"
)

Arguments

`parameter`	list where first vector represents lambdas, the second mus and the third transition rates.
`phy`	phylogenetic tree of class `phylo`, rooted and with branch lengths.
`traits`	vector with trait states for each tip in the phylogeny. The order of the states must be the same as the tree tips. For help, see `vignette("starting_secsse", package = "secsse")`.
`num_concealed_states`	number of concealed states, generally equivalent to the number of examined states in the dataset.
`cond`	condition on the existence of a node root: `"maddison_cond"`, `"proper_cond"` (default). For details, see vignette.
`root_state_weight`	the method to weigh the states: `"maddison_weights"`, `"proper_weights"` (default) or `"equal_weights"`. It can also be specified for the root state: the vector `c(1, 0, 0)` indicates state 1 was the root state.
`sampling_fraction`	vector that states the sampling proportion per trait state. It must have as many elements as there are trait states.
`setting_calculation`	argument used internally to speed up calculation. It should be left blank (default : `setting_calculation = NULL`).
`see_ancestral_states`	Boolean for whether the ancestral states should be shown? Defaults to `FALSE`.
`loglik_penalty`	the size of the penalty for all parameters; default is 0 (no penalty).
`is_complete_tree`	logical specifying whether or not a tree with all its extinct species is provided. If set to `TRUE`, it also assumes that all all extinct lineages are present on the tree. Defaults to `FALSE`.
`num_threads`	number of threads to be used. Default is one thread.
`atol`	A numeric specifying the absolute tolerance of integration.
`rtol`	A numeric specifying the relative tolerance of integration.
`method`	integration method used, available are: `"odeint::runge_kutta_cash_karp54"`, `"odeint::runge_kutta_fehlberg78"`, `"odeint::runge_kutta_dopri5"`, `"odeint::bulirsch_stoer"` and `"odeint::runge_kutta4"`. Default method is: `"odeint::bulirsch_stoer"`.

Value

The loglikelihood of the data given the parameter.

Examples

rm(list = ls(all = TRUE))
library(secsse)
set.seed(13)
phylotree <- ape::rcoal(31, tip.label = 1:31)
traits <- sample(c(0,1,2),ape::Ntip(phylotree),replace = TRUE)
num_concealed_states <- 2
cond <- "proper_cond"
root_state_weight <- "proper_weights"
sampling_fraction <- c(1,1,1)
drill <- id_paramPos(traits,num_concealed_states)
drill[[1]][] <- c(0.12,0.01,0.2,0.21,0.31,0.23)
drill[[2]][] <- 0
drill[[3]][,] <- 0.1
diag(drill[[3]]) <- NA
secsse_loglik(parameter = drill,
phylotree,
traits,
num_concealed_states,
cond,
root_state_weight,
sampling_fraction,
see_ancestral_states = FALSE)

#[1] -113.1018
rm(list = ls(all = TRUE))
library(secsse)
set.seed(13)
phylotree <- ape::rcoal(31, tip.label = 1:31)
traits <- sample(c(0,1,2),ape::Ntip(phylotree),replace = TRUE)
num_concealed_states <- 2
cond <- "proper_cond"
root_state_weight <- "proper_weights"
sampling_fraction <- c(1,1,1)
drill <- id_paramPos(traits,num_concealed_states)
drill[[1]][] <- c(0.12,0.01,0.2,0.21,0.31,0.23)
drill[[2]][] <- 0
drill[[3]][,] <- 0.1
diag(drill[[3]]) <- NA
secsse_loglik(parameter = drill,
phylotree,
traits,
num_concealed_states,
cond,
root_state_weight,
sampling_fraction,
see_ancestral_states = FALSE)

#[1] -113.1018

Likelihood for SecSSE model Logikelihood calculation for the SecSSE model given a set of parameters and data, returning also the likelihoods along the branches

Description

Likelihood for SecSSE model Logikelihood calculation for the SecSSE model given a set of parameters and data, returning also the likelihoods along the branches

Usage

secsse_loglik_eval(
  parameter,
  phy,
  traits,
  num_concealed_states,
  cond = "proper_cond",
  root_state_weight = "proper_weights",
  sampling_fraction,
  setting_calculation = NULL,
  loglik_penalty = 0,
  is_complete_tree = FALSE,
  num_threads = 1,
  atol = 1e-08,
  rtol = 1e-07,
  method = "odeint::bulirsch_stoer",
  num_steps = 100
)
secsse_loglik_eval(
  parameter,
  phy,
  traits,
  num_concealed_states,
  cond = "proper_cond",
  root_state_weight = "proper_weights",
  sampling_fraction,
  setting_calculation = NULL,
  loglik_penalty = 0,
  is_complete_tree = FALSE,
  num_threads = 1,
  atol = 1e-08,
  rtol = 1e-07,
  method = "odeint::bulirsch_stoer",
  num_steps = 100
)

Arguments

`parameter`	list where first vector represents lambdas, the second mus and the third transition rates.
`phy`	phylogenetic tree of class `phylo`, rooted and with branch lengths.
`traits`	vector with trait states for each tip in the phylogeny. The order of the states must be the same as the tree tips. For help, see `vignette("starting_secsse", package = "secsse")`.
`num_concealed_states`	number of concealed states, generally equivalent to the number of examined states in the dataset.
`cond`	condition on the existence of a node root: `"maddison_cond"`, `"proper_cond"` (default). For details, see vignette.
`root_state_weight`	the method to weigh the states: `"maddison_weights"`, `"proper_weights"` (default) or `"equal_weights"`. It can also be specified for the root state: the vector `c(1, 0, 0)` indicates state 1 was the root state.
`sampling_fraction`	vector that states the sampling proportion per trait state. It must have as many elements as there are trait states.
`setting_calculation`	argument used internally to speed up calculation. It should be left blank (default : `setting_calculation = NULL`).
`loglik_penalty`	the size of the penalty for all parameters; default is 0 (no penalty).
`is_complete_tree`	logical specifying whether or not a tree with all its extinct species is provided. If set to `TRUE`, it also assumes that all all extinct lineages are present on the tree. Defaults to `FALSE`.
`num_threads`	number of threads to be used. Default is one thread.
`atol`	A numeric specifying the absolute tolerance of integration.
`rtol`	A numeric specifying the relative tolerance of integration.
`method`	integration method used, available are: `"odeint::runge_kutta_cash_karp54"`, `"odeint::runge_kutta_fehlberg78"`, `"odeint::runge_kutta_dopri5"`, `"odeint::bulirsch_stoer"` and `"odeint::runge_kutta4"`. Default method is: `"odeint::bulirsch_stoer"`.
`num_steps`	number of substeps to show intermediate likelihoods along a branch.

Value

A list containing: "output", observed states along evaluated time points along all branches, used for plotting. "states" all ancestral states on the nodes and "duration", indicating the time taken for the total evaluation

Examples

set.seed(5)
phy <- ape::rphylo(n = 4, birth = 1, death = 0)
traits <- c(0, 1, 1, 0)
params <- secsse::id_paramPos(c(0, 1), 2)
params[[1]][] <- c(0.2, 0.2, 0.1, 0.1)
params[[2]][] <- 0.0
params[[3]][, ] <- 0.1
diag(params[[3]]) <- NA

secsse_loglik_eval(parameter = params,
                   phy = phy,
                   traits = traits,
                   num_concealed_states = 2,
                   sampling_fraction = c(1, 1),
                   num_steps = 10)
set.seed(5)
phy <- ape::rphylo(n = 4, birth = 1, death = 0)
traits <- c(0, 1, 1, 0)
params <- secsse::id_paramPos(c(0, 1), 2)
params[[1]][] <- c(0.2, 0.2, 0.1, 0.1)
params[[2]][] <- 0.0
params[[3]][, ] <- 0.1
diag(params[[3]]) <- NA

secsse_loglik_eval(parameter = params,
                   phy = phy,
                   traits = traits,
                   num_concealed_states = 2,
                   sampling_fraction = c(1, 1),
                   num_steps = 10)

Maximum likehood estimation for (SecSSE)

Description

Maximum likehood estimation under Several examined and concealed States-dependent Speciation and Extinction (SecSSE)

Usage

secsse_ml(
  phy,
  traits,
  num_concealed_states,
  idparslist,
  idparsopt,
  initparsopt,
  idparsfix,
  parsfix,
  cond = "proper_cond",
  root_state_weight = "proper_weights",
  sampling_fraction,
  tol = c(1e-04, 1e-05, 1e-07),
  maxiter = 1000 * round((1.25)^length(idparsopt)),
  optimmethod = "subplex",
  num_cycles = 1,
  loglik_penalty = 0,
  is_complete_tree = FALSE,
  verbose = (optimmethod == "simplex"),
  num_threads = 1,
  atol = 1e-08,
  rtol = 1e-07,
  method = "odeint::bulirsch_stoer"
)
secsse_ml(
  phy,
  traits,
  num_concealed_states,
  idparslist,
  idparsopt,
  initparsopt,
  idparsfix,
  parsfix,
  cond = "proper_cond",
  root_state_weight = "proper_weights",
  sampling_fraction,
  tol = c(1e-04, 1e-05, 1e-07),
  maxiter = 1000 * round((1.25)^length(idparsopt)),
  optimmethod = "subplex",
  num_cycles = 1,
  loglik_penalty = 0,
  is_complete_tree = FALSE,
  verbose = (optimmethod == "simplex"),
  num_threads = 1,
  atol = 1e-08,
  rtol = 1e-07,
  method = "odeint::bulirsch_stoer"
)

Arguments

`phy`	phylogenetic tree of class `phylo`, rooted and with branch lengths.
`traits`	vector with trait states for each tip in the phylogeny. The order of the states must be the same as the tree tips. For help, see `vignette("starting_secsse", package = "secsse")`.
`num_concealed_states`	number of concealed states, generally equivalent to the number of examined states in the dataset.
`idparslist`	overview of parameters and their values.
`idparsopt`	a numeric vector with the ID of parameters to be estimated.
`initparsopt`	a numeric vector with the initial guess of the parameters to be estimated.
`idparsfix`	a numeric vector with the ID of the fixed parameters.
`parsfix`	a numeric vector with the value of the fixed parameters.
`cond`	condition on the existence of a node root: `"maddison_cond"`, `"proper_cond"` (default). For details, see vignette.
`root_state_weight`	the method to weigh the states: `"maddison_weights"`, `"proper_weights"` (default) or `"equal_weights"`. It can also be specified for the root state: the vector `c(1, 0, 0)` indicates state 1 was the root state.
`sampling_fraction`	vector that states the sampling proportion per trait state. It must have as many elements as there are trait states.
`tol`	A numeric vector with the maximum tolerance of the optimization algorithm. Default is `c(1e-04, 1e-05, 1e-05)`.
`maxiter`	max number of iterations. Default is `1000 * round((1.25) ^ length(idparsopt))`.
`optimmethod`	A string with method used for optimization. Default is `"subplex"`. Alternative is `"simplex"` and it shouldn't be used in normal conditions (only for debugging). Both are called from `DDD::optimizer()`, simplex is implemented natively in DDD, while subplex is ultimately called from `subplex::subplex()`.
`num_cycles`	Number of cycles of the optimization. When set to `Inf`, the optimization will be repeated until the result is, within the tolerance, equal to the starting values, with a maximum of 10 cycles.
`loglik_penalty`	the size of the penalty for all parameters; default is 0 (no penalty).
`is_complete_tree`	logical specifying whether or not a tree with all its extinct species is provided. If set to `TRUE`, it also assumes that all all extinct lineages are present on the tree. Defaults to `FALSE`.
`verbose`	sets verbose output; default is `TRUE` when `optimmethod` is `"simplex"`. If `optimmethod` is set to `"simplex"`, then even if set to `FALSE`, optimizer output will be shown.
`num_threads`	number of threads to be used. Default is one thread.
`atol`	A numeric specifying the absolute tolerance of integration.
`rtol`	A numeric specifying the relative tolerance of integration.
`method`	integration method used, available are: `"odeint::runge_kutta_cash_karp54"`, `"odeint::runge_kutta_fehlberg78"`, `"odeint::runge_kutta_dopri5"`, `"odeint::bulirsch_stoer"` and `"odeint::runge_kutta4"`. Default method is: `"odeint::bulirsch_stoer"`.

Value

Parameter estimated and maximum likelihood

Examples

# Example of how to set the arguments for a ML search.
library(secsse)
library(DDD)
set.seed(13)
# lambdas for 0A and 1A and 2A are the same but need to be estimated
# mus are fixed to
# the transition rates are constrained to be equal and fixed 0.01
phylotree <- ape::rcoal(31, tip.label = 1:31)
traits <-  sample(c(0,1,2), ape::Ntip(phylotree),replace=TRUE)#get some traits
num_concealed_states<-3
idparslist <- id_paramPos(traits, num_concealed_states)
idparslist[[1]][c(1,4,7)] <- 1
idparslist[[1]][c(2,5,8)] <- 2
idparslist[[1]][c(3,6,9)] <- 3
idparslist[[2]][]<-4
masterBlock <- matrix(5,ncol = 3,nrow = 3,byrow = TRUE)
diag(masterBlock) <- NA
diff.conceal <- FALSE
idparslist[[3]] <- q_doubletrans(traits,masterBlock,diff.conceal)
startingpoint <- DDD::bd_ML(brts = ape::branching.times(phylotree))
intGuessLamba <- startingpoint$lambda0
intGuessMu <- startingpoint$mu0
idparsopt <- c(1,2,3,5)
initparsopt <- c(rep(intGuessLamba,3),rep((intGuessLamba/5),1))
idparsfix <- c(0,4)
parsfix <- c(0,0)
tol <- c(1e-02, 1e-03, 1e-04)
maxiter <- 1000 * round((1.25)^length(idparsopt))
optimmethod <- 'subplex'
cond <- 'proper_cond'
root_state_weight <- 'proper_weights'
sampling_fraction <- c(1,1,1)
model<-secsse_ml(
phylotree,
traits,
num_concealed_states,
idparslist,
idparsopt,
initparsopt,
idparsfix,
parsfix,
cond,
root_state_weight,
sampling_fraction,
tol,
maxiter,
optimmethod,
num_cycles = 1,
verbose = FALSE)
# model$ML
# [1] -16.04127
# Example of how to set the arguments for a ML search.
library(secsse)
library(DDD)
set.seed(13)
# lambdas for 0A and 1A and 2A are the same but need to be estimated
# mus are fixed to
# the transition rates are constrained to be equal and fixed 0.01
phylotree <- ape::rcoal(31, tip.label = 1:31)
traits <-  sample(c(0,1,2), ape::Ntip(phylotree),replace=TRUE)#get some traits
num_concealed_states<-3
idparslist <- id_paramPos(traits, num_concealed_states)
idparslist[[1]][c(1,4,7)] <- 1
idparslist[[1]][c(2,5,8)] <- 2
idparslist[[1]][c(3,6,9)] <- 3
idparslist[[2]][]<-4
masterBlock <- matrix(5,ncol = 3,nrow = 3,byrow = TRUE)
diag(masterBlock) <- NA
diff.conceal <- FALSE
idparslist[[3]] <- q_doubletrans(traits,masterBlock,diff.conceal)
startingpoint <- DDD::bd_ML(brts = ape::branching.times(phylotree))
intGuessLamba <- startingpoint$lambda0
intGuessMu <- startingpoint$mu0
idparsopt <- c(1,2,3,5)
initparsopt <- c(rep(intGuessLamba,3),rep((intGuessLamba/5),1))
idparsfix <- c(0,4)
parsfix <- c(0,0)
tol <- c(1e-02, 1e-03, 1e-04)
maxiter <- 1000 * round((1.25)^length(idparsopt))
optimmethod <- 'subplex'
cond <- 'proper_cond'
root_state_weight <- 'proper_weights'
sampling_fraction <- c(1,1,1)
model<-secsse_ml(
phylotree,
traits,
num_concealed_states,
idparslist,
idparsopt,
initparsopt,
idparsfix,
parsfix,
cond,
root_state_weight,
sampling_fraction,
tol,
maxiter,
optimmethod,
num_cycles = 1,
verbose = FALSE)
# model$ML
# [1] -16.04127

Maximum likehood estimation for (SecSSE) with parameter as complex functions.

Description

Maximum likehood estimation under Several examined and concealed States-dependent Speciation and Extinction (SecSSE) where some paramaters are functions of other parameters and/or factors.

Usage

secsse_ml_func_def_pars(
  phy,
  traits,
  num_concealed_states,
  idparslist,
  idparsopt,
  initparsopt,
  idfactorsopt,
  initfactors,
  idparsfix,
  parsfix,
  idparsfuncdefpar,
  functions_defining_params = NULL,
  cond = "proper_cond",
  root_state_weight = "proper_weights",
  sampling_fraction,
  tol = c(1e-04, 1e-05, 1e-07),
  maxiter = 1000 * round((1.25)^length(idparsopt)),
  optimmethod = "subplex",
  num_cycles = 1,
  loglik_penalty = 0,
  is_complete_tree = FALSE,
  num_threads = 1,
  atol = 1e-08,
  rtol = 1e-06,
  method = "odeint::bulirsch_stoer"
)
secsse_ml_func_def_pars(
  phy,
  traits,
  num_concealed_states,
  idparslist,
  idparsopt,
  initparsopt,
  idfactorsopt,
  initfactors,
  idparsfix,
  parsfix,
  idparsfuncdefpar,
  functions_defining_params = NULL,
  cond = "proper_cond",
  root_state_weight = "proper_weights",
  sampling_fraction,
  tol = c(1e-04, 1e-05, 1e-07),
  maxiter = 1000 * round((1.25)^length(idparsopt)),
  optimmethod = "subplex",
  num_cycles = 1,
  loglik_penalty = 0,
  is_complete_tree = FALSE,
  num_threads = 1,
  atol = 1e-08,
  rtol = 1e-06,
  method = "odeint::bulirsch_stoer"
)

Arguments

`phy`	phylogenetic tree of class `phylo`, rooted and with branch lengths.
`traits`	vector with trait states for each tip in the phylogeny. The order of the states must be the same as the tree tips. For help, see `vignette("starting_secsse", package = "secsse")`.
`num_concealed_states`	number of concealed states, generally equivalent to the number of examined states in the dataset.
`idparslist`	overview of parameters and their values.
`idparsopt`	a numeric vector with the ID of parameters to be estimated.
`initparsopt`	a numeric vector with the initial guess of the parameters to be estimated.
`idfactorsopt`	id of the factors that will be optimized. There are not fixed factors, so use a constant within `functions_defining_params`.
`initfactors`	the initial guess for a factor (it should be set to `NULL` when no factors).
`idparsfix`	a numeric vector with the ID of the fixed parameters.
`parsfix`	a numeric vector with the value of the fixed parameters.
`idparsfuncdefpar`	id of the parameters which will be a function of optimized and/or fixed parameters. The order of id should match `functions_defining_params`.
`functions_defining_params`	a list of functions. Each element will be a function which defines a parameter e.g. `id_3 <- (id_1 + id_2) / 2`. See example.
`cond`	condition on the existence of a node root: `"maddison_cond"`, `"proper_cond"` (default). For details, see vignette.
`root_state_weight`	the method to weigh the states: `"maddison_weights"`, `"proper_weights"` (default) or `"equal_weights"`. It can also be specified for the root state: the vector `c(1, 0, 0)` indicates state 1 was the root state.
`sampling_fraction`	vector that states the sampling proportion per trait state. It must have as many elements as there are trait states.
`tol`	A numeric vector with the maximum tolerance of the optimization algorithm. Default is `c(1e-04, 1e-05, 1e-05)`.
`maxiter`	max number of iterations. Default is `1000 * round((1.25) ^ length(idparsopt))`.
`optimmethod`	A string with method used for optimization. Default is `"subplex"`. Alternative is `"simplex"` and it shouldn't be used in normal conditions (only for debugging). Both are called from `DDD::optimizer()`, simplex is implemented natively in DDD, while subplex is ultimately called from `subplex::subplex()`.
`num_cycles`	Number of cycles of the optimization. When set to `Inf`, the optimization will be repeated until the result is, within the tolerance, equal to the starting values, with a maximum of 10 cycles.
`loglik_penalty`	the size of the penalty for all parameters; default is 0 (no penalty).
`is_complete_tree`	logical specifying whether or not a tree with all its extinct species is provided. If set to `TRUE`, it also assumes that all all extinct lineages are present on the tree. Defaults to `FALSE`.
`num_threads`	number of threads to be used. Default is one thread.
`atol`	A numeric specifying the absolute tolerance of integration.
`rtol`	A numeric specifying the relative tolerance of integration.
`method`	integration method used, available are: `"odeint::runge_kutta_cash_karp54"`, `"odeint::runge_kutta_fehlberg78"`, `"odeint::runge_kutta_dopri5"`, `"odeint::bulirsch_stoer"` and `"odeint::runge_kutta4"`. Default method is: `"odeint::bulirsch_stoer"`.

Value

Parameter estimated and maximum likelihood

Examples

# Example of how to set the arguments for a ML search.
rm(list=ls(all=TRUE))
library(secsse)
library(DDD)
set.seed(16)
phylotree <- ape::rbdtree(0.07,0.001,Tmax=50)
startingpoint<-bd_ML(brts = ape::branching.times(phylotree))
intGuessLamba <- startingpoint$lambda0
intGuessMu <- startingpoint$mu0
traits <- sample(c(0,1,2), ape::Ntip(phylotree),replace=TRUE) #get some traits
num_concealed_states<-3
idparslist<-id_paramPos(traits, num_concealed_states)
idparslist[[1]][c(1,4,7)] <- 1
idparslist[[1]][c(2,5,8)] <- 2
idparslist[[1]][c(3,6,9)] <- 3
idparslist[[2]][] <- 4
masterBlock <- matrix(c(5,6,5,6,5,6,5,6,5),ncol = 3,nrow = 3,byrow = TRUE)
diag(masterBlock) <- NA
diff.conceal <- FALSE
idparslist[[3]] <- q_doubletrans(traits,masterBlock,diff.conceal)
idparsfuncdefpar <- c(3,5,6)
idparsopt <- c(1,2)
idparsfix <- c(0,4)
initparsopt <- c(rep(intGuessLamba,2))
parsfix <- c(0,0)
idfactorsopt <- 1
initfactors <- 4
# functions_defining_params is a list of functions. Each function has no
# arguments and to refer
# to parameters ids should be indicated as "par_" i.e. par_3 refers to
# parameter 3. When a function is defined, be sure that all the parameters
# involved are either estimated, fixed or
# defined by previous functions (i.e, a function that defines parameter in
# 'functions_defining_params'). The user is responsible for this. In this
# exampl3, par_3 (i.e., parameter 3) is needed to calculate par_6. This is
# correct because par_3 is defined in
# the first function of 'functions_defining_params'. Notice that factor_1
# indicates a value that will be estimated to satisfy the equation. The same
# factor can be shared to define several parameters.
functions_defining_params <- list()
functions_defining_params[[1]] <- function(){
 par_3 <- par_1 + par_2
}
functions_defining_params[[2]] <- function(){
 par_5 <- par_1 * factor_1
}
functions_defining_params[[3]] <- function(){
 par_6 <- par_3 * factor_1
}

tol = c(1e-02, 1e-03, 1e-04)
maxiter = 1000 * round((1.25)^length(idparsopt))
optimmethod = "subplex"
cond<-"proper_cond"
root_state_weight <- "proper_weights"
sampling_fraction <- c(1,1,1)
model <- secsse_ml_func_def_pars(phylotree,
traits,
num_concealed_states,
idparslist,
idparsopt,
initparsopt,
idfactorsopt,
initfactors,
idparsfix,
parsfix,
idparsfuncdefpar,
functions_defining_params,
cond,
root_state_weight,
sampling_fraction,
tol,
maxiter,
optimmethod,
num_cycles = 1)
# ML -136.5796
# Example of how to set the arguments for a ML search.
rm(list=ls(all=TRUE))
library(secsse)
library(DDD)
set.seed(16)
phylotree <- ape::rbdtree(0.07,0.001,Tmax=50)
startingpoint<-bd_ML(brts = ape::branching.times(phylotree))
intGuessLamba <- startingpoint$lambda0
intGuessMu <- startingpoint$mu0
traits <- sample(c(0,1,2), ape::Ntip(phylotree),replace=TRUE) #get some traits
num_concealed_states<-3
idparslist<-id_paramPos(traits, num_concealed_states)
idparslist[[1]][c(1,4,7)] <- 1
idparslist[[1]][c(2,5,8)] <- 2
idparslist[[1]][c(3,6,9)] <- 3
idparslist[[2]][] <- 4
masterBlock <- matrix(c(5,6,5,6,5,6,5,6,5),ncol = 3,nrow = 3,byrow = TRUE)
diag(masterBlock) <- NA
diff.conceal <- FALSE
idparslist[[3]] <- q_doubletrans(traits,masterBlock,diff.conceal)
idparsfuncdefpar <- c(3,5,6)
idparsopt <- c(1,2)
idparsfix <- c(0,4)
initparsopt <- c(rep(intGuessLamba,2))
parsfix <- c(0,0)
idfactorsopt <- 1
initfactors <- 4
# functions_defining_params is a list of functions. Each function has no
# arguments and to refer
# to parameters ids should be indicated as "par_" i.e. par_3 refers to
# parameter 3. When a function is defined, be sure that all the parameters
# involved are either estimated, fixed or
# defined by previous functions (i.e, a function that defines parameter in
# 'functions_defining_params'). The user is responsible for this. In this
# exampl3, par_3 (i.e., parameter 3) is needed to calculate par_6. This is
# correct because par_3 is defined in
# the first function of 'functions_defining_params'. Notice that factor_1
# indicates a value that will be estimated to satisfy the equation. The same
# factor can be shared to define several parameters.
functions_defining_params <- list()
functions_defining_params[[1]] <- function(){
 par_3 <- par_1 + par_2
}
functions_defining_params[[2]] <- function(){
 par_5 <- par_1 * factor_1
}
functions_defining_params[[3]] <- function(){
 par_6 <- par_3 * factor_1
}

tol = c(1e-02, 1e-03, 1e-04)
maxiter = 1000 * round((1.25)^length(idparsopt))
optimmethod = "subplex"
cond<-"proper_cond"
root_state_weight <- "proper_weights"
sampling_fraction <- c(1,1,1)
model <- secsse_ml_func_def_pars(phylotree,
traits,
num_concealed_states,
idparslist,
idparsopt,
initparsopt,
idfactorsopt,
initfactors,
idparsfix,
parsfix,
idparsfuncdefpar,
functions_defining_params,
cond,
root_state_weight,
sampling_fraction,
tol,
maxiter,
optimmethod,
num_cycles = 1)
# ML -136.5796

Function to simulate a tree, conditional on observing all states.

Description

By default, secsse_sim assumes CLA-secsse simulation, e.g. inheritance of traits at speciation need not be symmetrical, and can be specified through usage of lambda-matrices. Hence, the input for lambdas is typically a list of matrices.

Simulation is performed with a randomly sampled initial trait at the crown - if you, however - want a specific, single, trait used at the crown, you can reduce the possible traits by modifying pool_init_states.

By default, the algorithm keeps simulating until it generates a tree where both crown lineages survive to the present - this is to ensure that the tree has a crown age that matches the used crown age. You can modify 'non-extinction' to deviate from this behaviour.

Usage

secsse_sim(
  lambdas,
  mus,
  qs,
  crown_age,
  num_concealed_states,
  pool_init_states = NULL,
  max_spec = 1e+05,
  min_spec = 2,
  max_species_extant = TRUE,
  tree_size_hist = FALSE,
  conditioning = "obs_states",
  non_extinction = TRUE,
  verbose = FALSE,
  max_tries = 1e+06,
  drop_extinct = TRUE,
  start_at_crown = TRUE,
  seed = NULL
)
secsse_sim(
  lambdas,
  mus,
  qs,
  crown_age,
  num_concealed_states,
  pool_init_states = NULL,
  max_spec = 1e+05,
  min_spec = 2,
  max_species_extant = TRUE,
  tree_size_hist = FALSE,
  conditioning = "obs_states",
  non_extinction = TRUE,
  verbose = FALSE,
  max_tries = 1e+06,
  drop_extinct = TRUE,
  start_at_crown = TRUE,
  seed = NULL
)

Arguments

`lambdas`	speciation rates, in the form of a list of matrices.
`mus`	extinction rates, in the form of a vector.
`qs`	The Q matrix, for example the result of function q_doubletrans, but generally in the form of a matrix.
`crown_age`	crown age of the tree, tree will be simulated conditional on non-extinction and this crown age.
`num_concealed_states`	number of concealed states, generally equivalent to the number of examined states in the dataset.
`pool_init_states`	pool of initial states at the crown, in case this is different from all available states, otherwise leave at NULL
`max_spec`	Maximum number of species in the tree (please note that the tree is not conditioned on this number, but that this is a safeguard against generating extremely large trees).
`min_spec`	Minimum number of species in the tree.
`max_species_extant`	Should the maximum number of species be counted in the reconstructed tree (if TRUE) or in the complete tree (if FALSE).
`tree_size_hist`	if TRUE, returns a vector of all found tree sizes.
`conditioning`	can be `"obs_states"`, `"true_states"` or `"none"`, the tree is simulated until one is generated that contains all observed states (`"obs_states"`), all true states (e.g. all combinations of obs and hidden states), or is always returned (`"none"`). Alternatively, a vector with the names of required observed states can be provided, e.g. c("S", "N").
`non_extinction`	boolean stating if the tree should be conditioned on non-extinction of the crown lineages. Defaults to `TRUE`.
`verbose`	sets verbose output; default is `TRUE` when `optimmethod` is `"simplex"`. If `optimmethod` is set to `"simplex"`, then even if set to `FALSE`, optimizer output will be shown.
`max_tries`	maximum number of simulations to try to obtain a tree.
`drop_extinct`	boolean stating if extinct species should be dropped from the tree. Defaults to `TRUE`.
`start_at_crown`	if FALSE, the simulation starts with one species instead of the two assumed by default by secsse (also in ML), and the resulting crown age will be lower than the set crown age. This allows for direct comparison with BiSSE and facilitates implementing speciation effects at the crown.
`seed`	pseudo-random number generator seed.

Value

a list with four properties: phy: reconstructed phylogeny, true_traits: the true traits in order of tip label, obs_traits: observed traits, ignoring hidden traits and lastly: initialState, delineating the initial state at the root used.

Data checking and trait sorting In preparation for likelihood calculation, it orders trait data according the tree tips

Description

Data checking and trait sorting In preparation for likelihood calculation, it orders trait data according the tree tips

Usage

sortingtraits(trait_info, phy)
sortingtraits(trait_info, phy)

Arguments

`trait_info`	data frame where first column has species ids and the second one is the trait associated information.
`phy`	phylogenetic tree of class `phylo`, rooted and with branch lengths.

Value

Vector of traits

Examples

# Some data we have prepared
data(traits)
data('phylo_vignette')
traits <- sortingtraits(traits, phylo_vignette)
# Some data we have prepared
data(traits)
data('phylo_vignette')
traits <- sortingtraits(traits, phylo_vignette)

A table with trait info to run the vignette

Description

An example of trait information in the right format for secsse

Usage

traits
traits

Format

A data frame where each species has a trait state associated

Package 'secsse'

Help Index

Parameter structure setting for cla_secsse It sets the parameters (speciation, extinction and transition) IDs. Needed for ML calculation with cladogenetic options (cla_secsse_ml)

Description

Usage

Arguments

Value

Examples

Likelihood for SecSSE model, using Rcpp Loglikelihood calculation for the cla_SecSSE model given a set of parameters and data using Rcpp

Description

Usage

Arguments

Value

Examples

Maximum likehood estimation for (SecSSE)

Description

Usage

Arguments

Value

Examples

Maximum likehood estimation for (SecSSE) with parameter as complex functions. Cladogenetic version

Description

Usage

Arguments

Value

Examples

Helper function to create a default lambda list

Description

Usage

Arguments

Examples

Helper function to create a default shift_matrix list

Description

Usage

Arguments

Examples

Helper function to automatically create lambda matrices, based on input

Description

Usage

Arguments

Examples

Generate mus vector

Description

Usage

Arguments

Value

Helper function to neatly setup a Q matrix, without transitions to concealed states (only observed transitions shown)

Description

Usage

Arguments

Value

Examples

Event times of a (possibly non-ultrametric) phylogenetic tree

Description

Usage

Arguments

Value

Note

A phylogeny with traits at the tips

Description

Usage

Format

Function to expand an existing q_matrix to a number of concealed states

Description

Usage

Arguments

Value

Note

Extract parameter values out of the result of a maximum likelihood inference run

Description

Usage

Arguments

Value

Helper function to enter parameter value on their right place

Description

Usage

Arguments

Value

Parameter structure setting Sets the parameters (speciation, extinction and transition) ids. Needed for ML calculation (secsse_ml()).

Description

Helper function to create a default `shift_matrix` list

Parameter structure setting Sets the parameters (speciation, extinction and transition) ids. Needed for ML calculation (`secsse_ml()`).