Last updated: 2022-01-24

Checks: 7 0

Knit directory: genes-to-foodweb-stability/

This reproducible R Markdown analysis was created with workflowr (version 1.6.2). The Checks tab describes the reproducibility checks that were applied when the results were created. The Past versions tab lists the development history.

R Markdown file: up-to-date

Great! Since the R Markdown file has been committed to the Git repository, you know the exact version of the code that produced these results.

Environment: empty

Great job! The global environment was empty. Objects defined in the global environment can affect the analysis in your R Markdown file in unknown ways. For reproduciblity it’s best to always run the code in an empty environment.

Seed: set.seed(20200205)

The command set.seed(20200205) was run prior to running the code in the R Markdown file. Setting a seed ensures that any results that rely on randomness, e.g. subsampling or permutations, are reproducible.

Session information: recorded

Great job! Recording the operating system, R version, and package versions is critical for reproducibility.

Cache: none

Nice! There were no cached chunks for this analysis, so you can be confident that you successfully produced the results during this run.

File paths: relative

Great job! Using relative paths to the files within your workflowr project makes it easier to run your code on other machines.

Repository version: 18d1722

Great! You are using Git for version control. Tracking code development and connecting the code version to the results is critical for reproducibility.

The results in this page were generated with repository version 18d1722. See the Past versions tab to see a history of the changes made to the R Markdown and HTML files.

Note that you need to be careful to ensure that all relevant files for the analysis have been committed to Git prior to generating the results (you can use wflow_publish or wflow_git_commit). workflowr only checks the R Markdown file, but you know if there are other scripts or data files that it depends on. Below is the status of the Git repository when the results were generated:


Ignored files:
    Ignored:    .Rhistory
    Ignored:    .Rproj.user/
    Ignored:    code/.Rhistory
    Ignored:    output/.Rapp.history

Untracked files:
    Untracked:  output/all.mar1.brm.adj.rds
    Untracked:  output/all.mar1.brm.unadj.ar2.lag.rds
    Untracked:  output/all.mar1.brm.unadj.noBRBRonLYER.rds
    Untracked:  output/all.mar1.brm.unadj.rds
    Untracked:  output/all.mar1.brm.unadj.xAOP2.rds
    Untracked:  output/initial.mar1.brm.adj.rds
    Untracked:  output/initial.mar1.brm.unadj.rds

Note that any generated files, e.g. HTML, png, CSS, etc., are not included in this status report because it is ok for generated content to have uncommitted changes.

These are the previous versions of the repository in which changes were made to the R Markdown (analysis/foodweb-persistence.Rmd) and HTML (docs/foodweb-persistence.html) files. If you’ve configured a remote Git repository (see ?wflow_git_remote), click on the hyperlinks in the table below to view the files as they were in that past version.

File	Version	Author	Date	Message
Rmd	18d1722	mabarbour	2022-01-24	Add mark-recapture and omnibus tests, fix data point error, update analyses, and edit readme
html	c802852	mabarbour	2021-06-24	Build website
Rmd	a054471	mabarbour	2021-06-24	Update location of .rds files to Release v2.1
Rmd	f467085	mabarbour	2021-06-24	Revise analysis to address reviewer’s comments

Wrangle data

# Load and manage data
df <- read_csv("data/InsectAbundanceSurvival.csv") %>%
  # renaming for brevity
  rename(cage = Cage,
         com = Composition,
         week = Week,
         temp = Temperature,
         rich = Richness) %>%
  mutate(cage = as.character(cage),
         temp = ifelse(temp=="20 C", 0, 3), # put on scale such that coef corresponds to 1 C increase
         temp_com = paste(temp, com)) %>% # create indicator for genetic composition nested within temperature for random effect
  arrange(cage, week)

# create data for survival analysis
survival_df <- df %>%
  # counter information is not relevant for survival (because it is the same), so we summarise across it
  group_by(cage, week, temp, rich, Col, gsm1, AOP2, AOP2.gsoh, com, temp_com) %>%
  summarise_at(vars(BRBR_Survival, LYER_Survival, Mummy_Ptoids_Survival), list(mean)) %>%
  ungroup() %>%
  mutate(week_since = week - 2) %>% # week since the full community was added (at week 3)
  filter(week_since > 0) %>%
  mutate(interval_start = week_since - 1,
         interval_stop = week_since) %>%
  #select(-week_since) %>% # no longer needed, replaced by interval_start(stop)
  # contrasts for allelic effects at each gene
  mutate(aop2_vs_AOP2 = Col + gsm1 - AOP2 - AOP2.gsoh,
         mam1_vs_MAM1 = gsm1 - Col,
         gsoh_vs_GSOH = AOP2.gsoh - AOP2) 

## species specific data sets ----

# these will be recombined later to assess extinction across species
# these are in "person-period" format, which is most suitable for discrete-time survival analysis
# note that we transform them to make them appropriate for the continuous-time Cox model.

# BRBR 
BRBR_survival_df <- survival_df %>%
  select(-LYER_Survival, -Mummy_Ptoids_Survival) %>%
  na.omit() %>%
  mutate(Extinction = ifelse(BRBR_Survival == 1, 0, 1))

# LYER
LYER_survival_df <- survival_df %>%
  select(-BRBR_Survival, -Mummy_Ptoids_Survival) %>%
  na.omit() %>%
  mutate(Extinction = ifelse(LYER_Survival == 1, 0, 1))

# Ptoid
Ptoid_survival_df <- survival_df %>%
  select(-BRBR_Survival, -LYER_Survival) %>%
  na.omit() %>%
  mutate(Extinction = ifelse(Mummy_Ptoids_Survival == 1, 0, 1))

# merge species
all_survival_df <- bind_rows(
  mutate(BRBR_survival_df, species = "BRBR"),
  mutate(LYER_survival_df, species = "LYER"),
  mutate(Ptoid_survival_df, species = "Ptoid"))

## create data for cox analysis (continuous time) ----

# BRBR
BRBR_cox_df <- BRBR_survival_df %>%
 arrange(cage, week) %>% 
 group_by(cage) %>% 
 summarise(across(everything(), last))
sum(BRBR_cox_df$Extinction) # 60, as it should be

[1] 60

select(BRBR_cox_df, com, cage, week_since, Extinction) %>% data.frame() # good, no zeros

              com cage week_since Extinction
1  AOP2_AOP2.gsoh    1          5          1
2  gsm1_AOP2.gsoh   10          5          1
3       gsm1_AOP2   11          5          1
4            Poly   12          5          1
5            Poly   13          4          1
6             Col   14          6          1
7       AOP2.gsoh   15          5          1
8        Col_gsm1   16          5          1
9       gsm1_AOP2   17          6          1
10  Col_AOP2.gsoh   18          5          1
11 AOP2_AOP2.gsoh   19          5          1
12           Poly    2          7          1
13      gsm1_AOP2   20          5          1
14       Col_AOP2   21          5          1
15      AOP2.gsoh   22          5          1
16 AOP2_AOP2.gsoh   23          5          1
17           gsm1   24          6          1
18           AOP2   25          4          1
19            Col   26          5          1
20       Col_AOP2   27          5          1
21  Col_AOP2.gsoh   28          4          1
22 gsm1_AOP2.gsoh   29          4          1
23 gsm1_AOP2.gsoh    3          5          1
24       Col_gsm1   30          6          1
25      gsm1_AOP2   31          5          1
26            Col   32          3          1
27            Col   33          3          1
28  Col_AOP2.gsoh   34          4          1
29 AOP2_AOP2.gsoh   35          4          1
30 gsm1_AOP2.gsoh   36          2          1
31 gsm1_AOP2.gsoh   37          4          1
32           Poly   38          2          1
33           gsm1   39          5          1
34           AOP2    4          5          1
35 gsm1_AOP2.gsoh   40          2          1
36       Col_AOP2   41          6          1
37       Col_AOP2   42          4          1
38           Poly   43          4          1
39           AOP2   44          5          1
40       Col_gsm1   45          4          1
41           Poly   46          4          1
42      gsm1_AOP2   47          2          1
43 AOP2_AOP2.gsoh   48          4          1
44       Col_gsm1   49          5          1
45           gsm1    5          6          1
46       Col_AOP2   50          4          1
47           gsm1   51          3          1
48           AOP2   52          3          1
49           Poly   53          6          1
50      AOP2.gsoh   54          2          1
51  Col_AOP2.gsoh   55          5          1
52      AOP2.gsoh   56          4          1
53 AOP2_AOP2.gsoh   57          4          1
54  Col_AOP2.gsoh   58          3          1
55       Col_gsm1   59          3          1
56       Col_AOP2    6          5          1
57      gsm1_AOP2   60          3          1
58           Poly    7          5          1
59  Col_AOP2.gsoh    8          4          1
60       Col_gsm1    9          5          1

# LYER
LYER_cox_df <- LYER_survival_df %>%
 arrange(cage, week) %>% 
 group_by(cage) %>% 
 summarise(across(everything(), last))
sum(LYER_cox_df$Extinction) # 28, as it should be

[1] 28

select(LYER_cox_df, com, cage, week_since, Extinction) %>% data.frame() # good, all zeros correspond to week 15, the last week of the experiment in terms of 'week_since'

              com cage week_since Extinction
1  AOP2_AOP2.gsoh    1          5          1
2  gsm1_AOP2.gsoh   10          5          1
3       gsm1_AOP2   11          5          1
4            Poly   12         15          0
5            Poly   13         15          0
6             Col   14         15          0
7       AOP2.gsoh   15         13          1
8        Col_gsm1   16          5          1
9       gsm1_AOP2   17         15          0
10  Col_AOP2.gsoh   18         14          1
11 AOP2_AOP2.gsoh   19          5          1
12           Poly    2          7          1
13      gsm1_AOP2   20         15          0
14       Col_AOP2   21          5          1
15      AOP2.gsoh   22         15          0
16 AOP2_AOP2.gsoh   23         15          0
17           gsm1   24          6          1
18           AOP2   25          6          1
19            Col   26          6          1
20       Col_AOP2   27          6          1
21  Col_AOP2.gsoh   28          6          1
22 gsm1_AOP2.gsoh   29         13          1
23 gsm1_AOP2.gsoh    3          5          1
24       Col_gsm1   30         15          0
25      gsm1_AOP2   31         15          0
26            Col   32         15          0
27            Col   33         15          0
28  Col_AOP2.gsoh   34         15          0
29 AOP2_AOP2.gsoh   35         15          0
30 gsm1_AOP2.gsoh   36         13          1
31 gsm1_AOP2.gsoh   37         15          0
32           Poly   38         15          0
33           gsm1   39         15          0
34           AOP2    4         15          0
35 gsm1_AOP2.gsoh   40         15          0
36       Col_AOP2   41         15          0
37       Col_AOP2   42          5          1
38           Poly   43         15          0
39           AOP2   44          5          1
40       Col_gsm1   45         15          0
41           Poly   46         15          0
42      gsm1_AOP2   47         15          0
43 AOP2_AOP2.gsoh   48         15          0
44       Col_gsm1   49         15          0
45           gsm1    5         15          0
46       Col_AOP2   50         15          0
47           gsm1   51          5          1
48           AOP2   52         15          0
49           Poly   53         15          0
50      AOP2.gsoh   54         13          1
51  Col_AOP2.gsoh   55          6          1
52      AOP2.gsoh   56          5          1
53 AOP2_AOP2.gsoh   57          5          1
54  Col_AOP2.gsoh   58         15          0
55       Col_gsm1   59         15          0
56       Col_AOP2    6         14          1
57      gsm1_AOP2   60          5          1
58           Poly    7          7          1
59  Col_AOP2.gsoh    8          6          1
60       Col_gsm1    9         15          0

# Ptoid
Ptoid_cox_df <- Ptoid_survival_df %>%
 arrange(cage, week) %>% 
 group_by(cage) %>% 
 summarise(across(everything(), last))
sum(Ptoid_cox_df$Extinction) # 53, as it should be

[1] 52

select(Ptoid_cox_df, com, cage, week_since, Extinction) %>% data.frame() # good, all zeros correspond to week 15

              com cage week_since Extinction
1  AOP2_AOP2.gsoh    1          6          1
2  gsm1_AOP2.gsoh   10          6          1
3       gsm1_AOP2   11          6          1
4            Poly   12         15          0
5            Poly   13         15          0
6             Col   14          8          1
7       AOP2.gsoh   15         14          1
8        Col_gsm1   16          7          1
9       gsm1_AOP2   17          7          1
10  Col_AOP2.gsoh   18         15          1
11 AOP2_AOP2.gsoh   19          7          1
12           Poly    2          9          1
13      gsm1_AOP2   20         11          1
14       Col_AOP2   21          6          1
15      AOP2.gsoh   22          6          1
16 AOP2_AOP2.gsoh   23         14          1
17           gsm1   24          7          1
18           AOP2   25          8          1
19            Col   26          7          1
20       Col_AOP2   27          8          1
21  Col_AOP2.gsoh   28          7          1
22 gsm1_AOP2.gsoh   29         13          1
23 gsm1_AOP2.gsoh    3          6          1
24       Col_gsm1   30         12          1
25      gsm1_AOP2   31         12          1
26            Col   32         13          1
27            Col   33          8          1
28  Col_AOP2.gsoh   34          7          1
29 AOP2_AOP2.gsoh   35          5          1
30 gsm1_AOP2.gsoh   36         13          1
31 gsm1_AOP2.gsoh   37         15          0
32           Poly   38         12          1
33           gsm1   39          7          1
34           AOP2    4          7          1
35 gsm1_AOP2.gsoh   40         14          1
36       Col_AOP2   41          8          1
37       Col_AOP2   42          5          1
38           Poly   43         10          1
39           AOP2   44          6          1
40       Col_gsm1   45         15          0
41           Poly   46          6          1
42      gsm1_AOP2   47         15          0
43 AOP2_AOP2.gsoh   48          7          1
44       Col_gsm1   49         11          1
45           gsm1    5         14          1
46       Col_AOP2   50         15          0
47           gsm1   51          6          1
48           AOP2   52          9          1
49           Poly   53         15          0
50      AOP2.gsoh   54         14          1
51  Col_AOP2.gsoh   55          8          1
52      AOP2.gsoh   56          6          1
53 AOP2_AOP2.gsoh   57          7          1
54  Col_AOP2.gsoh   58         15          0
55       Col_gsm1   59         10          1
56       Col_AOP2    6         14          1
57      gsm1_AOP2   60          6          1
58           Poly    7          6          1
59  Col_AOP2.gsoh    8          6          1
60       Col_gsm1    9         13          1

# merge continuous data
all_cox_df <- bind_rows(
  mutate(BRBR_cox_df, species = "BRBR"), 
  mutate(LYER_cox_df, species = "LYER"),
  mutate(Ptoid_cox_df, species = "Ptoid")) %>%
  select(cage, temp:temp_com, week_since, aop2_vs_AOP2:gsoh_vs_GSOH, species, Extinction) %>%
  mutate(temp_species = paste(temp, species)) %>%
  arrange(cage)
# all looks good
select(all_cox_df, cage, species, week_since, Extinction) %>% data.frame()

    cage species week_since Extinction
1      1    BRBR          5          1
2      1    LYER          5          1
3      1   Ptoid          6          1
4     10    BRBR          5          1
5     10    LYER          5          1
6     10   Ptoid          6          1
7     11    BRBR          5          1
8     11    LYER          5          1
9     11   Ptoid          6          1
10    12    BRBR          5          1
11    12    LYER         15          0
12    12   Ptoid         15          0
13    13    BRBR          4          1
14    13    LYER         15          0
15    13   Ptoid         15          0
16    14    BRBR          6          1
17    14    LYER         15          0
18    14   Ptoid          8          1
19    15    BRBR          5          1
20    15    LYER         13          1
21    15   Ptoid         14          1
22    16    BRBR          5          1
23    16    LYER          5          1
24    16   Ptoid          7          1
25    17    BRBR          6          1
26    17    LYER         15          0
27    17   Ptoid          7          1
28    18    BRBR          5          1
29    18    LYER         14          1
30    18   Ptoid         15          1
31    19    BRBR          5          1
32    19    LYER          5          1
33    19   Ptoid          7          1
34     2    BRBR          7          1
35     2    LYER          7          1
36     2   Ptoid          9          1
37    20    BRBR          5          1
38    20    LYER         15          0
39    20   Ptoid         11          1
40    21    BRBR          5          1
41    21    LYER          5          1
42    21   Ptoid          6          1
43    22    BRBR          5          1
44    22    LYER         15          0
45    22   Ptoid          6          1
46    23    BRBR          5          1
47    23    LYER         15          0
48    23   Ptoid         14          1
49    24    BRBR          6          1
50    24    LYER          6          1
51    24   Ptoid          7          1
52    25    BRBR          4          1
53    25    LYER          6          1
54    25   Ptoid          8          1
55    26    BRBR          5          1
56    26    LYER          6          1
57    26   Ptoid          7          1
58    27    BRBR          5          1
59    27    LYER          6          1
60    27   Ptoid          8          1
61    28    BRBR          4          1
62    28    LYER          6          1
63    28   Ptoid          7          1
64    29    BRBR          4          1
65    29    LYER         13          1
66    29   Ptoid         13          1
67     3    BRBR          5          1
68     3    LYER          5          1
69     3   Ptoid          6          1
70    30    BRBR          6          1
71    30    LYER         15          0
72    30   Ptoid         12          1
73    31    BRBR          5          1
74    31    LYER         15          0
75    31   Ptoid         12          1
76    32    BRBR          3          1
77    32    LYER         15          0
78    32   Ptoid         13          1
79    33    BRBR          3          1
80    33    LYER         15          0
81    33   Ptoid          8          1
82    34    BRBR          4          1
83    34    LYER         15          0
84    34   Ptoid          7          1
85    35    BRBR          4          1
86    35    LYER         15          0
87    35   Ptoid          5          1
88    36    BRBR          2          1
89    36    LYER         13          1
90    36   Ptoid         13          1
91    37    BRBR          4          1
92    37    LYER         15          0
93    37   Ptoid         15          0
94    38    BRBR          2          1
95    38    LYER         15          0
96    38   Ptoid         12          1
97    39    BRBR          5          1
98    39    LYER         15          0
99    39   Ptoid          7          1
100    4    BRBR          5          1
101    4    LYER         15          0
102    4   Ptoid          7          1
103   40    BRBR          2          1
104   40    LYER         15          0
105   40   Ptoid         14          1
106   41    BRBR          6          1
107   41    LYER         15          0
108   41   Ptoid          8          1
109   42    BRBR          4          1
110   42    LYER          5          1
111   42   Ptoid          5          1
112   43    BRBR          4          1
113   43    LYER         15          0
114   43   Ptoid         10          1
115   44    BRBR          5          1
116   44    LYER          5          1
117   44   Ptoid          6          1
118   45    BRBR          4          1
119   45    LYER         15          0
120   45   Ptoid         15          0
121   46    BRBR          4          1
122   46    LYER         15          0
123   46   Ptoid          6          1
124   47    BRBR          2          1
125   47    LYER         15          0
126   47   Ptoid         15          0
127   48    BRBR          4          1
128   48    LYER         15          0
129   48   Ptoid          7          1
130   49    BRBR          5          1
131   49    LYER         15          0
132   49   Ptoid         11          1
133    5    BRBR          6          1
134    5    LYER         15          0
135    5   Ptoid         14          1
136   50    BRBR          4          1
137   50    LYER         15          0
138   50   Ptoid         15          0
139   51    BRBR          3          1
140   51    LYER          5          1
141   51   Ptoid          6          1
142   52    BRBR          3          1
143   52    LYER         15          0
144   52   Ptoid          9          1
145   53    BRBR          6          1
146   53    LYER         15          0
147   53   Ptoid         15          0
148   54    BRBR          2          1
149   54    LYER         13          1
150   54   Ptoid         14          1
151   55    BRBR          5          1
152   55    LYER          6          1
153   55   Ptoid          8          1
154   56    BRBR          4          1
155   56    LYER          5          1
156   56   Ptoid          6          1
157   57    BRBR          4          1
158   57    LYER          5          1
159   57   Ptoid          7          1
160   58    BRBR          3          1
161   58    LYER         15          0
162   58   Ptoid         15          0
163   59    BRBR          3          1
164   59    LYER         15          0
165   59   Ptoid         10          1
166    6    BRBR          5          1
167    6    LYER         14          1
168    6   Ptoid         14          1
169   60    BRBR          3          1
170   60    LYER          5          1
171   60   Ptoid          6          1
172    7    BRBR          5          1
173    7    LYER          7          1
174    7   Ptoid          6          1
175    8    BRBR          4          1
176    8    LYER          6          1
177    8   Ptoid          6          1
178    9    BRBR          5          1
179    9    LYER         15          0
180    9   Ptoid         13          1

# data is 180 rows as it should (60 cages * 3 species)

Stratified Cox proportional-hazards model

Fit initial model:

# initial model, proportional hazards assumption
cox.zph(coxph(Surv(week_since, Extinction) ~ strata(species) + temp + rich + aop2_vs_AOP2 + mam1_vs_MAM1 + gsoh_vs_GSOH + cluster(cage), all_cox_df))

              chisq df       p
temp         18.406  1 1.8e-05
rich          1.562  1 0.21136
aop2_vs_AOP2  0.214  1 0.64364
mam1_vs_MAM1  0.208  1 0.64805
gsoh_vs_GSOH  0.215  1 0.64266
GLOBAL       21.053  5 0.00079

# clear evidence that temperature violates the proportional hazards assumption
cox.zph(coxph(Surv(week_since, Extinction) ~ strata(species, temp) + rich + aop2_vs_AOP2 + mam1_vs_MAM1 + gsoh_vs_GSOH + cluster(cage), all_cox_df))

             chisq df    p
rich         1.484  1 0.22
aop2_vs_AOP2 0.393  1 0.53
mam1_vs_MAM1 0.228  1 0.63
gsoh_vs_GSOH 0.385  1 0.53
GLOBAL       2.704  4 0.61

# proportional hazards assumption is met for the global model and each covariate.

Since our temperature treatment violated the proportional hazards assumption, we stratified by species and temperature in subsequent models. Note that we don’t have to retest the cox proportional hazards assumption because the strata remain the same and clustering doesn’t affect this test.

# cluster at com level, for testing genetic effects
coxph_com <- coxph(Surv(week_since, Extinction) ~ strata(species, temp) + rich + aop2_vs_AOP2 + mam1_vs_MAM1 + gsoh_vs_GSOH + cluster(com), all_cox_df)
summary(coxph_com)

Call:
coxph(formula = Surv(week_since, Extinction) ~ strata(species, 
    temp) + rich + aop2_vs_AOP2 + mam1_vs_MAM1 + gsoh_vs_GSOH, 
    data = all_cox_df, cluster = com)

  n= 180, number of events= 140 

                  coef exp(coef)  se(coef) robust se      z Pr(>|z|)    
rich         -0.214315  0.807094  0.098234  0.045985 -4.661 3.15e-06 ***
aop2_vs_AOP2 -0.179598  0.835606  0.084043  0.025718 -6.983 2.88e-12 ***
mam1_vs_MAM1 -0.006885  0.993139  0.118880  0.061874 -0.111    0.911    
gsoh_vs_GSOH  0.091267  1.095562  0.118729  0.066793  1.366    0.172    
---
Signif. codes:  0 '***' 0.001 '**' 0.01 '*' 0.05 '.' 0.1 ' ' 1

             exp(coef) exp(-coef) lower .95 upper .95
rich            0.8071     1.2390    0.7375    0.8832
aop2_vs_AOP2    0.8356     1.1967    0.7945    0.8788
mam1_vs_MAM1    0.9931     1.0069    0.8797    1.1212
gsoh_vs_GSOH    1.0956     0.9128    0.9611    1.2488

Concordance= 0.599  (se = 0.02 )
Likelihood ratio test= 10.34  on 4 df,   p=0.04
Wald test            = 95.88  on 4 df,   p=<2e-16
Score (logrank) test = 10.4  on 4 df,   p=0.03,   Robust = 6.34  p=0.2

  (Note: the likelihood ratio and score tests assume independence of
     observations within a cluster, the Wald and robust score tests do not).

# cluster at cage
coxph_cage <- coxph(Surv(week_since, Extinction) ~ strata(species, temp) + rich + aop2_vs_AOP2 + mam1_vs_MAM1 + gsoh_vs_GSOH + cluster(cage), all_cox_df)
summary(coxph_cage) # output for Table S1

Call:
coxph(formula = Surv(week_since, Extinction) ~ strata(species, 
    temp) + rich + aop2_vs_AOP2 + mam1_vs_MAM1 + gsoh_vs_GSOH, 
    data = all_cox_df, cluster = cage)

  n= 180, number of events= 140 

                  coef exp(coef)  se(coef) robust se      z Pr(>|z|)  
rich         -0.214315  0.807094  0.098234  0.101822 -2.105   0.0353 *
aop2_vs_AOP2 -0.179598  0.835606  0.084043  0.077496 -2.318   0.0205 *
mam1_vs_MAM1 -0.006885  0.993139  0.118880  0.128348 -0.054   0.9572  
gsoh_vs_GSOH  0.091267  1.095562  0.118729  0.113650  0.803   0.4219  
---
Signif. codes:  0 '***' 0.001 '**' 0.01 '*' 0.05 '.' 0.1 ' ' 1

             exp(coef) exp(-coef) lower .95 upper .95
rich            0.8071     1.2390    0.6611    0.9854
aop2_vs_AOP2    0.8356     1.1967    0.7179    0.9727
mam1_vs_MAM1    0.9931     1.0069    0.7723    1.2772
gsoh_vs_GSOH    1.0956     0.9128    0.8768    1.3689

Concordance= 0.599  (se = 0.034 )
Likelihood ratio test= 10.34  on 4 df,   p=0.04
Wald test            = 11.84  on 4 df,   p=0.02
Score (logrank) test = 10.4  on 4 df,   p=0.03,   Robust = 9.86  p=0.04

  (Note: the likelihood ratio and score tests assume independence of
     observations within a cluster, the Wald and robust score tests do not).

# show that AICc gives same inference to satisfy a reviewer
AICcmodavg::aictab(list(full = coxph_cage, 
                        reduced = update(coxph_cage, .~. -mam1_vs_MAM1 -gsoh_vs_GSOH), 
                        null = update(coxph_cage, .~. -mam1_vs_MAM1 -gsoh_vs_GSOH -rich -aop2_vs_AOP2)))


Model selection based on AICc:

        K   AICc Delta_AICc AICcWt Cum.Wt      LL
reduced 2 749.91       0.00   0.82   0.82 -372.92
full    4 753.48       3.56   0.14   0.95 -372.62
null    0 755.59       5.68   0.05   1.00 -377.79

# account for aop2 genotype effect first
summary(coxph(Surv(week_since, Extinction) ~ strata(species, temp) + I(Col+gsm1) + rich + cluster(cage), all_cox_df))

Call:
coxph(formula = Surv(week_since, Extinction) ~ strata(species, 
    temp) + I(Col + gsm1) + rich, data = all_cox_df, cluster = cage)

  n= 180, number of events= 140 

                  coef exp(coef) se(coef) robust se      z Pr(>|z|)  
I(Col + gsm1) -0.36139   0.69671  0.16809   0.15468 -2.336   0.0195 *
rich          -0.03614   0.96450  0.12993   0.13535 -0.267   0.7894  
---
Signif. codes:  0 '***' 0.001 '**' 0.01 '*' 0.05 '.' 0.1 ' ' 1

              exp(coef) exp(-coef) lower .95 upper .95
I(Col + gsm1)    0.6967      1.435    0.5145    0.9434
rich             0.9645      1.037    0.7398    1.2575

Concordance= 0.577  (se = 0.034 )
Likelihood ratio test= 9.74  on 2 df,   p=0.008
Wald test            = 11.6  on 2 df,   p=0.003
Score (logrank) test = 9.7  on 2 df,   p=0.008,   Robust = 9.01  p=0.01

  (Note: the likelihood ratio and score tests assume independence of
     observations within a cluster, the Wald and robust score tests do not).

# cluster at cage, interactions with strata
coxph_cage_by_tempstrata <- coxph(Surv(week_since, Extinction) ~ strata(species) + strata(temp)*(rich + aop2_vs_AOP2 + mam1_vs_MAM1 + gsoh_vs_GSOH) + cluster(cage), all_cox_df)

# assume each row of data is independent (which of course isn't true, 3 species per cage)
coxph_indep <- coxph(Surv(week_since, Extinction) ~ strata(species, temp) + rich + aop2_vs_AOP2 + mam1_vs_MAM1 + gsoh_vs_GSOH, all_cox_df)
summary(coxph_indep)

Call:
coxph(formula = Surv(week_since, Extinction) ~ strata(species, 
    temp) + rich + aop2_vs_AOP2 + mam1_vs_MAM1 + gsoh_vs_GSOH, 
    data = all_cox_df)

  n= 180, number of events= 140 

                  coef exp(coef)  se(coef)      z Pr(>|z|)  
rich         -0.214315  0.807094  0.098234 -2.182   0.0291 *
aop2_vs_AOP2 -0.179598  0.835606  0.084043 -2.137   0.0326 *
mam1_vs_MAM1 -0.006885  0.993139  0.118880 -0.058   0.9538  
gsoh_vs_GSOH  0.091267  1.095562  0.118729  0.769   0.4421  
---
Signif. codes:  0 '***' 0.001 '**' 0.01 '*' 0.05 '.' 0.1 ' ' 1

             exp(coef) exp(-coef) lower .95 upper .95
rich            0.8071     1.2390    0.6657    0.9785
aop2_vs_AOP2    0.8356     1.1967    0.7087    0.9852
mam1_vs_MAM1    0.9931     1.0069    0.7867    1.2537
gsoh_vs_GSOH    1.0956     0.9128    0.8681    1.3826

Concordance= 0.599  (se = 0.033 )
Likelihood ratio test= 10.34  on 4 df,   p=0.04
Wald test            = 10.24  on 4 df,   p=0.04
Score (logrank) test = 10.4  on 4 df,   p=0.03

anova(coxph_indep)

Analysis of Deviance Table
 Cox model: response is Surv(week_since, Extinction)
Terms added sequentially (first to last)

              loglik  Chisq Df Pr(>|Chi|)  
NULL         -377.79                       
rich         -375.24 5.1031  1    0.02388 *
aop2_vs_AOP2 -372.92 4.6398  1    0.03124 *
mam1_vs_MAM1 -372.92 0.0050  1    0.94352  
gsoh_vs_GSOH -372.62 0.5923  1    0.44154  
---
Signif. codes:  0 '***' 0.001 '**' 0.01 '*' 0.05 '.' 0.1 ' ' 1

# note that likelihood estimates are the same among all coxph models, clearly indicating that the likelihood ratio test doesn't account for clustered variables
coxph_com$loglik

[1] -377.7946 -372.6245

coxph_cage$loglik

[1] -377.7946 -372.6245

coxph_indep$loglik

[1] -377.7946 -372.6245

# random effect of cage
coxme_cage <- coxme(Surv(week_since, Extinction) ~ strata(species, temp) + rich + aop2_vs_AOP2 + mam1_vs_MAM1 + gsoh_vs_GSOH + (1|cage), all_cox_df)
summary(coxme_cage)

Cox mixed-effects model fit by maximum likelihood
  Data: all_cox_df
  events, n = 140, 180
  Iterations= 52 212 
                    NULL Integrated    Fitted
Log-likelihood -377.7946  -372.5176 -365.9144

                  Chisq   df         p  AIC    BIC
Integrated loglik 10.55 5.00 0.0609770 0.55 -14.15
 Penalized loglik 23.76 9.87 0.0077103 4.02 -25.02

Model:  Surv(week_since, Extinction) ~ strata(species, temp) + rich +      aop2_vs_AOP2 + mam1_vs_MAM1 + gsoh_vs_GSOH + (1 | cage) 
Fixed coefficients
                     coef exp(coef)   se(coef)     z     p
rich         -0.217070092 0.8048736 0.10570121 -2.05 0.040
aop2_vs_AOP2 -0.179799378 0.8354378 0.08986566 -2.00 0.045
mam1_vs_MAM1 -0.004221433 0.9957875 0.12762438 -0.03 0.970
gsoh_vs_GSOH  0.092409607 1.0968140 0.12675226  0.73 0.470

Random effects
 Group Variable  Std Dev    Variance  
 cage  Intercept 0.23828315 0.05677886

# note that if we formally compare models, there is not clear evidence to include cage as a random effect
anova(coxph_cage, coxme_cage)

Analysis of Deviance Table
 Cox model: response is  Surv(week_since, Extinction)
 Model 1: ~strata(species, temp) + rich + aop2_vs_AOP2 + mam1_vs_MAM1 +      gsoh_vs_GSOH
 Model 2: ~strata(species, temp) + rich + aop2_vs_AOP2 + mam1_vs_MAM1 +      gsoh_vs_GSOH + (1 | cage)
   loglik  Chisq Df P(>|Chi|)
1 -372.62                    
2 -372.52 0.2137  1    0.6438

# random effect of cage, com, and temp_com
coxme_allRE <- coxme(Surv(week_since, Extinction) ~ strata(species, temp) + rich + aop2_vs_AOP2 + mam1_vs_MAM1 + gsoh_vs_GSOH + (1|com/temp/cage), all_cox_df)
summary(coxme_allRE)

Cox mixed-effects model fit by maximum likelihood
  Data: all_cox_df
  events, n = 140, 180
  Iterations= 50 204 
                    NULL Integrated    Fitted
Log-likelihood -377.7946  -372.5182 -365.9064

                  Chisq   df         p   AIC    BIC
Integrated loglik 10.55 7.00 0.1593500 -3.45 -24.04
 Penalized loglik 23.78 9.88 0.0076995  4.02 -25.05

Model:  Surv(week_since, Extinction) ~ strata(species, temp) + rich +      aop2_vs_AOP2 + mam1_vs_MAM1 + gsoh_vs_GSOH + (1 | com/temp/cage) 
Fixed coefficients
                     coef exp(coef)   se(coef)     z     p
rich         -0.217071855 0.8048721 0.10571966 -2.05 0.040
aop2_vs_AOP2 -0.179799630 0.8354376 0.08987895 -2.00 0.045
mam1_vs_MAM1 -0.004216846 0.9957920 0.12764318 -0.03 0.970
gsoh_vs_GSOH  0.092412399 1.0968171 0.12677050  0.73 0.470

Random effects
 Group         Variable    Std Dev      Variance    
 com/temp/cage (Intercept) 2.384245e-01 5.684623e-02
 com/temp      (Intercept) 3.920002e-03 1.536642e-05
 com           (Intercept) 2.607809e-03 6.800670e-06

# certainly not good evidence to support this more complex model
anova(coxme_cage, coxme_allRE)

Analysis of Deviance Table
 Cox model: response is  Surv(week_since, Extinction)
 Model 1: ~strata(species, temp) + rich + aop2_vs_AOP2 + mam1_vs_MAM1 +      gsoh_vs_GSOH + (1 | cage)
 Model 2: ~strata(species, temp) + rich + aop2_vs_AOP2 + mam1_vs_MAM1 +      gsoh_vs_GSOH + (1 | com/temp/cage)
   loglik  Chisq Df P(>|Chi|)
1 -372.52                    
2 -372.52 0.0012  2    0.9994

Reproduce Fig. 2

tidy_coxph_cage <- tidy(coxph_cage, exponentiate = T, conf.int = T, conf.level = 0.95) %>% 
  mutate(model = "cluster(cage)",
         SE = robust.se)

plot_keystone_gene <- tidy_coxph_cage %>%
  filter(term != "rich") %>%
  ggplot(aes(x = term, y = estimate)) +
  geom_point(size = 3) +
  geom_linerange(aes(ymin = estimate - SE, ymax = estimate + SE), size = 1.5) +
  geom_linerange(aes(ymin = conf.low, ymax = conf.high)) +
  geom_hline(yintercept = 1, linetype = "dotted") +
  scale_x_discrete(labels = c(expression(italic(AOP2)),expression(italic(MAM1)),expression(italic(GSOH)))) +
  scale_y_continuous(name = expression("Extinction rate ratio"), trans = "log", breaks = c(0.8, 1.0, 1.2, 1.4)) + 
  xlab("")

x11(); plot_keystone_gene
#ggsave(plot = plot_keystone_gene, filename = "figures/keystone-gene.pdf", height = 5, width = 6)

Version	Author	Date
c802852	mabarbour	2021-06-24

Reproduce Fig. S1

tidy_coxph_com <- tidy(coxph_com, exponentiate = T, conf.int = T, conf.level = 0.95) %>% 
  mutate(model = "cluster(com)",
         SE = robust.se)
tidy_coxph_indep <- tidy(coxph_indep, exponentiate = T, conf.int = T, conf.level = 0.95) %>% 
  mutate(model = "independent",
         SE = std.error)
tidy_coxme_cage <- tidy(coxme_cage, exponentiate = T, conf.level = 0.95) %>% 
  mutate(model = "(1|cage)",
         SE = std.error)
tidy_coxme_allRE <- tidy(coxme_allRE, exponentiate = T, conf.level = 0.95) %>% 
  mutate(model = "(1|com/temp/cage)",
         SE = std.error)

plot_fig_S1 <- bind_rows(tidy_coxph_com, tidy_coxph_cage, tidy_coxph_indep, tidy_coxme_cage, tidy_coxme_allRE) %>%
  mutate(model = factor(model, levels = c("cluster(com)","cluster(cage)","independent","(1|cage)","(1|com/temp/cage)"))) %>%
  filter(term != "rich") %>%
  ggplot(aes(x = term, y = estimate, group = model, color = model)) +
  geom_point(size = 3, position = position_dodge(width = 0.5)) +
  geom_linerange(aes(ymin = estimate - SE, ymax = estimate + SE), size = 1.5, position = position_dodge(width = 0.5)) +
  geom_linerange(aes(ymin = conf.low, ymax = conf.high), position = position_dodge(width = 0.5)) +
  geom_hline(yintercept = 1, linetype = "dotted") +
  scale_x_discrete(labels = c(expression(italic(AOP2)),expression(italic(MAM1)),expression(italic(GSOH)))) +
  #scale_y_continuous(name = expression("Effect on extinction rate "(Delta~"%")), trans = "log", breaks = c(0.8, 1.0, 1.2, 1.4), labels = c(-20,0,20,40)) +  # -percent.pdf
  scale_y_continuous(name = expression("Extinction rate ratio"), trans = "log", breaks = c(0.8, 1.0, 1.2, 1.4)) + # -ratio.pdf
  scale_color_viridis_d(name = "Model") +
  xlab("")

x11(); plot_fig_S1
#ggsave(plot = plot_fig_S1, filename = "figures/keystone-effect-model-comparison-ratio.pdf", height = 5, width = 6)

Version	Author	Date
c802852	mabarbour	2021-06-24

Reproduce Fig. S2 and S3

# need to convert to data frame from a tibble for plotting (known issue: https://github.com/kassambara/survminer/issues/501)
all_cox_df.df <- as.data.frame(all_cox_df) %>%
  mutate(aop2_genos = Col+gsm1)

# add aop2 information to genetic composition
com_aop2 <- select(all_cox_df.df, variable = com, rich, aop2_genos, aop2_vs_AOP2) %>% distinct()

# get adjusted survival curves for each genetic composition
com_surv_adj <- surv_adjustedcurves(coxph(Surv(week_since, Extinction) ~ strata(temp, species) + com + cluster(cage), data = all_cox_df.df), variable = "com", method = "conditional") %>%
  left_join(., com_aop2)

Warning in .get_data(fit, data): The `data` argument is not provided. Data will
be extracted from model fit.

aop2_surv_adj <- surv_adjustedcurves(coxph(Surv(week_since, Extinction) ~ strata(temp, species) + aop2_genos + cluster(cage), data = all_cox_df.df), variable = "aop2_genos", method = "conditional") %>%
  mutate(aop2_genos = as.numeric(variable)-1)

Warning in .get_data(fit, data): The `data` argument is not provided. Data will
be extracted from model fit.

## Reproduce Fig. S2
# without rich = 4 to emphasize keystone gene effect
plot_fig_S2 <- com_surv_adj %>%
  filter(rich != 4) %>% 
  mutate(frich = factor(rich, levels = c(1,2), labels = c("Richness = 1","Richness = 2"))) %>%
  ggplot(aes(x = time+2, y = surv)) + 
  geom_step(aes(group = variable, color = aop2_vs_AOP2), size = 1) +
  scale_color_viridis_c(breaks = c(-2,-1,0,1,2), name = "AOP2\u2013 vs. AOP2+") +
  ylab("Survival probability") +
  scale_x_continuous(name = "Week of experiment") + # , breaks = 2:17
  facet_wrap(~frich, nrow = 1)
x11(); plot_fig_S2
#ggsave(plot = plot_fig_S2, filename = "figures/keystone-coxadjcurve.pdf", width = 6, height = 5, device=cairo_pdf)

## Reproduce Fig. S3
# ideal thing about this plot is that it properly adjusts for strata and it displays uncertainty at the level of genetic composition.
# confidence intervals from the kaplan meir plot are incorrect because they don't account for strata (this was confirmed by looking at survdiff log-rank test)
plot_fig_S3 <- ggplot(com_surv_adj, aes(x = time+2, y = surv)) + 
  geom_step(aes(group = variable, color = aop2_genos), alpha = 0.5) +
  geom_step(data = aop2_surv_adj, aes(group = aop2_genos, color = aop2_genos), size = 1.5) +
  scale_color_viridis_c(breaks = c(0,1,2), name = "AOP2\u2013\ngenotypes") +
  ylab("Survival probability") +
  scale_x_continuous(name = "Week of experiment", breaks = 2:17)

Version	Author	Date
c802852	mabarbour	2021-06-24

x11(); plot_fig_S3
#ggsave(plot = plot_fig_S3, filename = "figures/aop2-genos-coxadjcurve.pdf", width = 6, height = 5, device=cairo_pdf)

Version	Author	Date
c802852	mabarbour	2021-06-24

sessionInfo()

R version 4.1.2 (2021-11-01)
Platform: x86_64-pc-linux-gnu (64-bit)
Running under: Ubuntu 16.04.7 LTS

Matrix products: default
BLAS:   /usr/lib/libblas/libblas.so.3.6.0
LAPACK: /usr/lib/lapack/liblapack.so.3.6.0

locale:
 [1] LC_CTYPE=en_US.UTF-8       LC_NUMERIC=C              
 [3] LC_TIME=en_US.UTF-8        LC_COLLATE=en_US.UTF-8    
 [5] LC_MONETARY=en_US.UTF-8    LC_MESSAGES=en_US.UTF-8   
 [7] LC_PAPER=en_US.UTF-8       LC_NAME=C                 
 [9] LC_ADDRESS=C               LC_TELEPHONE=C            
[11] LC_MEASUREMENT=en_US.UTF-8 LC_IDENTIFICATION=C       

attached base packages:
[1] stats     graphics  grDevices utils     datasets  methods   base     

other attached packages:
 [1] survMisc_0.5.5     ehahelper_0.3.9999 broom_0.7.11       survminer_0.4.9   
 [5] ggpubr_0.4.0       coxme_2.2-16       bdsmatrix_1.3-4    survival_3.2-13   
 [9] cowplot_1.1.1      forcats_0.5.1      stringr_1.4.0      dplyr_1.0.7       
[13] purrr_0.3.4        readr_2.1.1        tidyr_1.1.4        tibble_3.1.6      
[17] ggplot2_3.3.5      tidyverse_1.3.1    workflowr_1.6.2   

loaded via a namespace (and not attached):
 [1] VGAM_1.1-5        colorspace_2.0-2  ggsignif_0.6.2    ellipsis_0.3.2   
 [5] rio_0.5.26        rprojroot_2.0.2   fs_1.5.2          eha_2.9.0        
 [9] rstudioapi_0.13   farver_2.1.0      bit64_4.0.5       fansi_1.0.0      
[13] lubridate_1.8.0   xml2_1.3.3        codetools_0.2-18  splines_4.1.2    
[17] knitr_1.37        jsonlite_1.7.2    km.ci_0.5-2       dbplyr_2.1.1     
[21] compiler_4.1.2    httr_1.4.2        backports_1.4.1   assertthat_0.2.1 
[25] Matrix_1.4-0      fastmap_1.1.0     cli_3.1.0         later_1.3.0      
[29] htmltools_0.5.2   tools_4.1.2       gtable_0.3.0      glue_1.6.0       
[33] Rcpp_1.0.7        carData_3.0-4     cellranger_1.1.0  jquerylib_0.1.4  
[37] raster_3.4-10     vctrs_0.3.8       nlme_3.1-152      xfun_0.29        
[41] openxlsx_4.2.4    rvest_1.0.2       lifecycle_1.0.1   rstatix_0.7.0    
[45] MASS_7.3-54       zoo_1.8-9         scales_1.1.1      vroom_1.5.7      
[49] hms_1.1.1         promises_1.2.0.1  parallel_4.1.2    yaml_2.2.1       
[53] curl_4.3.2        gridExtra_2.3     KMsurv_0.1-5      sass_0.4.0       
[57] stringi_1.7.3     highr_0.9         zip_2.2.0         rlang_0.4.12     
[61] pkgconfig_2.0.3   evaluate_0.14     lattice_0.20-45   labeling_0.4.2   
[65] bit_4.0.4         tidyselect_1.1.1  plyr_1.8.6        magrittr_2.0.1   
[69] R6_2.5.1          generics_0.1.1    DBI_1.1.2         pillar_1.6.4     
[73] haven_2.4.3       whisker_0.4       foreign_0.8-81    withr_2.4.3      
[77] abind_1.4-5       sp_1.4-5          modelr_0.1.8      crayon_1.4.2     
[81] car_3.0-10        utf8_1.2.2        tzdb_0.2.0        rmarkdown_2.11   
[85] grid_4.1.2        readxl_1.3.1      data.table_1.14.2 git2r_0.28.0     
[89] AICcmodavg_2.3-1  reprex_2.0.1      digest_0.6.29     xtable_1.8-3     
[93] httpuv_1.6.5      stats4_4.1.2      munsell_0.5.0     unmarked_1.1.1   
[97] viridisLite_0.4.0 bslib_0.3.1

Food-web persistence

Matthew Barbour

2022-01-24

Wrangle data

Stratified Cox proportional-hazards model

Reproduce Fig. 2

Reproduce Fig. S1

Reproduce Fig. S2 and S3