This website converted the following original .R scripts into .rmd files.

  • Rscript01DataFormat.R
  • RScriptSNADescriptives.R
  • Rscript02SienaVariableFormat.R
  • Rscript03SienaRunModel.R
  • Rscript04SienaBehaviour.R

Please visit GitHub for the latest .R files.


Data

All files (data, scripts, etc.) can also be found on Github


Contact

Specific questions with respect to the .rmd files can be addressed to: Jochem Tolsma.

For questions on RSiena please visit the designated GitHub page.


Introduction

Let us start by loading the data

library(RSiena)
friend.data.w1 <- s501
friend.data.w2 <- s502
friend.data.w3 <- s503
drink <- s50a
smoke <- s50s

A number of objects need to be created in R, as preparations to letting siena07 execute the estimation. This will be indicated by:

  • A: dependent variables;
  • B: explanatory variables;
  • C: combination of dependent and explanatory variables;
  • D: model specification.

A. dependent variables

First we have to create objects for the dependent variables.

sienaDependent() creates a sienaDependent object, here a network, from a matrix or array or list of sparse matrix of triples. This object will have the role of a dependent variable in the analysis.

The name of this network object (here: friendship) will be used in the output file.

friendship <- sienaDependent(array(c(friend.data.w1, friend.data.w2, friend.data.w3), dim = c(50, 50,
    3)))

The integers in the dim() here refer to the number of nodes (senders, receivers) and the number of waves. This object is an array of dimension 50 x 50 x 3, representing three adjacency matrices, with a number of attributes.

You can ask for a brief decription of this object simply by typing:

print(friendship)
#> Type         oneMode             
#> Observations 3                   
#> Nodeset      Actors (50 elements)
# for which the shorthand is

friendship
#> Type         oneMode             
#> Observations 3                   
#> Nodeset      Actors (50 elements)

Note that this is an object of class

class(friendship)
#> [1] "sienaDependent"

With specific attributes and methods associated with it. YYou can get the detailed information by requesting:

dim(friendship)
#> [1] 50 50  3
attributes(friendship)
#> $dim
#> [1] 50 50  3
#> 
#> $class
#> [1] "sienaDependent"
#> 
#> $type
#> [1] "oneMode"
#> 
#> $sparse
#> [1] FALSE
#> 
#> $nodeSet
#> [1] "Actors"
#> 
#> $netdims
#> [1] 50 50  3
#> 
#> $allowOnly
#> [1] TRUE

If you only are interested in the value of one particular attribute, you can request this by, e.g.,

attr(friendship, "type")
#> [1] "oneMode"

An extensive description of the friendship data is obtained by typing:

str(friendship)
#>  'sienaDependent' int [1:50, 1:50, 1:3] 0 0 0 0 0 0 0 0 0 0 ...
#>  - attr(*, "type")= chr "oneMode"
#>  - attr(*, "sparse")= logi FALSE
#>  - attr(*, "nodeSet")= chr "Actors"
#>  - attr(*, "netdims")= int [1:3] 50 50 3
#>  - attr(*, "allowOnly")= logi TRUE

The function sienaDependent() can also be used to create a behavior variable object with the extra argument type = "behavior".

Non-mentioned attributes get the default value, and in this case oneMode is the default; see below.

The drink data (created in RscriptDataFormat.R ) is made available as a dependent behavior variable by the function:

drinkingbeh <- sienaDependent(drink, type = "behavior")

# the class, class(drinkingbeh), is still sienaDependent.

Note: only use the variable in ONE role in a given model: behavior variable or changing covariate!

The options available for defining a sienaDependent object are displayed by typing: ?sienaDependent.

This shows that next to one-mode (unipartite) and behavior dependent variables, also two-mode (bipartite) dependent variables are possible.
You can infer that oneMode is the default type from the fact that it is mentioned first.

To create bipartite network objects you need two node sets and must create the node sets too.

Please See ?sienaNodeSet where the Examples section shows an example of the syntax.


B. explanatory variables

Second we construct objects for the explanatory (independent) variables.
From the help request: ?sienaDataCreate

We see that these can be of five kinds:

  1. coCovar: Constant actor covariates
  2. varCovar: Time-varying actor covariates
  3. coDyadCovar: Constant dyadic covariates
  4. varDyadCovar: Time-varying dyadic covariates
  5. compositionChange Composition change indicators

You can get help about this by the following requests:

  • ?coCovar
  • ?varCovar
  • ?coDyadCovar
  • ?varDyadCovar
  • ?sienaCompositionChange

The variables available for this data set all are changing actor covariates.
For illustrative purposes, we use smoking as observed at the first wave as a constant covariate:

smoke1 <- coCovar(smoke[, 1])

This selects the first column of smoke, which contains the first wave observations, and makes it available as a constant covariate. This is the pattern for for evey covariate file, e.g. Attr1 <- coCovar( Covariate1 ) where Covariate1 is a matrix with dim(Covariate1) equal to n x 1.

Note, if Covariates is a matrix with dim(Covariates) equal to n x p you can create constant covariates through:

  • Attr1 <- coCovar(Covariates[,1])

  • Attrp <- coCovar(Covariates[,p])

We use the drinking data as a changing covariate. The function `varCovar() creates a changing covariate object from a matrix; the name comes from ‘varying covariate’.

alcohol <- varCovar(drink)

You need at least three waves in the data set to define a varying covariate by the function varCovar(); the reason is that the previous wave is used as a predictor of the next wave.

The command…

attributes(alcohol)
#> $dim
#> [1] 50  3
#> 
#> $dimnames
#> $dimnames[[1]]
#>  [1] "1"  "2"  "3"  "4"  "5"  "6"  "7"  "8"  "9"  "10" "11" "12" "13" "14" "15" "16" "17" "18" "19"
#> [20] "20" "21" "22" "23" "24" "25" "26" "27" "28" "29" "30" "31" "32" "33" "34" "35" "36" "37" "38"
#> [39] "39" "40" "41" "42" "43" "44" "45" "46" "47" "48" "49" "50"
#> 
#> $dimnames[[2]]
#> [1] "V1" "V2" "V3"
#> 
#> 
#> $class
#> [1] "varCovar"
#> 
#> $centered
#> [1] TRUE
#> 
#> $nodeSet
#> [1] "Actors"

…will tell you the information that RSiena now has added to the drink data.


C. combination of dependent and explanatory variables

We now combine the dependent and independent variables.
The function sienaDataCreate() creates a Siena data object from input networks, covariates and composition change objects; the objects that earlier were created by sienaDependent() will have the role of dependent variables, and similarly the other roles are predetermined by creation by the functions coCovar(), varCovar(), coDyadCovar(), varDyadCovar(), and sienaCompositionChange().

mydata <- sienaDataCreate(friendship, smoke1, alcohol)

You may check the result by requesting

mydata
#> Dependent variables:  friendship 
#> Number of observations: 3 
#> 
#> Nodeset                  Actors 
#> Number of nodes              50 
#> 
#> Dependent variable friendship      
#> Type               oneMode         
#> Observations       3               
#> Nodeset            Actors          
#> Densities          0.046 0.047 0.05
#> 
#> Constant covariates:  smoke1 
#> Changing covariates:  alcohol

You should now understand how this differs from the result of:

mybehdata <- sienaDataCreate( friendship, smoke1, drinkingbeh)

If you would like to use different names, you could request this as follows:

mydata <- sienaDataCreate(nominations = friendship, smoke1, drinking = alcohol)

Another type of dependent network is a two-mode network, which here is also called a bipartite network.
To get advice for a first step in the use of two-mode networks, see the script RscriptSienaBipartite.R on the Siena scripts page.

This finishes the data specification. Now we have to specify the model.

D. model specification

Defining effects I

The data set as combined in mydata implies a certain set of effects that can be included in the specification of the model.
The function getEffects() creates a dataframe of effects with a number of extra properties for use in RSiena:

myeff <- getEffects(mydata)

mydata is needed as an argument as the effects depend on the number and types of covariates and dependent variables.
Before we explain the object myeff and how we shall be going to use it, we first produce a data description which is available now:

Data description

print01Report(mydata, modelname = "s50_3_init")

This writes a basic report of the data to the fil s50_3_init.txt in the current working directory. Locate and open it!
Inspecting this is important because it serves as a check and also contains a number of basic descriptives.
In this description you can see that the third wave for alcohol is not used. This is because changing covariates are assumed to be constant from one wave until immediately before the next wave, so that the values for the last wave are ignored.

Defining effects II

Let us now consider the myeff object, which is used to specify the model.
It is of the class sienaEffects, and contains the model specification.
You can inspect the current model specification by simply requesting…

myeff
#>   effectName                          include fix   test  initialValue parm
#> 1 constant friendship rate (period 1) TRUE    FALSE FALSE    4.69604   0   
#> 2 constant friendship rate (period 2) TRUE    FALSE FALSE    4.32885   0   
#> 3 outdegree (density)                 TRUE    FALSE FALSE   -1.46770   0   
#> 4 reciprocity                         TRUE    FALSE FALSE    0.00000   0

For starting, the model specification is just a very limited default (including rates of change, outdegree and reciprocity only). To make a meaningful analysis, you will need to add to it.

The rows of myeff correspond to the effects.
By requesting…

names(myeff)
#>  [1] "name"            "effectName"      "functionName"    "shortName"       "interaction1"   
#>  [6] "interaction2"    "type"            "basicRate"       "include"         "randomEffects"  
#> [11] "fix"             "test"            "timeDummy"       "initialValue"    "parm"           
#> [16] "functionType"    "period"          "rateType"        "untrimmedValue"  "effect1"        
#> [21] "effect2"         "effect3"         "interactionType" "local"           "setting"        
#> [26] "effectFn"        "statisticFn"     "netType"         "groupName"       "group"          
#> [31] "effectNumber"

You see the type of information that is stored about the effects, i.e., the columns (characteristics) defined for the effects. If desired, more information about these variables can be obtained from the help files: ?getEffects where the characteristics are described.

Some often used variables are effectName, shortName , type, and parameter.
The set of available effects and their most used columns can be inspected as follows:

effectsDocumentation(myeff)

This creates an html file with the list of all effects available in myeff; the effects are all defined in Section 12 of the RSiena manual, but only those that are meaningful for dataset mydata are used in myeff. The include column defines whether effects are included in the model.

myeff$include
#>   [1]  TRUE  TRUE FALSE FALSE FALSE FALSE FALSE FALSE FALSE  TRUE FALSE FALSE  TRUE FALSE FALSE
#>  [16] FALSE FALSE FALSE FALSE FALSE FALSE FALSE FALSE FALSE FALSE FALSE FALSE FALSE FALSE FALSE
#>  [31] FALSE FALSE FALSE FALSE FALSE FALSE FALSE FALSE FALSE FALSE FALSE FALSE FALSE FALSE FALSE
#>  [46] FALSE FALSE FALSE FALSE FALSE FALSE FALSE FALSE FALSE FALSE FALSE FALSE FALSE FALSE FALSE
#>  [61] FALSE FALSE FALSE FALSE FALSE FALSE FALSE FALSE FALSE FALSE FALSE FALSE FALSE FALSE FALSE
#>  [76] FALSE FALSE FALSE FALSE FALSE FALSE FALSE FALSE FALSE FALSE FALSE FALSE FALSE FALSE FALSE
#>  [91] FALSE FALSE FALSE FALSE FALSE FALSE FALSE FALSE FALSE FALSE FALSE FALSE FALSE FALSE FALSE
#> [106] FALSE FALSE FALSE FALSE FALSE FALSE FALSE FALSE FALSE FALSE FALSE FALSE FALSE FALSE FALSE
#> [121] FALSE FALSE FALSE FALSE FALSE FALSE FALSE FALSE FALSE FALSE FALSE FALSE FALSE FALSE FALSE
#> [136] FALSE FALSE FALSE FALSE FALSE FALSE FALSE FALSE FALSE FALSE FALSE FALSE FALSE FALSE FALSE
#> [151] FALSE FALSE FALSE FALSE FALSE FALSE FALSE FALSE FALSE FALSE FALSE FALSE FALSE FALSE FALSE
#> [166] FALSE FALSE FALSE FALSE FALSE FALSE FALSE FALSE FALSE FALSE FALSE FALSE FALSE FALSE FALSE
#> [181] FALSE FALSE FALSE FALSE FALSE FALSE FALSE FALSE FALSE FALSE FALSE FALSE FALSE FALSE FALSE
#> [196] FALSE FALSE FALSE FALSE FALSE FALSE FALSE FALSE FALSE FALSE FALSE FALSE FALSE FALSE FALSE
#> [211] FALSE FALSE FALSE FALSE FALSE FALSE FALSE FALSE FALSE FALSE FALSE FALSE FALSE FALSE FALSE
#> [226] FALSE FALSE FALSE FALSE FALSE FALSE FALSE FALSE FALSE FALSE FALSE FALSE FALSE FALSE FALSE
#> [241] FALSE FALSE FALSE FALSE FALSE FALSE FALSE FALSE FALSE FALSE FALSE FALSE FALSE FALSE FALSE
#> [256] FALSE FALSE FALSE FALSE FALSE FALSE FALSE FALSE FALSE FALSE FALSE FALSE FALSE FALSE FALSE
#> [271] FALSE FALSE FALSE FALSE FALSE FALSE FALSE FALSE FALSE FALSE FALSE FALSE FALSE FALSE FALSE
#> [286] FALSE FALSE FALSE FALSE FALSE FALSE FALSE FALSE FALSE FALSE FALSE FALSE FALSE FALSE FALSE
#> [301] FALSE FALSE FALSE FALSE FALSE FALSE FALSE FALSE FALSE FALSE FALSE FALSE FALSE FALSE FALSE
#> [316] FALSE FALSE FALSE FALSE FALSE FALSE FALSE FALSE FALSE FALSE FALSE FALSE FALSE FALSE FALSE
#> [331] FALSE FALSE FALSE FALSE FALSE FALSE FALSE FALSE FALSE FALSE FALSE FALSE FALSE FALSE FALSE
#> [346] FALSE FALSE FALSE FALSE FALSE FALSE FALSE FALSE FALSE FALSE FALSE FALSE FALSE FALSE FALSE
#> [361] FALSE FALSE FALSE FALSE FALSE FALSE FALSE FALSE FALSE FALSE FALSE FALSE FALSE FALSE FALSE
#> [376] FALSE FALSE FALSE FALSE FALSE FALSE FALSE FALSE FALSE FALSE FALSE FALSE FALSE FALSE FALSE
#> [391] FALSE FALSE FALSE FALSE FALSE FALSE FALSE FALSE FALSE FALSE FALSE FALSE FALSE FALSE FALSE
#> [406] FALSE FALSE FALSE FALSE FALSE FALSE FALSE FALSE FALSE FALSE FALSE FALSE FALSE FALSE FALSE
#> [421] FALSE FALSE FALSE FALSE FALSE FALSE FALSE FALSE FALSE FALSE FALSE FALSE FALSE FALSE FALSE
#> [436] FALSE FALSE FALSE FALSE FALSE FALSE FALSE FALSE FALSE FALSE FALSE FALSE FALSE FALSE FALSE
#> [451] FALSE FALSE FALSE FALSE FALSE FALSE FALSE FALSE FALSE FALSE FALSE FALSE FALSE FALSE FALSE
#> [466] FALSE FALSE FALSE FALSE FALSE FALSE FALSE FALSE FALSE FALSE FALSE FALSE FALSE FALSE FALSE
#> [481] FALSE FALSE FALSE FALSE FALSE FALSE FALSE FALSE FALSE FALSE FALSE FALSE FALSE FALSE FALSE
#> [496] FALSE FALSE FALSE FALSE FALSE FALSE FALSE FALSE FALSE FALSE FALSE FALSE FALSE FALSE FALSE
#> [511] FALSE FALSE FALSE FALSE FALSE FALSE FALSE FALSE FALSE FALSE FALSE FALSE FALSE FALSE FALSE
#> [526] FALSE FALSE FALSE FALSE FALSE FALSE FALSE FALSE FALSE FALSE

Here the TRUE values correspond to the default model specification which, however, is not meant as a serious model, being too limited. There are 3 main ways to operate on myeff:

  1. Using RSiena functions includeEffects(), `setEffects(), etc;
  2. Changing myeff in spreadsheet form by the function fix();
  3. Changing myeff directly by operating on its elements.

The first way is most in line with the design philosophy of R, and allows you to save scripts that also can be used when there will be new versions of RSiena. Therefore, we suggest that for starting you only study option 1, Adding/removing effects using includeEffects.
The other two options are treated only for special and more difficult occasions; and for looking ‘behind the screens’.

For identifying your effects you need the shortNames, which can be read in the manual (section “Mathematical definition of effects”), or obtained from the effectsDocumentation() function mentioned above.

1. Adding/removing effects using includeEffects()

structural effects

The best way of specifying the model is by the includeEffects() function. This function uses short names instead of full names. The short names are given by the effectsDocumentation() function mentioned above, and also are listed in the descriptions given in Section 12 of the manual.

For illustration, let us start from scratch with a new sienaEffects object, and add the transitive triples and 3-cycles effects.

myeff <- getEffects(mydata)
myeff <- includeEffects(myeff, transTrip, cycle3)
# To see the current model specification,
myeff
#>   effectName          include fix   test  initialValue parm
#> 1 transitive triplets TRUE    FALSE FALSE          0   0   
#> 2 3-cycles            TRUE    FALSE FALSE          0   0   
#>   effectName                          include fix   test  initialValue parm
#> 1 constant friendship rate (period 1) TRUE    FALSE FALSE    4.69604   0   
#> 2 constant friendship rate (period 2) TRUE    FALSE FALSE    4.32885   0   
#> 3 outdegree (density)                 TRUE    FALSE FALSE   -1.46770   0   
#> 4 reciprocity                         TRUE    FALSE FALSE    0.00000   0   
#> 5 transitive triplets                 TRUE    FALSE FALSE    0.00000   0   
#> 6 3-cycles                            TRUE    FALSE FALSE    0.00000   0

Note that we can set several effects in one go! To remove an effect, e.g., the 3-cycle effects…

myeff <- includeEffects(myeff, cycle3, include = FALSE)

# Check again which effects now are included in the model
myeff
#> [1] effectName   include      fix          test         initialValue parm        
#> <0 rows> (or 0-length row.names)
#>   effectName                          include fix   test  initialValue parm
#> 1 constant friendship rate (period 1) TRUE    FALSE FALSE    4.69604   0   
#> 2 constant friendship rate (period 2) TRUE    FALSE FALSE    4.32885   0   
#> 3 outdegree (density)                 TRUE    FALSE FALSE   -1.46770   0   
#> 4 reciprocity                         TRUE    FALSE FALSE    0.00000   0   
#> 5 transitive triplets                 TRUE    FALSE FALSE    0.00000   0

creating interaction effects

As a special topic, let us show how interaction effects are created.

A convenient method to include an interaction is offered by the includeInteraction() function. This can be used to interact two or three effects (if the interactions are allowed, which depends on their interactionType; see the manual for this).

The interaction between smoke1 ego and reciprocity, for instance, can be defined by the command:

myeff <- includeInteraction(myeff, egoX, recip, interaction1 = c("smoke1", ""))
myeff

This shows the interaction as an “unspecified interaction effect”; but when printing results of the estimation the names of the interacting effects will be mentioned.

An interaction between smoke1 ego and alcohol ego is defined by:

myeff <- includeInteraction(myeff, egoX, egoX, interaction1 = c("smoke1", "alcohol"))

Note that the keyword ‘interaction1’ used by RSiena is used for identifying the covariate for which the ego effect is selected, and does not refer to the interaction effect itself.
If at least one of the interacting effects requires the interaction1 parameter for it specification, then this parameter is also required for the includeInteraction() function.

Then the two or three interaction1 parameters must be combined using c(); the same goes for interaction2, if that also is necessary for the definition. As shown above for the recip effect, the interaction1 or interaction2 parameter is “” (i.e., an empty string) for effects where it is not needed.

accessing other characteristics of effects

A second special topic is how to access other characteristics of effects. This can be done by the setEffect() function. E.g., the dense triads effects counts the number of triplets with at least xx ties, where xx is the parameter of the effect, which can be 5 or 6 (note that 6 is the maximum number of ties in a triplet). The default is 5. This is changed to 6 by the command:

myeff <- setEffect(myeff, denseTriads, parameter = 6)
myeff

The ‘parameter’ keyword refers to the effect parameter, described in Section 12 of the manual.

2. Adding/removing effects using fix()

# fix calls a data editor internal to R, so we can manually edit the effects.

# fix( myeff )

# How to use fix() is presented merely for getting to know what myeff is.  In practical analysis it
# is more convenient to use routine 'includeEffects' instead, as explained above.  fix() may not be
# usable if you do not have tcl/tk available!  Note that the top of the dataframe shows the names
# of the columns: name, effectName, etc.  You can edit the 'include' column by changing the TRUE
# and FALSE values as required; when the editor is closed, the new values are stored.  When you
# make an error, however, the effects object may be corrupted.  Therefore, this way of adding and
# removing effects is more risky.

3. Adding/removing effects by direct manipulation of myeff

Alternatively we can edit the dataframe directly by using R functions. You are advised to skip this part (“3.”) at first and second reading, and read it only if you wish to get more understanding of the internal structure of the effects object. The commands below are used to set “include” to TRUE or FALSE, as an alternative to using the data editor. The “include” column with values TRUE or FALSE will always be located at the 9th column, but transitive triplets will not always be at the 16th row as this depends on the number of periods and variables; further, the list of available effects changes over different versions of RSiena.

Some examples are the following:

myeff[16, 9] <- TRUE  #transitive triples
myeff[34, 9] <- TRUE  #3 cycles
myeff[37, 9] <- TRUE  #transitive ties
myeff[77, 9] <- TRUE  #indegree popularity (sqrt)
myeff[86, 9] <- TRUE  #outdegree popularity (sqrt)
myeff[93, 9] <- TRUE  #indegree based activity (sqrt)
myeff[102, 9] <- TRUE  #outdegree based activity (sqrt)
myeff[155, 9] <- TRUE  #indegree-indegree assortativity
myeff[319, 9] <- TRUE  #alcohol alter
myeff[331, 9] <- TRUE  #alcohol ego
myeff[370, 9] <- TRUE  #alcohol similarity
myeff[439, 9] <- TRUE  #alcohol ego x alcohol alter

But in other choices of data, the effect numbers will change. This is a reason why this is not a convenient method.

Save your data if you whish.

save.image("WorkspaceRscript02.RData")

Next: SienaRunModel FOR PARAMETER ESTIMATION BY RSIENA


---
title: "VariableFormat"
author: '[Jochem Tolsma](https://www.jochemtolsma.nl) - Radboud University, the Netherlands'
bibliography: references.bib
date: "Last compiled on `r format(Sys.time(), '%B, %Y')`"
output: 
  html_document:
    toc:  true
    toc_float: true
    number_sections: false
    code_folding: show
    code_download: yes
---

```{r, globalsettings, echo=FALSE, warning=FALSE, message=FALSE, results='hide'}
library(knitr)
opts_chunk$set(tidy.opts=list(width.cutoff=100),tidy=TRUE, warning = FALSE, message = FALSE,comment = "#>", cache=TRUE, class.source=c("test"), class.output=c("test2"))
options(width = 100)
rgl::setupKnitr()


colorize <- function(x, color) {
  if (knitr::is_latex_output()) {
    sprintf("\\textcolor{%s}{%s}", color, x)
  } else if (knitr::is_html_output()) {
    sprintf("<span style='color: %s;'>%s</span>", color, 
      x)
  } else x
}

```

```{r klippy, echo=FALSE, include=TRUE}
klippy::klippy(position = c('top', 'right'))
#klippy::klippy(color = 'darkred')
#klippy::klippy(tooltip_message = 'Click to copy', tooltip_success = 'Done')
```

```{css, echo=FALSE}
pre.test {
  max-height: 300px;
  overflow-y: auto;
  overflow-x: auto;
  margin: 10px;
}

pre.test2 {
  max-height: 300px;
  overflow-y: auto;
  overflow-x: auto;
  margin: 10px;
  background-color: white
}


h1, .h1, h2, .h2, h3, .h3 {
    margin-top: 24px;
}


```

----

This website converted the following original .R scripts into .rmd files. 

- Rscript01DataFormat.R 
- RScriptSNADescriptives.R 
- Rscript02SienaVariableFormat.R  
- Rscript03SienaRunModel.R  
- Rscript04SienaBehaviour.R

Please visit [GitHub](https://github.com/snlab-nl/rsiena/tree/main/inst/scripts) for the latest .R files. 

----

## Data
All files (data, scripts, etc.) can also be found on [Github](https://github.com/JochemTolsma/Rsiena-scripts)

----

## Contact
Specific questions with respect to the .rmd files can be addressed to: [Jochem Tolsma](mailto:j.tolsma@ru.nl).  

For questions on RSiena please visit the designated [GitHub](https://github.com/snlab-nl/rsiena) page. 

----  

## Introduction 

Let us start by loading the data

```{r}
library(RSiena)
friend.data.w1 <- s501
friend.data.w2 <- s502
friend.data.w3 <- s503
drink <- s50a
smoke <- s50s
```

A number of objects need to be created in R, as preparations to letting siena07 execute the estimation. This will be indicated by:  

-  A: dependent variables;  
-  B: explanatory variables;  
-  C: combination of dependent and explanatory variables;  
-  D: model specification.

---  

## A. dependent variables  


 First we have to create objects for the dependent variables.

`sienaDependent()` creates a sienaDependent object, here a network, from a matrix or array or list of sparse matrix of triples. This object will have the role of a dependent variable in the analysis.  

The name of this network object (here: **friendship**) will be used in the output file.

```{r}
friendship <- sienaDependent(
  array( c( friend.data.w1, friend.data.w2, friend.data.w3 ),
         dim = c( 50, 50, 3 ) ) )
```

The integers in the `dim()` here refer to the number of nodes (senders, receivers) and the number of waves. This object is an array of dimension 50 x 50 x 3, representing three adjacency matrices, with a number of attributes.  

You can ask for a brief decription of this object simply by typing:  



```{r}
print(friendship)
# for which the shorthand is

friendship
```

Note that this is an object of class  
```{r}
class(friendship)
```

With specific attributes and methods associated with it.
YYou can get the detailed information by requesting: 

```{r}
dim( friendship )
attributes( friendship )
```


If you only are interested in the value of one particular attribute, you can request this by, e.g.,

```{r}
attr( friendship, "type")
```

An extensive description of the friendship data is obtained by typing: 
```{r}
str(friendship)
```

The function `sienaDependent()` can also be used to create a behavior variable object with the extra argument **`type = "behavior"`**.  

Non-mentioned attributes get the default value, and in this case oneMode is the default; see below.  

The `drink` data (created in RscriptDataFormat.R ) is made available as a dependent behavior variable by the function: 

```{r}
drinkingbeh <- sienaDependent( drink, type = "behavior" )

# the class, class(drinkingbeh), is still sienaDependent.
```

Note: only use the variable in ONE role in a given model: behavior variable or changing covariate!  

The options available for defining a sienaDependent object are displayed by typing:  `?sienaDependent`. 

This shows that next to one-mode (unipartite) and behavior dependent variables, also two-mode (bipartite) dependent variables are possible.  
You can infer that oneMode is the default type from the fact that it is mentioned first.  


To create bipartite network objects you need two node sets and must create the node sets too.

Please See `?sienaNodeSet` where the Examples section shows an example of the syntax.

---   

## B. explanatory variables  

Second we construct objects for the explanatory (independent) variables.  
From the help request: `?sienaDataCreate`
 
We see that these can be of five kinds:  

1. `coCovar`: Constant actor covariates  
2. `varCovar`: Time-varying actor covariates  
3. `coDyadCovar`: Constant dyadic covariates  
4. `varDyadCovar`: Time-varying dyadic covariates  
5. `compositionChange` Composition change indicators  



You can get help about this by the following requests:  

* `?coCovar`  
* `?varCovar`  
* `?coDyadCovar`  
* `?varDyadCovar`  
* `?sienaCompositionChange`  


The variables available for this data set all are changing actor covariates.  
For illustrative purposes, we use smoking as observed at the first wave as a constant covariate:


```{r}
smoke1 <- coCovar( smoke[ , 1 ] )
```

This selects the first column of smoke, which contains the first wave observations, and makes it available as a constant covariate. This is the pattern for for evey covariate file, e.g. `Attr1 <- coCovar( Covariate1 )` where Covariate1 is a matrix with `dim(Covariate1)` equal to n x 1.  

Note, if Covariates is a matrix with `dim(Covariates)` equal to n x p you can create constant covariates through:  

- `Attr1 <- coCovar(Covariates[,1])`  
- ...  
- `Attrp <- coCovar(Covariates[,p])`  

 We use the drinking data as a changing covariate. The function `varCovar() creates a changing covariate object from a matrix; the name comes from 'varying covariate'.  
 
```{r}
alcohol <- varCovar( drink )
```

You need at least three waves in the data set to define a varying covariate by the function `varCovar()`; the reason is that the previous wave is used as a predictor of the next wave.  

The command...

```{r}
attributes( alcohol )
```

...will tell you the information that RSiena now has added to the drink data.  

---  

## C. combination of dependent and explanatory variables  

We now combine the dependent and independent variables.  
The function `sienaDataCreate()` creates a Siena data object from input networks, covariates and composition change objects; the objects that earlier were created by `sienaDependent()` will have the role of dependent variables, and similarly the other roles are predetermined by creation by the functions `coCovar()`, `varCovar()`, `coDyadCovar()`, `varDyadCovar()`, and `sienaCompositionChange()`.

```{r}
mydata <- sienaDataCreate( friendship, smoke1, alcohol )
```

You may check the result by requesting
```{r}
mydata
```

You should now understand how this differs from the result of: 

`mybehdata <- sienaDataCreate( friendship, smoke1, drinkingbeh)` 

If you would like to use different names, you could request this as follows:

```{r, eval=FALSE}
mydata <- sienaDataCreate(nominations = friendship, smoke1, drinking = alcohol)
```

Another type of dependent network is a two-mode network, which here is also called a bipartite network.  
To get advice for a first step in the use of two-mode networks, see the script RscriptSienaBipartite.R on the Siena scripts page.  

This finishes the data specification. Now we have to specify the model.  

## D. model specification  

### Defining effects I 

The data set as combined in `mydata` implies a certain set of effects that can be included in the specification of the model.  
The function `getEffects()` creates a dataframe of effects with a number of extra properties for use in RSiena:

```{r}
myeff <- getEffects( mydata )
```

`mydata` is needed as an argument as the effects depend on the number and types of covariates and dependent variables.  
Before we explain the object `myeff` and how we shall be going to use it, we first produce a data description which is available now:

### Data description
```{r}
print01Report( mydata, modelname = 's50_3_init' )
```

This writes a basic report of the data to the fil *s50_3_init.txt*  in the current working directory. Locate and open it!  
Inspecting this is important because it serves as a check and also contains a number of basic descriptives.  
In this description you can see that the third wave for alcohol is not used. This is because changing covariates are assumed to be constant from one wave until immediately before the next wave, so that the values for the last wave are ignored.  

### Defining effects II   

Let us now consider the `myeff` object, which is used to specify the model.  
It is of the class `sienaEffects`, and contains the model specification.  
You can inspect the current model specification by simply requesting... 

```{r}
myeff
```

For starting, the model specification is just a very limited default (including rates of change, outdegree and reciprocity only).  To make a meaningful analysis, you will need to add to it.  

The rows of myeff correspond to the effects.  
By requesting...
```{r}
names(myeff)
```

You see the type of information that is stored about the effects, i.e., the columns (characteristics) defined for the effects. If desired, more information about these variables can be obtained from the help files: `?getEffects` where the characteristics are described.  

Some often used variables are *effectName*, *shortName* , *type*, and *parameter*.  
The set of available effects and their most used columns can be inspected as follows:

```{r, eval=FALSE}
effectsDocumentation(myeff)
```

This creates an html file with the list of all effects available in `myeff`; the effects are all defined in Section 12 of the RSiena manual, but only those that are meaningful for dataset `mydata` are used in `myeff`. The *include* column defines whether effects are included in the model.  

```{r}
myeff$include
```

Here the TRUE values correspond to the default model specification which, however, is not meant as a serious model, being too limited. There are 3 main ways to operate on `myeff`:  

1. Using RSiena functions `includeEffects()`, `setEffects(), etc;  
2. Changing `myeff` in spreadsheet form by the function `fix()`;  
3. Changing `myeff` directly by operating on its elements.  


The first way is most in line with the design philosophy of R, and allows you to save scripts that also can be used when there will be new versions of RSiena. Therefore, we suggest that for starting you only study option 1, *Adding/removing effects using includeEffects*.  
The other two options are treated only for special and more difficult occasions; and for looking 'behind the screens'.  

For identifying your effects you need the *shortName*s, which can be read in the manual (section "Mathematical definition of effects"), or obtained from the `effectsDocumentation()` function mentioned above.  

### 1. Adding/removing effects using `includeEffects()`  

#### **structural effects**  

The best way of specifying the model is by the `includeEffects()` function. This function uses short names instead of full names. The short names are given by the `effectsDocumentation()` function mentioned above, and also are listed in the descriptions given in Section 12 of the manual.  

For illustration, let us start from scratch with a new sienaEffects object, and add the transitive triples and 3-cycles effects. 


```{r, results='hold'}
myeff <- getEffects( mydata )
myeff <- includeEffects( myeff, transTrip, cycle3 )
# To see the current model specification,
myeff
```


Note that we can set several effects in one go! To remove an effect, e.g., the 3-cycle effects...

```{r, results='hold'}
myeff <- includeEffects( myeff, cycle3, include=FALSE )

# Check again which effects now are included in the model
myeff

```

#### **covariate related effects**  

Let us demonstrate how to add/remove covariate related effects.  
The short names do not differentiate between the covariates:  e.g., the effects *alcohol ego* and *smoke1 ego* both have shortName *egoX*, and the command `myeff <- includeEffects(myeff, egoX)` results in a message that does not (like the earlier one) confirm the newly included effect.  

The covariates are indicated by the variable *interaction1* in the sienaEffects object, listed as *inter1* in the result of `effectsDocumentation()`, and this has to be mentioned to include these effects:

```{r, results='hold'}
myeff <- includeEffects( myeff, egoX, altX, egoXaltX, interaction1 = "alcohol" )
myeff <- includeEffects( myeff, simX, interaction1 = "smoke1" )

# We check the results again:
myeff
```


By looking at the help offered by `?includeEffects`.  

you can see how to include endowment and creation effects. Effects that depend on other variables, such as egoX, altX, etc. above, need the specification of these variables to define them. This is done by the *interaction1* parameter when only one variable name is needed, and by *interaction2* if there is a second variable involved, such as AltsAvAlt (see the manual).  
Although the names of these parameters are *interaction1* and *interaction2*, this does not refer to the common concept of interaction in statistical modelling!


#### **creating interaction effects** 


As a special topic, let us show how interaction effects are created.  

A convenient method to include an interaction is offered by the `includeInteraction()` function. This can be used to interact two or three effects (if the interactions are allowed, which depends on their interactionType; see the manual for this).  

The interaction between **smoke1 ego** and **reciprocity**, for instance, can be defined by the command: 

```{r, eval=FALSE}
myeff <- includeInteraction( myeff, egoX, recip, interaction1 = c("smoke1","") )
myeff
```

This shows the interaction as an "unspecified interaction effect"; but when printing results of the estimation the names of the interacting effects will be mentioned. 

An interaction between smoke1 ego and alcohol ego is defined by:  

```{r, eval=FALSE}
myeff <- includeInteraction( myeff, egoX, egoX, interaction1 = c( "smoke1", "alcohol" ) )
```


Note that the keyword 'interaction1' used by RSiena is used for identifying the covariate for which the ego effect is selected, and does not refer to the interaction effect itself.  
If at least one of the interacting effects requires the interaction1 parameter for it specification, then this parameter is also required for the `includeInteraction()` function.  

Then the two or three interaction1 parameters must be combined using c(); the same goes for interaction2, if that also is necessary for the definition. As shown above for the recip effect, the interaction1 or interaction2 parameter is "" (i.e., an empty string) for effects where it is not needed.  

#### **accessing other characteristics of effects**  

A second special topic is how to access other characteristics of effects. This can be done by the `setEffect()` function. E.g., the dense triads effects counts the number of triplets with at least xx ties,  where xx is the parameter of the effect, which can be 5 or 6 (note that 6 is the maximum number of ties in a triplet). The default is 5. This is changed to 6 by the command:  

```{r, eval=FALSE}
myeff <- setEffect(myeff, denseTriads, parameter = 6)
myeff
```

The 'parameter' keyword refers to the effect parameter, described in Section 12 of the manual.

###  2. Adding/removing effects using `fix()`  

```{r, eval=FALSE}
# fix calls a data editor internal to R, so we can manually edit the effects.

#        fix( myeff )

# How to use fix() is presented merely for getting to know what myeff is.
# In practical analysis it is more convenient
# to use routine "includeEffects" instead, as explained above.
# fix() may not be usable if you do not have tcl/tk available!
# Note that the top of the dataframe shows the names of the columns:
# name, effectName, etc.
# You can edit the "include" column by changing the TRUE and FALSE values
# as required; when the editor is closed, the new values are stored.
# When you make an error, however, the effects object may be corrupted.
# Therefore, this way of adding and removing effects is more risky.
```

### 3. Adding/removing effects by direct manipulation of `myeff`  

Alternatively we can edit the dataframe directly by using R functions. You are advised to skip this part ("3.") at first and second reading, and read it only if you wish to get more understanding of the internal structure of the effects object. The commands below are used to set "include" to TRUE or FALSE, as an alternative to using the data editor. The "include" column with values TRUE or FALSE will always be located at the 9th column, but transitive triplets will not always be at the 16th row as this depends on the number of periods and variables; further, the list of available effects changes over different versions of RSiena.  

Some examples are the following: 

```{r, eval=FALSE}
myeff[16,9] <- TRUE   #transitive triples
myeff[34,9] <- TRUE   #3 cycles
myeff[37,9] <- TRUE   #transitive ties
myeff[77,9] <- TRUE   #indegree popularity (sqrt)
myeff[86,9] <- TRUE   #outdegree popularity (sqrt)
myeff[93,9] <- TRUE   #indegree based activity (sqrt)
myeff[102,9] <- TRUE  #outdegree based activity (sqrt)
myeff[155,9] <- TRUE   #indegree-indegree assortativity
myeff[319,9] <- TRUE   #alcohol alter
myeff[331,9] <- TRUE   #alcohol ego
myeff[370,9] <- TRUE   #alcohol similarity
myeff[439,9] <- TRUE   #alcohol ego x alcohol alter
```

But in other choices of data, the effect numbers will change. This is a reason why this is not a convenient method.

## Save your data if you whish.  
```{r, eval=FALSE}

save.image("WorkspaceRscript02.RData")

```


--- 

[**Next: SienaRunModel FOR PARAMETER ESTIMATION BY RSIENA**](/rscript03.html)

---




Copyright © 2020 Jochem Tolsma