Title: | Fit, Simulate and Diagnose Exponential-Family Models for Networks with Count Edges |
---|---|
Description: | A set of extensions for the 'ergm' package to fit weighted networks whose edge weights are counts. See Krivitsky (2012) <doi:10.1214/12-EJS696> and Krivitsky, Hunter, Morris, and Klumb (2023) <doi:10.18637/jss.v105.i06>. |
Authors: | Pavel N. Krivitsky [aut, cre] |
Maintainer: | Pavel N. Krivitsky <[email protected]> |
License: | GPL-3 + file LICENSE |
Version: | 4.1.2-1438 |
Built: | 2025-03-05 04:35:48 UTC |
Source: | https://github.com/statnet/ergm.count |
ergm.count
is a set of extensions to
package ergm
to fit and simulate from
exponential-family random graph models for networks whose edge weights are
counts (Krivitsky 2012).
Mainly, it implements Poisson, binomial, geometric, and discrete
uniform dyadwise reference measures for valued ERGMs (documented
here in ergmReference
), and provides some count-specific change
statistics (documented in ergmTerm
)
(Krivitsky 2012; Krivitsky et al. 2023), including
CMP
for the Conway–Maxwell–Poisson Distribution
(Shmueli et al. 2005).
For a complete list of the functions, use library(help="ergm")
and
library(help="ergm.count")
or read the rest of the manual.
When publishing results obtained using this package, please cite the
original authors as described in citation(package="ergm.count")
.
All programs derived from this package must cite it.
This package contains functions specific to using ergm
to
model networks whose dyad values are counts. Examples include counts of
conversations, messages, and other interactions.
For detailed information on how to download and install the software, go to the Statnet project website: https://statnet.org. A tutorial, support newsgroup, references and links to further resources are provided there.
Poisson- and geometric-reference ERGMs have an unbouded sample space. This
means that the parameter space may be constrained in complex ways that
depend on the terms used in the model. At this time ergm
has
no way to detect when a parameter configuration had strayed outside of the
parameter space, but it may be noticeable on a runtime trace plot (activated
via MCMC.runtime.traceplot
control parameter), when the simulated
values keep climbing upwards. (See Krivitsky (2012) for a further
discussion.)
A possible remedy if this appears to occur is to try lowering the
control.ergm()
parameter MCMLE.steplength
.
Maintainer: Pavel N. Krivitsky [email protected] (ORCID)
Other contributors:
Mark S. Handcock [email protected] [contributor]
David R. Hunter [email protected] [contributor]
Joyce Cheng [email protected] [contributor]
Krivitsky PN (2012).
“Exponential-family Random Graph Models for Valued Networks.”
Electronic Journal of Statistics, 6, 1100–1128.
doi:10.1214/12-EJS696.
Krivitsky PN, Hunter DR, Morris M, Klumb C (2023).
“ergm 4: New Features for Analyzing Exponential-Family Random Graph Models.”
Journal of Statistical Software, 105(6), 1–44.
doi:10.18637/jss.v105.i06.
Shmueli G, Minka TP, Kadane JB, Borle S, Boatwright P (2005).
“A Useful Distribution for Fitting Discrete Data: Revival of the Conway–Maxwell–Poisson Distribution.”
Journal of the Royal Statistical Society: Series C, 54(1), 127–142.
ISSN 1467-9876, doi:10.1111/j.1467-9876.2005.00474.x.
Specifies each dyad's baseline distribution to be binomial with
trials
trials and success probability of :
. Using
valued ERGM terms that are
"generalized" from their binary counterparts, with form
"sum"
(see previous link for the list) produces logistic
regression.
# Binomial(trials)
# Binomial(trials)
trails |
model parameter |
ergmReference
for index of reference distributions currently visible to the package.
None
If couple==TRUE
, this
term adds one statistic to the model, of the form
. This turns a Binomial- or a
discrete-uniform-reference ERGM into a Conway-Maxwell-Binomial-reference
ERGM, allowing it to represent a broad range of disperson
values. In particular, combined with a Binomial-reference ERGM, a
negative coefficient on this term induces underdispersion and a
positive coefficient induces overdispersion.
If coupled==FALSE
the two summands above are added as their own
statistic (each with its own free parameter).
# valued: CMB(trials, coupled = TRUE)
# valued: CMB(trials, coupled = TRUE)
trails |
model parameter |
coupled |
logical |
ergmTerm
for index of model terms currently visible to the package.
None
This term adds one statistic to the model, of the form
. This turns a Poisson- or a
geometric-reference ERGM into a Conway-Maxwell-Poisson-reference
ERGM, allowing it to represent a broad range of disperson
values. In particular, combined with a Poisson-reference ERGM, a
negative coefficient on this term induces underdispersion and a
positive coefficient induces overdispersion. (This behavior is
different from 3.1.1, when the negation of this value was used.)
# valued: CMP
# valued: CMP
Note that its current implementation may not perform well if the data are overdispersed relative to geometric.
ergmTerm
for index of model terms currently visible to the package.
None
Specifies
each dyad's baseline distribution to be uniform on the natural
numbers (and ):
. In itself, this
"distribution" is improper, but in the presence of
sum
, a geometric
distribution is induced. Using CMP
(in addition to
sum
) induces a
Conway-Maxwell-Poisson distribution that is geometric when its
coefficient is and Poisson when its coefficient is
.
# Geometric
# Geometric
ergmReference
for index of reference distributions currently visible to the package.
None
Specifies each
dyad's baseline distribution to be Poisson with mean 1:
, with the support of
being natural numbers (and
). Using
valued ERGM terms that are
"generalized" from their binary counterparts, with form
"sum"
(see previous link for the list) produces Poisson
regression. Using CMP
induces a
Conway-Maxwell-Poisson distribution that is Poisson when its
coefficient is and geometric when its coefficient is
.
@details Three proposal functions are currently implemented, two of them
designed to improve mixing for sparse networks. They can can be
selected via the MCMC.prop.weights=
control parameter. The
sparse proposals work by proposing a jump to 0. Both of them take
an optional proposal argument p0
(i.e.,
MCMC.prop.args=list(p0=...)
) specifying the probability of
such a jump. However, the way in which they implement it are
different:
"random"
: Select a dyad (i,j) at random, and draw the
proposal (a Poisson distribution with mean
slightly higher than the current value and conditional on not
proposing the current value).
"0inflated"
: As "random"
but, with
probability p0
, propose a jump to 0 instead of a
Poisson jump (if not already at 0). If p0
is not given,
defaults to the "surplus" of 0s in the observed network,
relative to Poisson.
"TNT"
: (the default) As "0inflated"
but
instead of selecting a dyad at random, select a tie with
probability p0
, and a random dyad otherwise, as with
the binary TNT. Currently, p0
defaults to 0.2.
# Poisson
# Poisson
ergmReference
for index of reference distributions currently visible to the package.
None
Zachary (1977) reported observations of social relations in a university karate club, with membership that varied between 50 and 100, of whom 34 individuals: 32 ordinary club members and officers, the club president ("John A."), and the part-time instructor ("Mr. Hi"); consistently interacted outside of the club. Over the course of the study, the club divided into two factions, and, ultimately, split into two clubs, one led by Hi and the other by John and the original club's officers. The split was driven by a disagreement over whether Hi could unilaterally change the level of compensation for his services.
The data are represented as a network
object, with an
edge attribute contexts
, giving the number of contexts of interaction
for that pair of actors. In addition, the following vertex attributes are
provided:
the club in which the actor ended up;
the club in which the actor ended up;
faction alignment of the actor as recorded by Zachary
faction alignment of the actor as recorded by Zachary
faction alignment coded numerically, as
(strongly Mr. Hi's),
(weakly Mr. Hi's),
(neutral),
(weakly John's), and
(strongly John's);
role of the actor in the network (Instructor, Member, or President)
Zachary identifies the faction with which each of the 34 actors was aligned and how strongly and reports, for each pair of actors, the count of social contexts in which they interacted. The 8 contexts recorded were
academic classes at the university;
Hi's private karate studio in his night classes;
Hi's private karate studio where he taught on weekends;
student-teaching at Hi's studio;
the university rathskeller (bar) located near the karate club;
a bar located near the university campus;
open karate tournaments in the area; and
intercollegiate karate tournaments.
The highest number of contexts of interaction for a pair of individuals that was observed was 7.
Zachary, WW (1977). An Information Flow Model for Conflict and Fission in Small Groups. Journal of Anthropological Research, 33(4), 452-473.
Sociomatrix in machine-readable format was retrieved from http://vlado.fmf.uni-lj.si/pub/networks/data/ucinet/ucidata.htm.
Zachary, WW (1977). An Information Flow Model for Conflict and Fission in Small Groups. Journal of Anthropological Research, 33(4), 452-473.
data(zach) oldpal <- palette() palette(gray((1:8)/8)) plot(zach, vertex.col="role", displaylabels=TRUE, edge.col="contexts") palette(oldpal) # Fit a binomial-reference ERGM. zach.fit1 <- ergm(zach~nonzero+sum+nodefactor("role",base=2)+absdiffcat("faction.id"), response="contexts", reference=~Binomial(8)) mcmc.diagnostics(zach.fit1) summary(zach.fit1) ## Not run: # This is much slower. zach.fit2 <- ergm(zach~nonzero+sum+nodefactor("role",base=2)+transitiveties, response="contexts", reference=~Binomial(8), eval.loglik=FALSE) mcmc.diagnostics(zach.fit2) summary(zach.fit2) ## End(Not run)
data(zach) oldpal <- palette() palette(gray((1:8)/8)) plot(zach, vertex.col="role", displaylabels=TRUE, edge.col="contexts") palette(oldpal) # Fit a binomial-reference ERGM. zach.fit1 <- ergm(zach~nonzero+sum+nodefactor("role",base=2)+absdiffcat("faction.id"), response="contexts", reference=~Binomial(8)) mcmc.diagnostics(zach.fit1) summary(zach.fit1) ## Not run: # This is much slower. zach.fit2 <- ergm(zach~nonzero+sum+nodefactor("role",base=2)+transitiveties, response="contexts", reference=~Binomial(8), eval.loglik=FALSE) mcmc.diagnostics(zach.fit2) summary(zach.fit2) ## End(Not run)