Links to key manuscripts, software etc from the Contingency Tables group:
The inaugural SAMSI program on Stochastic Computation focused on synthesis and
developmental research in four inter-related areas in which the problems of
searching for and fitting mathematical models to observational data are
characteristic of core challenges facing modern mathematical and computational
sciences: problems in which realistic applied models are mathematically complex;
involve many, and often very, very many uncertain parameters to be estimated; in
which the form and specification of the mathematical model itself is uncertain
so that scientists need tools for searching over vast spaces of candidate
models; and in which data is, relative to model and parameter dimensions,
relatively sparse.
The major themes of the StoCom research - stochastic computation - concerns
methods, with accompanying probability theory and computational algorithms, for
performing calculations to search effectively over large spaces of models and
potentially very high-dimensional parameters within models. Some key innovations
and results of StoCom include the following. In the area of large-scale variable
selection and regression model search -- perhaps the canonical problem facing
modern mathematical modelling in scientific investigation in every area,
exemplified by problems such as environmental risk assessment, for example --
StoCom introduced novel methods for rapid exploration of huge model spaces with
efficient simulation algorithms that can substantially improve regression model
search when faced with many candidate predictors, and also substantially improve
prediction performance as a result. In problems of discrete data analysis using
contingency tables subject to problems of high-dimension and missing data --
such as arise in applications in areas as diverse as government data-base
confidentiality and security, and in human population genetics, for example --
StoCom research produced significant advances in computational methods,
including quite novel and effective algorithms, as well as defining
interconnections between earlier poorly connected branches of core
mathematical/algebraic research and more applied statistical research. In the
area of large-scale graphical models -- a core theoretical framework for
studying and evaluating complex patterns of association among very many
variables, such as arise in applications in areas as diverse as genome-scale
gene expression studies in modern computational biology, and in large-scale
market research studies, for example -- StoCom led to substantial advances in
the capacity to explore very high-dimensional spaces of graphs, extending theory
and computational tools to develop radically more effective than earlier
available. This component also explored and generated studies of the approach in
exploratory analysis of large-scale genomic data via cluster computing, with
resulting software tools provided to the research community. In the fourth area,
StoCom brought
together statistical and mathematical researchers in financial modelling to
define new approaches to stochastic computation in some of the most challenging
financial volatility and pricing models, and defined new research directions
from this collaboration in studies of volatility at multiple scales, among other
topics.
Specific details of participants, areas of focus, papers and software produced
and that represent the full program are given below within each of the four
focus areas.
Leader: Mark Huber
SAMSI investigators: Ian Dinwoodie
Local participants: Yuguo Chen
SAMSI postdocs: Adrian Dobra
SAMSI grads: Mike Nicholas
Specific topics:
Yuguo Chen, Ian Dinwoodie, Adrian Dobra and
Mark Huber
Lattice points,
contingency tables, and sampling
Ian Dinwoodie and
Brenda MacGibbon
Exact Analysis of a Paired
Sibling Study
Ian Dinwoodie
Estimation of parameters
in a network reliability model with spatial dependence
Ian Dinwoodie, Laura
Felicia Matusevich and Ed Mosteig
Transform methods for the
hypergeometric distribution
Leader: | Mike West |
SAMSI investigators: | Chris Carter, Mark Huber |
SAMSI postdocs: | Beatrix Jones, Adrian Dobra |
SAMSI grads: | Chris Hans, Carlos Carvalho |
Links to key manuscripts, software etc from the Graphical Models group:
Experiments in Stochastic Computation for High-Dimensional Graphical Models
Sparse Graphical Models for Exploring Gene Expression Data
Cripps, E., Carter, C. K., and Kohn, R. (2003)
Variable selection and covariance selection in multivariate regression models
Leader: | Merlise Clyde |
SAMSI investigators: | Jim Berger, Joe Ibrahim, Bani Mallick |
Local participants: | Alan Gelfand |
SAMSI postdocs: | Rui Paulo |
SAMSI grads: | Christine Kohnen |
Links to key manuscripts, software etc from the Model Selection group:
Leader: | Jean-Pierre Fouque |
SAMSI investigators: | Robert Kohn |
Local participants: | Yuguo Chen, Chuanshu Ji, Sujit Ghosh, Tao Pang |
SAMSI grads: | Beom Lee, German Molina |
Links to key manuscripts, software etc from the Financial Models group: