Eleazar Eskin, University of
California, Los Angeles, USA.
Known and Unknown Confounding in Genetic Studies
Variation in human DNA sequences account for a significant amount of
genetic risk factors for common disease such as hypertension,
diabetes, Alzheimer's disease, and cancer. Identifying the human
sequence variation that makes up the genetic basis of common disease
will have a tremendous impact on medicine in many ways. Recent
efforts to identify these genetic factors through large scale
association studies which compare information on variation between a
set of healthy and diseased individuals have been remarkably
successful. However, despite the success of these initial studies,
many challenges and open questions remain on how to design and analyze
the results of association studies. As several recent studies have
demonstrated, confounding factors such as batch effects, population
structure, and measurement errors can complicate genetics analysis by
causing many spurious associations. Yet little is understood about how
these confounding factors affect analyses and how to correct for these
factors. In this talk I will discuss several recently developed
methods based on linear mixed models for correcting for both known and
unknown confounding factors in genetic studies.
Eleazar Eskin�s research focuses on developing computational methods
for analysis of genetic variation. He is currently an Associate
Professor in the Computer Science and Human Genetics departments at
the University of California Los Angeles. Previously, he was an
Assistant Professor in Residence in Computer Science Engineering at
the University of California, San Diego. Eleazar completed his Ph. D.
in the Computer Science Department of Columbia University in New York
City. After graduation, he spent one year in the Computer Science
Department at the Hebrew University in Jerusalem, Israel.
Magnus Rattray, The University
of Sheffield, Sheffield, UK.
Modeling Gene Expression Time-Series with Bayesian Non-Parametrics
Bayesian non-parametric methods are a natural approach to
fitting models with continuous parameters or unbounded parameter set
cardinality. We are applying these methods to diverse models of
time-series gene expression data. Example applications include
differential equation models of transcriptional regulation, clustering
data sampled at uneven times and phylogenetic models of gene
expression change over evolutionary time. We use continuous-time
Gaussian processes to model the time-evolution of gene expression and
protein activation/concentration in time. Dirichlet processes can also
be used to model an unbounded set of Gaussian process models. I will
present results where we apply these methods to gene expression
time-course data from embryonic development in Drosophila.
This is joint work with Neil Lawrence, Antti Honkela, Michalis Titsias
and James Hensman.
BSc in Mathematics and Physics (1992) and PhD in Computer Science
(1996) from the University of Manchester. Postdoctoral studies on
statistical mechanics theory of on-line learning in the Neural
Computing Research Group, Aston University (1996-1998). Lecturer and
Senior Lecturer in the School of Computer Science at the University of
Manchester (1998-2010) working on statistical inference with
applications to problems in computational molecular biology and
phylogenetic inference. Appointed Collaborative Chair in Neuro- and
Computer Science at the University of Sheffield in 2010. I co-lead
research groups in Machine Learning and Computational Biology and my
current work is on probabilistic modelling and statistical inference
with applications in functional genomics and systems biology.
Robert Küffner,
Ludwig-Maximilians-Universität, Munich, Germany.
Successful Strategies to Gene Regulatory Network Inference
The inference of gene regulatory networks from mRNA expression data is
characterized by the development of many different approaches with
their specific performances, data requirements, and inherent
biases.�Based on a recent community-wide challenge, the Dialogue on
Reverse Engineering Assessment and Methods (DREAM), the so far largest
assessment of inference approaches has been conducted. The accuracy of
predictions was evaluated against experimentally supported
interactions in the procaryote model organism E. coli, the eucaryote
model organism S. cerevisiae and in silico target systems.
Over thirty independently contributed methods were analyzed including
well-known approaches based on lasso, mutual information and Bayesian
networks but also a range of novel strategies. For instance, the novel
algorithms based on random forests and ANOVA outperformed established
tools significantly. Further analysis revealed not only which
inference strategies are particularly successful but also which kind
of specific information was utilized from the different types of
experimental measurements. At the same time, the performance of the
individual approaches varied in different network motifs or target
systems. By integrating individual predictions into community
predictions the performance improved overall and became markedly more
robust. This principle is known as the wisdom of crowds: a solution
derived from a community of independent decision makers will be
better, on average, than any individual solution. Based on community
predictions, we also constructed the first comprehensive gene
regulatory model for the human pathogen Staphylococcus aureus.
Dr. Robert Küffner habilitated in informatics in 2010 and is
currently a group leader for computer science and bioinformatics at
the Ludwig-Maximilians Universit�t M�nchen. He received his PhD in
molecular biology in 1998 at the Heinrich-Heine Universit�t in
D�sseldorf, Germany. His main interests include the analysis of
biological networks via Petri Nets as well as the areas of text
mining, expression analysis, gene regulation, alternative splicing and
systems biology.
Recently, Dr. Küffner's team was recognized as best performer in
two international community-wide challenges (DREAM4/2009 and
DREAM5/2010) where comprehensive blinded assessments of network
inference approaches have been conducted.
Preliminary Program
|
|
|
Wednesday, July 20, 2011
|
|
09:00 |
09:10 |
|
Registration and Welcome
|
|
09:10 |
10:00 |
|
Invited talk
|
|
|
|
|
Eleazar Eskin. Known and Unknown Confounding in Genetic Studies.
|
|
10:00 |
10:30 |
|
Session 1
|
|
10:00 |
10:30 |
|
Ryan Topping and John Pinney. An Evolutionary Measure for Studying the Re-wiring of Protein-Protein Interactions
|
|
10:30 |
11:00 |
|
Coffee break
|
|
11:00 |
12:20 |
|
Session 2: Theory / Applications
|
|
11:00 |
11:20 |
|
Johanna Mazur and Lars Kaderali. Bayesian Experimental Design for the Inference of Gene Regulatory Networks
|
|
11:20 |
11:40 |
|
Bettina Knapp and Lars Kaderali. Linear Model for Network Inference using RNA Interference Data
|
|
11:40 |
12:00 |
|
David Burstein, Sven Gould, Verena Zimorski, Thorsten Klösges, Fuat Kiosse, Peter Major, William Martin, Tal Pupko and Tal Dagan. A Machine-Learning Approach to Hydrogenosomal Protein Identification in Trichomonas Vaginalis
|
|
12:00 |
12:20 |
|
Juho Rousu, Daniel D. Agranoff, Delmiro Fernandez-Reyes and John Shawe-Taylor. Sparse Canonical Correlation Analysis for Biomarker Discovery: A Case Study in Tuberculosis
|
|
12:20 |
13:30 |
|
Lunch
|
|
13:30 |
15:00 |
|
Session 3: Gene Selection and Prioritization
|
|
13:30 |
14:00 |
|
Pierre Geurts and Yvan Saeys. Exploring Signature Multiplicity Using Ensembles of Randomized Trees
|
|
14:00 |
14:20 |
|
David Dernoncourt, Blaise Hanczar and Jean-Daniel Zucker. An Empirical Analysis of Markov Blanket Filters for Feature Selection on Microarray Data
|
|
14:20 |
14:40 |
|
Daniela Nitsch, Léon-Charles Tranchevent and Yves Moreau. Machine Learning Approaches for Network-Based Gene Prioritization from Expression Data
|
|
14:40 |
15:00 |
|
Shi Yu, Léon-Charles Tranchevent, Sonia M Leach, Bart De Moor and Yves Moreau. A Kernel Based Framework for Cross-Species Candidate Gene Prioritization
|
|
15:00 |
15:30 |
|
Coffee Break
|
|
15:30 |
16:30 |
|
Session 4: Applications of CCA
|
|
15:30 |
16:00 |
|
Yoshihiro Yamanishi, Edouard Pauwels, Hiroto Saigo and Veronique Stoven. Identification of Chemogenomic Features from Drug-Target Interaction Networks by Sparse Canonical Correspondence Analysis
|
|
16:00 |
16:30 |
|
Leo Lahti and Samuel Kaski. Probabilistic Dependency Models for Data Integration in Functional Genomics
|
|
19:00 (tentative time) |
|
|
Evening in a Viennese wine tavern (Heuriger) sponsored and organized by the City of Vienna
|
|
|
|
|
Thursday, July 21, 2011
|
|
09:10 |
10:00 |
|
Invited talk
|
|
|
|
|
Magnus Rattray.Modeling Gene Expression Time-Series with Bayesian Non-Parametrics.
|
|
10:00 |
10:30 |
|
Session 5
|
|
10:00 |
10:30 |
|
Panagiotis Achlioptas, Bernhard Schölkopf and Karsten Borgwardt. Epistasis Detection in Subquadratic Runtime
|
|
10:30 |
11:00 |
|
Coffee break
|
|
11:00 |
12:20 |
|
Session 6: Microarray Gene Expression Data / Data Integration
|
|
11:00 |
11:20 |
|
Alexandra Posekany, Klaus Felsenstein and Peter Sykacek. Assessing Noise Models for Microarray Data Analysis
|
|
11:20 |
11:40 |
|
Constanze Schmitt, Matthias Böck and Stefan Kramer. SOM Biclustering of Gene Expression Data
|
|
11:40 |
12:00 |
|
Hossein Rahmani, Hendrik Blockeel and Andreas Bender. Interaction-Based Feature Selection for Predicting Cancer-Related Proteins in Protein-Protein Interaction Networks
|
|
12:00 |
12:20 |
|
Ondřej Kuželka, Andrea Szabóová, Matěj Holec and Filip Zelezny. Gaussian Logic for Proteomics and Genomics
|
|
12:20 |
13:30 |
|
Lunch
|
|
13:30 |
14:20 |
|
Invited talk
|
|
|
|
|
Robert Küffner. Successful Strategies to Gene Regulatory Network Inference.
|
|
14:20 |
15:00 |
|
Session 7: Gene Expression Time Series
|
|
14:20 |
14:40 |
|
Matthias Böck, Constanze Schmitt and Stefan Kramer. A Study of Dynamic Time Warping for the Inference of Gene Regulatory Relationships
|
|
14:40 |
15:00 |
|
Alfredo A. Kalaitzis and Neil D. Lawrence. A Simple Approach to Ranking Differentially Expressed Gene Expression Time Courses through Gaussian Process Regression
|
|
15:00 |
15:30 |
|
Coffee break
|
|
15:30 |
16:30 |
|
Session 8: Network Inference
|
|
15:30 |
16:00 |
|
Celine Brouard, Florence D'Alch�-Buc and Marie Szafranski. A New Theoretical Angle to Semi-Supervised Output Kernel Regression for Protein-Protein Interaction Network Inference
|
|
16:00 |
16:30 |
|
Steven M. Hill and Sach Mukherjee. An Exact Empirical Bayes Approach for Incorporating Biological Knowledge into Network Inference
|
|
16:30 |
|
|
Closing remarks
|
|