Eleazar Eskin, University of California, Los Angeles, USA.

Known and Unknown Confounding in Genetic Studies

Variation in human DNA sequences account for a significant amount of genetic risk factors for common disease such as hypertension, diabetes, Alzheimer's disease, and cancer. Identifying the human sequence variation that makes up the genetic basis of common disease will have a tremendous impact on medicine in many ways. Recent efforts to identify these genetic factors through large scale association studies which compare information on variation between a set of healthy and diseased individuals have been remarkably successful. However, despite the success of these initial studies, many challenges and open questions remain on how to design and analyze the results of association studies. As several recent studies have demonstrated, confounding factors such as batch effects, population structure, and measurement errors can complicate genetics analysis by causing many spurious associations. Yet little is understood about how these confounding factors affect analyses and how to correct for these factors. In this talk I will discuss several recently developed methods based on linear mixed models for correcting for both known and unknown confounding factors in genetic studies.

Eleazar Eskin�s research focuses on developing computational methods for analysis of genetic variation. He is currently an Associate Professor in the Computer Science and Human Genetics departments at the University of California Los Angeles. Previously, he was an Assistant Professor in Residence in Computer Science Engineering at the University of California, San Diego. Eleazar completed his Ph. D. in the Computer Science Department of Columbia University in New York City. After graduation, he spent one year in the Computer Science Department at the Hebrew University in Jerusalem, Israel.

Magnus Rattray, The University of Sheffield, Sheffield, UK.

Modeling Gene Expression Time-Series with Bayesian Non-Parametrics

Bayesian non-parametric methods are a natural approach to fitting models with continuous parameters or unbounded parameter set cardinality. We are applying these methods to diverse models of time-series gene expression data. Example applications include differential equation models of transcriptional regulation, clustering data sampled at uneven times and phylogenetic models of gene expression change over evolutionary time. We use continuous-time Gaussian processes to model the time-evolution of gene expression and protein activation/concentration in time. Dirichlet processes can also be used to model an unbounded set of Gaussian process models. I will present results where we apply these methods to gene expression time-course data from embryonic development in Drosophila.

This is joint work with Neil Lawrence, Antti Honkela, Michalis Titsias and James Hensman.

BSc in Mathematics and Physics (1992) and PhD in Computer Science (1996) from the University of Manchester. Postdoctoral studies on statistical mechanics theory of on-line learning in the Neural Computing Research Group, Aston University (1996-1998). Lecturer and Senior Lecturer in the School of Computer Science at the University of Manchester (1998-2010) working on statistical inference with applications to problems in computational molecular biology and phylogenetic inference. Appointed Collaborative Chair in Neuro- and Computer Science at the University of Sheffield in 2010. I co-lead research groups in Machine Learning and Computational Biology and my current work is on probabilistic modelling and statistical inference with applications in functional genomics and systems biology.

Robert Küffner, Ludwig-Maximilians-Universität, Munich, Germany.

Successful Strategies to Gene Regulatory Network Inference

The inference of gene regulatory networks from mRNA expression data is characterized by the development of many different approaches with their specific performances, data requirements, and inherent biases.�Based on a recent community-wide challenge, the Dialogue on Reverse Engineering Assessment and Methods (DREAM), the so far largest assessment of inference approaches has been conducted. The accuracy of predictions was evaluated against experimentally supported interactions in the procaryote model organism E. coli, the eucaryote model organism S. cerevisiae and in silico target systems. Over thirty independently contributed methods were analyzed including well-known approaches based on lasso, mutual information and Bayesian networks but also a range of novel strategies. For instance, the novel algorithms based on random forests and ANOVA outperformed established tools significantly. Further analysis revealed not only which inference strategies are particularly successful but also which kind of specific information was utilized from the different types of experimental measurements. At the same time, the performance of the individual approaches varied in different network motifs or target systems. By integrating individual predictions into community predictions the performance improved overall and became markedly more robust. This principle is known as the wisdom of crowds: a solution derived from a community of independent decision makers will be better, on average, than any individual solution. Based on community predictions, we also constructed the first comprehensive gene regulatory model for the human pathogen Staphylococcus aureus.

Dr. Robert Küffner habilitated in informatics in 2010 and is currently a group leader for computer science and bioinformatics at the Ludwig-Maximilians Universit�t M�nchen. He received his PhD in molecular biology in 1998 at the Heinrich-Heine Universit�t in D�sseldorf, Germany. His main interests include the analysis of biological networks via Petri Nets as well as the areas of text mining, expression analysis, gene regulation, alternative splicing and systems biology.
Recently, Dr. Küffner's team was recognized as best performer in two international community-wide challenges (DREAM4/2009 and DREAM5/2010) where comprehensive blinded assessments of network inference approaches have been conducted.

Poster presentations

Preliminary Program

Wednesday, July 20, 2011

09:00

09:10

Registration and Welcome

09:10

10:00

Invited talk

Eleazar Eskin. Known and Unknown Confounding in Genetic Studies.

10:00

10:30

Session 1

10:00

10:30

Ryan Topping and John Pinney. An Evolutionary Measure for Studying the Re-wiring of Protein-Protein Interactions

10:30

11:00

Coffee break

11:00

12:20

Session 2: Theory / Applications

11:00

11:20

Johanna Mazur and Lars Kaderali. Bayesian Experimental Design for the Inference of Gene Regulatory Networks

11:20

11:40

Bettina Knapp and Lars Kaderali. Linear Model for Network Inference using RNA Interference Data

11:40

12:00

David Burstein, Sven Gould, Verena Zimorski, Thorsten Klösges, Fuat Kiosse, Peter Major, William Martin, Tal Pupko and Tal Dagan. A Machine-Learning Approach to Hydrogenosomal Protein Identification in Trichomonas Vaginalis

12:00

12:20

Juho Rousu, Daniel D. Agranoff, Delmiro Fernandez-Reyes and John Shawe-Taylor. Sparse Canonical Correlation Analysis for Biomarker Discovery: A Case Study in Tuberculosis

12:20

13:30

Lunch

13:30

15:00

Session 3: Gene Selection and Prioritization

13:30

14:00

Pierre Geurts and Yvan Saeys. Exploring Signature Multiplicity Using Ensembles of Randomized Trees

14:00

14:20

David Dernoncourt, Blaise Hanczar and Jean-Daniel Zucker. An Empirical Analysis of Markov Blanket Filters for Feature Selection on Microarray Data

14:20

14:40

Daniela Nitsch, Léon-Charles Tranchevent and Yves Moreau. Machine Learning Approaches for Network-Based Gene Prioritization from Expression Data

14:40

15:00

Shi Yu, Léon-Charles Tranchevent, Sonia M Leach, Bart De Moor and Yves Moreau. A Kernel Based Framework for Cross-Species Candidate Gene Prioritization

15:00

15:30

Coffee Break

15:30

16:30

Session 4: Applications of CCA

15:30

16:00

Yoshihiro Yamanishi, Edouard Pauwels, Hiroto Saigo and Veronique Stoven. Identification of Chemogenomic Features from Drug-Target Interaction Networks by Sparse Canonical Correspondence Analysis

16:00

16:30

Leo Lahti and Samuel Kaski. Probabilistic Dependency Models for Data Integration in Functional Genomics

19:00 (tentative time)

Evening in a Viennese wine tavern (Heuriger) sponsored and organized by the City of Vienna

Thursday, July 21, 2011

09:10

10:00

Invited talk

Magnus Rattray.Modeling Gene Expression Time-Series with Bayesian Non-Parametrics.

10:00

10:30

Session 5

10:00

10:30

Panagiotis Achlioptas, Bernhard Schölkopf and Karsten Borgwardt. Epistasis Detection in Subquadratic Runtime

10:30

11:00

Coffee break

11:00

12:20

Session 6: Microarray Gene Expression Data / Data Integration

11:00

11:20

Alexandra Posekany, Klaus Felsenstein and Peter Sykacek. Assessing Noise Models for Microarray Data Analysis

11:20

11:40

Constanze Schmitt, Matthias Böck and Stefan Kramer. SOM Biclustering of Gene Expression Data

11:40

12:00

Hossein Rahmani, Hendrik Blockeel and Andreas Bender. Interaction-Based Feature Selection for Predicting Cancer-Related Proteins in Protein-Protein Interaction Networks

12:00

12:20

Ondřej Kuželka, Andrea Szabóová, Matěj Holec and Filip Zelezny. Gaussian Logic for Proteomics and Genomics

12:20

13:30

Lunch

13:30

14:20

Invited talk

Robert Küffner. Successful Strategies to Gene Regulatory Network Inference.

14:20

15:00

Session 7: Gene Expression Time Series

14:20

14:40

Matthias Böck, Constanze Schmitt and Stefan Kramer. A Study of Dynamic Time Warping for the Inference of Gene Regulatory Relationships

14:40

15:00

Alfredo A. Kalaitzis and Neil D. Lawrence. A Simple Approach to Ranking Differentially Expressed Gene Expression Time Courses through Gaussian Process Regression

15:00

15:30

Coffee break

15:30

16:30

Session 8: Network Inference

15:30

16:00

Celine Brouard, Florence D'Alch�-Buc and Marie Szafranski. A New Theoretical Angle to Semi-Supervised Output Kernel Regression for Protein-Protein Interaction Network Inference

16:00

16:30

Steven M. Hill and Sach Mukherjee. An Exact Empirical Bayes Approach for Incorporating Biological Knowledge into Network Inference

16:30

Closing remarks

Program

Invited Speakers

Poster presentations

Preliminary Program