IBISC  GENOPOLE     

Vacant jobs

A 2-YEAR POSTDOC POSITION


AVAILABLE AT GENOPOLE-University of Evry-FRANCE in early 2010


in MACHINE LEARNING and REVERSE-MODELING OF SIGNALING and GENE REGULATORY NETWORKS


The Machine Learning, Modeling and Data Integration group at laboratoryIBISC at University of Evry - Genopole (30km Paris, France) is seeking postdoctoral candidates for a two-year position with a strong background in statistical learning (graphical models and kernel methods) and an interest for interdisciplinary research aimed towards reverse-modeling of biological networks.

The successful applicant will participate to an ANR project called ODESSA* (Ordinary Differential Equations and State-Space models for signAlling and regulatory networks inference) that will begin on January 1rst, 2010, in collaboration with the Laboratoire de Biologie Moléculaire de la Cellule (Ecole Normale Supérieure de Lyon, France). He/she will investigate new algorithms for structure and module learning in nonlinear state-space models. A sound knowledge in graphical models (dynamical models, mixture models, latent variable models) and kernel-based approaches is asked. The targeted application is the identification of signaling and gene regulatory networks involved in the cellular response to retinoic acid from large scale data (next generation sequencers). Experience in large scale data-mining of post-genomic data will be appreciated but is not required.


The candidate will have a Ph.D. in Computer Science, Mathematics or Bioinformatics with a strong publication record.

 

Applications for this role should include a statement of research experience and interests, a CV, two reference letter(s) from appropriate academic sources and   the two best publications.


Please send your applications in electronic form to Prof. Florence d’A lché-Buc : florence (dot) dalche (at) ibisc (dot) fr


Website of the team: http://amisbio.ibisc.fr


The position can start in 2010 and if possible not later than September 2010.


The deadline for application is July 10, 2010


ODESSA is described here:

ODESSA : Ordinary Differential Equation and State-Space models for signalling and gene regulatory networks inference

ANR Project

 

Soon (January 2010)

  www.odessa-project.fr

 

Partners:

- IBISC FRE CNRS 3190, Univeristé d’Evry-Val d’Essonne, Genopole, Evry,France

- Institut de Génomique Fonctionnelle de Lyon (IGFL)

 

Coordination:

Prof. Florence d’Alché-Buc : florence.dalche@ibisc.fr , amisbio.ibisc.fr

 

Keywords : signalling networks, gene regulation networks, next generation sequencers, network inference, feature selection, module extraction, constraints, parametric and non parametric ODEs, state-space models, structure learning, probabilistic graphical model, Bayesian inference, Population-based methods (Monte-Carlo, EDA, Gibbs sampling), kernels

 

Abstract

 

Complex signalling and regulation mechanisms at work in the cell involve different kinds of molecular components that interact through time. Identifying these interactions and understanding how the cell implements a response to a given input signal are undoubtedly the main challenge of systems biology. To address this issue is one of the most powerful way to build a mathematical model of the complex dynamical underlying system and to study how it gives support to biological hypotheses. Many modellers have significantly contributed to this research current by exploiting different modelling frameworks like Ordinary Differential Equations, Boolean and discrete-value networks with an analysis of the dynamics derived from bifurcation theory or formal test methods.

Meanwhile several researchers ask another question: how can we determine automatically the parameters and the structure of the dynamical system under observation ? While for a long time, this goal was supposed to be out of reach, the spread-out of new high-throughput technologies consecutively to the emergence of sequencing tools has completely changed the game: it is now possible to measure gene expression in a given tissue in a given organ of some organisms in given conditions and therefore, the dynamical system underlying the interactions can be partially observed. Therefore, reverse-modelling becomes in principle feasible although many technical problems have been raised these last years: non identifiability of some parameters, limited sample size compared to the length of available time series, difficulty of structure estimation in front of noisy and incomplete data.

 

This project, exploiting two concrete problems of reverse-modelling, is mainly focused on these technical issues.

 

First biological problem:

 

The first biological issue is related to the analysis of genetic networks controlled by transcription factors. We choose to focus our study on the analyses of gene network modulated by Retinoic Acid Receptors (RARs), which are members of the nuclear receptor (NR) superfamily. This family of ligand modulated transcription factors whose study has been instrumental in the understanding of transcription regulation mechanisms, gathers major transcription factors, sharing important structural and functional properties, and implicated in development, in adult physiology as well as in various pathologies.

 

Second biological problem:

 

The second biological issue is related to the study of oscillating biological systems and more specifically to the circadian clock. In the recent years, lots of attention was put on the elucidation of the molecular mechanisms supporting the generation of circadian oscillations in living organisms. Within this well-defined frame, we choose to focus our study on the modulation and cellular integration of the circadian oscillations with a particular emphasis on the role of the nuclear receptor rev-erb a .

 

 

Methods :

 

Starting from the wide class of Ordinary Differential Equations and given the fact that the signalling network (resp. regulatory) may be not fully observed, we propose to improve two frameworks for learning parameters and structure: the first one is based on the idea that the observed trajectory can be approximated using a non parametric or a semi-parametric model and that the derivative of the proxy can then be used in the cost function to be minimized. The second one encapsulates the ODE into a discrete time probabilistic model such as a state-space model in order to deal with hidden variables. The former framework opens the door to the incorporation of qualitative constraints on the nature of the observed trajectory whereas the latter framework benefits from the flexibility of graphical models and their dedicated leaning approaches. Among issues related to the proposed biological problems, the project will try to address the following ones:

 

-        feature selection, network inference from large scale data (next generation sequencer), subnetwork extraction

 

-        incorporation of various constraints   in a Bayesian or frequentist framework: dynamics, biological function etc…

 

-        new nonparametric models for state-space models and differential equations

 

 

-     statistical test of models

Some recent publications related to the project

 

[d’Alché-Buc & Brunel 2009] F. d’Alché-Buc, N., J.-B., Brunel. Estimation of parametric nonlinear ODEs for biological networks identification, to appear as a chapter in Computational Systems Biology: Data Driven Inference of Parameters and Network Structure, Neil Lawrence, Magnus Rattray, Guido Sanguinetti,Mark Girolami, MIT Press, to appear in late 2009.

 

[d’Alché-Buc & Wehenkel 2008] F. d’Alché-Buc, L. Wehenkel, Machine Learning in Systems Biology, special issue BMC proceedings, December 2008.

 

[Auliac et al. 2008] C. A uliac, V. Frouin, X. Gidrol, F. d’Alché-Buc, Evolutionary Approaches for the Reverse-Engineering of Gene Regulatory Networks: a study on a biologically realistic Dataset, BMC Bioinformatics 2008, 9:91.

[Bouchet et al 2007] P. Bouchet, N. Brunel, F. d'Alché-Buc, Parameter Estimation of ODEs using Support Vector Regression and Qualitative constraints, PASCAL Workshop, September 18 th  2007, Glasgow (communication).

 

[Brunel 2008] N.,J.-B.,Brunel, Parameter estimation of ODEs via nonparametric estimators, Electronic Journal of Statistics,   Vol. 2 (2008) 1242–1267.

[Cai et al. 2008] Cai W., Rambaud J., Teboul M., Masse I., Benoit G., Gustafsson J.A., Delaunay F., Laudet V., and Pongratz I. Expression levels of ERβ are modulated by components of the molecular clock. Mol Cell Biol: 28(2), 784-793. 2008.

[Geurts et al. 2007] P. Geurts, N. Touleimat, M. Dutreix, F. d'Alché-Buc, Inferring biological networks with output kernel trees, BMC Bioinformatics, 8(Suppl 2):S4, May 3, 2007.

[Geurts et al. 2007] P. Geurts, L. Wehenkel, F. d'Alché-Buc, Gradient Boosting for Kernelized
Output Spaces, Zoubin Ghahramani (Eds.), in Proceedings of theTwenty-Fourth International Conference on Machine Learning (ICML 2007),Corvallis, Oregon, USA, June 20-24, 2007, 227:289-296.

 

[Perrin et al. 2003] B.-E. Perrin, L. Ralaivola,A. Mazurie, S. Bottani, J. Mallet, F. d'Alché-Buc, Inference of gene regulatory network with Dynamic Bayesian Network, Bioinformatics (Oxford Press), vol. 19, pi38-49, (2003).

 

[Quach et al. 2007] M. Quach, N. Brunel, F. d'Alché-Buc, Estimating parameters and hidden variables in nonlinear state-space models based on ODEs for biological networks inference. Bioinformatics, 23, pp. 3209–3216, 2007.

[Rambaud et al. 2009] Rambaud J., Triqueneaux G., Masse I., Staels B., Laudet V. and Benoit G. Rev-erbα2 mRNA encodes a stable protein likely to play a role in circadian clock regulation. Mol Endocrinol. In press. 2009

[Veber et al. 2008] P. Veber, C. Guziolowski, M. Le Borgne, O. Radulescu, and A. Siegel. Inferring the role of transcription factors in regulatory networks. BMC Bioinformatics, 9 :228, 2008.

Master Internship/ Stages de Master 1, 2, Engineering school (grandes écoles)
Statistical learning for gene network inference read more...

Post-doctoral position in prediction in structured output spaces
We offer a 12 months post-doctoral position in prediction in structured output spaces. read more...

New Job Offer
 
a two-year
postdoc position
available in 2010
Network reverse-modeling
More details in Job Offers page


 

Machine Learning in Systems Biology,2009

This international workshop will take place at Ljubljana, Slovenia on September 5-6, 2009.
All details here


Top of page Recommend page Print version Contact  Accessible Version  Imprint