Title: Dynamic Programming in Secondary Structure Inference
Abstract: Given an RNA sequence, secondary structure inference is the problem of predicting that sequence's base pairs. A variety of methods for this problem exist; among the most popular are minimum free energy (MFE) methods, which assign each possible secondary structure an energy based on the presence or absence of various substructures, with negative energy structures being more likely to occur naturally. These methods then use dynamic programming to predict the lowest free energy structure(s) efficiently. We will give an introduction to dynamic programming, talk about why it is necessary for approaching this problem efficiently, and discuss some of the shortcomings of the method. If time permits, we will also talk about connections to machine learning methods for secondary structure prediction.
Abstract:Vector-borne diseases affects approximately 1 billion people and accounts for 17% of all infectious diseases. With travel becoming more frequent across the global, it is important to understand the spatial dynamics of vector-borne diseases. Host movement plays a key part on how a disease can be distributed as it enables a pathogen to invade a new environment, and helps the persistence of a disease in locations that would otherwise be isolated. In this talk, we will explore how spatial heterogeneity combines with mobility network structure to influence vector-borne disease dynamics
Abstract:Vector-borne diseases affects approximately 1 billion people and accounts for 17% of all infectious diseases. With travel becoming more frequent across the global, it is important to understand the spatial dynamics of vector-borne diseases. Host movement plays a key part on how a disease can be distributed as it enables a pathogen to invade a new environment, and helps the persistence of a disease in locations that would otherwise be isolated. In this talk, we will explore how spatial heterogeneity combines with mobility network structure to influence vector-borne disease dynamics
Title: Model-dependent and model-independent control of biological network models
Abstract: Network models of intracellular signaling and regulation are ubiquitous in systems biology research because of their ability to integrate the current knowledge of a biological process and test new findings and hypotheses. An often asked question is how to control a network model and drive it towards its dynamical attractors (which have been found to be identifiable with phenotypes or stable patterns of activity of the modeled system), and which nodes and interventions are required to do so. In this talk, we will introduce two recently developed network control methods -feedback vertex set control and stable motif control- that use the graph structure of a network model to identify nodes that drive the system towards an attractor of interest (i.e., nodes sufficient for attractor control). Feedback vertex set control makes predictions that apply to all network models with a given graph structure and stable motif control makes predictions for a specific model instance, and this allows us to compare the results of model-independent and model-dependent network control. We illustrate these methods with various examples and discuss the aspects of each method that makes its predictions dependent or independent of the model.
Abstract: Optical tomography is the process of reconstructing the optical parameters of the inside of an object from measurements taken on the boundary. This problem is hard if light inside the object is scattered -- if it bounces around a lot and refuses to travel in straight lines. To solve optical tomography problems, we need a mathematical model for light propagation inside a scattering medium. In this talk I'll give a brief introduction to one such model -- the radiative transport model -- and talk a little bit about its behavior and its implications for optical tomography.
Abstract: Optical tomography is the process of reconstructing the optical parameters of the inside of an object from measurements taken on the boundary. This problem is hard if light inside the object is scattered -- if it bounces around a lot and refuses to travel in straight lines. To solve optical tomography problems, we need a mathematical model for light propagation inside a scattering medium. In this talk I'll give a brief introduction to one such model -- the radiative transport model -- and talk a little bit about its behavior and its implications for optical tomography.
Title:Preconditioning for Accurate Solutions of Linear Systems and Eigenvalue Problems
Abstract:This paper develops the preconditioning technique as a method to address the accuracy issue caused by ill-conditioning. Given a preconditioner M for an ill-conditioned linear system Ax=b, we show that, if the inverse of the preconditioner can be applied to vectors accurately, then the linear system can be solved accurately. A stability concept called inverse-equivalent accuracy is introduced to describe higher accuracy that is achieved and an error analysis will be presented. As an application, we use the preconditioning approach to accurately compute a few smallest eigenvalues of certain ill-conditioned matrices. Numerical examples are presented to illustrate the error analysis and the performance of the methods.
Title: Algebraic Statistics Applications in Epidemiology
Abstract: Interactions between single nucleotide polymorphisms (SNPs) and complex diseases have been an important topic throughout epidemiological studies. Previous studies have mostly focused on gene variables at a single locus. In this talk, I will discuss a focused candidate gene study to test the interaction of multiple SNPs with the risk of different types of cancer.
We will exemplify the fact that traditional asympotic results in statistical analysis do not apply in our setting. This is due mainly to the fact that we have a relatively small fixed data set. In our work we develop a new statistical approach using techniques from the field of algebraic statistics. Algebraic statistics focuses on mathematical aspects of statistical models, where algebraic, geometric and combinatorial insights can be useful to study behavior of statistical procedures.
Using the R package algstat, developed by Kahle, Garcia Puente, and Yoshida, we implemented an algebraic statistics method that can test for independence between several variables and the desease. We applied our methods to the study of gene-gene interaction on cancer data obtained from the European case-control study Gen-Air extending previous work by Ricceri, Fassino, Matullo, Roggero, Torrente, Vineis, and Terracini.
Title: Structural and Functional Characterization of Expected and Aberrant Metal Ion Coordination in Proteins
Abstract: Metalloproteins bind and utilize metal ions for a variety of biological purposes. Due to the ubiquity of metalloprotein involvement throughout these processes across all domains of life, how proteins coordinate metal ions for different biochemical functions is of great relevance to understanding the implementation of these biological processes. Towards these ends, we have improved our methodology for structurally and functionally characterizing metal binding sites in metalloproteins. Our new ligand detection method is statistically much more robust, producing estimated false positive and false negative rates of ~0.11% and ~1.2%, respectively. Additional improvements expand both the range of metal ions and their coordination number that can be effectively analyzed. Also, the inclusion of many additional quality control filters has significantly improved structure-function Spearman correlations as demonstrated by rho values greater than 0.90 for several metal coordination analyses and even one rho value above 0.95. Also, improvements in bond-length distributions have revealed bond-length modes specific to chemical functional groups involved in multidentation. Using these improved methods, we analyzed all single metal ion binding sites with Zn, Mg, Ca, Fe, and Na ions in wwPDB, producing statistically rigorous results supporting the existence of both a significant number of unexpected compressed angles and subsequent aberrant metal ion coordination geometries (CGs) within structurally known metalloproteins. By recognizing these aberrant CGs in our clustering analyses, high correlations are achieved between structural and functional descriptions of metal ion coordination. Moreover, distinct biochemical functions are associated with aberrant CGs versus non-aberrant CGs.