Dr. Chen, Hung-Hsin 's orcid link picture Dr. Chen, Hung-Hsin 's publons link picture

Dr. Chen, Hung-Hsin

Assistant Research Fellow

  1. Human Genome
  2. Genetic Epidemiology
  3. RNA-sequencing
  4. Biobank analysis
  5. Longitudinal analysis

Education and Positions:
    • Ph.D. Vanderbilt University
    • Postdoc. Vanderbilt University Medical Center

Large-scale genetic data with linked health records represent a major untapped resource for understanding genetic mechanisms of human health and disease. Our lab takes three general directions, each leveraging large-scale and longitudinal data to improve human health:


Genomic segments shared due to relatedness in large-scale biobanks provide an opportunity to identify novel disease genes.


Most biobanks contain a substantial amount of cryptic relatedness, which can introduce bias or reduce statistical power in traditional GWAS and is often considered as a nuisance variable. However, these patterns of relatedness can also be utilized for gene mapping. Since shared segments are inherited from the same ancestor, rare, pathogenic variants within a shared region are likely to be shared. Because of this, comparison of the distribution of shared segments within cases and controls can be used for gene mapping. We are focusing on developing and implementing shared segment approaches for identifying novel disease genes in large-scale biobank data.


Longitudinal, repeated measurements in the EHR allow us to investigate genetic effects on a phenotype’s trajectory over time.


Traditional epidemiological studies typically have cross-sectional measurements or measurements from study visits over a short follow-up time, while biobanks’ linked longitudinal health records can contain repeated measures spanning decades. Extracting optimal values for analysis (e.g., highly heritable or predictive of disease risk) is a major challenge in biobank studies. Longitudinal data can also be used to study phenotypic trajectories across the lifespan. As a result, my group's goal is to create algorithms that capture powerful features of repeated measurements and use them to further our understanding of disease genetic mechanisms.


Longitudinal multi-omic measures enable characterizations of the change in expression (or other omic measure) pattern associated with development of disease.


Gene expression and other omics are the product of complex interactions between genetic regulation and environmental stimuli. Analysis of these longitudinal measures will increase understanding of genetic effects and response to environmental changes during disease pathogenesis. My lab works with existing cohorts to develop multi-omic profiling approaches for studying correlations between omics patterns and disease incidence.