Professional Summary

Summary Statement

As a computational scientist, I am deeply committed to advancing scientific discovery through innovative and rigorous computational approaches. My expertise spans statistical genetics, genomics, metagenomics, cancer genomics, and clinical data analysis, with a particular interest in the interaction between genetics, the microbiome, and environmental exposures. I have led and contributed to interdisciplinary research projects that integrate molecular, environmental, and clinical data, applying methodologies from statistical genetics, microbial ecology, biostatistics, epidemiology, and machine learning. I am experienced in all phases of large-scale human subject research, from study design and participant recruitment to data analysis and publication in high-impact journals. I have a strong track record of developing bioinformatic and statistical pipelines for diverse data types including NGS (genomic, metagenomic, transcriptomic), genetic variant, environmental exposure, and clinical subject data. I am passionate about mentoring, teaching, and consulting on computational research and methodologies, and skilled at communicating complex theory and findings to both expert and lay audiences. My work has contributed to both academic and industrial institutions and large-scale, multi-center research collaborations.

Skills

Skills
Data Analysis / Statistics
  • R (programming)
  • R and Quarto markdown
  • Python
  • GWAS
  • PheWAS
  • Statistical genetics
  • Microbial ecology
  • Epidemiology
  • Biostatistics
  • Cross-sectional data
  • Longitudinal data
  • Generalized linear models
  • Linear mixed models
  • Non-parametric methods
  • Network analysis
  • PCA
  • Mediation analysis
  • Machine learning
  • Data visualization
  • Clinical data analysis
Bioinformatics / Computing
  • HPC
  • Pipeline development
  • Metagenomic profiling
  • NGS quality control
  • Sequence alignment
  • Functional annotation
  • Big data
  • Shell scripting
  • Bash
  • Parallel computing
  • Linux
  • Git
  • GitHub
  • Visual Studio Code
  • Variant calling
  • Variant quality control
  • Software management
  • Data storage
Data Management
  • Data extraction
  • SQL
  • Quality control
  • Data cleaning
  • Harmonization
  • Database design
  • Database management
  • Database query
  • Public databases and datasets
  • Security
  • Electronic medical records
  • Microsoft Excel
Project Leadership and Management
  • Study design
  • Protocol design
  • Human subject research
  • Animal model research
  • Project leading
  • Project management
  • Documentation writing
  • Collaboration
  • Organizational skills
  • Communication skills
  • Time management
  • Conflict management
Interpersonal Skills
  • Attentiveness
  • Empathy
  • Supportive
  • Adaptability
  • Teamwork
  • Leadership
  • Responsibility
  • Dependability
  • Patience
Science Communication
  • Scientific writing
  • Technical reporting
  • Presentation skills
  • Networking
  • Public speaking
  • Scientific editing
  • Social media