Rice University 2004-present
Physical and Biological Computing Group
-
Designed and implemented a publicly available web-based application for protein function
prediction
-
Coded an automated large-scale data mining pipeline to gather biological meta-data from
heterogeneous sources and produce graphics-rich summary reports
-
Developed software to distribute computationally expensive protein analysis algorithms over large clusters of computers using MPI
-
Constructed statistical models to automatically detect statistically significant local structural features between proteins
-
Synthesized unsupervised machine learning approaches with linear/non-linear dimensionality
reduction for protein classification
-
Created novel geometric models of protein catalytic sites to improve computational protein function
prediction accuracy
University of Houston May-September, 2010
Center for Biomedical and Environmental Genomics
-
Implemented statistical models for distinguishing low frequency/rare variant SNPs from sequencing anomalies
-
Coded an automated pipeline for mapping NCBI and MITOMAP genomic annotations to newly sequenced genomes
Genentech, Inc. June-December, 2007
Internship with the Department of Early-Stage Purification
- Implemented a lab- and pilot-scale, non-chromatography process for monoclonal antibody (mAb) purification
- Optimized purification process parameters for improved mAb yield and time-efficiency
- Utilized a robotic wet lab station for automated, high-throughput parameter sweeping