Projects
Machine learning and deep learning
Deep learning model to forecast customer life time value
Developed a deep learning model to forecast customer life time value from historical time series time.
Used CNN/LSTM to extract time series features.
Combined time series features with time independent (geographical, business) features.
Model extendable to forecast different time horizons (1 month, 3 months, 12 months).
Deep Learning, Convolutional neural network, Long short-term memory, Life time value, Time series modeling
Convolutional neural network model for entity relation extraction
Developed a convolutional neural network model to extract entity relations from text corpus.
Used word embedding to transform tokens into numeric vectors, and applied convolutional kernels to extract semantic features to do prediction.
Deep Learning, Convolutional neural network, Word embedding, Relation extraction, Information retrieval
Deep learning model to predict anticancer drug response
Developed a deep learning model to predict anticancer drug response in lung cancer.
Used convolutional neural network framework to extract features from tens of thousands of genomic mutation locus.
Combined gene expression profile and drug information to predict drug sensitivity.
Convolutional neural network, Deep learning, Drug response
Statistical methodology and modeling
Bayesian hierarchical model to predict mRNA methylation sites
Developed a Bayesian hierarchical model to analyze methylated RNA immunoprecipitation sequencing (MeRIPseq) data.
Used zero-inflated negative binomial distribution to model sequencing read count. Modeled spatial dependency by hidden markov model.
Bayesian hierarchical model, Hidden markov model, Count data, Negative binomial distribution, Zero inflation
EM algorithm to identify genes that have tri-modal distribution
Developed an expectation maximization (EM) algorithm to deconvolute gene expression profile into three Gaussian components
(lower than normal, close to normal and higher than normal). Correlated breast cancer patients' survival with this stratification
to search for genes whose both high and lower expression are associated with worse survival outcome, which may potential have
dual role (both oncogene and tumor suppressor) in cancer development.
Expectation maximization, Gaussian mixture, Survival analysis, Breast Cancer
Bioinformatics software and web portal
Ensemble-based method to infer gene network structure
Developed an ensemble method to aggregate network constructed by multiple statistical methods (correlation-, mutual information-,
Bayesian-, likelihood-based methods); Developed a web server for online network inference and visualization.
Ensemble method, Hub node, Precision matrix, Network visualization, Online tool
DIGREM: an integrated web-based platform for detecting effective multi-drug combinations
Developed a computational algorithm to predict drug combination synergistic effect with transcriptomic profile,
drug dose response curve and gene regulatory network. Developed a user friendly web server to enable online prediction.
Implement this algorithm with an R package.
Drug-Induced Genomic Response models (DIGREM), Precision matrix, Drug synergy, Online tool