- Identified different protein species and concentration levels from 50K+ Raman spectrums with over 3,000 features in each using clustering algorithms
- Developed Python scripts with a PLS-DA and SVM combined machine learning pipeline for predicting protein aggregation in real-time during downstream purification; saved $20K annually in outsourcing analytical software fees