Data Analyst
Autonomous Professional Development Summer Project
• Developed course-job recommendation model for APD training company by helping users choose courses to match their career goal, and calculate the likelihood of getting a job based on users' educational background
• Used Selenium to crawl information from NYU CareerNet and NYU SPS school websites, reduced 90% processing time
• Processed 2, 000 jobs and 446 courses text files through Python for data preparation and prototypes design
• Quantified the similarity between course and job using different algorithm models tf-idf, LSI, etc. improved prediction accuracy from 30% to 67%.
• Determined the model to apply on the defining business problem by model selection technique KL Divergence
• Built and evaluated models to predict the probability of getting a job based on users' educational background