Transcript
Page 1: Xinyue Emma) Li · • Relevant Courses: Predictive Analytics, Data Mining, Data Visualization, Analytics for Big Data (Hadoop/Spark/Hive), Database Design and Information Retrieval,

Xinyue (Emma) Li Emeryville CA 94608 · (530) 564-2418 · [email protected]

EDUCATION Northwestern University Evanston, IL MS in Analytics, GPA: 3.80/4.00 Expected 12/2018 • Relevant Courses: Predictive Analytics, Data Mining, Data Visualization, Analytics for Big Data (Hadoop/Spark/Hive),

Database Design and Information Retrieval, A/B Testing University of California, Davis Davis, CA B.S. Applied Statistics, GPA: 3.91/4.00; B.S. Managerial Economics, GPA: 3.76/4.00 06/2017 SKILLS R, Python (Pandas, NumPy, Scikit-learn, Keras), SQL (MySQL, Hive, Netezza), Spark, Hadoop, Java, D3.js, HTML/CSS, Tableau, AWS, SAS, C, Stata, Matlab WORKING EXPERIENCE TransUnion Chicago, IL Data Science Intern 06/2018 – 09/2018 • Constructed risk score models to review personal loan applications using various methods such as tree-based models

(XGBoost, Gradient Boosting, C5.0, Random Forest), SVM and Artificial Neural Networks • Researched and implemented methods of variable interpretation in Neural Networks for adverse selection • Performed quantitative analysis on 1B+ trades records to identify the customers’ capacity to absorb ongoing credit products

as the interest rate increases, which improved its cycle readiness through early identification of a shift in consumers’ debt Graduate Analytics Consultant 09/2017 - 06/2018 • Created graph components and generated features on 500k+ credit accounts with shared identity information • Trained Boosting Trees with XGBoost to identify the fraudulent accounts and achieved 95.9% precision • Researched and applied a Convolutional Neural Network using Graph Kernels to improve the fraud detection performance • Detected undiscovered suspicious accounts and improved the previous model by 25% Agricultural Issues Center Davis, CA Undergraduate Researcher 07/2016 - 06/2017 • Analyzed the effect of the legalization on Cannabis price in California through Hypothesis Testing in R • Performed exploratory data analysis and analyzed and researched reasons for price change across different agricultural

commodities through Functional Principal Component Analysis on the 1995-2015 California agricultural exports Standard Chartered Bank Shanghai Intern 08/2016 - 09/2016 • Predicted key indicators (revenue growth, EBITDA, taxable profit, etc) of the client to ensure liquidity for debt issuance • Explored potential collaboration opportunities between Standard Chartered Bank and Alibaba through analyzing Alibaba’s

operational model and cash flow

PROJECTS Predictive Modeling on Clothing Sales 10/2017 - 12/2017 • Cleaned data inconsistencies and imputed missing values with MICE algorithm and KNN methods • Generated features measuring the recency and frequency of consumers’ purchasing behaviors • Evaluated the efficacy of catalog-driven marketing through predicting customers’ future purchases with stacking Logistic

Regression and Multiple Linear Regression models • Estimated the expected profit to assist the company’s marketing strategy decision Gaming Analytics on Destiny II 01/2018 - 06/2018 • Designed a Player versus Player recommendation system framework based on team play to improve team performance • Implemented clustering analysis through K-means, GMM, Archetype Analysis on 16M+ matches from Destiny II to create

player profiles • Produced team profiles accordingly and provided recommendations via K-Nearest Neighbor method • Submitted paper to AIIDE(Artificial Intelligence and Interactive Digital Entertainment Conference) 2018 Venmo Transaction Study 04/2018 - 06/2018 • Conducted quantitative analysis with effective visualizations on 7M+ transactions via PySpark and SparkSQL to summarize

Venmo's social network • Analyzed different emoji use patterns in various time frames to learn users' spending habits using RDD and Spark data frame • Clustered the transaction messages with PySpark using text-based attributes to improve the text classification algorithm in

each segmentation ACTIVITIES AND LEADERSHIP • Vice President of Career Development Department at CSSA – Davis, CA 01/2016 - 06/2017 • Academic Coordinator at UC Davis Statistics Club – Davis, CA 06/2015 - 12/2016 • Volunteer at NYBL Foundation of America – Sacramento, CA 01/2015 - 03/2016

Recommended