CakeResume 找人才

进阶搜寻
On
4 到 6 年
6 到 10 年
10 到 15 年
15 年以上
United States
Avatar of 姚璇亨.
Avatar of 姚璇亨.
商業分析師 @凱基商業銀行股份有限公司
2021 ~ 现在
Data Analyst、Data Engineer、Data Scientist、Customer Experience Analyst
一個月內
姚璇亨 Henry Yao 以人為本、技術為準;整合資源、解決問題;探索未知、建構知識。 Email: [email protected] |LINE: heinz.yao 工作經歷 凱基商業銀行股份有限公司:商業分析師 (DecNow) 第二波「鷹眼識詐聯盟」技術轉移,負責資料工程與模型導入之 Python 程式開發 運用 LightGBM 演算法建立預
Python
English as a Second Language (ESL)
Excel
就职中
目前会考虑了解新的机会
全职 / 对远端工作有兴趣
4 到 6 年
國立政治大學(National Chengchi University)
數位內容

最轻量、快速的招募方案,数百家企业的选择

搜寻简历,主动联系求职者,提升招募效率。

  • 浏览所有搜寻结果
  • 每日可无限次数开启陌生对话
  • 搜尋僅開放付費企業檢視的简历
  • 检视使用者信箱 & 电话
搜寻技巧
1
Search a precise keyword combination
senior backend php
If the number of the search result is not enough, you can remove the less important keywords
2
Use quotes to search for an exact phrase
"business development"
3
Use the minus sign to eliminate results containing certain words
UI designer -UX
免费方案仅能搜寻公开简历。
升级至进阶方案,即可浏览所有搜寻结果(包含数万笔览仅在 CakeResume 平台上公开的简历)。

职场能力评价定义

专业技能
该领域中具备哪些专业能力(例如熟悉 SEO 操作,且会使用相关工具)。
问题解决能力
能洞察、分析问题,并拟定方案有效解决问题。
变通能力
遇到突发事件能冷静应对,并随时调整专案、客户、技术的相对优先序。
沟通能力
有效传达个人想法,且愿意倾听他人意见并给予反馈。
时间管理能力
了解工作项目的优先顺序,有效运用时间,准时完成工作内容。
团队合作能力
具有向心力与团队责任感,愿意倾听他人意见并主动沟通协调。
领导力
专注于团队发展,有效引领团队采取行动,达成共同目标。
一年內
Machine Learning Engineer
Logo of Taiwan AI Labs.
Taiwan AI Labs
2019 ~ 现在
Taipei, 台灣
专业背景
目前状态
就职中
求职阶段
目前没有兴趣寻找新的机会
专业
机器学习工程师, 数据科学家
产业
人工智能 / 机器学习, 大数据, 软件
工作年资
2 到 4 年
管理经历
我有管理 1~5 人的经验
技能
Natural Language Processing
Machine Learning
Deep Learning
Data Science
Python
Git
PyTorch
Keras
CI/CD
k8s
Docker
Data Analysis
CNN
语言能力
English
专业
求职偏好
希望获得的职位
Machine Learning Engineer
预期工作模式
全职
期望的工作地点
台灣台北, 台灣台南市, 美國加利福尼亞洛杉磯, 德國慕尼黑, 加拿大安大略多倫多
远端工作意愿
对远端工作有兴趣
接案服务
学历
学校
NCKU, Master Degree
主修科系
Computer Science
列印
User 9593 1475392706

Kai-Chou, Yang

As a Kaggle Competition Master and a winner of international data science challenges, I am experienced in machine learning, deep learning and related frameworks such as PyTorch.


My research focuses on natural language processing (NLP), where I have released 11 open-source projects such as MianBot (700+★ on Github) and presented certain academic papers on top conferences like ACL, AAAI, CIKM, and WSDM.

International Awards

For the following achievements, I am the first author as well as the team leader.

2nd Place, CIKM Cup: Cross-lingual Short-text Matching Challenge

  • Proposed two densely-connected architectures, CPRNN and DACNN, for sentence pair modeling.
  • Fused semantic features from different levels to create diversity intra-models.
  • The solution has been oral presented on CIKM 2018 in Turin, Italy.

3rd Place, WSDM Cup: Fake News Classification Challenge

  • Implemented various NLI networks like ESIM and injected world knowledge using BERT.
  • Proposed a disagreement-aware model based on the single-word attention.
  • The paper has been oral presented on WSDM 2019 in Melbourne, Australia.

4th place, Google AI: Gendered Pronoun Resolution Competition

  • Leveraged the information redundancy from BERT and extracted features from the optimal layer.
  • Proposed a multi-heads Siamese semantic scorer for answer selection.
  • The paper has been presented on ACL 2019 in Florence, Italy.

Kaggle Competition Master, Ranked top 0.2% (233/114,366)

  • Top 1% (4/838), Gendered Pronoun Resolution Competition.
  • Top 1% (27/4,550), Toxic Comment Classification Challenge.
  • Top 3% (30/1,449), CareerCon 2019 - Help Navigate Robots.
  • Top 4% (103/3,165), Jigsaw Unintended Bias in Toxicity Classification.
  • Top 6% (223/3,633), CommonLit Readability Prize. 
  • Top 10% (384/3,946), TalkingData AdTracking Fraud Detection Challenge.

Work Experience

Taiwan AI Labs, Machine Learning Engineer, Sep 2019 ~ Now

Question Answering System

  • Propose a conditional question generator with mT5 for controllable QA data augmentation and as the base of dense retrieval, which improves recall@50 from baseline model by 16%.
  • Build a generative pseudo labeling pipeline using a open-domain passage retriever and machine reader, which improve the nDCG@10 by 4.2 - 9.7, on various domains.
  • Build an efficient passage re-ranker based on tiny-bert with a time-series based clustering framework for effective negative passage sampling.
  • Leverage FinBERT on QA analysis and slot filling for fintech dialogue system.

Natural Language Understanding 

  • Implement a document encoder with self-contrastive learning and a document clustering algorithm, which is scalable for million scale of streaming data.
  • Implement a GROVER-like generator as the backbone for topic detection, article rewriting, and tag generation.
  • Propose a semi-automatic framework for fake-news identification, which gathers evidence from event properties, user behavior and textual features.
  • Propose a SOTA Chinese typo correction system based on a boosting loop of automatic speech recognition and text to speech for weak supervision.
  • Build a general-purpose NLP training pipeline for team use involving data augmentation, data regularization, and unsupervised domain adaptation.

Education

Master in Department of Computer Science, NCKU                                         GPA: 4.30

  • Honorary member of the Phi Tau Phi Scholastic Honor Society. (Ranked 1st among all graduates.)
  • As a teaching assistant for Introduction to Data Science, Data Mining and Discrete Mathematics.
  • As a speaker / teaching assistant for introduction lectures of machine learning.

Bachelor in Department of Computer Science, NCKU                                      GPA: 3.92






  • Academic excellence awards 2016.
  • Academic excellence awards 2015.
  • Honorable mention on the graduation exhibition.
  • Research assistant on a question answering system project for the Ministry of Science and Technology.

Side Projects

I list some of my project experiences. You can refer to my Github for the other interesting ideas.

Mianbot

  • Got 700+ stars and 200+ forks on Github.
  • Implemented the hierarchical keywords matching using word2vec.
  • Implemented the IR-based searching module to support chit-chat.
  • Allow user to define customized scenarios with JSON.
  • The extracted QA pairs were released in PTT-Gossiping-Dataset, a widely-used Chinese chit-chat corpus.
Paragraph image 00 00@2x

NCKU Smart-Life LineBot

  • A Linebot that helps solve trivial matters such as restaurant recommendation.
  • The dialogue system is based on LUIS for intent classification.
  • The backend was built with Django / Flask (new version) and host on Heroku.
  • The backend is connected with Line server using the web API.
Paragraph image 00 00@2x

Knowledge & Skills


  • General Machine Learning
    • Classification, Regression, Clustering, Boosting, Feature Engineering.
  • Natural Language Processing
    • Sentence Pair Modeling: Natural language Inference, Machine Reading Comprehension, Sentence Similarity
    • Text Classification / Regression / Clustering
    • Deep contextual representation (ELMO / BERT / XLNet / ELECTRA / RoBERTa / ERINE2.0 / BigBird / T5)
  • Recommendation System
    • Factorization: Matrix Factorization, Factorization Machine, DeepFM
    • Graph Embedding: DeepWalk, Node2Vec, item2Vec

Publication


  1. Fake News Detection as Natural Language Inference. Kai-Chou Yang; Timothy Niven; Hung-Yu Kao. WSDM Cup 2019
  2. Fill the GAP: Exploiting BERT for Pronoun Resolution. Kai-Chou Yang; Timothy Niven; Tzu Hsuan Chou; Hung-Yu Kao. ACLWS'19
  3. Generalize Sentence Representation with Self-Inference. Kai-Chou Yang; Hung-Yu Kao. AAAI 2020
  4. The Prevalence and Impact of Fake News on COVID-19 Vaccination in Taiwan: A Retrospective Study of Digital Media. Yen-Pin Chen; Yi-Ying Chen; Kai-Chou Yang; Feipei Lai; Chien-Hua Huang; Yun-Nung Chen; Yi-Chin Tu. JMIR
简历
个人档案
User 9593 1475392706

Kai-Chou, Yang

As a Kaggle Competition Master and a winner of international data science challenges, I am experienced in machine learning, deep learning and related frameworks such as PyTorch.


My research focuses on natural language processing (NLP), where I have released 11 open-source projects such as MianBot (700+★ on Github) and presented certain academic papers on top conferences like ACL, AAAI, CIKM, and WSDM.

International Awards

For the following achievements, I am the first author as well as the team leader.

2nd Place, CIKM Cup: Cross-lingual Short-text Matching Challenge

  • Proposed two densely-connected architectures, CPRNN and DACNN, for sentence pair modeling.
  • Fused semantic features from different levels to create diversity intra-models.
  • The solution has been oral presented on CIKM 2018 in Turin, Italy.

3rd Place, WSDM Cup: Fake News Classification Challenge

  • Implemented various NLI networks like ESIM and injected world knowledge using BERT.
  • Proposed a disagreement-aware model based on the single-word attention.
  • The paper has been oral presented on WSDM 2019 in Melbourne, Australia.

4th place, Google AI: Gendered Pronoun Resolution Competition

  • Leveraged the information redundancy from BERT and extracted features from the optimal layer.
  • Proposed a multi-heads Siamese semantic scorer for answer selection.
  • The paper has been presented on ACL 2019 in Florence, Italy.

Kaggle Competition Master, Ranked top 0.2% (233/114,366)

  • Top 1% (4/838), Gendered Pronoun Resolution Competition.
  • Top 1% (27/4,550), Toxic Comment Classification Challenge.
  • Top 3% (30/1,449), CareerCon 2019 - Help Navigate Robots.
  • Top 4% (103/3,165), Jigsaw Unintended Bias in Toxicity Classification.
  • Top 6% (223/3,633), CommonLit Readability Prize. 
  • Top 10% (384/3,946), TalkingData AdTracking Fraud Detection Challenge.

Work Experience

Taiwan AI Labs, Machine Learning Engineer, Sep 2019 ~ Now

Question Answering System

  • Propose a conditional question generator with mT5 for controllable QA data augmentation and as the base of dense retrieval, which improves recall@50 from baseline model by 16%.
  • Build a generative pseudo labeling pipeline using a open-domain passage retriever and machine reader, which improve the nDCG@10 by 4.2 - 9.7, on various domains.
  • Build an efficient passage re-ranker based on tiny-bert with a time-series based clustering framework for effective negative passage sampling.
  • Leverage FinBERT on QA analysis and slot filling for fintech dialogue system.

Natural Language Understanding 

  • Implement a document encoder with self-contrastive learning and a document clustering algorithm, which is scalable for million scale of streaming data.
  • Implement a GROVER-like generator as the backbone for topic detection, article rewriting, and tag generation.
  • Propose a semi-automatic framework for fake-news identification, which gathers evidence from event properties, user behavior and textual features.
  • Propose a SOTA Chinese typo correction system based on a boosting loop of automatic speech recognition and text to speech for weak supervision.
  • Build a general-purpose NLP training pipeline for team use involving data augmentation, data regularization, and unsupervised domain adaptation.

Education

Master in Department of Computer Science, NCKU                                         GPA: 4.30

  • Honorary member of the Phi Tau Phi Scholastic Honor Society. (Ranked 1st among all graduates.)
  • As a teaching assistant for Introduction to Data Science, Data Mining and Discrete Mathematics.
  • As a speaker / teaching assistant for introduction lectures of machine learning.

Bachelor in Department of Computer Science, NCKU                                      GPA: 3.92






  • Academic excellence awards 2016.
  • Academic excellence awards 2015.
  • Honorable mention on the graduation exhibition.
  • Research assistant on a question answering system project for the Ministry of Science and Technology.

Side Projects

I list some of my project experiences. You can refer to my Github for the other interesting ideas.

Mianbot

  • Got 700+ stars and 200+ forks on Github.
  • Implemented the hierarchical keywords matching using word2vec.
  • Implemented the IR-based searching module to support chit-chat.
  • Allow user to define customized scenarios with JSON.
  • The extracted QA pairs were released in PTT-Gossiping-Dataset, a widely-used Chinese chit-chat corpus.
Paragraph image 00 00@2x

NCKU Smart-Life LineBot

  • A Linebot that helps solve trivial matters such as restaurant recommendation.
  • The dialogue system is based on LUIS for intent classification.
  • The backend was built with Django / Flask (new version) and host on Heroku.
  • The backend is connected with Line server using the web API.
Paragraph image 00 00@2x

Knowledge & Skills


  • General Machine Learning
    • Classification, Regression, Clustering, Boosting, Feature Engineering.
  • Natural Language Processing
    • Sentence Pair Modeling: Natural language Inference, Machine Reading Comprehension, Sentence Similarity
    • Text Classification / Regression / Clustering
    • Deep contextual representation (ELMO / BERT / XLNet / ELECTRA / RoBERTa / ERINE2.0 / BigBird / T5)
  • Recommendation System
    • Factorization: Matrix Factorization, Factorization Machine, DeepFM
    • Graph Embedding: DeepWalk, Node2Vec, item2Vec

Publication


  1. Fake News Detection as Natural Language Inference. Kai-Chou Yang; Timothy Niven; Hung-Yu Kao. WSDM Cup 2019
  2. Fill the GAP: Exploiting BERT for Pronoun Resolution. Kai-Chou Yang; Timothy Niven; Tzu Hsuan Chou; Hung-Yu Kao. ACLWS'19
  3. Generalize Sentence Representation with Self-Inference. Kai-Chou Yang; Hung-Yu Kao. AAAI 2020
  4. The Prevalence and Impact of Fake News on COVID-19 Vaccination in Taiwan: A Retrospective Study of Digital Media. Yen-Pin Chen; Yi-Ying Chen; Kai-Chou Yang; Feipei Lai; Chien-Hua Huang; Yun-Nung Chen; Yi-Chin Tu. JMIR