Ying-Jhe, Tang

Natural Language Processing Engineer

A participant in Natural Language Processing.  

Interested in Machine Learning, Natural Language Processing, and eager to pursue new knowledge and technology.

  Taipei City, Taiwan

  [email protected]

0983034412







Experience



演算法工程師

Asus 華碩電腦股份有限公司

四月 2023 - Present
Taipei, Taiwan

  • Design Error Correction algorithm for speech recognition 

    • Correct the error word of speech recognition model output

      • Arpa model based Error Correction

      • Traverse the Arpa word graph which is constructed by the customer sentences in beam search manner.

    • Correct the error word in ChatBot conversation

      • Context Entity based Error Correction

      • Extract the entity by the NER model, then correct error word base on the extracted entities

  • Text-to-Speech technology application

  • Create the API for the Error Correction algorithm as service

  • Create the offline version for Speech Recognition System

  • Large Language Model application

  • Build web crawler and data cleaning to obtain the model training data



研究助理

中央研究院 資訊科學研究所

十一月 2021 - Present
Taipei, Taiwan

  • Multi-news abstractive summarization

    • Leverage discourse relationship tree to help the model learn the news discourse and structure in order to generate an accurate and coherent summary 

  • News knowledge update task

    • Given an existing article about a related news event, then generate an updated article according to the information from the news event

    • Leverage web crawler to collect original articles, news and updated articles triples from the website and generate the news event triggered knowledge update dataset 

    • Published in CIKM 2022 [1]:

學習型研究助理

KKBOX香港商科科科技股份有限公司台灣分公司

八月 2020 - 六月 2021
Taipei, Taiwan

T

學習型研究助理

國立政治大學(National Chengchi University

十一月 2019 - 六月 2021
Taipei, Taiwan

Education


2019 - 2021

國立政治大學(National Chengchi University)

資訊科學研究所

2015 - 2019

國立政治大學(National Chengchi University)

資訊管理學系

Publication


[1] A Multi-grained Dataset for News Event Triggered Knowledge Update (CIKM 2022)


[2] A Teacher-Student Approach to Cross-Domain Transfer Learning with Multi-level Attention  (WI-IAT 2021) 


[3] Position Analysis of Internet Public Opinions - A Case Study of Nuclear Energy Policy in Taiwan (中央研究院調查研究方法與應用學術研討會 2021) 


Skills


  • Natural Language Processing, Machine Learning, Deep Learning, LLM, Langchain, UNIX‑like Operating System, Google cloud platform, Github, Pandas

  • Framework: Pytorch, Tensorflow, Django

  • Programming Languages: PYTHON, SQL