in medical Q&A applications, improving model performance by 2.01% . Efficiency Optimization: Utilized Low-Rank Adaptation (LoRA) for training LLM, reducing parameter training volume to 2% , decreasing memory usage by 70%, and increasing training speed by 25%. Architecture Optimization: Used Direct Preference Optimization (DPO) to optimize Reinforcement Learning, improving model efficiency and decision quality. [ G itHub ] Graduate Research Assistant OctOct 2023 Institute of Information Science, Academia Sinica, Taiwan Natural Language and Knowledge Processing Lab (NLP Lab) Publication: Published in ACM CIKMSequential Text-based Knowledge Update with Self-Supervised Learning for Generative Language Models . [ Paper
Ted Li Senior Firmware Engineer Over 6 years of firmware/software development expertise as a Senior Firmware Engineer, specializing in embedded systems, cross- functional projects, and RL-optimizations. Driving global technical innovations and training. New Taipei City, Taiwan [email protected] https://github.com/armcortex https://www.linkedin.com/in/ted-li/ https://about.armcortex.cc/ Skill Programming C/C++ Python Bash SQL AI (PyTorch, TensorFlow, Keras) Tool RTOS Embedded System Git Docker/Docker Compose
C
Python
C/C++
Unemployed
・
Ready to interview
Full-time / Interested in working remotely
6-10 years
日本電氣通信大學 The University of Electro-Communications (UEC)
・
Robotics Engineering
The Most Lightweight and Effective Recruiting Plan
Search resumes and take the initiative to contact job applicants for higher recruiting efficiency. The Choice of Hundreds of Companies.