Data engineer, working on ETL data flow design and developing projects about distributed fulltext search engine technology.
Data engineer,
Taipei Special Municipality,TW
[email protected]
1. Working on maintenance, deployment and turning of search engine cluster based open source projects, which can provide keyword search service for over 100 million documents.
2. Keyword query parser/tokenizer design and implementation based on Solr/Lucene project.
3. ETL processing design and implementation for text and image file.
4. Customized web crawler implementation.
1. Wet lab operation. (immunohistochemistry, cell culture, lab maintenance)
2. Confocal microscopy operation.
3. Image processing, coding in Matlab and leverage Imaris.
4. Bio-structure quantification/modeling.
Distributed fulltext search engine, development, maintenance, deployment, and schema design, based on Solr, Docker, Nginx, and Ansible. Implementation for tokenization, query parser, sharding optimization.
ETL process flow design and implementation in Java code, integrated with Solr, PostgreSQL, and MongoDB.
Pre-processing for image series and image tracking. Based on Matlab or Java code, leverage on OpenCV.
Experimental physics, design and implementation for low Reynold fluid dynamics. Image processing design and implementation, from data acquisition to data visualization.
Data engineer, working on ETL data flow design and developing projects about distributed fulltext search engine technology.
Data engineer,
Taipei Special Municipality,TW
[email protected]
1. Working on maintenance, deployment and turning of search engine cluster based open source projects, which can provide keyword search service for over 100 million documents.
2. Keyword query parser/tokenizer design and implementation based on Solr/Lucene project.
3. ETL processing design and implementation for text and image file.
4. Customized web crawler implementation.
1. Wet lab operation. (immunohistochemistry, cell culture, lab maintenance)
2. Confocal microscopy operation.
3. Image processing, coding in Matlab and leverage Imaris.
4. Bio-structure quantification/modeling.
Distributed fulltext search engine, development, maintenance, deployment, and schema design, based on Solr, Docker, Nginx, and Ansible. Implementation for tokenization, query parser, sharding optimization.
ETL process flow design and implementation in Java code, integrated with Solr, PostgreSQL, and MongoDB.
Pre-processing for image series and image tracking. Based on Matlab or Java code, leverage on OpenCV.
Experimental physics, design and implementation for low Reynold fluid dynamics. Image processing design and implementation, from data acquisition to data visualization.