CakeResume Talent Search

Advanced filters
On
4-6 tahun
6-10 tahun
10-15 tahun
Lebih dari 15 tahun
Avatar of the user.
AI工程師、機器學習工程師、深度學習工程師、資料科學家、Machine Learning Engineer、Deep Learning Engineer、Data Scientist
Dalam satu bulan
Python
R
Natural Language Processing (NLP)
Sudah bekerja
Siap untuk wawancara
Full-time / Tertarik bekerja jarak jauh
4-6 tahun
國立政治大學(National Chengchi University)
資訊科學系
Avatar of the user.
Avatar of the user.
Past
Data Engineer @Rooit Inc. (XO App)
2023 ~ 2023
AI工程師、機器學習工程師、深度學習工程師、資料科學家、Machine Learning Engineer、Deep Learning Engineer、Data Scientist
Dalam satu bulan
Python
Data Analysis
Data Science
Tidak bekerja
Siap untuk wawancara
Full-time / Tertarik bekerja jarak jauh
6-10 tahun
中國醫藥大學(China Medical University)
臨床醫學研究所
Avatar of Chun-Jung Huang.
Avatar of Chun-Jung Huang.
OPC Chief Engineer @TSMC
2020 ~ Sekarang
AI工程師、機器學習工程師、深度學習工程師、資料科學家、Machine Learning Engineer、Deep Learning Engineer、Data Scientist
Dalam satu bulan
Chun-Jung Huang [email protected] Chiao-Tung University, Ph.D. - Photonics,2015 ~ 2020 Member of The Phi Tau Phi Scholastic Honor Society of the Republic of China. Work Experience TSMC, OPC Chief Engineer (MarPresent) ◆Introduced image anomaly detection techniques to identify and address defects in photomask manufacturing, significantly improving product quality and reducing turnaround time. ◆Managed large-scale data processing tasks, demonstrating expertise in analyzing and handling datasets of hundreds of millions, to bolster model development and optimization. ◆Excelled in distributed computing, optimizing code execution across thousands of systems to
Deep learning with TensorFlow
Translational Research
Clinical Research
Sudah bekerja
Siap untuk wawancara
Full-time / Tertarik bekerja jarak jauh
4-6 tahun
National Chiao-Tung University
Ph.D. - Clinical Engineering
Avatar of the user.
Avatar of the user.
智慧製造全端開發工程師 @聯華電子股份有限公司
2022 ~ Sekarang
AI工程師、機器學習工程師、深度學習工程師、影像演算法工程師、資料科學家、Ai Application Engineer,Machine Learning Engineer,Deep Learning Engineer,Data Scientist
Dalam satu bulan
Python
Qt
Git
Sudah bekerja
Siap untuk wawancara
Full-time / Tertarik bekerja jarak jauh
4-6 tahun
元智大學 Yuan Ze University
工業工程與管理學系所
Avatar of Nelson Chen.
Avatar of Nelson Chen.
Senior engineer @Chicony Electronics Co, Ltd.
2018 ~ Sekarang
全端工程師、後端工程師、前端工程師、軟體專案主管、AI工程師、機器學習工程師、深度學習工程師、資料科學家、Machine Learning Engineer、Deep Learning Engineer、Data Scientist
Dalam satu bulan
Nelson Chen Senior engineer Dedicated Software Engineer with 6+ Years of Experience Senior software engineer specializing in web page development and deep learning. Proficient with machine learning technologies, such as TensorFlow, Numpy, etc. Experience Senior engineer • Chicony Electronics Co, Ltd. .Build an Auto-Encoder AI model for defective detection. .Build an object detection model for detecting car types. .Developed a Front-End and Back-End website for data analysis. .Manage the production process and make it automated production. NovPresent Software engineer • Teco image systems co. ltd .Developed and maintained MFP driver
Python
C
C++
Sudah bekerja
Siap untuk wawancara
Full-time / Tertarik bekerja jarak jauh
6-10 tahun
National Taiwan Ocean University
Computer science and engineering
Avatar of 陳勤霖.
Avatar of 陳勤霖.
Past
博士後研究員 @洛桑大學神經發育疾病實驗室
2023 ~ 2023
Data Scientist, Data Analyst, Machine Learning Engineer
Dalam satu bulan
1. 神經電生理訊號分析、神經細胞追蹤分析,與藥理試驗。 2. 研究論文撰寫與國際研討會的舉辦。 技能 Data Science Data Analysis, Image Analysis, Machine Learning, Deep Learning, Statistical Analysis, Data visualization Programming Python, PyTorch, NumPy, Pandas, Matplotlib, Scikit-Learn, Git, PostgreSQL, Docker Biotechnology Neuroscience, Genetics, Imaging, Scientific Writing Soft skill Project Management, Probelm Solving, Team Player, Proactive Communication 語言 English — 專業 Chinese — 母語或雙語 French — 初階 學歷
Data Science
Data Analysis
Machine Learning
Tidak bekerja
Siap untuk wawancara
Full-time / Tertarik bekerja jarak jauh
4-6 tahun
洛桑聯邦理工學院(EPFL)
神經科學
Avatar of Wang Chunshan.
Avatar of Wang Chunshan.
Data Engineer @TSMC 台積電
2022 ~ Sekarang
資料分析師、演算法工程師、軟體工程師、軟體專案管理
Dalam satu bulan
Chun Shan, Wang [email protected] SUMMARY I'm a skilled software engineer, experienced in NLP and Data Engineering for over 4 years. I've delivered dependable solutions across commercial, educational, and psychological counseling domains. Expertise lies in deploying stable systems, ensuring valuable and trustworthy development. My background seamlessly integrates data and machine learning for comprehensive solutions. KEYWORDS: Python, NLP/NLU, Backend, Data, CI/CD, kubernetes, JAVA Spring, EXPERIENCE Data Engineer,now, TSMC I Build and improved the Python/JAVA services, including caching service with mongoDB and Redis, monitoring
Backend Development
NLP
Python
Sudah bekerja
Siap untuk wawancara
Full-time / Tertarik bekerja jarak jauh
4-6 tahun
國立中央大學 National Central University
網路學習科技研究所
Avatar of Patrick Hsu.
Avatar of Patrick Hsu.
Algorithm Research & Development @適着三維科技股份有限公司 TG3D Studio Inc.
2021 ~ Sekarang
Software Engineer
Dalam satu bulan
Patrick Hsu AI Research & Development As a seasoned AI engineer with six years of experience, I specialize in computer vision, 3D body model reconstruction, generative AI, and possessing some knowledge in natural language processing (NLP). | New Taipei City, [email protected] Work Experience (6 years) Algorithm Research & Design• TG3D Studio MayPresent A skilled engineer specialized in computer vision and generative AI with experience in developing and training AI models for digital fashion applications. Body AI: Virtual Try On Integrated cutting-edge technologies such as Stable Diffusion, ControlNet, and Prompt Engineering to create a sophisticated system for
Python
AI & Machine Learning
Image Processing
Sudah bekerja
Siap untuk wawancara
Full-time / Tertarik bekerja jarak jauh
4-6 tahun
國立台灣大學
生物產業機電工程所
Avatar of 陶俊良.
Avatar of 陶俊良.
資料分析師 Data Analyst @Portto 門戶科技| Blocto
2022 ~ 2024
Data Analyst、Data Engineer、Data Scientist、Customer Experience Analyst
Dalam satu bulan
陶俊良 (Tao,Chun-Liang) Taipei, Taiwan Email: [email protected] Phone:I am very sensitive to data and enjoy finding inspiration and ideas from them. I am proficient in machine learning, text analysis, and recommendation systems, EVM blockchain analytics, and currently use Python as my primary programming languages. I am always open to learning new things, such as learning new data structure from blockchain. I am currently very interested in blockchain data and on-chain user segamentation. I was working in digital media, advertising (DSP, SSP, DMP platforms), gaming user analyst, blockchain
python
R
MySQL
Sudah bekerja
Siap untuk wawancara
Full-time / Tertarik bekerja jarak jauh
4-6 tahun
臺灣大學
流行病學與預防醫學所 生物統計組
Avatar of 王文祥.
Avatar of 王文祥.
經理 @鴻博資訊有限公司
2015 ~ Sekarang
軟體工程師、電玩程式設計師、後端工程師、APP開發工程師、演算法開發工程師
Dalam satu bulan
such as OpenCV, Scikit-Image, scikit-learn, NumPy, Matplotlib, PyQt5, etc., to implement various functionalities. Additionally, I possess the ability to develop mobile applications using Django and React Native. I am proficient in using testing frameworks, GitHub for version control, and Docker for deployment. . Machine Learning and Deep Learning Here is a summary of the relevant technologies in the field of machine learning that I have researched and become familiar with over the past year: Numpy (Numerical Computing Library) : Numpy is one of the core libraries for numerical computing in Python. It provides powerful
Python
AOI
MES
Sudah bekerja
Siap untuk wawancara
Full-time / Hanya bekerja jarak jauh
10-15 tahun
崑工科技大學
電子工程

Paket Perekrutan Paling Mudah dan Efektif, Pilihan Ratusan Perusahaan

Cari lebih dari 800 ribu CV dan ambil aksi menghubungi pelamar kerja untuk rekrutmen yang lebih efektif. Pilihan ratusan perusahaan.

  • Lihat semua hasil pencarian
  • Tanpa batas harian untuk memulai pesan baru
  • CV dapat diakses oleh perusahaan berbayar
  • Lihat email pengguna & nomor telepon
Tips pencarian
1
Search a precise keyword combination
senior backend php
If the number of the search result is not enough, you can remove the less important keywords
2
Use quotes to search for an exact phrase
"business development"
3
Use the minus sign to eliminate results containing certain words
UI designer -UX
Hanya CV publik yang tersedia dengan paket gratis.
Upgrade ke paket lanjutan untuk melihat semua hasil pencarian, termasuk 10.000 lebih CV eksklusif di Cake Resume.

Definition of Reputation Credits

Technical Skills
Specialized knowledge and expertise within the profession (e.g. familiar with SEO and use of related tools).
Problem-Solving
Ability to identify, analyze, and prepare solutions to problems.
Adaptability
Ability to navigate unexpected situations; and keep up with shifting priorities, projects, clients, and technology.
Communication
Ability to convey information effectively and is willing to give and receive feedback.
Time Management
Ability to prioritize tasks based on importance; and have them completed within the assigned timeline.
Teamwork
Ability to work cooperatively, communicate effectively, and anticipate each other's demands, resulting in coordinated collective action.
Leadership
Ability to coach, guide, and inspire a team to achieve a shared goal or outcome effectively.
Lebih dari satu tahun
AI & Embedded Systems Consultant @ Self Employed
Self Employed
2021 ~ Sekarang
Ahmedabad, Gujarat, India
Latar Belakang Profesional
Status sekarang
Sudah bekerja
Tahap pencarian kerja
Profesi
Lainnya
Bidang Pekerjaan
Software
Pengalaman Kerja
4-6 tahun
Management
Keterampilan
Deep Learning
machine learning
aws
Google cloud
Docker
Networking
Bahasa
English
Profesional
Preferensi Pencarian Pekerjaan
Jabatan
Deep Learning Engineer
Tipe Pekerjaan
Full-time
Lokasi
Pune, Maharashtra, India
Bekerja jarak jauh
Tertarik bekerja jarak jauh
Freelance
Tidak
Pendidikan
Institusi Pendidikan
Charotar University of Science & Technology
Jurusan
Electronics & Communication
Cetak

Kishan Gondaliya

Experienced embedded software engineer working on Embedded Systems and Deep Learning to enable vision and voice-based machine learning algorithms on low-power FPGA and edge embedded devices. ~8 years of experience consists in writing, debugging, and optimizing software/firmware for embedded devices.

+91 9409 24 93 94
[email protected]   Ahmedabad, Gujarat, India      

Skillset

Languages:

Frameworks:

Dev Tools:

HW Platform:

Cloud (GCP):

Cloud (AWS):

Other:


C, Python, C++

Tensorflow (TFlite, TFmicro), Keras, Caffe, Darknet

Anaconda, Git, Gerrit, Perforce, Pycharm, CVS, Jira, Confluence

Google Coral TPU, Lattice ECP5, U+, Crosslin-NX FPGA, Raspberry Pi, Intel Movidius, NVIDIA GPU

Compute Engine, App Engine, Vision API, Auto-ML, Container Registry, Kubernetes Engine

Sagemaker, DeepLens, Lambda, Rekognition API, Reko API custom labels

Docker, OpenCV, Machine Learning, Deep Learning, Computer Vision, Convolution Neural Nets (CNN), LSTM, Networking, Model Optimization, Quantization, Pruning, Linux Kernel, OpenWRT

Work Experience

Work Experience

AI & Embedded Systems Consultant

Self-Employed  •  February 2021 - Present

Working with companies to blend AI with embedded systems specifically to enable AI on edge devices, including the device ecosystem.

Staff Engineer

Softnautics  •  September 2016 - February 2021

  • Architectured a Dockerized ML training framework and led the team for bug-free releases
  • Led Machine Learning COE team and completed 9+ projects successfully based on edge devices and cloud services
  • Worked on different DL model architectures and customized them for small footprint edge FPGA devices with techniques like quantization and pruning
  • Worked on OpenWRT firmware customization for mobility solution, network utilization monitoring and controlling

Associate Engineer

Sibridge Technologies  •  May 2015 - August 2016

  • Worked as a developer in critical 32-bit Tensile core based audio processor firmware development
  • Implemented multi-radio feature for mesh networks in the Linux kernel and improved HWMP to get a 7% throughput increment
  • Contributed to several projects as an individual contributor

Projects

Omnivision Camera driver for OpenQ2500 platform and DL model integration

  • OpenQ2500 is a wearable SOC designed mainly for small devices like trackers, smart watches, smart eyewear etc.
  • Work involved camera driver development and fine-tuning the camera with parameters that can be changed from user space.
  • Later with a camera feed, DL model was developed to identify multiple custom objects based on wearable application of the client

Linux Driver for I2S on iMX8

  • Work involved developing an I2S driver to stream audio from/to the DSP core
  • Controlling parameters of of audio stream were controlled through I2C bus and part of driver work

Microchip WLSom1 WiFi support

  • Driver porting, specifically backporting, was done for Microchip's WLSOM1 target chip SAMA5D27 for OpenWRT operating system

802.11s mesh network for 802.11ac radios with multi-radio multi-channel support

  • The IEEE 802.11s Mesh standard has defined Hybrid Wireless Mesh Protocol (HWMP) as the default routing protocol and Airtime Link

    metric (ALM) as the default metric for path selection.

  • The project involves enhancing the existing HWMP routing protocol for more efficient working in different environmental conditions and considering other important wireless parameters other than ALM in link cost calculation for better path selection.

  • Add support for multiple Mesh Points with different channels MIMC (Multi Mesh Interface Multi Channel) for better n/w connectivity and performance by avoiding issues of interference due to the same channel in SISC (Single Mesh Interface Single Channel).

  • Define both user interfaces of command line and GUI for individual
    and central management of the Mesh network

  • All implementations are on the Linux-based open source code of 802.11s

  • Development includes understanding of mac80211, nl80211, and cfg80211 drivers as well as utilities like iw, iwconfig, ifconfig, and iwlist.

  • Integrate power-saving mechanism for multi-radio support in
    Linux kernel.

Audio processor firmware development for Tensilica-based DSP

  • This project was about the maintenance of voice processor firmware, which included bug fixing, feature enhancement, and functional testing.
  • The voice processor is based on a customized 32-bit Tensilica core running a single-threaded custom OS, which has various IO peripherals like I2C, PDM, I2S/PCM, SLIMBus etc

Dockerized ML training framework

  • Containerized Machine learning training framework by which users can create, train, debug and freeze the ML model
  • Architect whole framework from scratch and created plug and use components
  • Generated various docker images for the different training environments
  • Added generic base code component along with a detector which can support any object detection or classification model architecture
  • Enabled automated data augmentation, splitting, and performance matrix generation

Neural Network compiler development

  • Development/Enhancement of Neural network compiler tool written in Python for FPGA manufacturers
  • Tool code optimization for 2x speed of simulation
  • Dynamic fixed-point calculations implementation
  • Development of a part of a tool that handles debugging hardware through USB by reading and writing DRAM by doing bulk & control transfer
  • On top of the UMDF driver for windows and libusb for linux, wrapper library was developed.

Shoulder Surfing detection

  • Manually annotated OID v6 dataset of person class images with front and non-front looking classes
  • Automated class distribution and augmentation flow using python scripts
  • Customized SqeezeDet network architecture to fit into the small footprint of Lattice iCE40 FPGA
  • Developed C# windows GUI to communicate with FPGA through UART com port to display input images to the CNN engine and detection results

Intelligent parking slot allocation system

  • CNRPark-2 used as the base dataset
  • Used AWS rekognition custom label service at the POC stage
  • Automated pipeline on AWS to trigger training when a new dataset is added to the S3 bucket
  • Trained 2 different models due to available dataset, first to detect parking slots, second to detect if it is free or busy 
  • Generated dataset with augmentation operations like to fake weather conditions
  • Designed final model to accommodate both functionality and trained with custom dataset

Human Counting on low power FPGA

  • Developed human counting optimised model for FPGAs like Lattice ECP5, Crosslink-NX, Crosslink-NX Voice & Vision, iCE40
  • Customised training code based on SqueezeDet detector which can accommodate architectures like VGG, MobileNet V1 & V2, ResNet etc
  • Quantization and model pruning

Keyphrase detection

  • Develop a CNN that can recognize a keyword from its audio spectrum that runs on Lattice iCE40 FPGA.
  • Added support in NN compiler to generate filter binary to convert audio data into image like data
  • Audio data augmentation

Face Recognition

  • Developed face recognition model compatible with Lattice ECP5 FPGA

  • Cleaned VGGFace2 with the help of dlib to remove images that could confuse our network

  • The trained model with the VGGFace2 dataset and custom-added images to give a 128 feature map that can be used to recognize a person’s face

Analog gauge reader

  • Design a system for an industrial analog gauge reading
  • Synthetic dataset generation & augmentation for different gauges
  • Train model with Google AutoML and use TFLite model with Google Coral stick as POC
  • Design a custom VGG type model for speed and performance optimization with quantization techniques

Gesture Recognition

  • Lattice iCE40 FPGA with IR transmitter-based solution
  • Configured camera for enhanced IR sensitivity in RTL to mimic IR sensor-based input
  • Generated dataset by capturing actual images from the hardware itself for better accuracy and performance. Developed C# Windows app
  • Customized SqeezeDet network architecture to fit into the small footprint of Lattice iCE40 FPGA

AWS DeepLens

  • Deployed models based on Face analytics, clothing style detection, logo detection & scene detection
  • Developed lambda function for all the models for inference output processing
  • Developed ML IOT quiz based on pre-trained MobileNet SSD object detection model and node-red based service

POC Projects (Deep Learning)

  • Age & gender detection (Targeted advertisement)
  • Driver distraction alert
  • Face mask detection
  • Social distancing alert
  • Facial expression recognition

Education

Charotar University of Science & Technology

B.Tech (Electronics & Communication)  2011 – 2015

CV
Profil

Kishan Gondaliya

Experienced embedded software engineer working on Embedded Systems and Deep Learning to enable vision and voice-based machine learning algorithms on low-power FPGA and edge embedded devices. ~8 years of experience consists in writing, debugging, and optimizing software/firmware for embedded devices.

+91 9409 24 93 94
[email protected]   Ahmedabad, Gujarat, India      

Skillset

Languages:

Frameworks:

Dev Tools:

HW Platform:

Cloud (GCP):

Cloud (AWS):

Other:


C, Python, C++

Tensorflow (TFlite, TFmicro), Keras, Caffe, Darknet

Anaconda, Git, Gerrit, Perforce, Pycharm, CVS, Jira, Confluence

Google Coral TPU, Lattice ECP5, U+, Crosslin-NX FPGA, Raspberry Pi, Intel Movidius, NVIDIA GPU

Compute Engine, App Engine, Vision API, Auto-ML, Container Registry, Kubernetes Engine

Sagemaker, DeepLens, Lambda, Rekognition API, Reko API custom labels

Docker, OpenCV, Machine Learning, Deep Learning, Computer Vision, Convolution Neural Nets (CNN), LSTM, Networking, Model Optimization, Quantization, Pruning, Linux Kernel, OpenWRT

Work Experience

Work Experience

AI & Embedded Systems Consultant

Self-Employed  •  February 2021 - Present

Working with companies to blend AI with embedded systems specifically to enable AI on edge devices, including the device ecosystem.

Staff Engineer

Softnautics  •  September 2016 - February 2021

  • Architectured a Dockerized ML training framework and led the team for bug-free releases
  • Led Machine Learning COE team and completed 9+ projects successfully based on edge devices and cloud services
  • Worked on different DL model architectures and customized them for small footprint edge FPGA devices with techniques like quantization and pruning
  • Worked on OpenWRT firmware customization for mobility solution, network utilization monitoring and controlling

Associate Engineer

Sibridge Technologies  •  May 2015 - August 2016

  • Worked as a developer in critical 32-bit Tensile core based audio processor firmware development
  • Implemented multi-radio feature for mesh networks in the Linux kernel and improved HWMP to get a 7% throughput increment
  • Contributed to several projects as an individual contributor

Projects

Omnivision Camera driver for OpenQ2500 platform and DL model integration

  • OpenQ2500 is a wearable SOC designed mainly for small devices like trackers, smart watches, smart eyewear etc.
  • Work involved camera driver development and fine-tuning the camera with parameters that can be changed from user space.
  • Later with a camera feed, DL model was developed to identify multiple custom objects based on wearable application of the client

Linux Driver for I2S on iMX8

  • Work involved developing an I2S driver to stream audio from/to the DSP core
  • Controlling parameters of of audio stream were controlled through I2C bus and part of driver work

Microchip WLSom1 WiFi support

  • Driver porting, specifically backporting, was done for Microchip's WLSOM1 target chip SAMA5D27 for OpenWRT operating system

802.11s mesh network for 802.11ac radios with multi-radio multi-channel support

  • The IEEE 802.11s Mesh standard has defined Hybrid Wireless Mesh Protocol (HWMP) as the default routing protocol and Airtime Link

    metric (ALM) as the default metric for path selection.

  • The project involves enhancing the existing HWMP routing protocol for more efficient working in different environmental conditions and considering other important wireless parameters other than ALM in link cost calculation for better path selection.

  • Add support for multiple Mesh Points with different channels MIMC (Multi Mesh Interface Multi Channel) for better n/w connectivity and performance by avoiding issues of interference due to the same channel in SISC (Single Mesh Interface Single Channel).

  • Define both user interfaces of command line and GUI for individual
    and central management of the Mesh network

  • All implementations are on the Linux-based open source code of 802.11s

  • Development includes understanding of mac80211, nl80211, and cfg80211 drivers as well as utilities like iw, iwconfig, ifconfig, and iwlist.

  • Integrate power-saving mechanism for multi-radio support in
    Linux kernel.

Audio processor firmware development for Tensilica-based DSP

  • This project was about the maintenance of voice processor firmware, which included bug fixing, feature enhancement, and functional testing.
  • The voice processor is based on a customized 32-bit Tensilica core running a single-threaded custom OS, which has various IO peripherals like I2C, PDM, I2S/PCM, SLIMBus etc

Dockerized ML training framework

  • Containerized Machine learning training framework by which users can create, train, debug and freeze the ML model
  • Architect whole framework from scratch and created plug and use components
  • Generated various docker images for the different training environments
  • Added generic base code component along with a detector which can support any object detection or classification model architecture
  • Enabled automated data augmentation, splitting, and performance matrix generation

Neural Network compiler development

  • Development/Enhancement of Neural network compiler tool written in Python for FPGA manufacturers
  • Tool code optimization for 2x speed of simulation
  • Dynamic fixed-point calculations implementation
  • Development of a part of a tool that handles debugging hardware through USB by reading and writing DRAM by doing bulk & control transfer
  • On top of the UMDF driver for windows and libusb for linux, wrapper library was developed.

Shoulder Surfing detection

  • Manually annotated OID v6 dataset of person class images with front and non-front looking classes
  • Automated class distribution and augmentation flow using python scripts
  • Customized SqeezeDet network architecture to fit into the small footprint of Lattice iCE40 FPGA
  • Developed C# windows GUI to communicate with FPGA through UART com port to display input images to the CNN engine and detection results

Intelligent parking slot allocation system

  • CNRPark-2 used as the base dataset
  • Used AWS rekognition custom label service at the POC stage
  • Automated pipeline on AWS to trigger training when a new dataset is added to the S3 bucket
  • Trained 2 different models due to available dataset, first to detect parking slots, second to detect if it is free or busy 
  • Generated dataset with augmentation operations like to fake weather conditions
  • Designed final model to accommodate both functionality and trained with custom dataset

Human Counting on low power FPGA

  • Developed human counting optimised model for FPGAs like Lattice ECP5, Crosslink-NX, Crosslink-NX Voice & Vision, iCE40
  • Customised training code based on SqueezeDet detector which can accommodate architectures like VGG, MobileNet V1 & V2, ResNet etc
  • Quantization and model pruning

Keyphrase detection

  • Develop a CNN that can recognize a keyword from its audio spectrum that runs on Lattice iCE40 FPGA.
  • Added support in NN compiler to generate filter binary to convert audio data into image like data
  • Audio data augmentation

Face Recognition

  • Developed face recognition model compatible with Lattice ECP5 FPGA

  • Cleaned VGGFace2 with the help of dlib to remove images that could confuse our network

  • The trained model with the VGGFace2 dataset and custom-added images to give a 128 feature map that can be used to recognize a person’s face

Analog gauge reader

  • Design a system for an industrial analog gauge reading
  • Synthetic dataset generation & augmentation for different gauges
  • Train model with Google AutoML and use TFLite model with Google Coral stick as POC
  • Design a custom VGG type model for speed and performance optimization with quantization techniques

Gesture Recognition

  • Lattice iCE40 FPGA with IR transmitter-based solution
  • Configured camera for enhanced IR sensitivity in RTL to mimic IR sensor-based input
  • Generated dataset by capturing actual images from the hardware itself for better accuracy and performance. Developed C# Windows app
  • Customized SqeezeDet network architecture to fit into the small footprint of Lattice iCE40 FPGA

AWS DeepLens

  • Deployed models based on Face analytics, clothing style detection, logo detection & scene detection
  • Developed lambda function for all the models for inference output processing
  • Developed ML IOT quiz based on pre-trained MobileNet SSD object detection model and node-red based service

POC Projects (Deep Learning)

  • Age & gender detection (Targeted advertisement)
  • Driver distraction alert
  • Face mask detection
  • Social distancing alert
  • Facial expression recognition

Education

Charotar University of Science & Technology

B.Tech (Electronics & Communication)  2011 – 2015