CakeResume Talent Search

Advanced filters
On
4-6 tahun
6-10 tahun
10-15 tahun
Lebih dari 15 tahun
Avatar of Patrick Hsu.
Avatar of Patrick Hsu.
Algorithm Research & Development @適着三維科技股份有限公司 TG3D Studio Inc.
2021 ~ Sekarang
Software Engineer
Dalam satu bulan
level with a size error margin of less than 3 cm. Computer Vision Body Reconstruction Stable Diffusion DApp: Exercise Classification Implemented real-time activity recognition for specified exercises, including but not limited to push-ups and squats, enabling accurate tracking and analysis of workout routines. Skeleton Detection Motion Classification Body AI: 3D Body Data Analysis Engage in a variety of AI side projects focused on leveraging 3D body data, encompassing tasks such as Body ID Recognition, Body Measurements Prediction, and Body Shape Classification. Mesh Data Processing Body Modulization: SMPL Project Engineer • Acer Inc. JanMay 2021 Working
Python
AI & Machine Learning
Image Processing
Sudah bekerja
Siap untuk wawancara
Full-time / Tertarik bekerja jarak jauh
4-6 tahun
國立台灣大學
生物產業機電工程所
Avatar of the user.
AI工程師、機器學習工程師、深度學習工程師、資料科學家、Machine Learning Engineer、Deep Learning Engineer、Data Scientist
Dalam satu bulan
Python
R
Natural Language Processing (NLP)
Sudah bekerja
Siap untuk wawancara
Full-time / Tertarik bekerja jarak jauh
4-6 tahun
國立政治大學(National Chengchi University)
資訊科學系
Avatar of the user.
Avatar of the user.
Data Engineer @TSMC 台積電
2022 ~ Sekarang
資料分析師、演算法工程師、軟體工程師、軟體專案管理
Dalam satu bulan
Backend Development
NLP
Python
Sudah bekerja
Siap untuk wawancara
Full-time / Tertarik bekerja jarak jauh
4-6 tahun
國立中央大學 National Central University
網路學習科技研究所
Avatar of Chun-Jung Huang.
Avatar of Chun-Jung Huang.
OPC Chief Engineer @TSMC
2020 ~ Sekarang
AI工程師、機器學習工程師、深度學習工程師、資料科學家、Machine Learning Engineer、Deep Learning Engineer、Data Scientist
Dalam satu bulan
in distributed computing, optimizing code execution across thousands of systems to significantly improve processing speed and efficiency. ◆Developed sophisticated data visualization tools to distill complex datasets into actionable insights, aiding strategic decision-making. The University of Tokyo, Foreign Researcher (OctSep◆Pioneered a neural network-based approach for cell image classification and data visualization, enhancing lab capabilities in biological research. ◆Designed a user-friendly GUI for neural network model training, democratizing access to advanced computational tools for non-programmers. Education National Chiao-Tung University, Ph.D. - Photonics, 2015 ~ 2020 Development of Intelligent Wearable Near Infrared Spectroscopy
Deep learning with TensorFlow
Translational Research
Clinical Research
Sudah bekerja
Siap untuk wawancara
Full-time / Tertarik bekerja jarak jauh
4-6 tahun
National Chiao-Tung University
Ph.D. - Clinical Engineering
Avatar of Ahmed Yousaf.
Avatar of Ahmed Yousaf.
Past
Electrical Section Head @Sayyed Engineers Limited
2014 ~ 2016
Electrical and Electronics Engineer
Dalam tiga bulan
based Project using SIMATIC SSignal conditioning of data received from the Aircraft Engine MFI-17 Super Mushshak Part Task Trainer (Simulator) Team lead of the Hardware installation and interfacing with soft instruments Using National Instruments NI-Daq 6343 Hardware designing of Trim Panel using Arduino Uno Foreign Object Debris Detection, Classification, and Localization using Deep Learning Python language based Project using YOLO-V7 (you only look once) Addressed the Class imbalance in FOD-A Dataset FOD detection/localization with improved accuracy Data Acquisition design and implementation of a Level-D Simulator (C130-H) NI-PXI Chassis-based DATA ACQUISITION
Microsoft Office
C++
C#
Tidak bekerja
Siap untuk wawancara
Full-time / Tertarik bekerja jarak jauh
6-10 tahun
University of Central Punjab
Electrical and Power Transmission Installation/Installer, General
Avatar of the user.
Avatar of the user.
設備專案工程師 @日月光半導體製造股份有限公司 ADVANCED SEMICONDUCTOR ENGINEERING, INC.
2020 ~ Sekarang
工程師
Dalam tiga bulan
Equipment repair and maintenance
Wire bonding process
Python
Sudah bekerja
Terbuka untuk peluang
Full-time / Tertarik bekerja jarak jauh
4-6 tahun
國立高雄科技大學 National Kaohsiung University of Science and Technology
電腦與通訊工程系
Avatar of chiyun chao.
Avatar of chiyun chao.
Research & Development Engineer @三竹資訊股份有限公司
2023 ~ Sekarang
AI工程師、機器學習工程師、深度學習工程師、資料科學家、Machine Learning Engineer、Deep Learning Engineer、Data Scientist
Dalam satu bulan
and deployed it to the environment. Major Product and Project Experience AI training platform backend development with Label Studio/DVC/MLflow Developed training templates. AI-SaaS Implemented NLU tasks using BERT-based models to provide functions such as named entity recognition, entity relation extraction, and article classification . Implemented NLG tasks using T5 to provide article summarization functionality. NetProbe - DPI Use 1D-CNN and MLP models for encrypted network traffic identification . Use the Autoencoder model for unknown packet detection . Taiwan Ministry of the Interior Investigation Bureau Police Station - AI Crime Investigation Trace Analysis
Python
JAVA
Linux
Sudah bekerja
Terbuka untuk peluang
Full-time / Tidak tertarik bekerja jarak jauh
4-6 tahun
國立中央大學 National Central University
資訊工程
Avatar of 陳惠龍.
Avatar of 陳惠龍.
Data science lecturer @Ittraining
2020 ~ Sekarang
Data Scientist 資料科學家_數據分析師
Dalam satu bulan
氣道擴散偵測競賽 I:運用物體偵測作法於找尋STAS, 2022/06/02 - Bronze medal (team): (Kaggle) VinBigData Chest X-ray Abnormalities Detection: Automatically localize and classify thoracic abnormalities from chest radiographs, 2021/03/30 Classification (影像分類): - Bronze medal (solo): (Kaggle) Human Protein Atlas - Single Cell Classification: Find individual human cell differences in microscope images, 2021/05/12 - 5th place (team): (Aidea AI CUP) Mango grade classification, 2020/12/29. https://reurl.cc/
nlp-rasa
recommender system
pytorch tensorflow
Sudah bekerja
Terbuka untuk peluang
Paruh waktu / Tertarik bekerja jarak jauh
Lebih dari 15 tahun
Purdue University
School of civil engineering (Stochastic & statistical hydrology)
Avatar of Alex Yu.
Avatar of Alex Yu.
Product Manager @Linker Vision
2023 ~ Sekarang
PM/產品經理/專案管理
Dalam satu bulan
detection, segmentation, and classification AI scenario. Good communication skills with doctors' demands and collaboration with colleagues. Patent Disclosure: Ultrasound detect and notify system. (serial number: I學歷 SepJun 2 National Taiwan University of Science and Technology Masters in Electrical Engineering Thesis "Online Data Stream Analytics for Dynamic Environments Using Self-Regularized Learning Framework", IEEE journal SepJun 2020 Yuan Ze University Bachelor in Electrical Engineering Skills Customer/VC negotiation and customer services DL/ML/AI algorithm, keen problem solving 3D modeling (Blender) Python, Matlab, Tensorflow, Keras Object detection, Classification, [email protected]
Business Development
Deep Learning
PYTHON
Sudah bekerja
Terbuka untuk peluang
Full-time / Tertarik bekerja jarak jauh
4-6 tahun
國立台灣科技大學 National Taiwan University of Science and Technology
電機工程
Avatar of Chin Ya Chang.
Avatar of Chin Ya Chang.
Senior Software Engineer @International Integrated Systems, Inc.(IISI)
2020 ~ Sekarang
AI工程師、機器學習工程師、深度學習工程師、資料科學家、Machine Learning Engineer、Deep Learning Engineer、Data Scientist
Dalam satu bulan
modules:Resblock, GhostBottleNeck, SE-layer, DarkBlock. Used Attention mechanism:StripPooling, MixedPoolingModule, SelectiveKernel . According to the input data, use convolutional layers of different dimensions (1D~3D) to learn information. Used weight standardization to assign weights to improve model training effect. Built a composite model of regression and classification. AI development environment management Used docker or Anaconda to establish and maintain the development environment with GPU. Set up the environment to use the LLM (LLAMA2, Taiwan-LLaMa, Codellama, Llama2-chinese-13b, etc.) Model Training and Tuning Tips Adjusted the data batch size according to the
Python
PyTorch
Machine Learning
Sudah bekerja
Terbuka untuk peluang
Full-time / Tertarik bekerja jarak jauh
4-6 tahun
私立中原大學 Chung Yuan Christian University
環境工程

Paket Perekrutan Paling Mudah dan Efektif, Pilihan Ratusan Perusahaan

Cari lebih dari 800 ribu CV dan ambil aksi menghubungi pelamar kerja untuk rekrutmen yang lebih efektif. Pilihan ratusan perusahaan.

  • Lihat semua hasil pencarian
  • Tanpa batas harian untuk memulai pesan baru
  • CV dapat diakses oleh perusahaan berbayar
  • Lihat email pengguna & nomor telepon
Tips pencarian
1
Search a precise keyword combination
senior backend php
If the number of the search result is not enough, you can remove the less important keywords
2
Use quotes to search for an exact phrase
"business development"
3
Use the minus sign to eliminate results containing certain words
UI designer -UX
Hanya CV publik yang tersedia dengan paket gratis.
Upgrade ke paket lanjutan untuk melihat semua hasil pencarian, termasuk 10.000 lebih CV eksklusif di Cake Resume.

Definition of Reputation Credits

Technical Skills
Specialized knowledge and expertise within the profession (e.g. familiar with SEO and use of related tools).
Problem-Solving
Ability to identify, analyze, and prepare solutions to problems.
Adaptability
Ability to navigate unexpected situations; and keep up with shifting priorities, projects, clients, and technology.
Communication
Ability to convey information effectively and is willing to give and receive feedback.
Time Management
Ability to prioritize tasks based on importance; and have them completed within the assigned timeline.
Teamwork
Ability to work cooperatively, communicate effectively, and anticipate each other's demands, resulting in coordinated collective action.
Leadership
Ability to coach, guide, and inspire a team to achieve a shared goal or outcome effectively.
Lebih dari satu tahun
AI & Embedded Systems Consultant @ Self Employed
Self Employed
2021 ~ Sekarang
Ahmedabad, Gujarat, India
Latar Belakang Profesional
Status sekarang
Sudah bekerja
Tahap pencarian kerja
Profesi
Lainnya
Bidang Pekerjaan
Software
Pengalaman Kerja
4-6 tahun
Management
Keterampilan
Deep Learning
machine learning
aws
Google cloud
Docker
Networking
Bahasa
English
Profesional
Preferensi Pencarian Pekerjaan
Jabatan
Deep Learning Engineer
Tipe Pekerjaan
Full-time
Lokasi
Pune, Maharashtra, India
Bekerja jarak jauh
Tertarik bekerja jarak jauh
Freelance
Tidak
Pendidikan
Institusi Pendidikan
Charotar University of Science & Technology
Jurusan
Electronics & Communication
Cetak

Kishan Gondaliya

Experienced embedded software engineer working on Embedded Systems and Deep Learning to enable vision and voice-based machine learning algorithms on low-power FPGA and edge embedded devices. ~8 years of experience consists in writing, debugging, and optimizing software/firmware for embedded devices.

+91 9409 24 93 94
[email protected]   Ahmedabad, Gujarat, India      

Skillset

Languages:

Frameworks:

Dev Tools:

HW Platform:

Cloud (GCP):

Cloud (AWS):

Other:


C, Python, C++

Tensorflow (TFlite, TFmicro), Keras, Caffe, Darknet

Anaconda, Git, Gerrit, Perforce, Pycharm, CVS, Jira, Confluence

Google Coral TPU, Lattice ECP5, U+, Crosslin-NX FPGA, Raspberry Pi, Intel Movidius, NVIDIA GPU

Compute Engine, App Engine, Vision API, Auto-ML, Container Registry, Kubernetes Engine

Sagemaker, DeepLens, Lambda, Rekognition API, Reko API custom labels

Docker, OpenCV, Machine Learning, Deep Learning, Computer Vision, Convolution Neural Nets (CNN), LSTM, Networking, Model Optimization, Quantization, Pruning, Linux Kernel, OpenWRT

Work Experience

Work Experience

AI & Embedded Systems Consultant

Self-Employed  •  February 2021 - Present

Working with companies to blend AI with embedded systems specifically to enable AI on edge devices, including the device ecosystem.

Staff Engineer

Softnautics  •  September 2016 - February 2021

  • Architectured a Dockerized ML training framework and led the team for bug-free releases
  • Led Machine Learning COE team and completed 9+ projects successfully based on edge devices and cloud services
  • Worked on different DL model architectures and customized them for small footprint edge FPGA devices with techniques like quantization and pruning
  • Worked on OpenWRT firmware customization for mobility solution, network utilization monitoring and controlling

Associate Engineer

Sibridge Technologies  •  May 2015 - August 2016

  • Worked as a developer in critical 32-bit Tensile core based audio processor firmware development
  • Implemented multi-radio feature for mesh networks in the Linux kernel and improved HWMP to get a 7% throughput increment
  • Contributed to several projects as an individual contributor

Projects

Omnivision Camera driver for OpenQ2500 platform and DL model integration

  • OpenQ2500 is a wearable SOC designed mainly for small devices like trackers, smart watches, smart eyewear etc.
  • Work involved camera driver development and fine-tuning the camera with parameters that can be changed from user space.
  • Later with a camera feed, DL model was developed to identify multiple custom objects based on wearable application of the client

Linux Driver for I2S on iMX8

  • Work involved developing an I2S driver to stream audio from/to the DSP core
  • Controlling parameters of of audio stream were controlled through I2C bus and part of driver work

Microchip WLSom1 WiFi support

  • Driver porting, specifically backporting, was done for Microchip's WLSOM1 target chip SAMA5D27 for OpenWRT operating system

802.11s mesh network for 802.11ac radios with multi-radio multi-channel support

  • The IEEE 802.11s Mesh standard has defined Hybrid Wireless Mesh Protocol (HWMP) as the default routing protocol and Airtime Link

    metric (ALM) as the default metric for path selection.

  • The project involves enhancing the existing HWMP routing protocol for more efficient working in different environmental conditions and considering other important wireless parameters other than ALM in link cost calculation for better path selection.

  • Add support for multiple Mesh Points with different channels MIMC (Multi Mesh Interface Multi Channel) for better n/w connectivity and performance by avoiding issues of interference due to the same channel in SISC (Single Mesh Interface Single Channel).

  • Define both user interfaces of command line and GUI for individual
    and central management of the Mesh network

  • All implementations are on the Linux-based open source code of 802.11s

  • Development includes understanding of mac80211, nl80211, and cfg80211 drivers as well as utilities like iw, iwconfig, ifconfig, and iwlist.

  • Integrate power-saving mechanism for multi-radio support in
    Linux kernel.

Audio processor firmware development for Tensilica-based DSP

  • This project was about the maintenance of voice processor firmware, which included bug fixing, feature enhancement, and functional testing.
  • The voice processor is based on a customized 32-bit Tensilica core running a single-threaded custom OS, which has various IO peripherals like I2C, PDM, I2S/PCM, SLIMBus etc

Dockerized ML training framework

  • Containerized Machine learning training framework by which users can create, train, debug and freeze the ML model
  • Architect whole framework from scratch and created plug and use components
  • Generated various docker images for the different training environments
  • Added generic base code component along with a detector which can support any object detection or classification model architecture
  • Enabled automated data augmentation, splitting, and performance matrix generation

Neural Network compiler development

  • Development/Enhancement of Neural network compiler tool written in Python for FPGA manufacturers
  • Tool code optimization for 2x speed of simulation
  • Dynamic fixed-point calculations implementation
  • Development of a part of a tool that handles debugging hardware through USB by reading and writing DRAM by doing bulk & control transfer
  • On top of the UMDF driver for windows and libusb for linux, wrapper library was developed.

Shoulder Surfing detection

  • Manually annotated OID v6 dataset of person class images with front and non-front looking classes
  • Automated class distribution and augmentation flow using python scripts
  • Customized SqeezeDet network architecture to fit into the small footprint of Lattice iCE40 FPGA
  • Developed C# windows GUI to communicate with FPGA through UART com port to display input images to the CNN engine and detection results

Intelligent parking slot allocation system

  • CNRPark-2 used as the base dataset
  • Used AWS rekognition custom label service at the POC stage
  • Automated pipeline on AWS to trigger training when a new dataset is added to the S3 bucket
  • Trained 2 different models due to available dataset, first to detect parking slots, second to detect if it is free or busy 
  • Generated dataset with augmentation operations like to fake weather conditions
  • Designed final model to accommodate both functionality and trained with custom dataset

Human Counting on low power FPGA

  • Developed human counting optimised model for FPGAs like Lattice ECP5, Crosslink-NX, Crosslink-NX Voice & Vision, iCE40
  • Customised training code based on SqueezeDet detector which can accommodate architectures like VGG, MobileNet V1 & V2, ResNet etc
  • Quantization and model pruning

Keyphrase detection

  • Develop a CNN that can recognize a keyword from its audio spectrum that runs on Lattice iCE40 FPGA.
  • Added support in NN compiler to generate filter binary to convert audio data into image like data
  • Audio data augmentation

Face Recognition

  • Developed face recognition model compatible with Lattice ECP5 FPGA

  • Cleaned VGGFace2 with the help of dlib to remove images that could confuse our network

  • The trained model with the VGGFace2 dataset and custom-added images to give a 128 feature map that can be used to recognize a person’s face

Analog gauge reader

  • Design a system for an industrial analog gauge reading
  • Synthetic dataset generation & augmentation for different gauges
  • Train model with Google AutoML and use TFLite model with Google Coral stick as POC
  • Design a custom VGG type model for speed and performance optimization with quantization techniques

Gesture Recognition

  • Lattice iCE40 FPGA with IR transmitter-based solution
  • Configured camera for enhanced IR sensitivity in RTL to mimic IR sensor-based input
  • Generated dataset by capturing actual images from the hardware itself for better accuracy and performance. Developed C# Windows app
  • Customized SqeezeDet network architecture to fit into the small footprint of Lattice iCE40 FPGA

AWS DeepLens

  • Deployed models based on Face analytics, clothing style detection, logo detection & scene detection
  • Developed lambda function for all the models for inference output processing
  • Developed ML IOT quiz based on pre-trained MobileNet SSD object detection model and node-red based service

POC Projects (Deep Learning)

  • Age & gender detection (Targeted advertisement)
  • Driver distraction alert
  • Face mask detection
  • Social distancing alert
  • Facial expression recognition

Education

Charotar University of Science & Technology

B.Tech (Electronics & Communication)  2011 – 2015

CV
Profil

Kishan Gondaliya

Experienced embedded software engineer working on Embedded Systems and Deep Learning to enable vision and voice-based machine learning algorithms on low-power FPGA and edge embedded devices. ~8 years of experience consists in writing, debugging, and optimizing software/firmware for embedded devices.

+91 9409 24 93 94
[email protected]   Ahmedabad, Gujarat, India      

Skillset

Languages:

Frameworks:

Dev Tools:

HW Platform:

Cloud (GCP):

Cloud (AWS):

Other:


C, Python, C++

Tensorflow (TFlite, TFmicro), Keras, Caffe, Darknet

Anaconda, Git, Gerrit, Perforce, Pycharm, CVS, Jira, Confluence

Google Coral TPU, Lattice ECP5, U+, Crosslin-NX FPGA, Raspberry Pi, Intel Movidius, NVIDIA GPU

Compute Engine, App Engine, Vision API, Auto-ML, Container Registry, Kubernetes Engine

Sagemaker, DeepLens, Lambda, Rekognition API, Reko API custom labels

Docker, OpenCV, Machine Learning, Deep Learning, Computer Vision, Convolution Neural Nets (CNN), LSTM, Networking, Model Optimization, Quantization, Pruning, Linux Kernel, OpenWRT

Work Experience

Work Experience

AI & Embedded Systems Consultant

Self-Employed  •  February 2021 - Present

Working with companies to blend AI with embedded systems specifically to enable AI on edge devices, including the device ecosystem.

Staff Engineer

Softnautics  •  September 2016 - February 2021

  • Architectured a Dockerized ML training framework and led the team for bug-free releases
  • Led Machine Learning COE team and completed 9+ projects successfully based on edge devices and cloud services
  • Worked on different DL model architectures and customized them for small footprint edge FPGA devices with techniques like quantization and pruning
  • Worked on OpenWRT firmware customization for mobility solution, network utilization monitoring and controlling

Associate Engineer

Sibridge Technologies  •  May 2015 - August 2016

  • Worked as a developer in critical 32-bit Tensile core based audio processor firmware development
  • Implemented multi-radio feature for mesh networks in the Linux kernel and improved HWMP to get a 7% throughput increment
  • Contributed to several projects as an individual contributor

Projects

Omnivision Camera driver for OpenQ2500 platform and DL model integration

  • OpenQ2500 is a wearable SOC designed mainly for small devices like trackers, smart watches, smart eyewear etc.
  • Work involved camera driver development and fine-tuning the camera with parameters that can be changed from user space.
  • Later with a camera feed, DL model was developed to identify multiple custom objects based on wearable application of the client

Linux Driver for I2S on iMX8

  • Work involved developing an I2S driver to stream audio from/to the DSP core
  • Controlling parameters of of audio stream were controlled through I2C bus and part of driver work

Microchip WLSom1 WiFi support

  • Driver porting, specifically backporting, was done for Microchip's WLSOM1 target chip SAMA5D27 for OpenWRT operating system

802.11s mesh network for 802.11ac radios with multi-radio multi-channel support

  • The IEEE 802.11s Mesh standard has defined Hybrid Wireless Mesh Protocol (HWMP) as the default routing protocol and Airtime Link

    metric (ALM) as the default metric for path selection.

  • The project involves enhancing the existing HWMP routing protocol for more efficient working in different environmental conditions and considering other important wireless parameters other than ALM in link cost calculation for better path selection.

  • Add support for multiple Mesh Points with different channels MIMC (Multi Mesh Interface Multi Channel) for better n/w connectivity and performance by avoiding issues of interference due to the same channel in SISC (Single Mesh Interface Single Channel).

  • Define both user interfaces of command line and GUI for individual
    and central management of the Mesh network

  • All implementations are on the Linux-based open source code of 802.11s

  • Development includes understanding of mac80211, nl80211, and cfg80211 drivers as well as utilities like iw, iwconfig, ifconfig, and iwlist.

  • Integrate power-saving mechanism for multi-radio support in
    Linux kernel.

Audio processor firmware development for Tensilica-based DSP

  • This project was about the maintenance of voice processor firmware, which included bug fixing, feature enhancement, and functional testing.
  • The voice processor is based on a customized 32-bit Tensilica core running a single-threaded custom OS, which has various IO peripherals like I2C, PDM, I2S/PCM, SLIMBus etc

Dockerized ML training framework

  • Containerized Machine learning training framework by which users can create, train, debug and freeze the ML model
  • Architect whole framework from scratch and created plug and use components
  • Generated various docker images for the different training environments
  • Added generic base code component along with a detector which can support any object detection or classification model architecture
  • Enabled automated data augmentation, splitting, and performance matrix generation

Neural Network compiler development

  • Development/Enhancement of Neural network compiler tool written in Python for FPGA manufacturers
  • Tool code optimization for 2x speed of simulation
  • Dynamic fixed-point calculations implementation
  • Development of a part of a tool that handles debugging hardware through USB by reading and writing DRAM by doing bulk & control transfer
  • On top of the UMDF driver for windows and libusb for linux, wrapper library was developed.

Shoulder Surfing detection

  • Manually annotated OID v6 dataset of person class images with front and non-front looking classes
  • Automated class distribution and augmentation flow using python scripts
  • Customized SqeezeDet network architecture to fit into the small footprint of Lattice iCE40 FPGA
  • Developed C# windows GUI to communicate with FPGA through UART com port to display input images to the CNN engine and detection results

Intelligent parking slot allocation system

  • CNRPark-2 used as the base dataset
  • Used AWS rekognition custom label service at the POC stage
  • Automated pipeline on AWS to trigger training when a new dataset is added to the S3 bucket
  • Trained 2 different models due to available dataset, first to detect parking slots, second to detect if it is free or busy 
  • Generated dataset with augmentation operations like to fake weather conditions
  • Designed final model to accommodate both functionality and trained with custom dataset

Human Counting on low power FPGA

  • Developed human counting optimised model for FPGAs like Lattice ECP5, Crosslink-NX, Crosslink-NX Voice & Vision, iCE40
  • Customised training code based on SqueezeDet detector which can accommodate architectures like VGG, MobileNet V1 & V2, ResNet etc
  • Quantization and model pruning

Keyphrase detection

  • Develop a CNN that can recognize a keyword from its audio spectrum that runs on Lattice iCE40 FPGA.
  • Added support in NN compiler to generate filter binary to convert audio data into image like data
  • Audio data augmentation

Face Recognition

  • Developed face recognition model compatible with Lattice ECP5 FPGA

  • Cleaned VGGFace2 with the help of dlib to remove images that could confuse our network

  • The trained model with the VGGFace2 dataset and custom-added images to give a 128 feature map that can be used to recognize a person’s face

Analog gauge reader

  • Design a system for an industrial analog gauge reading
  • Synthetic dataset generation & augmentation for different gauges
  • Train model with Google AutoML and use TFLite model with Google Coral stick as POC
  • Design a custom VGG type model for speed and performance optimization with quantization techniques

Gesture Recognition

  • Lattice iCE40 FPGA with IR transmitter-based solution
  • Configured camera for enhanced IR sensitivity in RTL to mimic IR sensor-based input
  • Generated dataset by capturing actual images from the hardware itself for better accuracy and performance. Developed C# Windows app
  • Customized SqeezeDet network architecture to fit into the small footprint of Lattice iCE40 FPGA

AWS DeepLens

  • Deployed models based on Face analytics, clothing style detection, logo detection & scene detection
  • Developed lambda function for all the models for inference output processing
  • Developed ML IOT quiz based on pre-trained MobileNet SSD object detection model and node-red based service

POC Projects (Deep Learning)

  • Age & gender detection (Targeted advertisement)
  • Driver distraction alert
  • Face mask detection
  • Social distancing alert
  • Facial expression recognition

Education

Charotar University of Science & Technology

B.Tech (Electronics & Communication)  2011 – 2015