Yogi Eye: Building a MedTech Solution for Ayurvedic Plants Recognition

Recently my college organised an inner hackathon to select teams for SIH (Smart India Hackathon). The motto of the hackathon was that each team would solve a problem from the problems mentioned in SIH's problem list and would build a solution to it in in 24-hour hack period. My team selected a problem from the Ministry of Ayush which focused on building a MedTech to identify different medical plants/raw materials used in Ayurvedic medicines/studies

Yogi eye

The idea we came up with was to build a backend API that would have a web client, as well as a mobile client (in the future when we build for SIH), connecting to it. API will have pre-trained weights trained on a custom dataset of Ayurvedic plants and a prediction route containing a prediction function. Also, we wanted to make it a Wikipedia for Ayurvedic plants containing all kinds of information about that plant/ material becasuse as we know our Indian culture is very enriched with Ayurvedic knowledge but most of it is going to be lost in the upcoming generation. So the idea was to have an expert as well as local people add info about that material and we would have some certification to both so each material would have its page divided into two parts expert part and local people theory on how to use that material. so once you upload the image, the API will identify which material it is then you will be directed to its "Yogi Wiki ". There will also be a forum where experts will be able to create posts and have discussions over them (posts will have two categories experts only (only experts can take part in this post) and a community post.

TECH STACK

BACKEND API: We chose FastAPI because I think it is a better alternative to Flask
DATABASE: POSTGRESQL & PgAdmin is just a personal choice also relational database made sense to our problem rather than a nonrelational
ORM: SQLAlchemy best choice in my opinion if you are using a SQL database and want your app to be SQL injection proof
ML ENGINE: PyTorch
ML MODEL: Efficient Net V_1 (pre-trained) It is a state-of-the-art model for image classification based on mobile net having 12M params
WEB CLIENT: React
APP CLIENT: Swift(IOS Version) & ReactNative for Android
DEPLOYMENT: deployed on AWS using docker container and ec2 instance

We also used Alembic to create automatic migration with the help of the Docker command. Axios to link React frontend to FastAPI backend. Roboflow for processing and hosting the dataset for training.

PROBLEMS FACED BY US BUILDING YOGI EYE

Unavailability of datasets for Ayurvedic plants especially Indian Ayurvedic Plants :
I read 8 to 9 research papers preparing for this hack most of them had data on how to design the CNN or which pre-trained model produces the best result in the case of plants but none of them had a link to a dataset they used. I also emailed a few researchers (some I know some I didn't) but either didn't have the dataset or they didn't reply. In the end, I had to make my own custom dataset, so I did what any normal person would do. I downloaded It from Google using this Chrome extension called Download All Images, but pre-processing each image one by one was impossible so I wrote some scripts to do it but couldn't achieve the required level I was expecting, then I stumbled upon this amazing website Robflow not only it preprocessed the image (Augmentation, segmentation, adding noise, Inverasion), split the dataset into Test/Validation/Train according to custom ratio given by me and hosted it and made it available using a simple API call.
DOCKER:
Before this project, I had never used docker (deep regret) and none of my teammates knew either, So I as the team leader had to learn it. Damn, it's the most high-tech stuff I have learned in recent years. It's so awesome that it looks magical when it works. Implementing docker first time was a very tedious task as our application was almost a production prototype (annh maybe). I will explain it in depth in my future blogs (#follow lol).
Preprocessing the uploaded image for the prediction function :
The image uploaded through the API route travels to backend in bytes but we need the image as ( 3, 256, 256 ) tensor for prediction function. So the code turned out to be like this :-

https://gist.github.com/ce272c8b723aae90dcb7d1f49e57c36d

The prediction route looked like this in - - >

https://gist.github.com/ce7be37ef06c271b617994fe4cdfbdb3

We had a few more issues but most of them got resolved pretty easily like none us knew how to link FastAPI to our React client app since I'm not good with frontend and was also busy deploying the docker container so one of team actually did small project tutorial in the last 3hr remaining and made it work. This is the most thrilling process I enjoy about Hackthons is where people learn things in such a short time that it would have taken atleast few weeks but was done in a few hours with practical confidence of applying it.

Repo for the done project is at Yogi-v1 free to clone it or contribute to it as my college will most probably not select my team, Open for any developer advice on it.

Zero - Docker - Hackthon

Table of contents

TECH STACK

PROBLEMS FACED BY US BUILDING YOGI EYE

Feel free reach me at any of my handels