Hi, I'm Sujit Soni

AI/ML Engineer with 1+ years of experience in Generative AI, LLMs, Computer Vision, and Data Engineering. Passionate about solving real-world technical challenges.

Introduction

Overview.

AIML Engineer with 1+ years of experience in Generative AI, LLMs API Integration, Data Engineering, Computer Vision, Python Web Development, and Google Apps Script automations.


Passionate about achieving deep domain expertise in Artificial Intelligence and Machine Learning by continuously adapting to emerging technologies and mastering advanced concepts to solve real-world technical challenges that create measurable impact.

Full Stack Developer

AI/ML Engineer

Data Analyst

Problem Solver

What I Work With

Skills & Technologies.

Programming Languages

Python JavaScript SQL

AI/ML & Computer Vision

YOLO MediaPipe Scikit-learn PyTorch TensorFlow OpenCV Ollama Gemini AI Studio

Web & Tools

Flask Pandas Playwright N8N Tesseract OCR Git & GitHub cPanel

Concepts

Generative AI RAG Prompt Engineering Web Scraping OCR

My Work

Featured Projects.

Database Chatbot

AI-powered database chatbot using Flask and Google Gemini API that translates natural language queries into executable MongoDB queries and returns structured results in Excel/JSON formats.

Google Gemini API Flask MongoDB
View Project

Multilingual Chatbot

Multilingual document Q&A chatbot using Flask, Tesseract OCR, and Gemma LLM (via Ollama), with hybrid RAG retrieval powered by LaBSE embeddings and FAISS vector search.

Tesseract OCR Ollama FAISS
View Project

Emergency Call Priority System

Flask-based app that processes emergency audio using Whisper for transcription, Pyannote for speaker diarization, and a fine-tuned Gemma LLM for automated urgency assessment (0-5 scale).

PyTorch Whisper Hugging Face
View Project

My Expertise

AI/ML Lab.

Python
YOLO
MediaPipe
PyTorch
Gemini AI
Ollama

Machine Learning

Building predictive models using supervised and unsupervised learning techniques for real-world applications.

Regression Classification Clustering Random Forest

Deep Learning

Designing neural network architectures for complex pattern recognition and feature extraction tasks.

CNN RNN LSTM Transfer Learning

NLP

Processing and analyzing text data for sentiment analysis, text classification, and language understanding.

NLTK Transformers TF-IDF Word2Vec

Computer Vision

Image processing and object detection using advanced computer vision algorithms and neural networks.

OpenCV YOLO Image Segmentation

Knowledge Sharing

Latest Insights.

Dec 2024 5 min read

Building a Movie Recommendation System from Scratch

A deep dive into content-based filtering using cosine similarity and CountVectorizer for personalized recommendations.

Machine Learning Python
Read Article
Nov 2024 7 min read

NLP in Action: Detecting Spam with Machine Learning

Exploring text preprocessing, TF-IDF vectorization, and classifier fine-tuning for spam detection systems.

NLP NLTK
Read Article
Oct 2024 6 min read

Exploratory Data Analysis: Best Practices for ML Projects

Essential EDA techniques using Pandas, Matplotlib, and Seaborn to uncover patterns before model training.

Data Science EDA
Read Article

What I've Done

Work Experience.

AI/ML Engineer

Capermint Technologies | Iskcon, Ahmedabad

March 2025 - Present
  • Developed "Airzoy," a computer vision-based gaming product utilizing MediaPipe/YOLO to enable real-time gameplay through body gesture recognition.
  • Engineered a "Text-to-Database" chatbot for a Ludo fantasy game, enabling non-technical stakeholders to query MongoDB analytics using plain English.
  • Architected data pipelines for "Brokerzoy," a real estate Flutter app, ensuring accurate aggregation of residential and commercial property data.
  • Managed end-to-end data engineering lifecycle for "Inside the Jersey," scraping and structuring coach data to connect American athletes with recruitment opportunities.

Technologies: Python, MediaPipe, YOLO, MongoDB, Flask, Gemini API, Playwright

SDE Intern

BMV System Integration | Vastrapur, Ahmedabad

Nov 2024 - March 2025
  • Designed and deployed Google Apps Script automations to streamline complex workflows for high-profile media clients (e.g., Dhar Mann Studio).
  • Built Python-based web solutions, including a Multilingual ChatPDF tool using RAG architecture.
  • Developed robust web scrapers to support internal data needs and integrated various Google Workspace tools.

Technologies: Python, Google Apps Script, RAG, Flask, Web Scraping

My Education

Academic Background.

B.Tech in Computer Science Engineering

LJ University, Sarkhej, Ahmedabad

2022 - 2026

Major Coursework: Python, AI/ML, Java, Database, DevOps, DSA, OS, Web Development

Average CGPA: 7.18/10

Higher Secondary Education

Swaminarayan English School, New Ranip, Ahmedabad

2020 - 2022

HSC Class 12th (2022):

81.51%

SSC Class 10th (2020):

80.33%

Certifications

Get In Touch

Contact Me.

Let's Connect!

I'm always open to new opportunities and collaborations.