Abhimanyu Singh - GenAI & Computer Vision Engineer

About Me

Education

M.Tech - Automation & Robotics Engineering

Defence Institute Of Advanced Technology (DIAT), DRDO

CGPA: 8.47
B.Tech - Mechanical Engineering

LNM Institute Of Information Technology

CGPA: 8.16

Specialization

Highly skilled in building systems that combine Generative AI with Computer Vision:

Multimodal AI Architectures
Real-time Inference Systems
Robust Vision Pipelines
End-to-end Data Systems

Professional Summary

Software Engineer & Data Scientist with expertise in:

Designing and implementing data pipelines
Developing conversational AI systems
Building vision-based analytics
Optimizing AI performance on edge devices

Professional Experience

Software Engineer (Gen AI & Data Science)

06/2024 – Present

Alphadroid India Ltd, Noida

Designed data pipelines using FastAPI, SQL, and NoSQL for analytics from autonomous robots
Developed multilingual conversational AI systems with LangChain and Llama3.2 on AWS
Built vision-based data acquisition app for Beam Suntory using Flask and OpenCV
Implemented facial recognition using MediaPipe and TFLite with efficient vector search

FastAPI AWS LangChain OpenCV MongoDB

Research Fellow (AI & Computer Vision)

06/2023 – 06/2024

Technology Innovation and Development Foundation, IIT Guwahati

Developed automated data annotation pipelines for underwater surveillance
Created real-time AI training and inference dashboards using FastAPI and D3.js
Optimized end-to-end pipelines with ONNX Runtime and TFLite
Presented research at international conferences on computer vision

Computer Vision FastAPI ONNX TFLite Data Annotation

Technical Skills

Programming Languages

Python JavaScript SQL Dart C++ HTML CSS NoSQL C Shell/Bash

Technologies & Frameworks

FastAPI Flask LangChain OpenCV Transformers YOLO Scikit-learn Pandas Numpy MongoDB PostgreSQL AWS EC2/SageMaker PyTorch TensorFlow ONNX Runtime TFLite D3.js

Technical Expertise

Computer Vision

Object Detection/Recognition
Image Enhancement
Facial Recognition
Underwater Vision

Generative AI

Multimodal Models
Conversational AI
LLM Integration
RAG Systems

Data Engineering

ETL Pipelines
Real-time Analytics
Data Quality Assurance
Dashboard Creation

Featured Projects

Resume JD Matcher

AI Job Matching Tool | Python, NLTK, LLMs, Streamlit

LangChain and Gemini 2.5 for skill extraction
TF-IDF scoring for candidate-job relevance
Streamlit dashboard with PDF parsing and Plotly visualizations

Python LLMs Streamlit NLP

cryptoTracker

Wallet Dashboard | Flask, LSTM, SQLite, WebSockets

Real-time Ethereum wallet dashboard with Flask
LSTM models for price trend forecasting
WebSocket pipelines for low-latency updates

Flask LSTM WebSockets SQLite

Langchain One

Image Based Chatbot | GenAI, Tabular Data, OCR

Multimodal GenAI chatbot with LangChain and YOLOv8
EasyOCR for tabular data extraction
Hybrid RAG pipeline with Few-Shot Prompting

LangChain YOLOv8 OCR RAG

Publications & Presentations

Article

"The Evolution of Object Detection: YOLO & NAS Revolutionizing Computer Vision"

Published on IndiaAI.gov.in

Object Detection YOLO Computer Vision

Conference Presentation

"Subaqueous Crack Detection using Fusion-Based Image Enhancement for Dam Structure"

4th International Conference on River Corridor Research and Management (RCRM) 2024

Underwater Vision Crack Detection Image Enhancement

Let's Connect

Have an interesting project or want to discuss AI opportunities? Feel free to reach out!

Email

abhimanyus1997@gmail.com

LinkedIn

linkedin.com/in/abhimanyus1997

GitHub

github.com/abhimanyus1997

Hi, I'm Abhimanyu Singh