Hi, I'm Abhimanyu Singh

I build intelligent systems that see, understand, and interact with the world through computer vision and generative AI.

About Me

Education

  • M.Tech - Automation & Robotics Engineering

    Defence Institute Of Advanced Technology (DIAT), DRDO

    CGPA: 8.47

  • B.Tech - Mechanical Engineering

    LNM Institute Of Information Technology

    CGPA: 8.16

Specialization

Highly skilled in building systems that combine Generative AI with Computer Vision:

  • Multimodal AI Architectures
  • Real-time Inference Systems
  • Robust Vision Pipelines
  • End-to-end Data Systems

Professional Summary

Software Engineer & Data Scientist with expertise in:

  • Designing and implementing data pipelines
  • Developing conversational AI systems
  • Building vision-based analytics
  • Optimizing AI performance on edge devices

Professional Experience

Software Engineer (Gen AI & Data Science)

06/2024 – Present

Alphadroid India Ltd, Noida

  • Designed data pipelines using FastAPI, SQL, and NoSQL for analytics from autonomous robots
  • Developed multilingual conversational AI systems with LangChain and Llama3.2 on AWS
  • Built vision-based data acquisition app for Beam Suntory using Flask and OpenCV
  • Implemented facial recognition using MediaPipe and TFLite with efficient vector search
FastAPI AWS LangChain OpenCV MongoDB

Research Fellow (AI & Computer Vision)

06/2023 – 06/2024

Technology Innovation and Development Foundation, IIT Guwahati

  • Developed automated data annotation pipelines for underwater surveillance
  • Created real-time AI training and inference dashboards using FastAPI and D3.js
  • Optimized end-to-end pipelines with ONNX Runtime and TFLite
  • Presented research at international conferences on computer vision
Computer Vision FastAPI ONNX TFLite Data Annotation

Technical Skills

Programming Languages

Python JavaScript SQL Dart C++ HTML CSS NoSQL C Shell/Bash

Technologies & Frameworks

FastAPI Flask LangChain OpenCV Transformers YOLO Scikit-learn Pandas Numpy MongoDB PostgreSQL AWS EC2/SageMaker PyTorch TensorFlow ONNX Runtime TFLite D3.js

Technical Expertise

Computer Vision

  • Object Detection/Recognition
  • Image Enhancement
  • Facial Recognition
  • Underwater Vision

Generative AI

  • Multimodal Models
  • Conversational AI
  • LLM Integration
  • RAG Systems

Data Engineering

  • ETL Pipelines
  • Real-time Analytics
  • Data Quality Assurance
  • Dashboard Creation

Featured Projects

Resume JD Matcher

AI Job Matching Tool | Python, NLTK, LLMs, Streamlit

  • LangChain and Gemini 2.5 for skill extraction
  • TF-IDF scoring for candidate-job relevance
  • Streamlit dashboard with PDF parsing and Plotly visualizations
Python LLMs Streamlit NLP

cryptoTracker

Wallet Dashboard | Flask, LSTM, SQLite, WebSockets

  • Real-time Ethereum wallet dashboard with Flask
  • LSTM models for price trend forecasting
  • WebSocket pipelines for low-latency updates
Flask LSTM WebSockets SQLite

Langchain One

Image Based Chatbot | GenAI, Tabular Data, OCR

  • Multimodal GenAI chatbot with LangChain and YOLOv8
  • EasyOCR for tabular data extraction
  • Hybrid RAG pipeline with Few-Shot Prompting
LangChain YOLOv8 OCR RAG

Publications & Presentations

Article

"The Evolution of Object Detection: YOLO & NAS Revolutionizing Computer Vision"

Published on IndiaAI.gov.in

Object Detection YOLO Computer Vision

Conference Presentation

"Subaqueous Crack Detection using Fusion-Based Image Enhancement for Dam Structure"

4th International Conference on River Corridor Research and Management (RCRM) 2024

Underwater Vision Crack Detection Image Enhancement

Let's Connect

Have an interesting project or want to discuss AI opportunities? Feel free to reach out!